Skip to main content

AI, Bots, and Canvases, Part II: Cortana to rule the world

Back in 2014 I posited that Cortana would be the intelligent, voice-engaged UI for our increasingly mobile experiences. As a mediator between our "will" and our apps, I thought that she would effectively make apps "invisible" as she became the ambient user interface of our mobile computing lifestyles.

As Microsoft continues to evolve Cortana toward the vision the company sees for its AI digital assistant, I've seen that my 2014 analysis was not entirely correct. Nor was it entirely wrong.

I did not foresee bots. Nor did I identify human language as the next user interface as Satya Nadella, Microsoft's CEO, did this year. My analysis identified Cortana (at least for Microsoft's ecosystem) as the next personal computing UI, not human language.

Still, in 2014 I was two years early with my claim that Cortana would be the next big thing. It wasn't until Build 2016 that Satya Nadella laid out the ambitious details of Microsoft's strategy to enhance Cortana's abilities through extensibility, making her a potential meta-app to a host of intelligent apps, or bots. He made it known that this cross-platform AI was being positioned as a platform for the next stage in the evolution of the personal computing user interface.

Cortana may emerge as "The Next Big Thing" in the age of AI and bots.

Through this vision, we may see Cortana emerge as The Next Big Thing, among the industry's seeming acknowledgment that AI and bots are "The Next Big Thing." As a platform company, Microsoft's approach to Conversations as a Platform certainly plays to its strengths.

I want to give a better picture of how Microsoft is one of the leaders heralding us into the new world in which we may soon find ourselves interacting. A world where bots and AI digital assistants may very well be a part of the verbal conversations and text-based dialogues we engage in daily.

Beginning the conversation

By approaching the personal computing user interaction from the position of conversations as a platform, Microsoft has positioned itself advantageously for the shift to AI and bots. As one of its core strengths, Microsoft is the provider of a host of cross-platform apps and services, including conversation tools such as Skype and Cortana. Additionally, with the Bot Framework, Redmond has extended upon the tools it offers to developers that make Microsoft's ecosystem a platform for app development. Windows is indeed a devbox.

Conversations as a Platform and the Bot Framework strategy are a comprehensive approach.

Moreover, Microsoft's Bot Framework provides the development tools that allow developers to target a range of first- and third-party Conversation Canvases. This broad AI and bot strategy gives Microsoft a comprehensive solution and a potential preemptive position of authority in relation to the possible industry adoption of bots.

Furthermore, Redmond's addition of a bot "storefront" or registry to complement its bot development platform provides a central repository for anyone to find bots in these very early stages of this transition.

Building the foundation

Microsoft is, of course, leading this charge with its own Conversation Canvases, while also providing a platform for third-party canvases such as Line, Slack and others. As an agent in Microsoft's Conversation Canvas strategy developers will be able to use Microsoft's Bot Framework to build bots (or experts/intelligent apps) that will communicate proactively with Cortana. As a meta-app that knows the user across contexts, Cortana will be able to mediate a dialogue between the user and bots to accomplish a task which would typically require user initiation, execution and completion.

In this forward-looking scenario the inefficient "warehouse of apps" model that is popular today is replaced in large part with bots as apps and AI digital assistants as meta-apps. Marcus Ash, Group Program Manager for Cortana for Windows demonstrates this at Build 2016 in the video below:

This scenario is Microsoft's vision. It is not a new vision. Despite all of the fanfare, headlines and the rush of companies that seem to be flocking to AI, we are simply seeing the fruits of years of labor and planning. Technology has only recently reached the point where the beginning of this transition to a world infused with AI and proactive bots is possible.

Microsoft: We weren't t late we were way too early

Despite the concert of raised voices throughout the industry hailing the merits and potential of the UI's evolution, AI and the transition of apps to bots, our voices are merely joining a cacophony of voices echoing from the past. Microsoft has invested many years and millions of dollars in research and development into AI, natural language processing and machine learning. Don't let Cortana's 2014 arrival to Window Phone — three years after Siri and two years after Google Now — fool you. Microsoft's been at this game for a long time. Before there was Cortana, there was Bob.

Microsoft Bob was an assistant that was before his time.

Microsoft's desktop-restricted Microsoft Bob, which was introduced in 1995, was an early attempt at an AI assistant that didn't have the benefit of an "always-connected" internet, the ubiquity of smartphones, nor the maturity of natural language processing and machine learning that is available today. Clearly, Microsoft was not a company without ambition; Bob was just way ahead of his time.

Beginning at the 3:25 mark in the 21-year old Microsoft Bob launch video below, then CEO of Microsoft, Bill Gates, promises and demos a digital assistant that we would be able to talk to and that would also talk back to us.

Twenty years later the future that Gates predicted is here. As we were then given a peek at a future that has been realized, a foundation has been laid for the future of AI and bots that we are now glimpsing.

In the video below, scientist Eric Horvitz of Microsoft Research talks about the new age of artificial intelligence. He talks about where we are, where we are headed and how we will get there. He shares some of the accomplishments in machine intelligence and other technologies that Redmond's massive investments have brought to the industry:

Horvitz's vision for this technology anticipates a practical and widespread application of artificial intelligence. He actually has an AI secretary, Monica, that helps organize his day and greets visitors to his office. Though AI will take many forms and perform many obscure and back-end functions, Microsoft's most consumer-facing AI and the personal "face" (like Monica) for the technology will be Cortana.

It's personal

Cortana in Halo

Cortana in Halo (Image credit: Microsoft)

Once, Microsoft entered the digital assistant space; they approached it from an intentionally deeply personal perspective. This intense personal approach was in stark contrast to how rivals Apple and Google positioned their digital assistants Siri and Google Now respectively.

First, her name connects her to a beloved character (opens in new tab) from one of the most popular game franchises, Halo. As a result, Microsoft's digital assistant indirectly benefits from the personality, back story and even the face (which neither Siri or Google Now have) of her fictional counterpart.

Second, and more directly, Cortana was modeled after real personal assistants, Notebook and all. This user-controlled notebook (opens in new tab) is where she, like a real assistant, records what she learns about a user.

Additionally, Redmond believes that personality is essential to encouraging interaction with an AI to facilitate wide-spread adoption. Deborah Harrison, Editorial writer for Cortana at Microsoft, elaborated on this point at the 2016 Re-Work Virtual Assistant Summit.

If Microsoft's ambitions for Cortana ended with the basic reminder-type scenarios that are at the core of what our digital assistants are popularly used for today, Redmond's investments in natural language processing, deep neural networks and machine learning that Eric Horvitz talked about above would be in vain. Thankfully that is not the case.

Cortana as a platform makes her "CEO" rather than a mere assistant

When looking at Cortana from just her most user-facing features, it's easy to fall into the trap of comparing her feature vs. feature with competitors Siri and Google Now. But as we delve into Cortana's foundations, her history, her present and her strategically planned future, its becomes clearer that Microsoft is far more interested in creating a ubiquitous and pervasive intelligent platform than a mere voice assistant.

Nadella, emphasized the boundless nature of Cortana compared to her competitors this way in an interview with Business Insider:

"So I think of Cortana, its uniqueness will come because it can take your personal data, your Office 365 data, and be available across all platforms. No one else, at least as far as I can tell, is taking that unbounded approach to something like personal digital assistants. Anyone else who has a personal digital assistant, it's mostly a personal digital assistant that just sits and resides in either their software or their device or what-have-you.

He continues with expounding on the company's unique approach to third-party bots in conjunction with Redmond's competitive advantage with cloud technologies:

Take bots — no one else is talking about bots that can be built using all the rich cognitive cloud services we have. How did one teach a bot how to have a conversation with a human? That requires conversational understanding, dialogue understanding. We have APIs for doing all of that in our cloud in Azure. And you can, in fact, build a Slack bot, or you can build a bot for Line, or you can build a bot even for Facebook. We don't know yet exactly what Facebook does, but our back end is independent. And you can just use your back end to build bots, like building mobile apps or building websites in the past.

He sums up that portion of the conversation by sharing that (in addition to Cortana's role) Microsoft has first-party Conversation Canvases that expand on Microsoft's position as a duo user (personal and professional) platform for the new age of bots (intelligent apps):

Then we have our own set of conversational canvases like Skype that they're opening up for these bots. So the approach we're taking is much more of a platform company approach, much more of an approach that says that it's both your personal and professional data. And if I take those two dimensions, I don't think anyone else comes at it that way.

This broad scope of positioning Cortana as a platform within the context of Microsoft "Conversations as a Platform" strategy potentially positions Microsoft to benefit greatly from developers who adopt Microsoft's Bot Framework.

Microsoft has a platform for that

When observing Microsoft's personal computing strategy, we see that the company is positioning an array of its products and services as the "computing function" or platform that others use to get things done. Some examples include Azure as a cloud-based computing environment, a host of cross-platform apps including Office, Windows as a developer's platform and even Cortana across various platforms and devices. Matt Rosoff articulated it this way during his April 2016 interview with Nadella: "When I heard platform, I used to think operating system. And this sounds like it's a platform as a particular type of computing function, regardless of operating system."

Microsoft's cross-platform ambitions position platforms like Cortana as industry-wide tools.

Microsoft is clearly positioning its vast range of tools and services as the go-to brand for a wide variety of computing functions. By leveraging Cortana, as an industry-wide platform that developers can tap into by connecting apps as bots (and voice-interaction as traditional apps in Windows) Microsoft's AI digital assistant is positioned more aggressively as both a consumer and enterprise tool than Siri or Google Now.

Cortana is the platform for that

Personal computing is increasingly mobile, and users often transition between devices. Thus Microsoft's strategy for an unbounded AI as a platform and Conversation Canvas agent is likely a strategic advantage over competitors offerings. In a current paradigm where many developers have not yet embraced developing Universal Windows Store apps, Cortana is a cross-platform "platform" that can potentially be targeted by developers with platform-agnostic bots.

Rather than simply courting developers to target Windows, Microsoft can court developers to target Conversation Canvases and Cortana (through Skype and other canvases), with bots that run on all platforms. If successful, this would help to close the app gap. Indeed, Microsoft's push to improve Cortana and Skype on Android and iOS may be an indication of this strategy.

The Untold App Gap Story Part IV: Going from apps to bots

Cortana on other platforms may represent a Conversation Canvas strategy to bridge the app gap.

Microsoft's strategic positioning of Cortana as a platform and a target for intelligent apps/bots can potentially overcome the weaknesses developers perceive in Windows as the platform and a target for Modern apps.

The motivator for most developers in developing apps is a large mobile install base, Windows simply does not offer that incentive. Despite the fact that the weight of the 1.5 billion PC install base is intended to assure developers that their Universal Windows apps will have an audience, many have not yet embraced that message.

In a near future where bots are expected to be popular, Cortana on other platforms is intended to interact with a developer's bots. Thus Cortana as a platform may be strategically positioned to benefit from the massive iOS and Android install bases. In essence Cortana (and Conversation Canvases) become an extension of Microsoft's platform, "canvasing" iOS and Android, ultimately benefiting from the install base of those platforms as developers build bots for the canvases.

Such a scenario where a developer uses Microsoft tools to create bots may also lead to his using Redmond's developer's platform to reach users with Universal Windows 10 apps as well.

Just the beginning

We are still at the beginning of this journey. Cortana has yet to reach some regions, and at just two years old is seeing constant improvement. There is no expectation that the fulfillment of the strategy would occur overnight. Yet, she is progressing. Our own Jez Corden, recently wrote about her progress on Xbox and her coming interaction with UWP apps there.

One thing is clear: Microsoft is positioning Cortana as much more than a mere digital assistant. As an unbounded, cross-platform Conversation Canvas agent, her role in the potential age of bots may be central to how users on any platform interact with their digital experiences? Will developers embrace the Conversations as a Platform vision? And if so, will Cortana emerge as The Next Big Thing? Time will tell.

So what are your thoughts? Should Microsoft have kept Cortana as a Windows-only AI digital assistant? Will bots help to bridge the app gap? Are you looking forward to a transition from traditional apps to bots? Sound off in comments and on Twitter!

And if you missed Part I or these related pieces check them out here!:

Jason L Ward is a columnist at Windows Central. He provides unique big picture analysis of the complex world of Microsoft. Jason takes the small clues and gives you an insightful big picture perspective through storytelling that you won't find *anywhere* else. Seriously, this dude thinks outside the box. Follow him on Twitter at @JLTechWord. He's doing the "write" thing!

  • Thanks for reading folks. We're watching a very ambitious plan for Cortana unfold. Microsoft has the scale and resources to make Cortana an industry-wide cross-platform tool. It will be interesting to see if bots are embraced by the industry, and if so how big a role Microsoft's Bot Framework and first-party Conversation Canvases will play. One thing is clear, Microsoft is positioning itself for a power position for this potential shift. With development tools, Canvases, connection to third-party Canvases and a centralized bot registry Microsoft is building the foundation and infrastructure to be the go-to source for developers and users in a potential AI and bot ruled industry. So, will competitors such as Google, Apple, Facebook and Viv hamper Redmond's flow? Will the shift to bots even happen? Is Cortana's cross-platform play the way to go? Well, you know the drill - LET'S TALK!
  • I hope MS is working on a stand-alone Cortana appliance, like the Amazon Echo.  My Echo is amazing, but I'd love something that plays better with the MS ecosystem.  And instead of the cylindrical shape of the Echo, they could actually make something that looked like Cortana.  How cool would that be?
  • I like your idea for a Cortana appliance, but I want it to look like 343 Guilty Spark.
  • YES!  They could even sell different skins.  I'll take a Claptrap, please!
  • Jason, you are like the knight in Monty Python. Lose an arm...just a flesh wound! Credit where it is due though, you are a true believer and your articles are always thorough.
  • Interesting to read as I watch terminator genesis.
  • "Microsoft is positioning itself for a power position" With regards to Cortana, this would be believable if Microsoft had a mobile phone platform that wasn't on life support. Up take for her usage on iOS and Android hasn't been notable, so what does that leave? PC and XBox? Hardly a power position Jason, sometimes I wonder if you recognize how unrealistic your choice of terminology can be. Posted via the Windows Central App for Android
  • Hi Visa Declined, thanks for participating. This might help. :-) The paragraph just before the excerpt you excised and responded to says: "We're watching a very ambitious plan for Cortana unfold. Microsoft has the scale and resources to make Cortana an industry-wide cross-platform tool. It will be interesting to see if bots are embraced by the industry, and if so how big a role Microsoft's Bot Framework and first-party Conversation Canvases will play." We then flow right into the excerpt you chose to focus on: One thing is clear, Microsoft is positioning itself for a power position ***for THIS potential shift.** Note you left out the rest of the sentence. There was no period after **position**lol. This one point changes the tone of the interpretation you present, because that excerpt in its entirety, not just the portion you chose, then filled in with a limited application to Cortana, is actually referring to the potential industry shift to bots. I emphasized the word *this* in my paste of the actual statement above to stress that the *this* in the portion of the excerpt you left out specifically refers back to the subject, (industry shift to bots) stated in the previous paragraph. As you continue to read on beyond that you see that I expound on Microsoft's development tools, Canvases and Bot registry as what helps position them in a power position for this shift. Thanks for participating!:-)
  • Very interesting to discover that Microsoft BOB was the beginning of AI that works :) I always thought that Apple came first.
    Not that it really matters, because all competition bring forth evolution to a service
  • I *did* have a longer reply... But the bug of End of Line arrow key press in the WC app killed my longer comment :(
  • i will wait to Hebrew Cortana.
  • That, offcourse, will never happen.
  • Cortana is the next big fail. I said it many times before. But i think you US people don't want to know and I am sure you going to downvote....Cortana is for the happy few and other people don't want Cortana for privacy reasons.
  • Well I guess people shouldn't use Google Now and Siri Gerard. Personal assistants need your info to do their jobs. Anyone using one really shouldn't have any expectation to privacy.
  • Thats what I said. Thats why Cortana will never rule the world. It will fail.
  • Tin foil hat syndrome???
  • I for one am an avid user of Cortana... I have Favorite places logged in etc. Daily reminders etc. The more I use it the more I love it. Its very good... the simple fact that I can ask Cortana how long it will take me to get home towards the end of a work day and she replies with a time is awesome. Siri does not do that nor does the Google thing. My friends have tested it against their phones. I wish Cortana had intellegence built in... learning capabilities... instead of me logging in info, it should be smart enough to "know" that if I am some place from midnight till early morning that that is my home... and log it in as such. Cortana can ask me to confirm it. Same for work.. if randomly for a few days every other week she senses I am at a place from 9 to 5 that that is most probably my work. Its smart enough to tell me its time to go home.. why not learn where work and home is. Same is calling my spouse. etc etc. Its a great product... I believe it will become better but I dont have faith in the MS today. Its still the king of BSOD and randome reboots(shut the face up if readers are going to say that since Windows 7 never had a reboot). WM10 is still in beta as is Windows 10 OS. I dont knwo how long it will be in beta but MS needs to get its act together... BTW: DId you read that Anniversery update is freezeing some computers? MS???!!!! Have you not learned anything?
  • Google now automatically shows you your drive home at the end of the day. You don't even have to ask it, but you can. Just tried and it worked fine. Posted via the Windows Central App for Android
  • The automatic 'work/home thing, Cortana confirms' happened for me. The only I had to do was manually change work location when I moved elsewhere.
  • From what I understood, Cortana actually does suggest a place as home or work based on certain factors. However, it currently might take a while. When she does, she asks you to confirm it. At least that is what I heard/read.
    I haven't experienced it myself yet.
  • Yeah, it worked for me, the automatic thing, but then it doesn't adapt at all from that.
  • That may be true. It may not. Time will tell. But I do acknowledge that the general user will only use Cortana after maybe 5-10 years, if and when it becomes the norm.
  • Sounds all good, but I'm still waiting to get Cortana! Maybe they should launch it for all first...
  • Last I heard. Next big thing, is to enable Cortana for all in English. Does that means they will give up localizing her for now...? I HOPE not! But I fear it.
  • Hopefully we wont have to read thru this in the near future. we could just ask - Hey Cortana, what is Jason Ward on about this morning? cortana please summerize
  • LOL
  • There is a Skype bot that tackles this already
  • I'm dubious about normal people talking to their pc's. I would love to see actual numbers from MS on the matter. I read recently that even on IOS and Android the actual usage numbers for Siri and Google Now are disappointing.
  • An example: My wife's grandmother talks to her phone often, because of her failing eyesight.
    By the way, she bought herself an iPhone; I would have steered her towards, say, a Lumia 640, but Cortana isn't available here...
  • The matrix, terminator, which ones is better?
  • Cortana the best digital assistant so far!
  • Yes, like Windows Mobile is the best.... Now, go find some people that will use it. Again, like Windows Mobile.
  • What's your problem?? I use Windows, I am happy with it, I use Cortana and I do not have privacy issues. Why are you here??
  • I don't use Cortana and if I would use Cortana I, like you, would not have privacy issues. But many people do. If I would order a pizza, and Cortana was to remind me, I have no privacy issues. I don't mind MS knowing. But many people do mind. O, and why are you here? (like I care).
  • Mr. Corbier, you have good points, but your replies are worded in a childish way. For instance "O, and why are you here?" Would be a great way to break the ice with other users and not be interpreted as a jerk, but then you add the notice of you don't care for the answer. Please, if you only have negativity to post, just keep it to yourself :)
  • I don't think this will work for a number of reasons: -Two years after release, most countries can't use the dictation feature in Cortana. How can Microsoft build a platform around dictation if they haven't expanded dictation access to other languages/nationalities? -How many services and apps would this really make sense for? It might make sense for Starbucks ("Order my usual coffee") or Domino's ("Order a large pepperoni pizza to be delivered at 530"), but beyond those? The bots aren't going to replace games, e-books, comics, browsing social media or browsing the web. They *might* be okay for productivity apps, but I think most users would prefer to read/type their content themselves. -Are developers actually going to invest in this for Cortana? With Windows 10 Mobile usage so low and dictation support limited world wide, there's a fairly narrow potential market for devlopers to integrate their bots into Cortana. I think most developers are going to invest in a cross platform app like Facebook Messenger, and possibly Siri and Google Assistant since iOS and Android are so big. -Would use