Microsoft is spending half a billion dollars to make sure that when you hear the word Kinect you think âthe future of video games.â
But Microsoft believes that the real potential for the voice recognition, motion-based technology is how they plan to use it with cell phones, computers and perhaps in the military and health industries.
âThink about a world where machines understand what people want from them,â said Kinect Creative Director Kudo Tsunoda. âYou can see that extrapolate out to a host of other devices.â
Microsoft promised when they launched the $150 Xbox 360 Kinect add-on this week that it would make you the controller. That means it allows gamers to play soccer, race on a hoverboard and raft down rapids all by simply going through the motions in front of your television. The Kinect also allows you to video chat with friends, flip through the consoleâs menus with a swipe of your hand and zip through movies by plucking at the air in front of your face.
Despite the sometimes magical moments it creates, Kinect is really just a collection of off-the-shelf cameras, sensors and microphones packed into an array that sits atop or under your television. It looks surprisingly like the head of the lovable, titular robot star of Pixar movie Wall-E.
What Microsoft is betting on isnât that plastic peripheral, itâs the software and algorithms that Kinect uses to do its job. A job that includes not just watching a gamerâs movements and translating it into on-screen video game action, but also using the spoken word to take direction, recognizing who is playing a game and, perhaps ultimately, what that person wants from their gaming experience.
âFrom a broad Microsoft perspective,â Microsoftâs director of the Kinect technology, Alex Kipman added, âwe believe in a more natural way for people to interact with technology. People donât like to hold gadgets in their hands, that includes keyboards and mice.
âThe Kinect is the start of that journey.â
Kinect Isnât Just Watching, Itâs Listening
The first thing youâre likely to notice when you play a Kinect game is how, like with Wii and the Playstation Move, the game recognizes your movements. Unlike its competitors, with the Kinect you donât need to hold anything in your hand to play games, but on some level the experience feels very similar across all of the platforms.
I pointed this out to Kipman, the brains behind Kinect. Heâs quick to explain that the neatest bit of technology to come out of Kinect has nothing to do with motion tracking. Itâs the audio work, he said.
âIt is totally different than some of the things people have experienced before,â he said.
While there already are clocks, phones and other electronics that actively listen for voice commands, most current voice recognition systems are âpush to talk.â That means you have to push a button to get them to listen to you. And typically, the device is being used in a place where the user is a foot or two in front of the microphone and not moving around.
Thatâs the case if youâre in a car, or using a PCâs software or on a phone. But not the case with Kinect.
âIn our world, first of all, people are sitting all over the room,â he said. âThere are many different people sitting 10 to 12 feet away from the system. People are having fun in the living room, so itâs not a quiet environment. There is lots of ambient noises.â
And lots of noises pumping out of the speakers thanks to the Xbox 360 as well. But you donât have to quiet down the room and then push a button to get the Kinectâs attention. Itâs always listening for you.
To get that to work Microsoft combined several different types of technology.
âThe first one is how advanced our multichannel echo cancellation is,â he said. âWe are able to cancel out all of the noise coming out of the speakers because the Kinect is synced with the Xbox and can cancel it out the instant it comes out.
âThe second issue is the ambient noise of people talking. We use sophisticated algorithms to figure out who is talking and where they are so we can hone in on them.â
Kipman says Kinect uses its software to essentially form a virtual cone around the speakers mouth and listen to that one spot for commands. And it can do that even if the person is moving.
âOnce you combine and fuse all of these things together you get a system that transforms how you interact with it,â Kipman said.
Tested in my home, the voice recognition technology struck me as the most impressive. I can sit on my couch, more than nine feet from the television and that Wall-E sensor, lean back into the cushions and say in a normal voice âXbox.â Occasionally I have to repeat myself, but typically it hears me. It can even hear me through the roar of Last.FMâs music or the sounds of movies or ESPN sports. There are some stumbling points. It does, for instance, seem to have issues recognizing my 9-year-old son. But when it works itâs impressive, a little transformational and, yes, a tiny bit creepy
On its surface Video Kinect is a typical video chat program. The difference is that the users donât need to wear a headphone or sit close to the screen to use it. Nor do they have to sit still. A person can pace in front of the camera and speak in a normal tone and the Kinect will follow the user using digital zoom and hear what he or she says with its audio algorithms.
âIn a way what you see with Video Kinect is where we are transforming not just gaming but entertainment,â Kipman said. âYou donât have to understand technology, but technology understands you.â
From Games To Phones, To Computers, To Cars
Kipman says he receives calls on a daily basis from people involved in the military, healthcare, mobile phones and computer spaces about Kinect and its potential in those fields.
âThere is a wide amount of interest from many, many industries,â he said. âI love the interest, but if you want to create something transformational you need focus. And our focus has really been, letâs nail it in the living room on the Xbox 360.â
But Microsoft is already looking past this weekâs launch of Kinect, to the technologies uses outside of gaming.
âI think that both with the gesture tech and the voice recognition there are so many different types of machines with some sort of input paradigm associated with them that this could be applied to,â Tsunoda said. âIs it coming to the PC and mobile spaces? Sure, but anything with an input device, keypads or things you are punching numbers into, can use this.
âThis is much bigger than something just for the mobile phone and PC.â
I asked Kipman and Tsunoda if we would see this technology in computers, cell phones and cars in the next decade.
âIf it takes a decade to get there I should fire myself today,â Kipman responded.
While the two declined to be more specific about when we might see the technology behind Kinect on phones and computers, they did make it a point to say that the existing cameras and microphones in cell phones and laptops are âwithin spitting distanceâ of the tech needed to get the job done.
Microsoftâs commitment to extending this technology can be seen in who worked on the project.
âWhat you see in Kinect is a movement across all of Microsoftâ Kipman said, âwhere people are galvanized behind this vision about transforming the world from what it was to what it will be.
âFrom Steve Ballmer on down, everyone at Microsoft is waiting to see if the consumers will love this as much as we love itâ
And Kipman says that Microsoft is already working with companies theyâve partnered with in the past. While he declined to name those companies, past efforts have included bringing television to the Xbox 360 with AT&Tâs Uverse and push-to-talk voice commands to cars with Ford, Kia and Fiatâs take on Windows Embedded Automotive 7. Thatâs better known to drivers as SYNC, UVO and Blue&Me
https://lastchance.cc/u-s-xbox-360s-get-iptv-through-at-t-uverse-this-year-5442751%3C/a%3E%3C/p%3E
When I met with Microsoft co-founder Bill Gates in 2007, he also talked to me about the importance of giving the Xbox 360 the ability to see and hear. Even back then Gates called vision and audio for the Xbox 360 the future.
http://lastchance.cc/226819/the-bill-gates-interview-wrap+up%3C/a%3E%3C/p%3E
This notion, of combining television with a gaming console that is always listening and watching seems to form the basis for Kinectâs next big leap.
For instance, while Kipman said that using motions and voice to control channel surfing would be a good use of the technology, but added that âthere are other massive uses that are not as obvious. You can use those inputs in more subtle ways.â
âImagine a world where we sit in our living room watching a Brazil versus England football game and the Xbox notices you are wearing Brazilian t-shirt and makes it a more Brazilian experience,â he said. âWe could offer a two-way discourse as youâre watching news.â
The Xbox 360 already allows you to use Kinect to watch live and recorded sports with ESPN. That experience includes the ability to hang out with fellow sports fans in avatar form and do things like vote in sports polls.
The key to doing all of that, Kipman says, is to make technology disappear and put the person at the center of the experience.
The Future of Kinect and Gaming
As Microsoft moves forward to spread the technology first seen with Kinect to other areas, they will continue to extend its use with gaming as well as they strive to change they way people play games.
âI think this is as big as going from black and white to color,â Kipman said. âIt changes the discourse for how people treat interactive entertainment, and even making the more linear static kind more interactive.â
âKinect is not an âorâ technology, itâs an âandâ technology,â he said. âItâs an and of identity, of voice, of gestures and the controller. You can have a controller experience augmented with voice, augmented with who I know you are. I can have any number of hybrid experiences.â
Imagine a game you play with an Xbox controller that also listens for your voice commands or watches for you to lean or make gestures to do things like fling a grenade or peek around a corner.
âFrom a creative perspective the stuff Iâm most excited about isnât how can we take existing games and update them with Kinect, itâs how can we create new experiences that are Kinect enabled,â Tsunoda said. âYou have games that are very story based, that have you build a relationship with your characters. With Kinect you could do stuff like combine not just motion-based stuff, but voice, like being able to talk to people in the game. Human recognition stuff. You can start building completely unique experiences in game like you do in real life.â
Tsunoda says Kinect reminds him of his childhood dreams of having a holodeck to play with.
âI remember dreaming of going into a fully immersive world where you are able to interact with everything as you would in real life,â he said. âThis is far from a fully immersive world, but I do think that Kinect is one of the very first stepping stones toward making that dream a reality.â
War games, Zombie massage, Farmvilleâs Creator on the future of gaming, read the rest of this monthâs edition of K Monthly here.
Illustration by Kotaku illustrator Sam Spratt. Check out Samâs portfolio and become a fan of his Facebook Artistâs Page
https://gizmodo.com/meet-our-contributing-illustrator-sam-spratt-5620127