A.I. Speech Sounds More Human than Ever
With its stilted phrasing and flat tone, computer-generated speech is still pretty robotic.
The technique brings us closer to a world where computers not only understand everything we say, but are able to communicate as naturally and smoothly as a real person.
Some of the best computer-generated speech comes from a method called concatenative text-to-speech (TTS for short), which knits together short, recorded phrases that were spoken by a human and saved to a database.
But although each individual phrase sounds humanlike, the overall dialogue sounds unnatural because the combined phrases can't be varied to represent inflection or emotion.
WATCH VIDEO: Where the Future of AI Is Headed
There is also a method called parametric TTS that generates audio using signal processing algorithms known as vocoders. But the results are also robotic.
Google's WaveNet is different. It works like other deep neural systems and learns from the audio samples fed into its system.
Like all learning, there is a period of training. During the training phase, WaveNet is fed sound recorded from human speakers. Using statistics, the A.I. makes a prediction about which audio piece it needs next based on the previous one.
The animation above demonstrates how the inputs move through various layers of the neural network as the A.I. predicts the next sound in order to produce an output.
After training, the researchers sampled the network to generate synthetic utterances. The results were analyzed and new values were generated that could be fed back into WaveNet to improve its predictions. Overtime, the speech got better and better.
WaveNet is so good at predicting the sound it needs, it can even generate its own raw audio samples. This means it can synthesize speech -- and even music -- without any human inputting the sound. To hear some examples, click over to the Deep Mind site and scroll down.
It's just a matter of time before machines can understand our every word and respond with a human-sounding voice.
TASTY TECH EYE CANDY OF THE MONTH: PHOTOS
Rising 86 feet up the wall of the lobby of the Wells Fargo Bank Building in Denver, Colo., are five high-resolution LED displays. The eight-story screens are part of a digital installation developed by NYC-based ESI Design. The firm didn't want to fill the cavernous glass atrium with one huge screen and so they decided to split it into five separate panels to accentuate the verticality of the lobby. The screens display video inspired by the nature surrounding Denver. Credit: ESI Design
Using a high-speed camera and volunteers standing against a black backdrop, scientists recorded sneezes. Their goal was to better understand what happens when a person sneezes. The video, which was recorded at 1000 frames per second, showed that the motion and particle distribution of sneezes are more complicated than previously known.
For example, the particles in cloud range in size and travel through the air in swirls and eddies. The biggest surprise was the discovery that the finest droplets stayed suspended in the air for up to several minutes and could travel as far as several feet, which shows the potential for a sick person to infect other people in the room. Credit: New England Journal of Medicine (2016). DOI: 10.1056/NEJMicm1501197
Amsterdam-based DUS Architects 3D-printed this small, 86-sq-ft cabin to demonstrate the potential of 3-D printing in the construction industry. The Urban Cabin was printed over a period of about four weeks and is made from a bio-plastic that can be shredded and reused on another 3D-printed project. Credit: Ossip
Scientists at the University of Zurich have made a remarkable breakthrough in Alzheimer's research. They found that an antibody called Aducanumab targets beta-amyloid plaques known to build up in the brain 10 to 15 years before symptoms of the disease begin to show. In tests, patients who received the highest dose of the antibody had almost no beta-amyloid plaque. Wider-spread clinical trials are now underway. Credit: University of Zurich/Nature
China, already known for its artificial island building, is planning a man-made tourism hub for the country's Haikou Bay. This week they announced that the New York City-based architectural firm Diller Scofidio + Renfro won the competition to design the hub, called the South Sea Pearl Eco-Island.
Amtrak's Acela Express trains, which run between Boston, New York City and Washington D.C., are getting an upgrade. A sleek new exterior will enclose improved interior comforts, including more comfortable seating and better Wi-Fi and charging ports. The new trains will be capable of faster speeds, too. After track improvements, the train could top out at 186 mph, a decent improvement over the current 150 mph. Credit: Alstom SA, 2016; Meconopsis by Trimaran
Twelve unattractive and unused electricity pylons in Stockholm's Norra Djurgården national park are getting a makeover. Swedish architect Anders Berensson is proposing converting at least two of the pylons into "picnic towers" that would offer people views of the park and nearby city. Although Berensson's plans for the so-called Power Towers are conceptual at the moment, the firm is investigating way to fund then and make them a reality. Credit: Anders Berensson Architects
Self-driving cars are not just for the streets. Corn-rows may be seeing autonomous tractors, too. UK-based CNH Industrial's NH Drive is a self-driving tractor chock full of software, LiDar, cameras, GPS and other equipment that allows a farmer to plot the most efficient course through a field, all from the comfort of his office. Credit: CNH Industrial
At the 11th World Wheelie Championship held in Yorkshire, UK, Egbert Van Popta set a new world record for riding a wheelie one kilometer at 213 mph, beating the previous record of 210 mph, set by Gary Rothwell the year before. Credit: Phil Evans
One of the fat-tired Batpod motorcycles that appeared in the Batman movies "The Dark Knight" and "The Dark Knight Rises" is up for auction on September 27. The UK-based Prop Store, a movie-lover's paraphernalia company, is handling the deal. Expect to plunk down $80,000 to $106,000 for the machine. Credit: Prop Store