June 18, 2013
LEIPZIG, Germany, June 18 -- NVIDIA today announced that it has collaborated with a research team at Stanford University to create the world’s largest artificial neural network built to model how the human brain learns. The network is 6.5 times bigger than the previous record-setting network developed by Google in 2012.
Computer-based neural networks are capable of “learning” how to model the behavior of the brain – including recognizing objects, characters, voices and audio in the same way that humans do.
Yet creating large-scale neural networks is extremely computationally expensive. For example, Google used approximately 1,000 CPU-based servers, or 16,000 CPU cores, to develop its neural network, which taught itself to recognize cats in a series of YouTube videos. The network included 1.7 billion parameters, the virtual representation of connections between neurons.
In contrast, the Stanford team, led by Andrew Ng, Director of the university’s Artificial Intelligence Lab, created an equally large network with only three servers using NVIDIA GPUs to accelerate the processing of the big data generated by the network. With 16 NVIDIA GPU-accelerated servers, the team then created an 11.2 billion-parameter neural network – 6.5 times bigger than a network Google announced in 2012.
The bigger and more powerful the neural network, the more accurate it is likely to be in tasks such as object recognition, enabling computers to model more human-like behavior. A paper on the Stanford research was published yesterday at the International Conference on Machine Learning.
“Delivering significantly higher levels of computational performance than CPUs, GPU accelerators bring large-scale neural network modeling to the masses,” said Sumit Gupta, General Manager of the Tesla Accelerated Computing Business Unit at NVIDIA. “Any researcher or company can now use machine learning to solve all kinds of real-life problems with just a few GPU-accelerated servers.”
GPU Accelerators Power Machine Learning
Machine learning, a fast-growing branch of the artificial intelligence (AI) field, is the science of getting computers to act without being explicitly programmed. In the past decade, machine learning has given us self-driving cars, effective web search and a vastly improved understanding of the human genome. Many researchers believe that it is the best way to make progress towards human-level AI.
One of the companies using GPUs in this area is Nuance, a leader in the development of speech recognition and natural language technologies. Nuance trains its neural network models to understand users’ speech by using terabytes of audio data. Once the models are trained, they can then recognize the pattern of spoken words by relating them to the patterns that the model learned earlier.
“GPUs significantly accelerate the training of our neural networks on very large amounts of data, allowing us to rapidly explore novel algorithms and training techniques,” said Vlad Sejnoha, Chief Technology Officer at Nuance. “The resulting models improve accuracy across all of Nuance’s core technologies in healthcare, enterprise and mobile-consumer markets.”
NVIDIA will be exhibiting at the 2013 International Supercomputing Conference (ISC) in Leipzig, Germany this week, June 16-20, at booth #220.
Since 1993, NVIDIA has pioneered the art and science of visual computing. The company’s technologies are transforming a world of displays into a world of interactive discovery — for everyone from gamers to scientists, and consumers to enterprise customers. More information at http://nvidianews.nvidia.com and http://blogs.nvidia.com.
10/30/2013 | Cray, DDN, Mellanox, NetApp, ScaleMP, Supermicro, Xyratex | Creating data is easy… the challenge is getting it to the right place to make use of it. This paper discusses fresh solutions that can directly increase I/O efficiency, and the applications of these solutions to current, and new technology infrastructures.
10/01/2013 | IBM | A new trend is developing in the HPC space that is also affecting enterprise computing productivity with the arrival of “ultra-dense” hyper-scale servers.
Ken Claffey, SVP and General Manager at Xyratex, presents ClusterStor at the Vendor Showdown at ISC13 in Leipzig, Germany.
Join HPCwire Editor Nicole Hemsoth and Dr. David Bader from Georgia Tech as they take center stage on opening night at Atlanta's first Big Data Kick Off Week, filmed in front of a live audience. Nicole and David look at the evolution of HPC, today's big data challenges, discuss real world solutions, and reveal their predictions. Exactly what does the future holds for HPC?