The deep learning speech recognition acceleration solution leverages an Altera Arria 10 FPGA, iFLYTEK's deep neural network (DNN) recognition algorithms and Inspur's FPGA-based DNN parallel design, migration and optimization with OpenCL. The solution has a hardware platform in CPU+Arria 10 FPGA heterogeneous architecture and software in a high-level programming model in OpenCL to enable migration from CPU to FPGAs.
"Software algorithms for deep learning models need be fine-tuned and optimized continuously. Server accelerators with fixed functionalities will have increasingly low efficiency over time and waste space and electricity," said Yu Zhenhua, director of technology, iFLYTEK Co., Ltd. "In contrast, FPGAs are flexible, customizable and power-efficient. This is also an important reason that iFLYTEK decided to migrate DNN algorithms to a FPGA platform."
Field-Programmable Gate Arrays (FPGA), which have the characteristics of both an application-specific integrated circuit (ASIC) and a general chip, have the ability to do data parallel and task parallel computing simultaneously, which allows them to be more efficient in dealing with specific applications. FPGAs are currently used in logic control, signal processing and image processing and recently in online recognition systems.
"Inspur's Arria 10 FPGA-based deep learning speech recognition solution further demonstrates the performance-per Watt advantages that FPGA accelerators provide," said David Gamba, general manager of the computer & storage business unit at Altera. "This success in solution development will become an important reference for FPGAs in the deep learning field."
Meanwhile, Inspur is also expanding its software cooperation on the speech recognition system, designing OpenCL programming frameworks combined with iFLYTEK's applications, to increase the efficiency of application programming. With these efforts, Inspur can enable the migration of more applications to FPGA-based platforms and foster an FPGA ecosystem, which includes FPGA software, hardware and an applied algorithms library.
When speaking about further cooperation, Hu Leijun, vice president of Inspur, said that Inspur is committed to providing clients with computing solutions that best suit their needs. Given FPGA-based solutions great advantages in terms of performance per watt, Inspur will expand its software cooperation with IFLYTEK and Altera on FPGA-based deep learning online speech recognition applications. Moreover, Inspur will develop an FPGA-based system solutions, covering full cabinet computing, Internet and storage solutions, with the aim of making these solutions available for applications and clients in other fields.
In the future, a CPU+FPGA solution will probably be the new heterogeneous computing model for HPC, more and more HPC applications, data center applications and Internet deep learning applications will use CPU+FPGA solution.
The solution revealed include:
- High performance: When processing 100 bounds data, the DNN running time based on two Intel's Xeon E5-2650 V2 CPU (16 cores) is 242.027s, while the DNN running time based on Altera's Arria 10 FPGA is 84.312s, with a faster performance acceleration of 2.871.
- Low power consumption: The power consumption of Altera's Arria 10 FPGA and two Intel Xeon E5-2650 V2 CPU are respectively 30W and 190W, with the power consumption of FPGA-based system is 15.7 percent that of a CPU system. In an actual test of the DNN algorithms, an FPGA-based system can realize high performance per watt, up to 30GFlops/W, greatly saving application power costs.
- Easy to program: It only took four man-months for software engineers to do FPGA-based DNN parallel program development with OpenCL programming models. If traditional underlying languages, such as Verilog and VHDL, were used, it would take 12 man-months at least to do similar development, with collaboration between software engineers and hardware engineers required.
- High adaptability: FPGA can execute data-parallel computing with a DNRange model or task-parallel computing with a Pipeline model to address more applications and bring overall performance improvement to more applications and software.
About Inspur, iFLYTEK and Altera
- Inspur Group leverages its world-leading research, development and innovation systems, including the National Key Lab for High-Performance Server and Storage Technology, National Engineering Center for Information Storage Technology, Inspur-Intel China Parallel Computing Joint Lab , Inspur-NIVDIA Cloud Supercomputing Application Innovation Center, Inspur's own impressive R&D, system building, operation & maintenance and service capabilities in supercomputers from teraflops to petaflops as well as a complete set of HPC hardware and software product lines provides superior industry-leading supercomputing systems and application services across many sectors in China. Application of Inspur's products and technologies includes university and scientific research institutions, oil & gas exploration, weather forecasting, aerospace and aviation, manufacturing design, animation rendering and environmental monitoring.
- iFLYTEK Co., Ltd. iFLYTEK, the largest intelligent speech technology provider in China, has long engaged in developing intelligent speech technologies and has world-leading achievements in many technologies, including speech synthesis, speech recognition, speech evaluation and natural language processing. The accuracy of iFLYTEK's industry-leading speech recognition technology is 98 percent.
- Altera programmable solutions enable designers of electronic systems to rapidly and cost effectively innovate, differentiate and win in their markets. Altera offers FPGA, and complementary technologies, such as power solutions to provide high-value solutions to customers worldwide.
To view the original version on PR Newswire, visit: http://www.prnewswire.com/news-releases/inspur-and-altera-launch-speech-recognition-fpga-solution-with-opencl-300180790.html
SOURCE Inspur Group Co., Ltd.
|Inspur Group Co., Ltd.