End-to-End Speech Recognition in English and Mandarin

w语音识别、噪音、方言，算法迭代。

https://arxiv.org/abs/1512.02595

We show that an end-to-end deep learning approach can be used to recognize either English or Mandarin Chinese speech--two vastly different languages. Because it replaces entire pipelines of hand-engineered components with neural networks, end-to-end learning allows us to handle a diverse variety of speech including noisy environments, accents and different languages. Key to our approach is our application of HPC techniques, resulting in a 7x speedup over our previous system. Because of this efficiency, experiments that previously took weeks now run in days. This enables us to iterate more quickly to identify superior architectures and algorithms. As a result, in several cases, our system is competitive with the transcription of human workers when benchmarked on standard datasets. Finally, using a technique called Batch Dispatch with GPUs in the data center, we show that our system can be inexpensively deployed in an online setting, delivering low latency when serving users at scale.

相关阅读:
C#实现带阴历显示的日期代码
ASP.NET实现支付宝接口功能
网站添加手机短信功能
ASP.NET支付宝扫码即时到账支付开发流程（序言）
ASP.NET支付宝扫码即时到账支付开发流程（下）
ASP.NET支付宝扫码即时到账支付开发流程（上）
如何把自己写的程序加入到开机启动项（Windows）
C#操作注册表
重温SQL——行转列，列转行
Unity Hub破解

原文地址：https://www.cnblogs.com/rsapaper/p/6286082.html