245 / 2018-08-30 15:48:36
Speech Certification Based on Bidirectional Long Short-Term Memory Neural Network
CTC,speech certification,speech recognition
摘要录用
Mu Guo / BeiHang Unversity
Li Wei / BeiHang Unversity
Guo Dongbin / BeiHang Unversity
Chen Xu / Tsinghua University
Li Liang / Beijing Lanuage and Culture University
Abstract:
With the continuous improvement of computer hardware performance and the development of neural network technology, the DNN (deep neural networks) used for speech recognition have made considerable headway and have outperformed the speech recognition model based on the HMM. The LSTM technology is a variant of the RNN neural network model, and it has been successful in semantic recognition and reading comprehension. In this paper, a bidirectional LSTM model combined with the CTC (connectionist temporal classification) loss function is proposed to verify the accuracy of speech recognition on the open LibriSpeech corpus. The CER (character error rate) on the training data set is reduced to 0.04 and the test data set to 0.19. The model is extended and applied based on the results of the pre-trained speech model to realize the evaluation of English pronunciation at a phonetic level, and the results of the pronunciation test can be given in real time.
Key Words: LSTM, CTC, speech certification, speech recognition
重要日期
  • 会议日期

    12月01日

    2018

    12月02日

    2018

  • 08月15日 2018

    摘要截稿日期

  • 08月15日 2018

    初稿截稿日期

  • 12月02日 2018

    注册截止日期

联系方式
移动端
在手机上打开
小程序
打开微信小程序
客服
扫码或点此咨询