TY  - EJOU
AU  - Zhao, Yue  
AU  - Yue, Jianjian  
AU  - Song, Wei  
AU  - Xu, Xiaona  
AU  - Li, Xiali  
AU  - Wu, Licheng  
AU  - Ji, Qiang  

TI  - Tibetan Multi-Dialect Speech Recognition Using Latent Regression Bayesian Network and End-To-End Mode
T2  - Journal on Internet of Things

PY  - 2019
VL  - 1
IS  - 1
SN  - 2579-0080

AB  - We proposed a method using latent regression Bayesian network (LRBN) to extract the shared speech feature for the input of end-to-end speech recognition model. The structure of LRBN is compact and its parameter learning is fast. Compared with Convolutional Neural Network, it has a simpler and understood structure and less parameters to learn. Experimental results show that the advantage of hybrid LRBN/Bidirectional Long Short-Term Memory-Connectionist Temporal Classification architecture for Tibetan multi-dialect speech recognition, and demonstrate the LRBN is helpful to differentiate among multiple language speech sets.
KW  - Multi-dialect speech recognition
KW  -  Tibetan language
KW  -  latent regression bayesian network
KW  -  end-to-end model

DO  - 10.32604/jiot.2019.05866