TY - EJOU AU - Zhao, Yue AU - Yue, Jianjian AU - Song, Wei AU - Xu, Xiaona AU - Li, Xiali AU - Wu, Licheng AU - Ji, Qiang TI - Tibetan Multi-Dialect Speech Recognition Using Latent Regression Bayesian Network and End-To-End Mode T2 - Journal on Internet of Things PY - 2019 VL - 1 IS - 1 SN - 2579-0080 AB - We proposed a method using latent regression Bayesian network (LRBN) to extract the shared speech feature for the input of end-to-end speech recognition model. The structure of LRBN is compact and its parameter learning is fast. Compared with Convolutional Neural Network, it has a simpler and understood structure and less parameters to learn. Experimental results show that the advantage of hybrid LRBN/Bidirectional Long Short-Term Memory-Connectionist Temporal Classification architecture for Tibetan multi-dialect speech recognition, and demonstrate the LRBN is helpful to differentiate among multiple language speech sets. KW - Multi-dialect speech recognition KW - Tibetan language KW - latent regression bayesian network KW - end-to-end model DO - 10.32604/jiot.2019.05866