Discriminatively trained continuous Hindi speech recognition using integrated acoustic features and recurrent neural network language modeling

Journal of Intelligent Systems 30 (1):165-179 (2020)
  Copy   BIBTEX

Abstract

This paper implements the continuous Hindi Automatic Speech Recognition (ASR) system using the proposed integrated features vector with Recurrent Neural Network (RNN) based Language Modeling (LM). The proposed system also implements the speaker adaptation using Maximum-Likelihood Linear Regression (MLLR) and Constrained Maximum likelihood Linear Regression (C-MLLR). This system is discriminatively trained by Maximum Mutual Information (MMI) and Minimum Phone Error (MPE) techniques with 256 Gaussian mixture per Hidden Markov Model(HMM) state. The training of the baseline system has been done using a phonetically rich Hindi dataset. The results show that discriminative training enhances the baseline system performance by up to 3%. Further improvement of ~7% has been recorded by applying RNN LM. The proposed Hindi ASR system shows significant performance improvement over other current state-of-the-art techniques.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 93,891

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Recurrent Neural Network Based Speech emotion detection using Deep Learning.P. Pavithra - 2022 - Journal of Science Technology and Research (JSTAR) 3 (1):65-77.

Analytics

Added to PP
2020-08-01

Downloads
9 (#1,268,194)

6 months
2 (#1,446,842)

Historical graph of downloads
How can I increase my downloads?

Author's Profile

Aditya Kumar
KOLHAN UNIVERSITY INDIA

Citations of this work

No citations found.

Add more citations

References found in this work

No references found.

Add more references