Joint optimization of spectro-temporal features and deep neural nets for robust automatic speech recognition

Kovács György; Tóth László: Joint optimization of spectro-temporal features and deep neural nets for robust automatic speech recognition. In: Acta cybernetica, (22) 1. pp. 117-134. (2015)

Előnézet

Cikk, tanulmány, mű
actacyb_22_1_2015_8.pdf
Letöltés (247kB) | Előnézet

Absztrakt (kivonat)

In speech recognition, feature extraction and acoustical model training are traditionally done in two separate steps. Here, instead, we use a framework that combines spectro-temporal feature extraction and the training of neural network based acoustic models into a single process. We found earlier that this approach can be successfully applied for the recognition of speech. In this paper, we propose two further improvements to our method based on recent advances in neural net technology and extend our evaluation to speech contaminated with new types of noise. By repeating our experiments on TIMIT phone recognition tasks using clean and noise contaminated speech, we can compare the recognition performance of the original framework with our new, modified framework. The results indicate that both these modifications significantly improve the recognition performance of our framework. Moreover, we will show that these modifications allow us to achieve a substantially better performance than what we got earlier.

Mű típusa:	Cikk, tanulmány, mű
Befoglaló folyóirat/kiadvány címe:	Acta cybernetica
Dátum:	2015
Kötet:	22
Szám:	1
ISSN:	0324-721X
Oldalak:	pp. 117-134
Nyelv:	angol
Kiadás helye:	Szeged
Befoglaló mű URL:	http://acta.bibl.u-szeged.hu/38539/
DOI:	10.14232/actacyb.22.1.2015.8
Kulcsszavak:	Számítógép alkalmazása - beszédfelismerés
Megjegyzések:	Bibliogr.: p. 132-134. ; összefoglalás angol nyelven
Szakterület:	01. Természettudományok 01. Természettudományok > 01.02. Számítás- és információtudomány
Feltöltés dátuma:	2016. okt. 17. 10:36
Utolsó módosítás:	2022. jún. 20. 10:52
URI:	http://acta.bibl.u-szeged.hu/id/eprint/36260

Bővebben:

Tétel nézet