A Discriminative Segmental Speech Model and its Application to Hungarian Number Recognition
|Title||A Discriminative Segmental Speech Model and its Application to Hungarian Number Recognition|
|Publication Type||Conference Paper|
|Year of Publication||2000|
|Authors||Tóth L, Kocsor A, Kovács K|
|Conference Name||Text, Speech and Dialogue: Third International Workshop, TSD 2000, LNAI vol. 1902|
|Place Published||Brno, Czech Republic|
This paper presents a stochastic segmental speech recogniser that models the a posteriori probabilities directly. The main issues concerning the system are segmental phoneme classification, utterance-level aggregation and the pruning of the search space. For phoneme classification, artificial neural networks and support vector machines are applied. Phonemic segmentation and utterance-level aggregation is performed with the aid of anti-phoneme modelling. At the phoneme level, the system convincingly outperforms the HMM system trained on the same corpus, while at the word level it attains the performance of the HMM system trained without embedded training.