Idioma:

Speech recognition algorithms using weighted finite-state transducers

Takaaki Hori Atsushi Nakamura; Morgan & Claypool Publishers

San Rafael, Calif. Morgan & Claypool Publishers c2013

Localização: EPELM - Esc. Politécnica-Bib Eng Elet., Mec. e Naval (004.934 H782s )(Acessar)

Enviar para

Título:
Speech recognition algorithms using weighted finite-state transducers
Autor: Takaaki Hori
Atsushi Nakamura; Morgan & Claypool Publishers
Assuntos: Automatic speech recognition; Speech processing systems; PROCESSAMENTO DE VOZ; RECONHECIMENTO DE VOZ
Notas: Includes bibliographical references
Descrição: 1. Introduction -- 2. Brief overview of speech recognition -- 3. Introduction to weighted finite-state transducers -- 4. Speech recognition by weighted finite-state transducers -- 5. Dynamic decoders with on-the-fly WFST operations -- 6. Summary and perspective
This book introduces the theory, algorithms, and implementation techniques for efficient decoding in speech recognition mainly focusing on the Weighted Finite-State Transducer (WFST) approach. The decoding process for speech recognition is viewed as a search problem whose goal is to find a sequence of words that best matches an input speech signal. Since this process becomes computationally more expensive as the system vocabulary size increases, research has long been devoted to reducing the computational cost. Recently, the WFST approach has become an important state-of-the-art speech recognition technology, because it offers improved decoding speed with fewer recognition errors compared with conventional methods. However, it is not easy to understand all the algorithms used in this framework, and they are still in a black box for many people. In this book, we review the WFST approach and aim to provide comprehensive interpretations of WFST operations and decoding algorithms to help anyone who wants to understand, develop, and study WFST-based speech recognizers. We also mention recent advances in this framework and its applications to spoken language processing
Títulos relacionados: Série:Synthesis lectures on speech and audio processing #10
Editor: San Rafael, Calif. Morgan & Claypool Publishers
Data de criação/publicação: c2013
Formato: xii, 150 p. ill. 24 cm.
Idioma: Inglês

Links

Este item no Dedalus

Voltar para lista de resultados

Resultado 1 Avançar Ir para próxima página

Realização: Logos de Redes Sociais:

Speech recognition algorithms using weighted finite-state transducers

Takaaki Hori Atsushi Nakamura; Morgan & Claypool Publishers

San Rafael, Calif. Morgan & Claypool Publishers c2013

Buscando em bases de dados remotas. Favor aguardar.