skip to main content

Non-linear encoding of the excitation source using neural networks for transition mode coding in CELP

Joseph, M. A ; Yegnanarayana, B

2012 International Conference on Signal Processing and Communications (SPCOM), July 2012, pp.1-5

Texto completo disponível

Citações Citado por
  • Título:
    Non-linear encoding of the excitation source using neural networks for transition mode coding in CELP
  • Autor: Joseph, M. A ; Yegnanarayana, B
  • Assuntos: Speech ; Decoding ; Speech Coding ; Neural Networks ; Synchronization ; Bit Rate ; Speech Coding ; Gci ; Neural Network ; Transition Mode Coding ; Celp ; Engineering
  • É parte de: 2012 International Conference on Signal Processing and Communications (SPCOM), July 2012, pp.1-5
  • Descrição: When a frame suffers erasure, the adaptive codebook at the decoder is no longer in sync with the one at the encoder. When the frame that is erased is a frame following the voice-onset frame, this loss of synchronization of the codebooks severely degrades the quality of the decoded speech. This degradation is primarily because no meaningful excitation signal is present in the adaptive codebook. In this paper, an autoassociative neural network (AANN) with a compression layer is used to capture the characteristics of the excitation source around the GCIs. A transition mode frame that differs from the conventional CELP frame without altering the bit-rate is proposed to deal with this problem of frame drops during transition regions. In this transition mode frames, the compressed representation of the excitation source around the GCIs obtained through AANNs is used to reconstruct the adaptive codebook at the receiver. It is shown that the proposed method improves the quality of the decoded speech.
  • Idioma: Inglês

Buscando em bases de dados remotas. Favor aguardar.