100904 - Speech and Audio Processing
Introduction- Speech production and modeling - Human Auditory System; General structure of speech coders; Classification of speech coding techniques – parametric, waveform and hybrid ; Requirements of speech codecs –quality, coding delays, robustness.
Speech Signal Processing- Pitch-period estimation, all-pole and all-zero filters, convolution; Power spectral density, periodogram, autoregressive model, autocorrelation estimation.
Linear Prediction of Speech- Basic concepts of linear prediction; Linear Prediction Analysis of non- stationary signals –prediction gain, examples; Levinson-Durbin algorithm; Long term and short-term linear prediction models; Moving average prediction.
Speech Quantization- Scalar quantization–uniform quantizer, optimum quantizer, logarithmic quantizer, adaptive quantizer, differential quantizers; Vector quantization – distortion measures, codebook design, codebook types.
Scalar Quantization of LPC- Spectral distortion measures, Quantization based on reflection coefficient and log area ratio, bit allocation; Line spectral frequency – LPC to LSF conversions, quantization based on LSF.
Linear Prediction Coding- LPC model of speech production; Structures of LPCencoders and decoders; Voicing detection; Limitations of the LPC model.
Code Excited Linear Prediction- CELP speech production model; Analysis-by-synthesis; Generic CELP encoders and decoders; Excitation codebook search – state-save method, zero- input zero-state method; CELP based on adaptive codebook, Adaptive Codebook search; Low Delay CELP and algebraic CELP.
Speech Coding Standards-An overview of ITU-T G.726, G.728 and G.729standards
Text/Reference Books:
1. “Digital Speech” by A.M.Kondoz, Second Edition (Wiley Students‟ Edition), 2004.
2. “Speech Coding Algorithms: Foundation and Evolution of Standardized Coders”, W.C. Chu, Wiley Inter science, 2003.