Skip to main content
eScholarship
Open Access Publications from the University of California

Transform Representation of the Spectra of Acoustic Speech Segments with Applications, Part 2: Speech Analysis, Synthesis and Coding

  • Author(s): Algazi, Ralph
  • Cadwell, C.
  • Irvine, D.
  • Ready, M.
  • Brown, K.
  • Chung, S.
  • et al.
Abstract

In Part I of this paper, we introduced a new approach to the representation of the speech spectral envelope which makes use of the Karhunen-Loeve transformation of acoustic subword segments. This new signal-dependent representation captures, with a few KL vectors and transform coefficients, the perceptually and phonetically important structure of the spectral envelope. In this second part, we study the application of this new representation to the analysis, synthesis, and coding of speech. We propose simple quantization and coding strategies for the KL representation vectors as well as for the resulting transform coefficients. The resulting technique is a variable rate encoding scheme which achieves good speech quality at an average rate of 3.5 kilobits per second.

Main Content
Current View