Dr. Guillaume Fuchs, Jeremie Lecomte, Dr. Christian Uhle, from Fraunhofer IIS
Summer Term 2014, New time: Fridays at 14:15-16:00
NOTE: It seems that this schedule overlaps with a compulsory lab course of some CME students. We will therefore try to find an alternative time-slot on the first lecture. Please notify me if you cannot attend.
Am Wolfsmantel 33, Erlangen-Tennenlohe, Room 3R4.04
Please come to the first lecture on Thursday, 10.04.2014, 16:15 - 18:00, Room 3R4.04 (Am Wolfsmantel 33). If you are unable to attend, please contact Prof. Dr. Tom Bäckström.
Mobile phones – everyone has one. With 7 billion mobile phones in use, digital speech transmission is a truly global technology. Your grandma has one, Prince Charles has one and the poorest village in Africa has one. While the technology clearly works already, with such a market, the smallest improvement, when multiplied by 7 billion, has a huge impact worldwide.
Speech coding refers to digital compression and transmission of speech. This course provides an in-depth perspective to ACELP, the most commonly used speech coding algorithm. We will study the speech production models on which it is based, the perceptual models which are used for its optimization, and most importantly, go through the theory and practice of the most important concepts, linear prediction (LP), long time prediction (LTP), algebraic codebooks, line spectral frequencies (LSFs) and windowing. In addition, we will look at the big picture, the additional challenges that emerge when building a commercial speech coding product.
The goal of this course is to provide a strong foundation for researchers, engineers, and graduate students who are interested in the problem of speech coding.
This course is the most advanced course offered by the university on this topic, and serves as an excellent basis from which to commence research in the area. Various aspects of the course bring students up to date with the very latest developments in the field, as seen in recent international standards, conferences and journals. This course builds on Sprach- und Audiosignalverarbeitung (by Prof. Kellermann), and is well complimented by Mensch-Maschine-Schnittstelle (by Prof. Rabenstein), Praxis der Audiodatenkompression (Dr. Grill), Speech Enhancement (Prof. Habets) and Selected Topics in Perceptual Audio Coding (Prof. Herre), which deal with many other signal processing methods and gives an understanding of human auditory perception (also a key part of speech coding) and audio compression techniques.