Phase-Aware Spectral Speech Enhancement Using Deep Learning Techniques
Seiten
2025
Shaker (Verlag)
978-3-8191-0312-4 (ISBN)
Shaker (Verlag)
978-3-8191-0312-4 (ISBN)
In everyday environments, speech is often degraded by background noise, reverberation, echo, or transmission losses. These distortions reduce quality and intelligibility, impairing communication. Speech enhancement techniques aim to overcome these challenges by improving the perceptual quality and clarity of speech under adverse conditions.
This dissertation advances the emerging field of phase-aware speech enhancement, which extends conventional magnitude-based methods by also processing the often-overlooked phase spectrum. Novel concepts for deep learning-based approaches are proposed and evaluated, with a particular focus on phase estimation and its integration into speech enhancement. Beyond theoretical investigations that highlight the potential of phase processing, methods for estimating the phase with deep neural networks are introduced, and strategies for jointly optimizing magnitude and phase estimation are proposed. Objective measures and subjective listening experiments confirm the effectiveness of the proposed approaches, underlining their relevance for the next generation of speech enhancement systems.
This dissertation advances the emerging field of phase-aware speech enhancement, which extends conventional magnitude-based methods by also processing the often-overlooked phase spectrum. Novel concepts for deep learning-based approaches are proposed and evaluated, with a particular focus on phase estimation and its integration into speech enhancement. Beyond theoretical investigations that highlight the potential of phase processing, methods for estimating the phase with deep neural networks are introduced, and strategies for jointly optimizing magnitude and phase estimation are proposed. Objective measures and subjective listening experiments confirm the effectiveness of the proposed approaches, underlining their relevance for the next generation of speech enhancement systems.
| Erscheinungsdatum | 24.10.2025 |
|---|---|
| Reihe/Serie | Aachen Series on Communication Systems ; 7 |
| Verlagsort | Düren |
| Sprache | englisch |
| Maße | 148 x 210 mm |
| Gewicht | 256 g |
| Themenwelt | Technik ► Elektrotechnik / Energietechnik |
| Technik ► Nachrichtentechnik | |
| Schlagworte | Deep learning • Phase Estimation • Speech Enhancement |
| ISBN-10 | 3-8191-0312-0 / 3819103120 |
| ISBN-13 | 978-3-8191-0312-4 / 9783819103124 |
| Zustand | Neuware |
| Informationen gemäß Produktsicherheitsverordnung (GPSR) | |
| Haben Sie eine Frage zum Produkt? |
Mehr entdecken
aus dem Bereich
aus dem Bereich
Grundlagen, Systemtechnik und Analysen ausgeführter Beispiele …
Buch | Softcover (2025)
Springer Vieweg (Verlag)
CHF 55,95
Wegweiser für Elektrofachkräfte
Buch | Hardcover (2024)
VDE VERLAG
CHF 67,20