Zum Hauptinhalt springen
Nicht aus der Schweiz? Besuchen Sie lehmanns.de
Real-time Speech and Music Classification by Large Audio Feature Space Extraction - Florian Eyben

Real-time Speech and Music Classification by Large Audio Feature Space Extraction

(Autor)

Buch | Softcover
XXXVIII, 298 Seiten
2018 | Softcover reprint of the original 1st ed. 2016
Springer International Publishing (Verlag)
978-3-319-80111-7 (ISBN)
CHF 224,65 inkl. MwSt
  • Versand in 10-15 Tagen
  • Versandkostenfrei
  • Auch auf Rechnung
  • Artikel merken

This book reports on an outstanding thesis thathas significantly advanced the state-of-the-art in the automated analysis andclassification of speech and music.  Itdefines several standard acoustic parameter sets and describes theirimplementation in a novel, open-source, audio analysis framework calledopenSMILE, which has been accepted and intensively used worldwide. The bookoffers extensive descriptions of key methods for the automatic classificationof speech and music signals in real-life conditions and reports on theevaluation of the framework developed and the acoustic parameter sets that wereselected. It is not only intended as a manual for openSMILE users, but also andprimarily as a guide and source of inspiration for students and scientists involvedin the design of speech and music analysis methods that can robustly handlereal-life conditions.

Abstract.- Introduction.- Acoustic Features and Modelling.- Standard Baseline Feature Sets.- Real-time Incremental Processing.- Real-life Robustness.- Evaluation.- Discussion and Outlook.- Appendix.- Mel-frequency Filterbank Parameters.

Erscheinungsdatum
Reihe/Serie Springer Theses
Zusatzinfo XXXVIII, 298 p. 41 illus., 39 illus. in color.
Verlagsort Cham
Sprache englisch
Maße 155 x 235 mm
Gewicht 516 g
Themenwelt Informatik Software Entwicklung User Interfaces (HCI)
Technik Elektrotechnik / Energietechnik
Schlagworte Acoustic Feature Extraction • Affective computing • Computational Paralinguistics • Music Information Retrieval • openSMILE • speech emotion recognition • Voice Analytics
ISBN-10 3-319-80111-2 / 3319801112
ISBN-13 978-3-319-80111-7 / 9783319801117
Zustand Neuware
Informationen gemäß Produktsicherheitsverordnung (GPSR)
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
Kindersachbuch über die Welt von Morgen

von Christoph Drösser

Buch | Hardcover (2025)
Gabriel in der Thienemann-Esslinger Verlag GmbH
CHF 24,90
Wissensverarbeitung - Neuronale Netze

von Uwe Lämmel; Jürgen Cleve

Buch | Hardcover (2023)
Carl Hanser (Verlag)
CHF 48,95
was alle wissen sollten, die Websites und Apps entwickeln

von Jens Jacobsen; Lorena Meyer

Buch | Hardcover (2024)
Rheinwerk (Verlag)
CHF 55,85