Mathematical Models for Speech Technology - Stephen Levinson

Blick ins Buch

Mathematical Models for Speech Technology (eBook)

Stephen Levinson (Autor)

eBook Download: PDF

2005
John Wiley & Sons (Verlag)
978-0-470-02090-6 (ISBN)

Lese- und Medienproben

Ebook-Leseprobe (PDF)

Mathematical Models of Spoken Language presents the motivations for, intuitions behind, and basic mathematical models of natural spoken language communication. A comprehensive overview is given of all aspects of the problem from the physics of speech production through the hierarchy of linguistic structure and ending with some observations on language and mind.

The author comprehensively explores the argument that these modern technologies are actually the most extensive compilations of linguistic knowledge available.Throughout the book, the emphasis is on placing all the material in a mathematically coherent and computationally tractable framework that captures linguistic structure.

It presents material that appears nowhere else and gives a unification of formalisms and perspectives used by linguists and engineers. Its unique features include a coherent nomenclature that emphasizes the deep connections amongst the diverse mathematical models and explores the methods by means of which they capture linguistic structure.

This contrasts with some of the superficial similarities described in the existing literature; the historical background and origins of the theories and models; the connections to related disciplines, e.g. artificial intelligence, automata theory and information theory; an elucidation of the current debates and their intellectual origins; many important little-known results and some original proofs of fundamental results, e.g. a geometric interpretation of parameter estimation techniques for stochastic models and finally the author's own unique perspectives on the future of this discipline.

There is a vast literature on Speech Recognition and Synthesis however, this book is unlike any other in the field. Although it appears to be a rapidly advancing field, the fundamentals have not changed in decades. Most of the results are presented in journals from which it is difficult to integrate and evaluate all of these recent ideas. Some of the fundamentals have been collected into textbooks, which give detailed descriptions of the techniques but no motivation or perspective. The linguistic texts are mostly descriptive and pictorial, lacking the mathematical and computational aspects. This book strikes a useful balance by covering a wide range of ideas in a common framework. It provides all the basic algorithms and computational techniques and an analysis and perspective, which allows one to intelligently read the latest literature and understand state-of-the-art techniques as they evolve.

Stephen Levinson is the author of Mathematical Models for Speech Technology, published by Wiley.

Mathematical Models of Spoken Language presents the motivations for, intuitions behind, and basic mathematical models of natural spoken language communication. A comprehensive overview is given of all aspects of the problem from the physics of speech production through the hierarchy of linguistic structure and ending with some observations on language and mind. The author comprehensively explores the argument that these modern technologies are actually the most extensive compilations of linguistic knowledge available.Throughout the book, the emphasis is on placing all the material in a mathematically coherent and computationally tractable framework that captures linguistic structure. It presents material that appears nowhere else and gives a unification of formalisms and perspectives used by linguists and engineers. Its unique features include a coherent nomenclature that emphasizes the deep connections amongst the diverse mathematical models and explores the methods by means of which they capture linguistic structure. This contrasts with some of the superficial similarities described in the existing literature; the historical background and origins of the theories and models; the connections to related disciplines, e.g. artificial intelligence, automata theory and information theory; an elucidation of the current debates and their intellectual origins; many important little-known results and some original proofs of fundamental results, e.g. a geometric interpretation of parameter estimation techniques for stochastic models and finally the author's own unique perspectives on the future of this discipline. There is a vast literature on Speech Recognition and Synthesis however, this book is unlike any other in the field. Although it appears to be a rapidly advancing field, the fundamentals have not changed in decades. Most of the results are presented in journals from which it is difficult to integrate and evaluate all of these recent ideas. Some of the fundamentals have been collected into textbooks, which give detailed descriptions of the techniques but no motivation or perspective. The linguistic texts are mostly descriptive and pictorial, lacking the mathematical and computational aspects. This book strikes a useful balance by covering a wide range of ideas in a common framework. It provides all the basic algorithms and computational techniques and an analysis and perspective, which allows one to intelligently read the latest literature and understand state-of-the-art techniques as they evolve.

Stephen Levinson is the author of Mathematical Models for Speech Technology, published by Wiley.

Author's preface.

1 Introduction

2 Preliminaries

2.1 The physics of speech production

2.2 The source-filter model

2.3 Information-bearing features of the speech signal

2.4 Time-frequency representations

2.5 Classifications of acoustic patterns in speech

2.6 Temporal invariance and stationarity

2.7 Taxonomy of linguistic structure

3 Mathematical models of linguistic structure

3.1 Probabilistic functions of a discrete Markov process

3.2 Formal grammars and abstract automata

4 Syntactic analysis

4.1 Deterministic parsing algorithms

4.2 Probabilistic parsing algorithms

4.3 Parsing natural language

5 Grammatical inference

5.1 Exact inference and Gold's theorem

5.2 Baum's algorithm for regular grammars

5.3 Event counting in parse trees

5.4 Baker's algorithm for context-free grammars

6 Information-theoretic analysis of speech communication

6.1 The Miller et al. experiments

6.2 Entropy of an information source

6.3 Recognition error rates and entropy

7 Automatic speech recognition and constructive theories of language

7.1 Integrated architectures

7.2 Modular architectures

7.3 Parameter estimation from fluent speech

7.4 System performance

7.5 Other speech technologies

8 Automatic speech understanding and semantics

8.1 Transcription and comprehension

8.2 Limited domain semantics

8.3 The semantics of natural language

8.4 System architectures

8.5 Human and machine performance

9 Theories of mind and language

9.1 The challenge of automatic natural language understanding

9.2 Metaphors for mind

9.3 The artificial intelligence program

10 A speculation on the prospects for a science of the mind

10.1 The parable of the thermos bottle: measurements and symbols

10.2 The four questions of science

10.3 A constructive theory of the mind

10.4 The problem of consciousness

10.5 The role of sensorimotor function, associative memory and reinforcement learning in automatic acquisition of spoken language by an autonomous robot

10.6 Final thoughts: predicting the course of discovery

"...a succinct presentation of the most important mathematical technology of speech technology and the author's ideas for overcoming the limitations of these techniques..." (Mathematical Reviews, 2005j)

Erscheint lt. Verlag	13.5.2005
Sprache	englisch
Themenwelt	Geisteswissenschaften ► Sprach- / Literaturwissenschaft ► Sprachwissenschaft
	Mathematik / Informatik ► Informatik
	Mathematik / Informatik ► Mathematik ► Angewandte Mathematik
	Technik ► Elektrotechnik / Energietechnik
Schlagworte	Ability • Audio & Speech Processing & Broadcasting • Audio-, Sprachverarbeitung u. Ãbertragung • Audio-, Sprachverarbeitung u. Übertragung • competence • consists • convey • Electrical & Electronics Engineering • Elektrotechnik u. Elektronik • Express • humans • invaluable reference • language • lexical • Linguistic • many • Meaningful • Messages • Modell (Math.) • Models • motivations • Product • Research • Signal Processing • Signalverarbeitung • Simply • Sprachverarbeitung • Technology • Tool • transcriptions • use • years experience
ISBN-10	0-470-02090-3 / 0470020903
ISBN-13	978-0-470-02090-6 / 9780470020906

Informationen gemäß Produktsicherheitsverordnung (GPSR)
Haben Sie eine Frage zum Produkt?

PDF (Adobe DRM)

Kopierschutz: Adobe-DRM
Adobe-DRM ist ein Kopierschutz, der das eBook vor Mißbrauch schützen soll. Dabei wird das eBook bereits beim Download auf Ihre persönliche Adobe-ID autorisiert. Lesen können Sie das eBook dann nur auf den Geräten, welche ebenfalls auf Ihre Adobe-ID registriert sind.
Details zum Adobe-DRM

Dateiformat: PDF (Portable Document Format)
Mit einem festen Seitenlayout eignet sich die PDF besonders für Fachbücher mit Spalten, Tabellen und Abbildungen. Eine PDF kann auf fast allen Geräten angezeigt werden, ist aber für kleine Displays (Smartphone, eReader) nur eingeschränkt geeignet.

Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen eine Adobe-ID und die Software Adobe Digital Editions (kostenlos). Von der Benutzung der OverDrive Media Console raten wir Ihnen ab. Erfahrungsgemäß treten hier gehäuft Probleme mit dem Adobe DRM auf.
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen eine Adobe-ID sowie eine kostenlose App.
Geräteliste und zusätzliche Hinweise

Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.

Print-Ausgabe

Buch | Hardcover

CHF 199,95