Zum Hauptinhalt springen
Nicht aus der Schweiz? Besuchen Sie lehmanns.de
Joint Training for Neural Machine Translation - Yong Cheng

Joint Training for Neural Machine Translation

(Autor)

Buch | Hardcover
78 Seiten
2019 | 2019 ed.
Springer Verlag, Singapore
978-981-329-747-0 (ISBN)
CHF 74,85 inkl. MwSt
This book presents four approaches to jointly training bidirectional neural machine translation (NMT) models. First, in order to improve the accuracy of the attention mechanism, it proposes an agreement-based joint training approach to help the two complementary models agree on word alignment matrices for the same training data. Second, it presents a semi-supervised approach that uses an autoencoder to reconstruct monolingual corpora, so as to incorporate these corpora into neural machine translation. It then introduces a joint training algorithm for pivot-based neural machine translation, which can be used to mitigate the data scarcity problem. Lastly it describes an end-to-end bidirectional NMT model to connect the source-to-target and target-to-source translation models, allowing the interaction of parameters between these two directional models.

Yong Cheng is currently a software engineer engaged in research at Google. Before joining Google, he worked as a senior researcher at Tencent AI Lab. He obtained his Ph.D. from the Institute for Interdisciplinary Information Sciences (IIIS) at Tsinghua University in 2017. His research interests focus on neural machine translation and natural language processing.

1. Introduction.- 2. Neural Machine Translation.- 3. Agreement-based Joint Training for Bidirectional Attention-based Neural Machine Translation.- 4. Semi-supervised Learning for Neural Machine Translation.- 5. Joint Training for Pivot-based Neural Machine Translation.- 6. Joint Modeling for Bidirectional Neural Machine Translation with Contrastive Learning.- 7. Related Work.- 8. Conclusion.

Erscheinungsdatum
Reihe/Serie Springer Theses
Zusatzinfo 9 Illustrations, color; 14 Illustrations, black and white; XIII, 78 p. 23 illus., 9 illus. in color.
Verlagsort Singapore
Sprache englisch
Maße 155 x 235 mm
Themenwelt Informatik Theorie / Studium Künstliche Intelligenz / Robotik
ISBN-10 981-329-747-6 / 9813297476
ISBN-13 978-981-329-747-0 / 9789813297470
Zustand Neuware
Informationen gemäß Produktsicherheitsverordnung (GPSR)
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
die materielle Wahrheit hinter den neuen Datenimperien

von Kate Crawford

Buch | Hardcover (2024)
C.H.Beck (Verlag)
CHF 44,75
Künstliche Intelligenz, Macht und das größte Dilemma des 21. …

von Mustafa Suleyman; Michael Bhaskar

Buch | Softcover (2025)
C.H.Beck (Verlag)
CHF 25,20