Zum Hauptinhalt springen
Nicht aus der Schweiz? Besuchen Sie lehmanns.de
Für diesen Artikel ist leider kein Bild verfügbar.

Textual and contextual data analysis

A multivariate statistical approach using R
Buch | Hardcover
248 Seiten
2026
Chapman & Hall/CRC (Verlag)
9781032502267 (ISBN)
CHF 174,55 inkl. MwSt
  • Noch nicht erschienen (ca. Juli 2026)
  • Versandkostenfrei
  • Auch auf Rechnung
  • Artikel merken
Multidimensional statistical analysis of textual data is a powerful technique that enables researchers to uncover deeper insights into the context and meaning of documents. This book addresses the challenge of jointly analyzing textual and contextual data, presenting rigorous theoretical foundations alongside practical methodologies. By incorporating metadata and contextual information, readers can extract richer, more nuanced information from textual corpora, making this book an essential resource for statisticians, data scientists, and linguistics experts.

The book explores a wide range of textual data, from open-ended survey responses and political speeches to legal texts, literary works, and technical reports. It also examines the diverse contextual variables that shape these texts, such as sociodemographic characteristics, chronology, political affiliations, and external influences. Through real-world examples, readers will learn how to apply exploratory multivariate statistical methods to compare, characterize, and reveal the underlying structure of textual data. Each chapter builds on the previous one, offering a systematic approach to encoding, analyzing, and visualizing textual and contextual data. Topics include machine learning methods like latent semantic analysis and correspondence analysis, clustering techniques, restricted clustering defined by contextual data, and advanced visualization tools. The book also introduces methodologies for analyzing multilingual corpora and isolated texts, emphasizing the importance of discourse strategies and thematic contrasts.

This book is not only a guide to advanced statistical methods but also a practical toolkit for researchers working with diverse corpora. Whether analyzing legal databases, sensory evaluations, or political speeches, readers will find robust techniques to uncover patterns, relationships, and strategies within their data. By combining textual and contextual analysis, this book empowers readers to make meaningful comparisons and draw actionable conclusions.

KEY FEATURES:

• Comprehensive coverage of methods for jointly analyzing textual and contextual data.

• Practical applications to diverse corpora, including legal texts, political speeches, and sensory evaluations.

• Systematic comparison of machine learning methods like latent semantic analysis and correspondence analysis.

• Advanced visualization techniques, including interactive, 3D, and animated graphics.

• Methodologies for analyzing multilingual corpora and isolated texts, with a focus on discourse strategies.

Dr. Mónica Bécue-Bertaut taught statistics and data science at the Universitat Politènica de Catalunya and offered numerous guest lectures on textual data science in different countries. She has published several books and chapters on this topic, and she has helped design software related to textual data science, including SPAD.T and the R package Xplortext. She is an elected fellow of the International Statistical Institute and a Chevalier des Palmes Académiques, a distinction bestowed by the French government. Dr. Ramón Alvarez-Esteban is an associate professor at the University of León (Spain), where he teaches multivariate data analysis and R. His research interests include textual data analysis, climate change models, and integrated statistical and geospatial techniques. He is an author and the maintainer of the Xplortext R package (Statistical Analysis of Textual Data), which has been available on the CRAN website since 2017.

Preface 1. Consideration of Additional Information Called Contextual Data 2. SVD-Based Methods in Textual Analysis: An Overview 3. Clustering Methods 4. Constrained Clustering Defined by the Contextual Data into the Analysis 5. Textual Data Visualization 6. Textual Data and Contextual Data Playing a Symmetric Role 7. Correspondence Analysis on a Generalized Aggregate Lexical Table 8. Structure and Organization of a Text 9. Extension of Multivariate Statistical Methods to Multilingual Corpus Bibliography Index List of Figures List of Tables

Erscheint lt. Verlag 16.7.2026
Reihe/Serie Chapman & Hall/CRC Data Science Series
Zusatzinfo 36 Tables, black and white; 3 Line drawings, color; 64 Line drawings, black and white; 2 Halftones, black and white; 3 Illustrations, color; 66 Illustrations, black and white
Sprache englisch
Maße 156 x 234 mm
Themenwelt Informatik Datenbanken Data Warehouse / Data Mining
Mathematik / Informatik Informatik Theorie / Studium
Mathematik / Informatik Mathematik
ISBN-13 9781032502267 / 9781032502267
Zustand Neuware
Informationen gemäß Produktsicherheitsverordnung (GPSR)
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
eine Einführung mit Python, Scikit-Learn und TensorFlow

von Oliver Zeigermann; Chi Nhan Nguyen

Buch | Softcover (2024)
O'Reilly (Verlag)
CHF 27,85
Von den Grundlagen bis zum Produktiveinsatz

von Anatoly Zelenin; Alexander Kropp

Buch (2025)
Hanser (Verlag)
CHF 69,95