Zum Hauptinhalt springen
Nicht aus der Schweiz? Besuchen Sie lehmanns.de
Machine Learning for Microbiome Statistics - Yinglin Xia, Jun Sun

Machine Learning for Microbiome Statistics

, (Autoren)

Buch | Hardcover
736 Seiten
2026
CRC Press (Verlag)
978-1-041-00524-7 (ISBN)
CHF 239,95 inkl. MwSt
  • Noch nicht erschienen (ca. Februar 2026)
  • Versandkostenfrei
  • Auch auf Rechnung
  • Artikel merken
Machine learning fundamentally learns from the past experiences (seen data) to make predictions about future (unseen data). Predictions in nature are often uncertain. Microbiome data have unique characteristics, including high-dimensionality, over-dispersion, sparsity and zero-inflation, and heterogeneity. Thus, machine learning microbiome data for predicting the outcome of phenotypes is even more uncertain than learning those data from other fields. Machine Learning for Microbiome Statistics poses many challenges for evaluating the prediction performance using appropriate metrics and independent data validation.

This unique book aims to address the challenges of machine learning statistics, emphasize the importance of performance valuation by appropriate metrics and independent data, and describe several important concepts of machine learning statistics, such as feature engineering and overfitting. It comprehensively reviews commonly used and newly developed machine learning models for microbiome research. Specifically, this book provides the step-by-step procedures to perform machine learning microbiome data, including feature engineering, algorithm selection and optimization, performance evaluation and model testing. It comments the benefits and limitations of using machine learning for microbiome statistics and remarks on the advantages and disadvantages of each machine learning algorithm.

It will be an excellent reference book for students and academics in the field.



Presents a thorough overview of machine learning algorithms for microbiome statistics.
Performs step-by-step procedures to perform machine learning microbiome data, using important supervised learning algorithms, including classical, ensemble learning and tree-based models.
Describes important concepts of machine learning, including bias and variance tradeoff, accuracy and precision, overfitting and underfitting, model complexity and interpretability, and feature engineering,
Investigates and applies various cross-validation techniques step-by-step.
Introduces confusion matrix and its derived measures. Comprehensively describes the properties of F1, Matthews’ correlation coefficient (MCC), area under the receiver operating characteristic curve (AUC-ROC), and area under the precision-recall curve (AUC-PR), as well as discusses their advantages and disadvantages when using for microbiome data.
Offers all related R codes and the datasets from the authors’ first-hand microbiome research and publicly available data.

Dr. Yinglin Xia is a Clinical Professor in the Department of Medicine at the University of Illinois Chicago. He has published six books on statistical analysis of microbiome and metabolomics data and more than 180 statistical methodology and research papers in peer-reviewed journals. He serves on the editorial boards of several scientific journals including as an Associate Editor of Gut Microbes and has served as a reviewer for over 100 scientific journals. Dr. Jun Sun is a tenured Professor of Medicine at the University of Illinois Chicago and an internationally recognized expert on microbiome and human diseases, e.g., vitamin D receptor in inflammation, dysbiosis and intestinal dysfunction in amyotrophic lateral sclerosis (ALS). Her lab is the first to discover that chronic effects and molecular mechanisms of Salmonella infection and risk of colon cancer. Dr. Sun has published over 260 scientific articles in peer-reviewed journals and 10 books on microbiome.

Preface Acknowledgements About the Authors Chapter 1 Introduction to Machine Learning Chapter 2 Overview of Machine Learning in Microbiome Research Chapter 3 Accessing Model Accuracy and Goodness of Fit Tests for Normality Chapter 4 Overfitting and Underfitting Chapter 5 Assessing Model Accuracy Using Cross-Validation Chapter 6 Feature Engineering and Model Selection Chapter 7 Logistic Regression Chapter 8 Support Vector Machines Chapter 9 Classification Trees Chapter 10 Random Forest Chapter 11 The Evolution of Tree-Based Algorithms Chapter 12 Extreme Gradient Boosting (XGBoost) Chapter 13 Artificial Neural Networks and Deep Learning Chapter 14 Machine Learning Microbiome with SIAMCAT Chapter 15 Basic Performance Metrics for Machine Learning Models Chapter 16 Matthews Correlation Coefficient Chapter 17 Area Under the Receiver Operating Characteristic Curve (AUC-ROC) Chapter 18 Area Under the Precision-Recall Curve (AUC-PR) Chapter 19 Comparisons of Machine Learning Classification Models with Tidymodels

Erscheint lt. Verlag 26.2.2026
Reihe/Serie Chapman & Hall/CRC Biostatistics Series
Zusatzinfo 49 Tables, black and white; 56 Line drawings, color; 35 Line drawings, black and white; 2 Halftones, color; 58 Illustrations, color; 35 Illustrations, black and white
Verlagsort London
Sprache englisch
Maße 156 x 234 mm
Themenwelt Informatik Theorie / Studium Künstliche Intelligenz / Robotik
Mathematik / Informatik Mathematik Statistik
ISBN-10 1-041-00524-5 / 1041005245
ISBN-13 978-1-041-00524-7 / 9781041005247
Zustand Neuware
Informationen gemäß Produktsicherheitsverordnung (GPSR)
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
Eine kurze Geschichte der Informationsnetzwerke von der Steinzeit bis …

von Yuval Noah Harari

Buch | Hardcover (2024)
Penguin (Verlag)
CHF 39,95
die materielle Wahrheit hinter den neuen Datenimperien

von Kate Crawford

Buch | Hardcover (2024)
C.H.Beck (Verlag)
CHF 44,75