Building and Training Generative AI Models
Apress (Verlag)
979-8-8688-2331-2 (ISBN)
- Titel nicht im Sortiment
- Artikel merken
The book starts by guiding readers on selecting the right model architecture for their application, be it text generation, image synthesis, or multimodal tasks. It then walks through essential components of model training, including dataset handling, self-supervised learning, and core optimisation techniques such as backpropagation, gradient descent, and learning rate scheduling. It also delves into large-scale training infrastructure, covering GPU/TPU usage, distributed computing frameworks, and system-level strategies for scaling performance. Practical guidance is provided on fine-tuning models with domain-specific data and applying reinforcement learning from human feedback (RLHF), model quantisation, and pruning to improve efficiency. Key challenges in generative AI—such as overfitting, bias, hallucination, and data efficiency—are addressed through proven techniques and emerging best practices. Readers will also gain insight into model interpretability and generalisation, ensuring robust and trustworthy outputs. The book demonstrates how to build scalable, production-ready generative systems across domains like media, healthcare, scientific simulation, and design through real-world examples and applied case studies.
By the end, readers will gain an understanding of how to architect, optimise, and apply generative models across diverse domains such as media creation, healthcare, design, scientific simulation, and beyond.
What you will learn;
Learn how to choose and implement generative models—VAEs, GANs, transformers, and diffusion models—for specific use cases.
Master training optimization techniques such as backpropagation, gradient descent, adaptive learning rates, and regularization.
Apply best practices for large-scale training using GPUs, TPUs, and distributed computing frameworks for performance scaling.
Boost model efficiency through quantization, pruning, fine-tuning, and RLHF to enhance output quality and reduce overhead.
Who this book is for:
AI Engineers and Machine Learning Practitioners looking to build and deploy generative models in real-world applications. Data Scientists working on deep learning projects involving text, vision, audio, or multimodal generation.
Irena Cronin is SVP of Product for DADOS Technology, which is making an app for the Apple Vision Pro that does data analytics and visualization. She is also the CEO of Infinite Retina, which provides research to help companies develop and implement AI, AR and other new technologies for their businesses. Prior to this, she worked for several years as an equity research analyst and gained extensive experience in evaluating both public and private companies. Irena has a joint MBA/MA from the University of Southern California and an MS with Distinction in Management and Systems from New York University. She graduated with a BA from the University of Pennsylvania with a major in Economics (summa cum laude). Irena Cronin is the author of the book "Understanding Generative AI Business Applications' published by Apress.
Chapter 1: Introduction to Generative AI Systems.- Chapter 2: Choosing the Right Architecture.- Chapter 3: Data Collection and Preparation.- Chapter 4: Training Fundamentals and Self-Supervised Learning.- Chapter 5: Optimization and Learning Strategies.- Chapter 6: Scaling Training with Infrastructure and Distributed Systems.- Chapter 7: Fine-Tuning and Domain Adaptation.- Chapter 8: Reinforcement Learning with Human Feedback (RLHF).- Chapter 9: Model Compression and Inference Optimization.- Chapter 10: Addressing Bias, Hallucinations, and Failure Modes.- Chapter 11: Evaluation and Benchmarking of Generative Models.- Chapter 12: Future Directions and Deployment in Production.
| Erscheint lt. Verlag | 27.5.2026 |
|---|---|
| Zusatzinfo | Approx. 350 p. |
| Verlagsort | Berkley |
| Sprache | englisch |
| Maße | 155 x 235 mm |
| Themenwelt | Mathematik / Informatik ► Informatik ► Programmiersprachen / -werkzeuge |
| Informatik ► Theorie / Studium ► Künstliche Intelligenz / Robotik | |
| Schlagworte | Deep Learning for Generative Models • Diffusion models in AI • Generative Adversarial Networks (GANs) • generative AI • Training large-scale AI systems • Transformer-based AI models • Variational Autoencoders (VAEs) |
| ISBN-13 | 979-8-8688-2331-2 / 9798868823312 |
| Zustand | Neuware |
| Informationen gemäß Produktsicherheitsverordnung (GPSR) | |
| Haben Sie eine Frage zum Produkt? |
aus dem Bereich