Transformers in Action
Manning Publications (Verlag)
978-1-63343-788-3 (ISBN)
- Noch nicht erschienen (ca. März 2026)
- Versandkostenfrei
- Auch auf Rechnung
- Artikel merken
Transformer models power the chatbots, coders, and translators reshaping every industry today. Yet their architecture, math, and tuning often remain an intimidating black box. Stop copy pasting tutorials and start truly understanding what happens under the hood. Transformers in Action walks you through every layer with practical Python and clear analogies. Master small, large, and multimodal models, then optimize them for speed and cost. Build solutions that translate, summarize, and generate with confidence, efficiency, and rigor.
Layer-by-layer walkthrough: See how attention, embeddings, and positional encodings produce fluent output.
Task adaptation recipes: Fine-tune models for summarization, classification, or translation in minutes.
Optimization strategies: Reduce latency, shrink memory, and cut cloud bills without sacrificing accuracy.
Reinforcement learning techniques: Refine text generation quality using reward models and policy gradients.
Multimodal expansion: Combine text and vision to build next-generation, cross-media applications.
Complete code repository: Experiment instantly, tweak hyperparameters, and validate concepts on real datasets.
Transformers in Action, by Quantmate CEO and Chief AI Officer Nicole Koenigstein, has clear math walkthroughs, annotated Python, and production-ready patterns that you can trust.
The journey starts with encoder-only, decoder-only, and encoder-decoder variants, then moves to small language models for constrained environments. Each chapter couples theory with runnable notebooks, visual explanations, and performance benchmarks. Finish knowing exactly when to deploy a lightweight model, how to tune hyperparameters, and how to monitor costs. You will ship faster, safer, and leaner LLM solutions that impress users and stakeholders.
Ideal for software engineers and data scientists comfortable with Python and basic machine learning, eager to unlock transformer power.
Nicole Koenigstein is a CEO and Chief AI Officer renowned for transforming raw research into profitable AI systems. With years leading Quantmate’s agentic intelligence platform, Nicole brings clarity, precision, and business focus to every page. She distills deep model-building expertise into accessible guidance that helps readers deliver faster, smarter transformer solutions.
PART 1 FOUNDATIONS OF MODERN TRANSFORMER MODELS
1 THE NEED FOR TRANSFORMERS
2 A DEEPER LOOK INTO TRANSFORMERS
PART 2: GENERATIVE TRANSFORMERS
3 MODEL FAMILIES AND ARCHITECTURE VARIANTS
4 TEXT GENERATION STRATEGIES AND PROMPTING TECHNIQUES
5 PREFERENCE ALIGNMENT AND RAG
PART 3: SPECIALIZED MODELS
6 MULTIMODAL MODELS
7 EFFICIENT AND SPECIALIZED LARGE LANGUAGE MODELS
8 TRAINING AND EVALUATING LARGE LANGUAGE MODELS
9 OPTIMIZING AND SCALING LARGE LANGUAGE MODELS
10 ETHICAL AND RESPONSIBLE LARGE LANGUAGE MODELS
| Erscheint lt. Verlag | 4.3.2026 |
|---|---|
| Verlagsort | New York |
| Sprache | englisch |
| Themenwelt | Mathematik / Informatik ► Informatik ► Software Entwicklung |
| Informatik ► Theorie / Studium ► Künstliche Intelligenz / Robotik | |
| ISBN-10 | 1-63343-788-4 / 1633437884 |
| ISBN-13 | 978-1-63343-788-3 / 9781633437883 |
| Zustand | Neuware |
| Informationen gemäß Produktsicherheitsverordnung (GPSR) | |
| Haben Sie eine Frage zum Produkt? |
aus dem Bereich