Hands-On RAG for Production
Design, Develop, and Deploy Production-Ready RAG Applications
Seiten
2026
O'Reilly Media (Verlag)
979-8-3416-2171-8 (ISBN)
O'Reilly Media (Verlag)
979-8-3416-2171-8 (ISBN)
- Titel nicht im Sortiment
- Artikel merken
Authors Ofer Mendelevitch and Forrest Bao guide you through every phase of development, from data ingestion, embeddings, and vector search to advanced techniques like agentic RAG, multimodal RAG, and GraphRAG. This book simplifies the process, offering a comprehensive road map to building, refining, and scaling production-grade RAG applications.
Retrieval-augmented generation (RAG) is the go-to strategy for integrating large language models with your organization's unique knowledge. However, the market is full of RAG pipelines and components, making it hard to choose the right solution for your enterprise's needs. This book simplifies the process, offering a comprehensive road map to building, refining, and scaling production-grade RAG applications.
Authors Ofer Mendelevitch and Forrest Bao guide you through every phase of development, from data ingestion, embeddings, and vector search to advanced techniques like agentic RAG, multimodal RAG, and GraphRAG. Engineers and architects will learn how to tackle the challenges they'll encounter when building RAG applications at enterprise scale: ensuring high accuracy with minimal hallucinations, maintaining low-latency performance, safeguarding data privacy, and providing transparent, explainable responses among them.
Determine whether to build RAG yourself or deploy a RAG-as-a-service platform
Build a basic RAG stack that maximizes performance and cost-effectiveness
Measure key metrics such as hallucinations, response quality, latency, and cost
Address challenges in enterprise deployment, such as compliance with data security and privacy requirements, explainability, and prompt design
Implement advanced techniques such as multimodal RAG, agentic RAG, and GraphRAG
Retrieval-augmented generation (RAG) is the go-to strategy for integrating large language models with your organization's unique knowledge. However, the market is full of RAG pipelines and components, making it hard to choose the right solution for your enterprise's needs. This book simplifies the process, offering a comprehensive road map to building, refining, and scaling production-grade RAG applications.
Authors Ofer Mendelevitch and Forrest Bao guide you through every phase of development, from data ingestion, embeddings, and vector search to advanced techniques like agentic RAG, multimodal RAG, and GraphRAG. Engineers and architects will learn how to tackle the challenges they'll encounter when building RAG applications at enterprise scale: ensuring high accuracy with minimal hallucinations, maintaining low-latency performance, safeguarding data privacy, and providing transparent, explainable responses among them.
Determine whether to build RAG yourself or deploy a RAG-as-a-service platform
Build a basic RAG stack that maximizes performance and cost-effectiveness
Measure key metrics such as hallucinations, response quality, latency, and cost
Address challenges in enterprise deployment, such as compliance with data security and privacy requirements, explainability, and prompt design
Implement advanced techniques such as multimodal RAG, agentic RAG, and GraphRAG
| Erscheint lt. Verlag | 30.6.2026 |
|---|---|
| Verlagsort | Sebastopol |
| Sprache | englisch |
| Maße | 178 x 232 mm |
| Themenwelt | Informatik ► Theorie / Studium ► Künstliche Intelligenz / Robotik |
| ISBN-13 | 979-8-3416-2171-8 / 9798341621718 |
| Zustand | Neuware |
| Informationen gemäß Produktsicherheitsverordnung (GPSR) | |
| Haben Sie eine Frage zum Produkt? |
Mehr entdecken
aus dem Bereich
aus dem Bereich
Eine praxisorientierte Einführung
Buch | Softcover (2025)
Springer Vieweg (Verlag)
CHF 53,15
Buch | Softcover (2025)
Reclam, Philipp (Verlag)
CHF 11,20
die materielle Wahrheit hinter den neuen Datenimperien
Buch | Hardcover (2024)
C.H.Beck (Verlag)
CHF 44,75