Large Scale Machine Learning with Spark - Md. Rezaul Karim, Md. Mahedi Kaysar

Blick ins Buch

Large Scale Machine Learning with Spark (eBook)

Md. Rezaul Karim, Md. Mahedi Kaysar (Autoren)

eBook Download: EPUB

2016
476 Seiten
Packt Publishing (Verlag)
978-1-78588-371-2 (ISBN)

Lese- und Medienproben

Ebook-Leseprobe (EPUB)

Discover everything you need to build robust machine learning applications with Spark 2.0

About This Book

Get the most up-to-date book on the market that focuses on design, engineering, and scalable solutions in machine learning with Spark 2.0.0
Use Spark's machine learning library in a big data environment
You will learn how to develop high-value applications at scale with ease and a develop a personalized design

Who This Book Is For

This book is for data science engineers and scientists who work with large and complex data sets. You should be familiar with the basics of machine learning concepts, statistics, and computational mathematics. Knowledge of Scala and Java is advisable.

What You Will Learn

Get solid theoretical understandings of ML algorithms
Configure Spark on cluster and cloud infrastructure to develop applications using Scala, Java, Python, and R
Scale up ML applications on large cluster or cloud infrastructures
Use Spark ML and MLlib to develop ML pipelines with recommendation system, classification, regression, clustering, sentiment analysis, and dimensionality reduction
Handle large texts for developing ML applications with strong focus on feature engineering
Use Spark Streaming to develop ML applications for real-time streaming
Tune ML models with cross-validation, hyperparameters tuning and train split
Enhance ML models to make them adaptable for new data in dynamic and incremental environments

In Detail

Data processing, implementing related algorithms, tuning, scaling up and finally deploying are some crucial steps in the process of optimising any application.

Spark is capable of handling large-scale batch and streaming data to figure out when to cache data in memory and processing them up to 100 times faster than Hadoop-based MapReduce. This means predictive analytics can be applied to streaming and batch to develop complete machine learning (ML) applications a lot quicker, making Spark an ideal candidate for large data-intensive applications.

This book focuses on design engineering and scalable solutions using ML with Spark. First, you will learn how to install Spark with all new features from the latest Spark 2.0 release. Moving on, you'll explore important concepts such as advanced feature engineering with RDD and Datasets. After studying developing and deploying applications, you will see how to use external libraries with Spark.

In summary, you will be able to develop complete and personalised ML applications from data collections,model building, tuning, and scaling up to deploying on a cluster or the cloud.

Style and approach

This book takes a practical approach where all the topics explained are demonstrated with the help of real-world use cases.

Discover everything you need to build robust machine learning applications with Spark 2.0About This BookGet the most up-to-date book on the market that focuses on design, engineering, and scalable solutions in machine learning with Spark 2.0.0Use Spark's machine learning library in a big data environmentYou will learn how to develop high-value applications at scale with ease and a develop a personalized designWho This Book Is ForThis book is for data science engineers and scientists who work with large and complex data sets. You should be familiar with the basics of machine learning concepts, statistics, and computational mathematics. Knowledge of Scala and Java is advisable.What You Will LearnGet solid theoretical understandings of ML algorithmsConfigure Spark on cluster and cloud infrastructure to develop applications using Scala, Java, Python, and RScale up ML applications on large cluster or cloud infrastructuresUse Spark ML and MLlib to develop ML pipelines with recommendation system, classification, regression, clustering, sentiment analysis, and dimensionality reductionHandle large texts for developing ML applications with strong focus on feature engineeringUse Spark Streaming to develop ML applications for real-time streamingTune ML models with cross-validation, hyperparameters tuning and train splitEnhance ML models to make them adaptable for new data in dynamic and incremental environmentsIn DetailData processing, implementing related algorithms, tuning, scaling up and finally deploying are some crucial steps in the process of optimising any application.Spark is capable of handling large-scale batch and streaming data to figure out when to cache data in memory and processing them up to 100 times faster than Hadoop-based MapReduce. This means predictive analytics can be applied to streaming and batch to develop complete machine learning (ML) applications a lot quicker, making Spark an ideal candidate for large data-intensive applications.This book focuses on design engineering and scalable solutions using ML with Spark. First, you will learn how to install Spark with all new features from the latest Spark 2.0 release. Moving on, you'll explore important concepts such as advanced feature engineering with RDD and Datasets. After studying developing and deploying applications, you will see how to use external libraries with Spark.In summary, you will be able to develop complete and personalised ML applications from data collections,model building, tuning, and scaling up to deploying on a cluster or the cloud.Style and approachThis book takes a practical approach where all the topics explained are demonstrated with the help of real-world use cases.

Erscheint lt. Verlag	27.10.2016
Sprache	englisch
Themenwelt	Mathematik / Informatik ► Informatik
ISBN-10	1-78588-371-2 / 1785883712
ISBN-13	978-1-78588-371-2 / 9781785883712

Informationen gemäß Produktsicherheitsverordnung (GPSR)
Haben Sie eine Frage zum Produkt?

EPUB (Adobe DRM)

Kopierschutz: Adobe-DRM
Adobe-DRM ist ein Kopierschutz, der das eBook vor Mißbrauch schützen soll. Dabei wird das eBook bereits beim Download auf Ihre persönliche Adobe-ID autorisiert. Lesen können Sie das eBook dann nur auf den Geräten, welche ebenfalls auf Ihre Adobe-ID registriert sind.
Details zum Adobe-DRM

Dateiformat: EPUB (Electronic Publication)
EPUB ist ein offener Standard für eBooks und eignet sich besonders zur Darstellung von Belletristik und Sachbüchern. Der Fließtext wird dynamisch an die Display- und Schriftgröße angepasst. Auch für mobile Lesegeräte ist EPUB daher gut geeignet.

Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen eine Adobe-ID und die Software Adobe Digital Editions (kostenlos). Von der Benutzung der OverDrive Media Console raten wir Ihnen ab. Erfahrungsgemäß treten hier gehäuft Probleme mit dem Adobe DRM auf.
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen eine Adobe-ID sowie eine kostenlose App.
Geräteliste und zusätzliche Hinweise

Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.