Price loading...

O'Reilly Data Algorithms with Spark: Recipes and Design Patterns for Scaling Up using PySpark

Price data last checked 106 day(s) ago - refreshing...

View at Amazon

Price History & Forecast

No Price Data Available

Price history will appear here once data is collected from Amazon.

Price Distribution

No price data available for histogram

Description

Apache Spark's speed, ease of use, sophisticated analytics, and multilanguage support makes practical knowledge of this cluster-computing framework a required skill for data engineers and data scientists. With this hands-on guide, anyone looking for an introduction to Spark will learn practical algorithms and examples using PySpark. In each chapter, author Mahmoud Parsian shows you how to solve a data problem with a set of Spark transformations and algorithms. You'll learn how to tackle problems involving ETL, design patterns, machine learning algorithms, data partitioning, and genomics analysis. Each detailed recipe includes PySpark algorithms using the PySpark driver and shell script. With this book, you will: Learn how to select Spark transformations for optimized solutions Explore powerful transformations and reductions including reduceByKey(), combineByKey(), and mapPartitions() Understand data partitioning for optimized queries Build and apply a model using PySpark design patterns Apply motif-finding algorithms to graph data Analyze graph data by using the GraphFrames API Apply PySpark algorithms to clinical and genomics data Learn how to use and apply feature engineering in ML algorithms Understand and use practical and pragmatic data design patterns

Product Specifications

Format
Paperback
Domain
Amazon UK
Release Date
30 April 2022
Listed Since
28 May 2021

Barcode

No barcode data available

Similar Products You Might Like

Data Analysis with Python and PySpark
94% match

Data Analysis with Python and PySpark

£33.73 07 Dec 2025
Advanced Analytics with Spark, 2e: Patterns for Learning from Data at Scale
94% match

Advanced Analytics with Spark, 2e: Patterns for Learning from Data at Scale

O'Reilly

£34.38 02 Mar 2026
Learning Spark 2e: Lightning-Fast Data Analytics
94% match

Learning Spark 2e: Lightning-Fast Data Analytics

O'Reilly

£41.80 14 Jan 2026
Big Data Analytics with Spark: A Practitioner's Guide to Using Spark for Large Scale Data Analysis
94% match

Big Data Analytics with Spark: A Practitioner's Guide to Using Spark for Large Scale Data Analysis

Apress

£41.77 21 Feb 2026
High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark
93% match

High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark

O'Reilly

£41.15 14 Jan 2026
Learn PySpark: Build Python-based Machine Learning and Deep Learning Models
93% match

Learn PySpark: Build Python-based Machine Learning and Deep Learning Models

Apress

£36.13 12 Mar 2026
Spark – The Definitive Guide: Big data processing made simple
93% match

Spark – The Definitive Guide: Big data processing made simple

O'Reilly

£39.98 14 Jan 2026
Scaling Machine Learning with Spark: Distributed ML with MLlib, TensorFlow, and PyTorch
93% match

Scaling Machine Learning with Spark: Distributed ML with MLlib, TensorFlow, and PyTorch

O'Reilly

£44.14 21 Apr 2026
Spark in Action, Second Edition
93% match

Spark in Action, Second Edition

Manning Publications

£45.39 12 Apr 2026
Modern Data Engineering with Apache Spark: A Hands-On Guide for Building Mission-Critical Streaming Applications
93% match

Modern Data Engineering with Apache Spark: A Hands-On Guide for Building Mission-Critical Streaming Applications

Apress

£34.70 21 Feb 2026
Hands-on Guide to Apache Spark 3: Build Scalable Computing Engines for Batch and Stream Data Processing
93% match

Hands-on Guide to Apache Spark 3: Build Scalable Computing Engines for Batch and Stream Data Processing

Apress

£44.22 24 Feb 2026
Beginning Apache Spark 3: With DataFrame, Spark SQL, Structured Streaming, and Spark Machine Learning Library
93% match

Beginning Apache Spark 3: With DataFrame, Spark SQL, Structured Streaming, and Spark Machine Learning Library

Apress

£45.57 22 Feb 2026
Large Scale Machine Learning with Python
93% match

Large Scale Machine Learning with Python

Packt Publishing

£41.99 07 Mar 2026
Stream Processing with Apache Spark: Mastering Structured Streaming and Spark Streaming
92% match

Stream Processing with Apache Spark: Mastering Structured Streaming and Spark Streaming

O'Reilly

£40.67 15 Apr 2026
Graph Algorithms: Practical Examples in Apache Spark and Neo4j
92% match

Graph Algorithms: Practical Examples in Apache Spark and Neo4j

O'Reilly

£41.80 06 Jan 2026
Scala for Machine Learning - Second Edition: Build systems for data processing, machine learning, and deep learning
92% match

Scala for Machine Learning - Second Edition: Build systems for data processing, machine learning, and deep learning

Packt Publishing

£48.99 25 Feb 2026
Algorithms for Data Science
92% match

Algorithms for Data Science

Springer

£62.75 23 Feb 2026
Big Data Science & Analytics: A Hands-On Approach
92% match

Big Data Science & Analytics: A Hands-On Approach

£45.11 17 Feb 2026
An Architecture for Fast and General Data Processing on Large Clusters (ACM Books)
92% match

An Architecture for Fast and General Data Processing on Large Clusters (ACM Books)

Morgan & Claypool

£62.00 07 Mar 2026
The Azure Data Lakehouse Toolkit: Building and Scaling Data Lakehouses on Azure with Delta Lake, Apache Spark, Databricks, Synapse Analytics, and Snowflake
91% match

The Azure Data Lakehouse Toolkit: Building and Scaling Data Lakehouses on Azure with Delta Lake, Apache Spark, Databricks, Synapse Analytics, and Snowflake

Apress

£42.76 15 Apr 2026
Machine Learning with Python Cookbook: Practical Solutions from Preprocessing to Deep Learning
91% match

Machine Learning with Python Cookbook: Practical Solutions from Preprocessing to Deep Learning

O'Reilly

£44.67 06 Jan 2026
Big Data Processing Using Spark in Cloud: 43 (Studies in Big Data, 43)
91% match

Big Data Processing Using Spark in Cloud: 43 (Studies in Big Data, 43)

Springer

£72.12 08 Mar 2026
Big Data Architect's Handbook: A guide to building proficiency in tools and systems used by leading big data experts
91% match

Big Data Architect's Handbook: A guide to building proficiency in tools and systems used by leading big data experts

Packt Publishing

£45.99 19 Feb 2026
Next-Generation Machine Learning with Spark: Covers XGBoost, LightGBM, Spark NLP, Distributed Deep Learning with Keras, and More
91% match

Next-Generation Machine Learning with Spark: Covers XGBoost, LightGBM, Spark NLP, Distributed Deep Learning with Keras, and More

Apress

£38.50 07 Mar 2026