Price loading...

O'Reilly High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark

Price data last checked 101 day(s) ago - refreshing...

View at Amazon

Price History & Forecast

No Price Data Available

Price history will appear here once data is collected from Amazon.

Price Distribution

No price data available for histogram

Description

Apache Spark is amazing when everything clicks. But if you haven't seen the performance improvements you expected or still don't feel confident enough to use Spark in production, this practical book is for you. Authors Holden Karau, Rachel Warren, and Anya Bida walk you through the secrets of the Spark code base, and demonstrate performance optimizations that will help your data pipelines run faster, scale to larger datasets, and avoid costly antipatterns. Ideal for data engineers, software engineers, data scientists, and system administrators, the second edition of High Performance Spark presents new use cases, code examples, and best practices for Spark 3.x and beyond. This book gives you a fresh perspective on this continually evolving framework and shows you how to work around bumps on your Spark and PySpark journey. With this book, you'll learn how to: Accelerate your ML workflows with integrations including PyTorch Handle key skew and take advantage of Spark's new dynamic partitioning Make your code reliable with scalable testing and validation techniques Make Spark high performance Deploy Spark on Kubernetes and similar environments Take advantage of GPU acceleration with RAPIDS and resource profiles Get your Spark jobs to run faster Use Spark to productionize exploratory data science projects Handle even larger datasets with Spark Gain faster insights by reducing pipeline running times

Product Specifications

Format
Paperback
Domain
Amazon UK
Release Date
31 January 2026
Listed Since
30 June 2025

Barcode

No barcode data available

Similar Products You Might Like

Data Algorithms with Spark: Recipes and Design Patterns for Scaling Up using PySpark
93% match

Data Algorithms with Spark: Recipes and Design Patterns for Scaling Up using PySpark

O'Reilly

£43.85 09 Jan 2026
Spark – The Definitive Guide: Big data processing made simple
93% match

Spark – The Definitive Guide: Big data processing made simple

O'Reilly

£39.98 14 Jan 2026
Learning Spark 2e: Lightning-Fast Data Analytics
93% match

Learning Spark 2e: Lightning-Fast Data Analytics

O'Reilly

£41.80 14 Jan 2026
Advanced Analytics with Spark, 2e: Patterns for Learning from Data at Scale
93% match

Advanced Analytics with Spark, 2e: Patterns for Learning from Data at Scale

O'Reilly

£34.38 02 Mar 2026
Scaling Machine Learning with Spark: Distributed ML with MLlib, TensorFlow, and PyTorch
93% match

Scaling Machine Learning with Spark: Distributed ML with MLlib, TensorFlow, and PyTorch

O'Reilly

£44.14 21 Apr 2026
Modern Data Engineering with Apache Spark: A Hands-On Guide for Building Mission-Critical Streaming Applications
93% match

Modern Data Engineering with Apache Spark: A Hands-On Guide for Building Mission-Critical Streaming Applications

Apress

£34.70 21 Feb 2026
Big Data Analytics with Spark: A Practitioner's Guide to Using Spark for Large Scale Data Analysis
93% match

Big Data Analytics with Spark: A Practitioner's Guide to Using Spark for Large Scale Data Analysis

Apress

£41.77 21 Feb 2026
Hands-on Guide to Apache Spark 3: Build Scalable Computing Engines for Batch and Stream Data Processing
93% match

Hands-on Guide to Apache Spark 3: Build Scalable Computing Engines for Batch and Stream Data Processing

Apress

£44.22 24 Feb 2026
O'Reilly High Performance Python - Practical Programming Book
93% match

O'Reilly High Performance Python - Practical Programming Book

O'Reilly

£38.17 18 Apr 2026
Spark in Action, Second Edition
92% match

Spark in Action, Second Edition

Manning Publications

£45.39 12 Apr 2026
High Performance Python 2e: Practical Performant Programming for Humans
92% match

High Performance Python 2e: Practical Performant Programming for Humans

O'Reilly

£39.80 14 Jan 2026
Beginning Apache Spark 3: With DataFrame, Spark SQL, Structured Streaming, and Spark Machine Learning Library
92% match

Beginning Apache Spark 3: With DataFrame, Spark SQL, Structured Streaming, and Spark Machine Learning Library

Apress

£45.57 22 Feb 2026
Data Analysis with Python and PySpark
92% match

Data Analysis with Python and PySpark

£33.73 07 Dec 2025
Database Performance at Scale: A Practical Guide
92% match

Database Performance at Scale: A Practical Guide

Apress

£38.05 20 Feb 2026
Large Scale Machine Learning with Python
92% match

Large Scale Machine Learning with Python

Packt Publishing

£41.99 07 Mar 2026
Stream Processing with Apache Spark: Mastering Structured Streaming and Spark Streaming
91% match

Stream Processing with Apache Spark: Mastering Structured Streaming and Spark Streaming

O'Reilly

£40.67 15 Apr 2026
Scala for Machine Learning - Second Edition: Build systems for data processing, machine learning, and deep learning
91% match

Scala for Machine Learning - Second Edition: Build systems for data processing, machine learning, and deep learning

Packt Publishing

£48.99 25 Feb 2026
Learn PySpark: Build Python-based Machine Learning and Deep Learning Models
91% match

Learn PySpark: Build Python-based Machine Learning and Deep Learning Models

Apress

£36.13 12 Mar 2026
An Architecture for Fast and General Data Processing on Large Clusters (ACM Books)
91% match

An Architecture for Fast and General Data Processing on Large Clusters (ACM Books)

Morgan & Claypool

£62.00 07 Mar 2026
Scaling Python with Dask: From Data Science to Machine Learning
91% match

Scaling Python with Dask: From Data Science to Machine Learning

O'Reilly

£24.08 18 Mar 2026
Data Engineering with Advanced Python: Learn to Build Production Data applications using Modern Cloud Data tools (Data Engineering with Python cookbook series)
91% match

Data Engineering with Advanced Python: Learn to Build Production Data applications using Modern Cloud Data tools (Data Engineering with Python cookbook series)

£43.20 30 Jan 2026
The Azure Data Lakehouse Toolkit: Building and Scaling Data Lakehouses on Azure with Delta Lake, Apache Spark, Databricks, Synapse Analytics, and Snowflake
91% match

The Azure Data Lakehouse Toolkit: Building and Scaling Data Lakehouses on Azure with Delta Lake, Apache Spark, Databricks, Synapse Analytics, and Snowflake

Apress

£42.76 15 Apr 2026
In-Memory Analytics with Apache Arrow: Perform fast and efficient data analytics on both flat and hierarchical structured data
91% match

In-Memory Analytics with Apache Arrow: Perform fast and efficient data analytics on both flat and hierarchical structured data

Packt Publishing

£46.12 09 Mar 2026
Data Pipelines with Apache Airflow, Second Edition
91% match

Data Pipelines with Apache Airflow, Second Edition

Manning

£42.23 19 Feb 2026