We can't find the internet
Attempting to reconnect
Something went wrong!
Hang in there while we get back on track
Price loading...
O'Reilly High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark
Price data last checked 101 day(s) ago - refreshing...
Price History & Forecast
No Price Data Available
Price history will appear here once data is collected from Amazon.
Price Distribution
No price data available for histogram
Description
Apache Spark is amazing when everything clicks. But if you haven't seen the performance improvements you expected or still don't feel confident enough to use Spark in production, this practical book is for you. Authors Holden Karau, Rachel Warren, and Anya Bida walk you through the secrets of the Spark code base, and demonstrate performance optimizations that will help your data pipelines run faster, scale to larger datasets, and avoid costly antipatterns. Ideal for data engineers, software engineers, data scientists, and system administrators, the second edition of High Performance Spark presents new use cases, code examples, and best practices for Spark 3.x and beyond. This book gives you a fresh perspective on this continually evolving framework and shows you how to work around bumps on your Spark and PySpark journey. With this book, you'll learn how to: Accelerate your ML workflows with integrations including PyTorch Handle key skew and take advantage of Spark's new dynamic partitioning Make your code reliable with scalable testing and validation techniques Make Spark high performance Deploy Spark on Kubernetes and similar environments Take advantage of GPU acceleration with RAPIDS and resource profiles Get your Spark jobs to run faster Use Spark to productionize exploratory data science projects Handle even larger datasets with Spark Gain faster insights by reducing pipeline running times
Product Specifications
- Brand
- O'Reilly
- Format
- Paperback
- ASIN
- 1098145852
- Category
- Books > Subjects > Computing & Internet > Databases > Data Storage & Management > Data Mining
- Domain
- Amazon UK
- Release Date
- 31 January 2026
- Listed Since
- 30 June 2025
Barcode
No barcode data available
Similar Products You Might Like
93% match
Data Algorithms with Spark: Recipes and Design Patterns for Scaling Up using PySpark
O'Reilly
£43.85
09 Jan 2026
93% match
Spark – The Definitive Guide: Big data processing made simple
O'Reilly
£39.98
14 Jan 2026
93% match
Learning Spark 2e: Lightning-Fast Data Analytics
O'Reilly
£41.80
14 Jan 2026
93% match
Advanced Analytics with Spark, 2e: Patterns for Learning from Data at Scale
O'Reilly
£34.38
02 Mar 2026
93% match
Scaling Machine Learning with Spark: Distributed ML with MLlib, TensorFlow, and PyTorch
O'Reilly
£44.14
21 Apr 2026
93% match
Modern Data Engineering with Apache Spark: A Hands-On Guide for Building Mission-Critical Streaming Applications
Apress
£34.70
21 Feb 2026
93% match
Big Data Analytics with Spark: A Practitioner's Guide to Using Spark for Large Scale Data Analysis
Apress
£41.77
21 Feb 2026
93% match
Hands-on Guide to Apache Spark 3: Build Scalable Computing Engines for Batch and Stream Data Processing
Apress
£44.22
24 Feb 2026
93% match
O'Reilly High Performance Python - Practical Programming Book
O'Reilly
£38.17
18 Apr 2026
92% match
Spark in Action, Second Edition
Manning Publications
£45.39
12 Apr 2026
92% match
High Performance Python 2e: Practical Performant Programming for Humans
O'Reilly
£39.80
14 Jan 2026
92% match
Beginning Apache Spark 3: With DataFrame, Spark SQL, Structured Streaming, and Spark Machine Learning Library
Apress
£45.57
22 Feb 2026
92% match
Data Analysis with Python and PySpark
£33.73
07 Dec 2025
92% match
Database Performance at Scale: A Practical Guide
Apress
£38.05
20 Feb 2026
92% match
Large Scale Machine Learning with Python
Packt Publishing
£41.99
07 Mar 2026
91% match
Stream Processing with Apache Spark: Mastering Structured Streaming and Spark Streaming
O'Reilly
£40.67
15 Apr 2026
91% match
Scala for Machine Learning - Second Edition: Build systems for data processing, machine learning, and deep learning
Packt Publishing
£48.99
25 Feb 2026
91% match
Learn PySpark: Build Python-based Machine Learning and Deep Learning Models
Apress
£36.13
12 Mar 2026
91% match
An Architecture for Fast and General Data Processing on Large Clusters (ACM Books)
Morgan & Claypool
£62.00
07 Mar 2026
91% match
Scaling Python with Dask: From Data Science to Machine Learning
O'Reilly
£24.08
18 Mar 2026
91% match
Data Engineering with Advanced Python: Learn to Build Production Data applications using Modern Cloud Data tools (Data Engineering with Python cookbook series)
£43.20
30 Jan 2026
91% match
The Azure Data Lakehouse Toolkit: Building and Scaling Data Lakehouses on Azure with Delta Lake, Apache Spark, Databricks, Synapse Analytics, and Snowflake
Apress
£42.76
15 Apr 2026
91% match
In-Memory Analytics with Apache Arrow: Perform fast and efficient data analytics on both flat and hierarchical structured data
Packt Publishing
£46.12
09 Mar 2026
91% match
Data Pipelines with Apache Airflow, Second Edition
Manning
£42.23
19 Feb 2026