We can't find the internet
Attempting to reconnect
Something went wrong!
Hang in there while we get back on track
Price loading...
Biological data exploration with Python, pandas and seaborn: Clean, filter, reshape and visualize complex biological datasets using the scientific Python stack
Price data last checked 103 day(s) ago - refreshing...
Price History & Forecast
No Price Data Available
Price history will appear here once data is collected from Amazon.
Price Distribution
No price data available for histogram
Description
In biological research, we're currently in a golden age of data. It's never been easier to assemble large datasets to probe biological questions. But these large datasets come with their own problems. How to clean and validate data? How to combine datasets from multiple sources? And how to look for patterns in large, complex datasets and display your findings? The solution to these problems comes in the form of Python's scientific software stack. The combination of a friendly, expressive language and high quality packages makes a fantastic set of tools for data exploration. But the packages themselves can be hard to get to grips with. It's difficult to know where to get started, or which sets of tools will be most useful. Learning to use Python effectively for data exploration is a superpower that you can learn. With a basic knowledge of Python, pandas (for data manipulation) and seaborn (for data visualization) you'll be able to understand complex datasets quickly and mine them for biological insight. You'll be able to make beautiful, informative charts for posters, papers and presentations, and rapidly update them to reflect new data or test new hypotheses. You'll be able to quickly make sense of datasets from other projects and publications - millions of rows of data will no longer be a scary prospect! In this book, Dr. Jones draws on years of teaching experience to give you the tools you need to answer your research questions. Starting with the basics, you'll learn how to use Python, pandas, seaborn and matplotlib effectively using biological examples throughout. Rather than overwhelm you with information, the book concentrates on the tools most useful for biological data. Full color illustrations show hundreds of examples covering dozens of different chart types, with complete code samples that you can tweak and use for your own work. This book will help you get over the most common obstacles when getting started with data exploration in Python. You'll learn about pandas' data model; how to deal with errors in input files and how to fit large datasets in memory. The chapters on visualization will show you how to make sophisticated charts with minimal code; how to best use color to make clear charts, and how to deal with visualization problems involving large numbers of data points. Chapters include: Getting data into pandas: series and dataframes, CSV and Excel files, missing data, renaming columns Working with series: descriptive statistics, string methods, indexing and broadcasting Filtering and selecting: boolean masks, selecting in a list, complex conditions, aggregation Plotting distributions: histograms, scatterplots, custom columns, using size and color Special scatter plots: using alpha, hexbin plots, regressions, pairwise plots Conditioning on categories: using color, size and marker, small multiples Categorical axes:strip/swarm plots, box and violin plots, bar plots and line charts Styling figures: aspect, labels, styles and contexts, plotting keywords Working with color: choosing palettes, redundancy, highlighting categories Working with groups: groupby, types of categories, filtering and transforming Binning data: creating categories, quantiles, reindexing Long and wide form: tidying input datasets, making summaries, pivoting data Matrix charts: summary tables, heatmaps, scales and normalization, clustering Complex data files: cleaning data, merging and concatenating, reducing memory FacetGrids: laying out multiple charts, custom charts, multiple heat maps Unexpected behaviours: bugs and missing groups, fixing odd scales High performance pandas: vectorization, timing and sampling Further reading: dates and times, alternative syntax
Product Specifications
- Format
- Paperback
- ASIN
- B089M41Y1F
- Domain
- Amazon UK
- Release Date
- 03 June 2020
- Listed Since
- 05 June 2020
Barcode
No barcode data available
Similar Products You Might Like
95% match
Hands-On Data Analysis with Pandas: A Python data science handbook for data collection, wrangling, analysis, and visualization
Packt Publishing
£48.13
13 Jan 2026
95% match
Data Analysis and Visualization Using Python: Analyze Data to Create Visualizations for BI Systems
Apress
£48.20
20 Feb 2026
94% match
Hands on Data Science for Biologists Using Python
CRC Press
£71.95
13 Apr 2026
94% match
Hands on Data Science for Biologists Using Python
CRC Press
£175.47
13 Jan 2026
94% match
Pandas 1.x Cookbook: Practical recipes for scientific computing, time series analysis, and exploratory data analysis using Python
Packt Publishing
£52.70
09 Jan 2026
94% match
The Data Science Manual: A Comprehensive Guide to Tools and Techniques for Data Analysis, Modeling, and Deployment with Python
£95.43
21 Feb 2026
94% match
Learning Python for Data: Fundmental Python Skills for Starting with Data
£50.00
23 Jan 2026
94% match
Bioinformatics with Python Cookbook: Use modern Python libraries and applications to solve real-world computational biology problems, 3rd Edition
Packt Publishing
£41.99
12 Jan 2026
94% match
Managing Your Biological Data with Python (Chapman & Hall/CRC Computational Biology Series)
CRC Press
£158.50
09 Mar 2026
94% match
Computational Biology with Python: Modeling and Visualizing Macromolecular Complexes
£75.35
09 Jan 2026
94% match
Data Visualization in Python with Pandas and Matplotlib
£41.89
19 Feb 2026
94% match
Python for Bioinformatics (Jones and Bartlett Series in Biomedical Informatics)
Jones & Bartlett Learning
£72.93
27 Feb 2026
94% match
Advanced Data Science and Analytics with Python (Chapman & Hall/CRC Data Mining and Knowledge Discovery Series)
CRC Press
£93.97
09 Mar 2026
94% match
Data Engineering Foundations: Core Techniques for Data Analysis with Pandas, NumPy, and Scikit-Learn (Advanced Data Analysis Series)
£43.09
18 Feb 2026
94% match
Applied Univariate, Bivariate, and Multivariate Statistics Using Python: A Beginner's Guide to Advanced Data Analysis
Wiley
£83.96
24 Feb 2026
94% match
Bioinformatics with Python Cookbook: Learn how to use modern Python bioinformatics libraries and applications to do cutting-edge research in computational biology, 2nd Edition
Packt Publishing
£49.44
08 Mar 2026
94% match
Extending Power BI with Python and R: Perform advanced analysis using the power of analytical languages
Packt Publishing
£41.99
07 Jan 2026
93% match
Data Science and Analytics with Python (Chapman & Hall/CRC Data Mining and Knowledge Discovery Series)
CRC Press
£51.99
20 Feb 2026
93% match
Python Programming for Data Analysis
Springer
£41.26
16 Feb 2026
93% match
Computational Molecular Bioscience: Concepts
Delve Publishing
£90.37
09 Mar 2026
93% match
Research Software Engineering with Python: Building software that makes research possible
£121.00
12 Dec 2025
93% match
Effective Polars: Optimized Data Manipulation for Polars 1.0
£49.00
22 Feb 2026
93% match
Python for Bioinformatics (Chapman & Hall/CRC Computational Biology Series)
CRC Press
£132.17
10 Mar 2026
93% match
Introduction to Data Science for Social and Policy Research: Collecting and Organizing Data with R and Python
Cambridge University Press
£87.98
09 Mar 2026