Scatter Plot
Plot individual data points on two axes to reveal correlations, clusters, and outliers.
What is a scatter plot?
A scatter plot (also called an XY chart) places individual data points on a two-dimensional grid where each axis represents a different variable. Each point shows the intersection of two measurements for a single observation. The resulting pattern reveals whether and how the two variables are related.
Scatter plots are the primary tool for exploring relationships between variables. A tight diagonal cluster suggests strong correlation. A random spread suggests none. Outliers stand out visually, and natural groupings often emerge without any statistical analysis.
When to use a scatter plot
- Exploring correlation between two continuous variables
- Identifying outliers in a dataset
- Spotting natural clusters or groupings
- Showing the distribution of data points across two dimensions
- Comparing groups by plotting each with a different color
Best practices
- Use both axes for continuous (numeric) variables, not categories
- Add axis labels that include units (e.g., "Hours studied" not just "X")
- Use semi-transparent dots when many points overlap
- Color-code groups to compare subpopulations
- Consider adding a trend line if the relationship is meaningful
- Keep dot sizes consistent unless encoding a third variable (bubble chart)
Example
Study hours vs. test scores, showing the positive correlation between time invested and results.
Make this in Claude
With ChartPane installed, just describe what you want:
Create a scatter plot of study hours vs test scores for these students: (2,55), (3,62), (4,68), (5,74), (6,78), (7,85), (8,88), (9,92)