Visualization Reference
Univariate
Core tools for distribution shape, spread, and outliers.
- Histogram β bin-based distribution profile.
- Kernel density estimate β smoothed distribution.
- Rug plot β point-level distribution cue.
- Empirical CDF / CCDF β cumulative distribution structure.
- Box plot β robust summary + outlier detection.
- Violin plot β boxβdensity hybrid.
- Stem-and-leaf display β quick numeric snapshot.
- Bean plot β density + individual observations.
- Quantile plots (QβQ, PβP, worm plot) β distributional deviation diagnostics.
- Ridgeline plot β stacked density comparison across groups.
Bivariate
Characterize relationships, patterns, and dependence.
- Scatter plot β baseline relational structure.
- Hexbin plot β high-N scatter aggregation.
- 2D KDE / contour plot β continuous joint density.
- Joint histogram β binned joint distribution.
- Two-sample QβQ plot β direct distribution comparison.
- Correlogram β correlation mapping.
- Scatter with marginal densities β joint + marginal structure.
- Alpha-blended scatter β density-enhanced point cloud.
Multivariate
Expose patterns in higher-dimensional spaces.
- Scatterplot matrix (SPLOM) β all-pairs relationships.
- Parallel coordinates β multi-feature path structure.
- Andrews curves β function-based multivariate encoding.
- Star coordinates β radial multi-axis encoding.
- Glyph encodings (Chernoff faces, radial glyphs) β shape-encoded multivariate values.
- Biplot β PCA scores and loadings.
- t-SNE / UMAP embeddings β nonlinear manifold projections.
- Radar chart β profile comparison across features.
- Heatmap β matrix visualization for features Γ samples or correlations.
- Mosaic plot β multiway categorical dependence.
Categorical
Compare counts, proportions, and structure.
- Bar chart β category comparisons.
- Stacked bar chart β partβwhole segmentation.
- Grouped bar chart β multi-factor comparison.
- Dot plot (Cleveland) β high-resolution categorical comparison.
- Lollipop chart β stemβvalue alternative to bars.
- Pie chart β coarse proportion cue.
- Marimekko (Mekko) chart β joint categorical proportions.
- Contingency table β joint distribution summary.
- Alluvial plot β category transitions.
- Sankey diagram β categorical flow structure.
Time Series
Reveal temporal evolution, stability, and autocorrelation.
- Line plot β baseline continuous-time trajectory.
- Run chart β level shifts over time.
- Control chart β stability and process limits.
- Seasonal subseries plot β seasonal pattern decomposition.
- Lag plot β serial dependence diagnostics.
- ACF/PACF β autocorrelation structure.
- Spectral density plot β frequency-domain behavior.
- Horizon chart β compressed multi-band amplitude view.
- Fan chart β forecast paths with uncertainty bands.
- Calendar heatmap β long-horizon temporal distribution.
- Rolling-window metrics β local variance/mean/volatility.
Model and Residual Diagnostics
Assess fit quality, calibration, and assumptions.
- Residuals vs fitted β bias or heteroscedasticity.
- Residual QβQ β normality deviations.
- Scaleβlocation plot β variance structure.
- Influence/leverage plots β Cookβs distance and hat values.
- Partial dependence plots β marginal effect visualization.
- ICE curves β individual conditional effects.
- Calibration plot β probabilistic reliability.
- ROC / PR curves β classification discrimination.
- Lift chart β ranked predictive performance.
- Variable-importance heatmap β feature contribution profiling.
- Forest plot β effect sizes with intervals.
Network and Non-Hierarchical Structures
For relational data lacking parentβchild nesting.
- Force-directed graph β topology-driven layouts.
- Adjacency matrix heatmap β matrix-encoded network structure.
- Edge-bundled network β reduced visual clutter via bundling.
- Chord diagram β directional/bilateral flows.
- Sankey diagram β flow-oriented network representation.
Hierarchical Structures
For nested, tree-based, or clustered data.
- Tree diagram β explicit hierarchy.
- Cluster dendrogram β agglomerative/divisive clustering.
- Treemap β space-filling hierarchy.
- Sunburst β radial hierarchical partition.
- Icicle plot β stacked hierarchical partition.
- Partition diagram β recursive splitting structure.
- Packed circles β containment hierarchy.
Text Visualization
Frequency- and structure-oriented tools for tokenized text.
- Word cloud β size-encoded term frequency; coarse quantitative precision.
- Termβdocument heatmap β frequency matrix structure.
- Co-occurrence network β relational term structure.
- Topic-embedding scatter (t-SNE/UMAP) β semantic-space projection of documents/topics.
- Concordance / KWIC strips β positional term context.
- N-gram bar charts β frequency of multi-token sequences.
Spatial
Geospatial distribution, density, and structure.
- Choropleth β region-level value encoding.
- Proportional symbol map β magnitude-encoded markers.
- Dot-density map β discrete-event spatial distribution.
- Hex-map β normalized spatial binning.
- Kernel density surface / heatmap β smoothed spatial intensity.
- Cartogram β value-distorted geography.
- Flow map β directional movement.
- Voronoi tessellation β spatial partition via nearest regions.
- Contour/elevation map β gradient-based spatial fields.
A box plot encodes the following elements:
- Range β the span from the minimum to the maximum observed values.
- Interquartile range (IQR) β the central 50% of the data, defined as IQR = Q3 - Q1.
- Q1: 25th percentile, median of the lower half.
- Q3: 75th percentile, median of the upper half.
Outliers follow the standard 1.5 * IQR rule: any observation below Q1 - 1.5 IQR or above Q3 + 1.5, IQR is classified as an outlier.
Graphical Integrity
Source: https://www.amazon.com/Visual-Display-Quantitative-Information/dp/1930824130
there are six principles to ensure Graphical Integrity:
- Make the representation of numbers proportional to quantities
- Use clear, detailed, and thorough labeling
- Show data variation, not design variation
- Use standardized units, not nominal values
- Depict βnβ data dimensions with less than or equal to βnβ variable dimensions
- Quote data in full context
Source: https://realpython.com/python-data-visualization-bokeh
Histograms and Density Plots
Histograms work very well for display a single variable from one category.
For displaying multiple categories use
- Side-by-Side Histograms
- Stacked Histograms
- Density Plots
- Rug Plot