Global Predictions by Inputs
(Predictive space by input variables)Predictive space by input variables
What it is: A grid of partial dependence plots that marginalize over other inputs to show the model’s average predicted output: 1D PDPs per input (diagonal) and 2D PDPs for input pairs (lower triangle).
How to use it:
- Diagonal (1D PDP):
- • **Continuous** inputs → line for mean PDP with a shaded ±1σ band hidden in this mean-only matrix (your 1D function draws the band; in mean mode it reflects local variation across the marginalization set).
- • **Categorical/Chemical/Discrete/Temporal** inputs → bar chart with error bars; bar height is the mean PDP at each level.
- Lower triangle (2D PDP):
- • **Continuous–Continuous** → contour/heatmap (Viridis) of mean predictions; ridges/peaks indicate favorable regions; a colorbar encodes the output level.
- • **Categorical–Categorical** → heatmap over level combinations; bright cells mark promising pairs.
- • **Mixed (Categorical×Continuous)** → heatmap with categories on one axis and the continuous grid on the other; compare rows/columns to spot category-dependent trends.
- Overlays (when provided): white circles = training data density; red stars = recommended experiments.
- Use consistent color scale across cells (global zmin/zmax) to make intensities comparable.
Why it matters:
- Reveals **global response structure**—monotonic trends, thresholds, plateaus, and interactions—to guide constraints and target regions before BO.
- Highlights **robust operating windows** (broad high plateaus) versus sharp spikes that may be fragile.
- Supports **feature prioritization**: steep gradients or strong contrasts indicate influential inputs.
Limitations:
- **Marginalization bias**: PDPs assume independence while averaging; with correlated inputs, surfaces can be misleading.
- **Extrapolation**: grid edges may extend beyond data support—interpret faint/empty regions cautiously.
- **Resolution & smoothing**: coarse grids can alias features; too-smooth contours can hide narrow optima.
- **Category imbalance** can skew bar/heatmap expectations; check sample counts per level.