A Satellite-based Synthetic Control Analysis of Wildfire Mitigation in California
Does low-intensity prescribed fire reduce subsequent high-intensity wildfire risk? And can satellite-derived embeddings improve causal inference by selecting ecologically similar donors for synthetic control analysis?
Average treatment effect on the treated (ATT) for VMP-treated pixels (n=3,966) relative to synthetic controls constructed from untreated conifer pixels.
Pre-period: 2003–2011 (9 years) | Post-period: 2012–2018 (6 years)
Maximum Fire Radiative Power (FRP) in MW from MODIS FIRMS, capturing fire intensity at 1 km² resolution. Higher FRP indicates more intense fires.
Random statewide (500 pixels) vs. Manual covariate screens (284 pixels) vs. Embedding-based K-nearest neighbors (50 pixels in latent space).
Block bootstrap over treated units (500 resamples) for standard errors and 95% confidence intervals. Parallel trends assumption validated by near-flat pre-treatment gaps.
Explore how different donor selection strategies affect pre-treatment balance and causal inference quality. Click buttons to switch between strategies and see how donor pools differ in embedding space.
Randomly sample 500 untreated conifer pixels statewide. No explicit constraint on ecological similarity; relies on large sample to approximate covariate distributions.
RMSPE: Root Mean Squared Prediction Error Measures how well synthetic control tracks treated pixels pre-treatment. Lower = better pre-fit.
Red: treated pixel. Gray/Orange/Green: donors from Random/Manual/Embedding strategies. Embedding-based selection yields tighter clusters in latent similarity space, improving comparability.
Treated pixels (n=3,966) vs. Synthetic Controls: Pre-treatment alignment (2003–2011) and post-treatment divergence (2012–2018) by donor strategy. All strategies share identical pre-window and post-window definitions.
Interpretation: All three strategies track pre-treatment trajectories closely, supporting parallel-trends assumption. Post-2012, Embedding-based (green) exhibits the largest and most sustained divergence from synthetic control, followed by Manual (orange), then Random (gray). This ordering aligns with pre-treatment balance quality.
Average Treatment Effect on the Treated (ATT): Post-treatment reduction in FRP (MW) aggregated over 2012–2018. Negative values indicate wildfire mitigation.
| Strategy | Pre-RMSPE | ATT (MW) | 95% CI | % ATT | 95% CI % |
|---|---|---|---|---|---|
| Random (500) | 0.45 | −0.53 | [−0.88, −0.18] | −3.2% | [−3.9%, −2.5%] |
| Manual (284) | 0.36 | −0.67 | [−0.98, −0.36] | −3.8% | [−4.6%, −3.0%] |
| Embeddings (K=50) | 0.23 | −0.81 | [−1.08, −0.54] | −4.4% | [−5.1%, −3.7%] |
Key Finding: Embedding-based donors deliver the largest negative ATT (−0.81 MW) with the smallest uncertainty (SE=0.14). This translates to a −4.4% reduction in post-treatment FRP relative to baseline. The ordering is consistent: Embeddings > Manual > Random, mirroring pre-treatment balance quality.
Gap Plot: Median treated − synthetic FRP over time. Zero or near-zero pre-gaps indicate good parallel-trends validity. Sustained negative post-gaps indicate treatment effect credibility.
Interpretation: All strategies show near-flat pre-trends (flat gray band 2003–2011), supporting the assumption that treated and control outcomes would have followed parallel paths absent treatment. Post-2012, Embeddings (green) exhibits the largest and most stable negative gap, approximately twice the magnitude of Manual and Random strategies. The absence of systematic pre-existing divergence strengthens causal attribution.
Sensitivity Analysis: ATT estimates under alternative specifications (pre-period length, donor count K, outlier exclusion, spatial buffers). Robustness to design choices supports credibility of main results.
| Specification | Pre-RMSPE | ATT (MW) | 95% CI | % ATT |
|---|---|---|---|---|
| Embeddings: Baseline (K=50) | 0.23 | −0.81 | [−1.08, −0.54] | −4.4% |
| Embeddings: Shorter pre (5y) | 0.29 | −0.68 | [−1.03, −0.33] | −3.7% |
| Embeddings: K=25 | 0.26 | −0.65 | [−1.05, −0.25] | −3.5% |
| Embeddings: K=100 | 0.22 | −0.85 | [−1.11, −0.59] | −4.6% |
| Embeddings: Exclude FRP > 500 MW | 0.24 | −0.54 | [−0.97, −0.11] | −2.9% |
| Embeddings: Buffer 5 km | 0.23 | −0.87 | [−1.16, −0.58] | −4.7% |
Robustness Finding: ATT estimates remain stable across design variations. Embedding-based ATT ranges from −0.54 to −0.87 MW across all specifications, with tight confidence intervals. Even when relaxing donor constraints (K=100) or using only 5-year pre-periods, estimates remain consistent. This demonstrates the method's reliability and reduces concerns about design-specific artifacts.
Low-intensity prescribed fire reduces subsequent high-intensity wildfire risk by 4.4% (95% CI: [−5.1%, −3.7%]) over a six-year horizon (2012–2018), with effects concentrated in embedding-matched control pixels that exhibit superior pre-treatment balance.
The core innovation is using satellite-derived embeddings from foundation models to constrain donor pools in synthetic control analysis, dramatically improving pre-treatment balance:
500 untreated pixels, no ecological constraint
284 pixels filtered by slope, elevation, forest type
49% improvement via K-nearest in latent space
Key Insight: Satellite embeddings capture ecological similarity better than explicit covariates alone. The PrithVi V2 foundation model learns spectral patterns that proxy unmeasured ecological heterogeneity (soil quality, microclimate, vegetation structure). K-nearest neighbors in embedding space yields pre-treatment RMSPE of 0.23—a 49% reduction—while maintaining interpretability.
First satellite-based causal evidence that low-intensity prescribed fire is an effective wildfire mitigation tool, with effects robust to design variations.
Demonstrates how foundation models improve causal inference by learning ecological similarity from satellite imagery, with 49% RMSPE improvement.
Provides quantitative evidence to justify continued investment in prescribed fire programs as part of wildfire risk management strategy.
Shows practical value of satellite embeddings for high-dimensional policy evaluation in environmental governance without traditional covariate data.