California Wildfire Prediction

TAM AIR Club × CAL FIRE × UCI × UCSF

Our research initiative aims to build a pixel-wise wildfire risk prediction system for California at 800m × 800m resolution using machine learning. By analyzing over 147 years of fire perimeter data, we're developing tools to help predict and prevent catastrophic wildfires.

This collaboration brings together high school researchers, university academics, and California's fire management professionals to tackle one of the state's most pressing environmental challenges.

Dataset Overview

  • 22,000+ fire perimeters analyzed
  • 147 years of historical data (1878-2025)
  • 97%+ data completeness for key fields (post-1993)
  • GPS-based high-quality perimeter data

Project Roadmap

Our multi-phase approach to building a wildfire prediction model

Phase 1: CAL FIRE Historical Perimeters

COMPLETED

Comprehensive analysis of 22,000+ fire perimeters. Created 19 publication-quality visualizations. Identified key patterns and data quality thresholds.

Phase 2: Fire Ignition Points

IN PROGRESS

Analyze fire ignition point dataset to understand fire origin patterns, causes, and spatial distribution across California.

Phase 3: Vegetation Data

PLANNED

Analyze vegetation and fuel load data to understand how different land cover types influence fire behavior and spread.

Phase 4: Topography & Lakes

PLANNED

Incorporate elevation, slope, aspect, and water body data to understand terrain influences on fire risk and natural fire breaks.

Phase 5: PRISM Weather Data

PLANNED

Integrate PRISM climate data including temperature, precipitation, and vapor pressure deficit to model weather-driven fire risk.

Phase 6: Census, Roads & Power Lines

PLANNED

Analyze human infrastructure data including population density, road networks, and electrical lines as ignition risk factors.

Phase 7: Hypothesis & Methodology

PLANNED

Synthesize findings from all datasets to form research hypotheses and develop methodology for statistical analyses.

Phase 8: Statistical Analyses

PLANNED

Conduct multiple statistical analyses to identify significant predictors and build predictive models for wildfire risk.

Phase 9: Web Application for CAL FIRE

FUTURE

Develop backend and frontend web application to deliver research results and predictive tools to CAL FIRE personnel.

Key Findings

Critical insights from our exploratory data analysis

4.2 Million

Acres burned in 2020 alone - California's worst fire year on record

84%

Of burned acreage occurs during June-September fire season

Top 1%

Of fires cause 58% of all damage - extreme concentration of risk

38 Mega-fires

Fires exceeding 100,000 acres recorded since 1993

Research Visualizations

Key findings from our comprehensive exploratory data analysis

Executive Dashboard - 10-panel overview of California wildfire data

Executive Dashboard

Comprehensive 10-panel overview showing fire counts, burned acreage trends, seasonal patterns, size distributions, causes, and responding agencies over the past 30+ years.

Fire Clock - Seasonal fire patterns visualization

Fire Clock

Polar visualization showing seasonal fire patterns. The outer ring displays acres burned by month, while the inner ring shows fire counts - clearly demonstrating the June-September peak.

Cumulative Fire Risk Map of California

Cumulative Fire Risk Map

Burn frequency heatmap at 1000m resolution showing high-risk zones across California. This visualization helps identify areas that have experienced repeated fire activity.

Trend Analysis showing wildfire acceleration

Trend Analysis

Statistical visualization from 1950-present with fitted regression line providing clear evidence of wildfire acceleration, particularly pronounced after 2000.

Interactive Mega-Fires Map

Explore all 38 mega-fires (>100,000 acres) recorded since 1993

Interactive Folium Map

Click below to explore the full interactive map with all mega-fire perimeters, details, and filtering options.

Open Interactive Map

Note: Large file (~35MB). Best viewed on desktop.

Research Documents

Download our comprehensive analysis report and explore the methodology

Full Analysis Report

Complete exploratory data analysis with 19 high-resolution figures, statistical findings, and methodology documentation.

Download PDF (3.6 MB)

Jupyter Notebooks

View our analysis notebooks with full code, methodology explanations, and reproducible results using Python and GeoPandas.

View on GitHub

Research Partners

This research is made possible through collaboration

TAM AIR Club

Student-led research team from Tamalpais High School

David Passovoy

CAL FIRE - California Department of Forestry and Fire Protection

Shu Li

UC Irvine - Environmental Science

Evan Porter

Research Collaborator - UCSF

Dante Capaldi

Research Collaborator - UCSF

Hui Lin

Research Collaborator - UCSF