WildfireDB: An Open-Source Dataset ConnectingWildfire Spread with Relevant Determinants

Samriddhi Singla, Ayan Mukhopadhyay, Michael Wilbur, Tina Diao, Vinayak Gajjewar, Ahmed Eldawy, Mykel Kochenderfer, Ross Shachter, Abhishek Dubey

35th Conference on Neural Information Processing Systems (NeurIPS 2021) Track on Datasets and Benchmarks 2021

Why This Matters

Modeling wildfire spread is critical for emergency management and fire risk assessment, but comprehensive datasets relating fires to relevant environmental covariates have been lacking. Traditional fire spread models rely on physics-based approaches that require detailed parameter specification, while data-driven approaches are limited by insufficient data. This work is innovative because it provides the research community with a large-scale, multi-source dataset that enables machine learning approaches to improve fire spread forecasting and supports more sophisticated understanding of fire dynamics.

What We Did

WildFireDB is a comprehensive open-source dataset connecting wildfire spread with relevant environmental determinants including weather, vegetation, and topography. The dataset comprises over 17.8 million data points covering wildfire occurrences in the continental United States from 2012-2017, integrating fire detection data from satellite imagery with spatial vegetation and topographic information. The work presents algorithmic approaches for merging large-scale raster and vector data to create a spatially and temporally coherent dataset for modeling fire spread.

Key Results

The dataset successfully integrates fire occurrence data from satellite sensors with vegetation, topographic, and weather information across a standardized spatial grid. The publicly available dataset enables development of data-driven models for fire spread forecasting. The work demonstrates the feasibility of creating large-scale integrated datasets that combine multiple data types at different spatial resolutions, providing a foundation for advancing fire risk management through improved predictive models.

@inproceedings{wildfiredb2021, author = {Singla, Samriddhi and Mukhopadhyay, Ayan and Wilbur, Michael and Diao, Tina and Gajjewar, Vinayak and Eldawy, Ahmed and Kochenderfer, Mykel and Shachter, Ross and Dubey, Abhishek}, booktitle = {35th Conference on Neural Information Processing Systems (NeurIPS 2021) Track on Datasets and Benchmarks}, title = {WildfireDB: An Open-Source Dataset ConnectingWildfire Spread with Relevant Determinants}, year = {2021}, acceptance = {26}, abstract = {Modeling fire spread is critical in fire risk management. Creating data-driven models to forecast spread remains challenging due to the lack of comprehensive data sources that relate fires with relevant covariates. We present the first comprehensive and open-source dataset that relates historical fire data with relevant covariates such as weather, vegetation, and topography. Our dataset, named \textit{WildfireDB}, contains over 17 million data points that capture how fires spread in the continental USA in the last decade. In this paper, we describe the algorithmic approach used to create and integrate the data, describe the dataset, and present benchmark results regarding data-driven models that can be learned to forecast the spread of wildfires.}, contribution = {minor}, tag = {ai4cps,incident}, keywords = {wildfire, fire spread modeling, dataset, geospatial analysis, machine learning, risk management, environmental data integration} }

WildfireDB: An Open-Source Dataset ConnectingWildfire Spread with Relevant Determinants

Why This Matters

What We Did

Key Results

Full Abstract

Cite This Paper

Quick Info

Keywords

Research Areas

Search Tags