Monday, December 15, 2025

Creating and Assessing an Unconventional Global Database of Dust Storms Utilizing Generative AI

In the past we have written about how one can use social media to monitor dust storms along with how multi-modal large language models (MLLMs) can be used to analyze images. At the recent American Geophysical Union (AGU) Fall Meeting we (Sage Keidel, Stuart Evans and myself) brought these two strands of research together in a poster entitled "Creating and Assessing an Unconventional Global Database of Dust Storms Utilizing Generative AI."

In this work we showcase how MLLMs are providing new opportunities and accessible methods for information extraction from imagery data using geo-located images from Flickr which have a dust keyword tag associated with it from multiple languages (e.g., Arabic, English, Spanish).  We run these images through ChatGPT, which classifies them as dust storms or not and compare this classification with human classifed images. If this sounds of interest, below you can read the abstract, see the poster along with a selection of images that have been labeled as as dust storm or not and ChatGPTs confidence in its classification. While the dust storm database itself can be found here

Abstract:

Complete observations of dust events are difficult, as dust’s spatial and temporal variability means satellites may miss dust due to overpass time or cloud coverage, while ground stations may miss dust due to not being in the plume. As a result, an unknown number of dust events go unrecorded in traditional datasets. Dust’s importance both for atmospheric processes and as a health and travel hazard makes detecting dust events whenever possible important, and in particular, studies of the health impacts of dust are limited by detailed exposure information. 

In recent years, social media platforms have emerged as a valuable source of unconventional data to study events such as earthquakes and flooding around the world. However, one challenge with respect to using such data is classifying and labeling it (i.e., is it a dust storm or not?). While it is relatively simple to classify textural data through natural language processing, it is not the case with imagery data. Traditionally, classifying imagery data was a complex computer vision task. However, recent advancements in generative artificial intelligence (AI) especially multi-modal large language models (MLLMs) are opening up new opportunities and offering accessible methods for information extraction from imagery data. Therefore, in this study we collected geotagged Flickr images referencing dust from around the globe from multiple languages (e.g., English, Spanish, Arabic) and use generative AI (i.e., ChatGPT) to classify the images as dust storms or not. Furthermore, we compare a sample of these classified images from ChatGPT with human classified images to assess its accuracy in classification. Our results suggest that ChatGPT can relatively accurately detect dust storms from Flickr images and thus helps us create an unconventional global database of dust storm events that might otherwise go unobserved from more traditional datasets.



Workflow

Poster

Dust storm database (click here to go to it)



Full Referece: 
Keidel, S., Evans S. and Crooks, A.T. (2025), Creating and Assessing an Unconventional Global Database of Dust Storms Utilizing Generative AI, American Geophysical Union (AGU) Fall Meeting, 15th–19th December, New Orleans, LA. (pdf of poster).

Integration of Community Level Data into Mathematical Models

In the past we have posted about how we can utilize data and models to explore pandemics and peoples reactions to them. And while interest in the COVID might of waned, there will be future pandemics. 

To this end, at the 53rd Annual Meeting of NAPCRG we (Laurene Tumiel Berhalter, Sanchit Goel, Dawn Vanderkooi, Bruce PitmanYinyin Ye,  Jennifer Surtees and myself) had a poster entitled "Integration of Community Level Data into Mathematical Models to Predict Future Public Health Emergencies." The objective of the poster is to showcase how one can integrate 211 data into models to predict future public health emergencies. If this sounds of interest, below you can see the poster and at the bottom of the post you can access the abstract. 


Full Reference:

Tumiel, L.M., Goel, S., Vanderkooi, D., Pitman E.B., Crooks A.T., Ye, Y. and Surtees, J. (2025), Integration of Community Level Data into Mathematical Models to Predict Future Public Health Emergencies, North American Primary Care Research Group (NAPCRG) 53rd Annual Meeting, 21st-25th November, Atlanta, GA (pdf).

Saturday, November 08, 2025

New Paper: Modeling Wildfire Evacuation with Embedded Fuzzy Cognitive Maps

While we have explored disasters in the past through agent-based models and other computational social science approaches, one area we have not explored is how one can use agent-based models to explore evacuations durring a wild fire event.  This has now changed with a new paper with  Zhongyu Zhou and myself entitled  "Modeling Wildfire Evacuation with Embedded Fuzzy Cognitive Maps: An Agent-Based Simulation of Emotion and Social Contagion" which was recently presented at the  2025 International Conference of the Computational Social Science Society of the Americas (CSSSA). 

In the paper we present an agent-based model combined with an embedded fuzzy cognitive map (FCM) to simulate residents’ evacuation behavior during a wildfire event. If this sounds of interest, below we provide the abstract to the paper along with some of the figures that showcase the model logic and some of its results. A detailed ODD, the model and the data needed to run the model can be found at: https://github.com/ozzyzhou99/LA-Wildfire-Model/. Finally, at the bottom of the post you can find the full referece to the paper and a link to it.  

Abstract: 

Wildfires are becoming increasingly dangerous, especially in densely populated fire-prone areas like Los Angeles. People’s evacuation decisions during wildfire events are influenced by many factors, including emotions such as fear or panic, which often affect people’s choices to evacuate. Traditional evacuation models often assume that individuals behave rationally. As a result, these models tend to overlook the influence of emotional factors on evacuation behavior. To address this issue, this study develops an agent-based model (ABM) combined with an embedded fuzzy cognitive map (FCM) to simulate residents’ evacuation behavior during a wildfire event. The model covers two types of agents: evacuees and rescuers. It focuses on how emotions change over time and how they spread among people. While we also expect to observe how these emotional changes will affect evacuation decisions. This research also considers differences between different income groups to explore whether low-income residents are more likely to panic. Results from the model show that agents with different emotions behave differently during the evacuation process. Emotional changes clearly affect how agents choose routes and whether they can respond quickly. In addition, the results suggest that income level affects emotional responses, and low-income groups are more likely to feel fear. This study highlights the value of using ABM and FCM together to better understand evacuation behavior and provides a new idea for developing fairer and more effective disaster response plans.

Keywords: Agent-Based Modeling, Emotional decision-making, GIS, Fuzzy Cognitive Map, Wildfire Evacuation.
Data used in the setting up the model experiment. (A) is household income data, (B) is location of previously affected houses, and (C) is evacuation road data.

Agent-level embedded FCM loop with social contagion.
Evacuees’ Workflow (A), Rescuers” Workflow (B).




Box plots of average emotions for three groups of experiments (50 repetitions each). From left to right, the number of people in each income group increases progres- sively. Low income (LI), middle income (MI), and high income (HI).

Full Referece 
Zhou, Z. and Crooks, A.T. (2025), Modeling Wildfire Evacuation with Embedded Fuzzy Cognitive Maps:An Agent-Based Simulation of Emotion and Social Contagion, Proceedings of the 2025 International Conference of the Computational Social Science Society of the Americas, Santa Fe, NM. (pdf)

Thursday, November 06, 2025

HD-GEN: A Software System for Large-Scale Human Mobility Data Generation Based on Patterns of Life


 
Human mobility datasets are essential for investigating human behavior, mobility patterns, and traffic dynamics.  In the past we have written about how one can use agent-based models to generate patterns of life trajectories datasets. Building on this work at the ACM SIGSPATIAL 2025 conference, we (Hossein AmiriRichard YangShiyang RuanJoon-Seok KimHamdi KavakAndrew Crooks,  Dieter Pfoser,  Carola Wenk and Andreas Züfle) had a paper entitled "HD-GEN: A Software System for Large-Scale Human Mobility Data Generation Based on Patterns of Life"

In this paper, we extend our previous work by introducing a software system that provides a new suite of tools built on top of the Patterns of Life simulation framework. Specifically this work consolidates our contributions into a unified data generation pipeline that includes:

  1. additional discussion of the motivation and applications of large-scale simulated trajectory data, 
  2. detailed instructions on running the simulation and generating datasets, 
  3. extended analysis of the shared dataset, and 
  4. an integrated GitHub repository

The proposed system enables large-scale synthetic dataset generation, either by statistically replicating real-world data or by creating datasets with user-defined properties. If this sounds of interest, below you can read the abstract to the paper, the poster that accompanies it and we have also provided detailed instructions on how to reproduce the generated datasets, and made the code and data available at https://github.com/onspatial/large-scale-dataset-generator.

Abstract

Understanding individual human mobility is critical for a wide range of applications. Real-world trajectory datasets provide valuable insights into actual movement behaviors but are often constrained by data sparsity and participant bias. Synthetic data, by contrast, offer scalability and flexibility but frequently lack realism. To address this gap, we introduce a comprehensive software pipeline for generating, calibrating, and processing large-scale human mobility datasets that integrate the realism of empirical data with the control and extensibility of Patterns-of-Life simulations. Our system consists of three integrated components. First, a genetic algorithm–based calibration module fine-tunes simulation parameters to align with real-world mobility characteristics, such as daily trip counts and radius of gyration, enabling realistic behavioral modeling. Second, a data generation engine constructs geographically grounded simulations using OpenStreetMap data to produce diverse mobility logs. Third, a data processing suite transforms raw simulation logs into structured formats suitable for downstream applications, including model training and benchmarking. 

Keywords: GeoLife, Patterns of Life, Simulation, Realistic Trajectory Datasets

Dataset creation phases with HD-GEN software.

Full Reference: 

Hossein, A., Yang, R.,  Ruan, S., Kim, J-S., Kavak, H., Crooks, A.T., Pfoser, D., Wenk, C. and Züfle, A., (2025). HDGEN: A Software System for Large-Scale Human Mobility Data Generation Based on Patterns of Life. In The 33rd ACM International Conference on Advances in Geographic Information Systems (SIGSPATIAL ’25), November 3–6, 2025, Minneapolis, MN. pp. 407-410. (pdf) (poster)

Thursday, October 09, 2025

Call for Papers: Geosimulation and Its Emerging Directions with AI




As part of the GeoAI and Deep Learning Symposium at the 2026 AAG Annual Meeting in San Francisco, California we have a call for papers for sessions entitled "Geosimulation and Its Emerging Directions with AI"

Call for Papers:

Simulating past, present, and future events can empower humans to understand the composition and interactions in complex systems and explain their emergence and evolution from bottom up. In practice, geosimulations constitute a powerful tool in engaging different stakeholders, exploring what-if scenarios, and evaluating alternative policy outcomes.

We invite interdisciplinary works for the exploration and understanding of complex social and environmental processes by means of computer simulation. We focus on all aspects of simulation and agent societies, including multi-agent systems, agent-based modeling, microsimulation, artificial intelligence (AI) agents, and the integration of Generative AI with simulation.

As GenAI is impacting all aspects of our lives, we are wondering how it will impact geospatial simulations. How do multimodal large language models (MLLMs) help with agent-decision making in the form of generating agent-personas or scheduling agent activities? Can MLLMs reduce coding barriers for beginners? Will GenAI lead to a new generation of modeling toolkits? What are the challenges brought by MLLMs in model design, validation, and computing costs?

We welcome a wide range of studies exploring simulation theories, data, methodologies, and frameworks. We are also interested in case studies applying geosimulations to address real-world challenges. Potential topic areas include, but are not limited to:
  • Geosimulation Models and Applications
  • Conceptual Geosimulation Models
  • General-Purpose Geosimulation Framework
  • AI and Geosimulation
  • Agents’ Behaviors, Decision-making and AI Agents
  • Data Generation Framework
  • Validation and Verification for Geosimulation
  • Digital Twins
  • Microsimulation
  • Multi-agent Systems

If you are interested, please email your title and 250-word abstract to Fuzhen Yin (fyin@uccs.edu) and Jeon-Young Kang (geokang@khu.ac.kr) by October 30th.

Chairs:

Organizers:
Sponsor Groups: