Crooks, A.T. (2024), Cities and Disasters: What can Urban Analytics Do?, Environment and Planning B, 52(3): 523-526. (pdf)
Monday, March 10, 2025
New Editorial: Cities and disasters: What can urban analytics do?
Monday, March 03, 2025
Call for Papers: Integrating LLMs and Geospatial Foundation Models to Enhance Spatial Reasoning in ABMs
- Long paper (10-12 pages, excluding references – long oral presentations, will be included in the post-proceedings)
- Short paper (6-9 pages, excluding references – short oral presentations, will be included in the post-proceedings)
- Extended abstract (3-4 pages, excluding references – short oral presentations, will not be included in the post-proceedings)
- Nick Malleson, University of Leeds, UK
- Alison Heppenstall, University of Glasgow, UK
- Ed Manley, University of Leeds, UK
- Andrew Crooks, University of Buffalo, US
Please feel free to get in touch with any of us in case of questions.
Thursday, February 06, 2025
From print to perspective: A mixed-method analysis of the convergence and divergence of COVID-19 topics in newspapers and interviews
The rationale behind this was that the COVID-19 pandemic has led to diverse experiences influenced by public health measures like lockdowns and social distancing. To explore these dynamics, we introduce a novel ’big-thick’ data approach that integrates extensive U.S. newspaper data with detailed interviews. By employing natural language processing (NLP) and geoparsing techniques, we identify key topics related to the pandemic and vaccinations both in newspapers and personal narratives from interviews, and compare the (spatial) convergences and divergences between them.
Abstract:
In the face of the unprecedented COVID-19 pandemic, various government-led initiatives and individual actions (e.g., lockdowns, social distancing, and masking) have resulted in diverse pandemic experiences. This study aims to explore these varied experiences to inform more proactive responses for future public health crises. Employing a novel “big-thick” data approach, we analyze and compare key pandemic-related topics that have been disseminated to the public through newspapers with those collected from the public via interviews. Specifically, we utilized 82,533 U.S. newspaper articles from January 2020 to December 2021 and supplemented this “big” dataset with “thick” data from interviews and focus groups for topic modeling. Identified key topics were contextualized, compared and visualized at different scales to reveal areas of convergence and divergence. We found seven key topics from the “big” newspaper dataset, providing a macro-level view that covers public health, policies and economics. Conversely, three divergent topics were derived from the “thick” interview data, offering a micro-level view that focuses more on individuals’ experiences, emotions and concerns. A notable finding is the public’s concern about the reliability of news information, suggesting the need for further investigation on the impacts of mass media in shaping the public’s perception and behavior. Overall, by exploring the convergence and divergence in identified topics, our study offers new insights into the complex impacts of the pandemic and enhances our understanding of key issues both disseminated to and resonating with the public, paving the way for further health communication and policy-making.
![]() |
An overview of the research workflow. |
The monthly distribution of collected articles in the United States from January 2020 to December 2021. |
An example of identified entities labeled with predefined entity types. |
The spatial distribution of newspaper articles by different scales. |
The spatial distribution of identified newspaper topics across different regions in New York State. |
Ordered rank of identified topics by percentage from interviews. |
Chen, Q., Crooks, A.T., Sullivan, A.J., Surtees, J.A. and Tumiel-Berhalter, L. (2025). From Print to Perspective: A mixed-method analysis of the convergence and divergence of COVID-19 topics in newspapers and interviews, PLOS Digital Health. Available at https://doi.org/10.1371/journal.pdig.0000736. (pdf)
Friday, January 31, 2025
New Directions in Mapping the Earth’s Surface with Citizen Science and Generative
As more satellite imagery has become openly available, efforts in mapping the Earth’s surface have accelerated. Yet the accuracy of these maps is still limited by the lack of in-situ data needed to train machine learning algorithms. Citizen science has proven to be a valuable approach for collecting in-situ data through applications like Geo-Wiki and Picture Pile, but better approaches for optimizing volunteer time are still required. Although machine learning is being used in some citizen science projects, advances in generative Artificial Intelligence (AI) are yet to be fully exploited. This paper discusses how generative AI could be harnessed for land cover/land use mapping by enhancing citizen science approaches with multi-modal large language models (MLLMs), including improvements to the spatial awareness of AI.
![]() |
Visual interpretation tasks undertaken by ChatGPT for (a) a wetland/mangrove landscape in South America (b) an agricultural area in central Europe. |
![]() |
Integrating multi-modal Large Language Models (MLLMs) in a citizen science visual interpretation workflow. |
See, L., Chen, Q., Crooks, A., Bayas, J.C.L., Fraisl, D., Fritz, S., Georgieva, I., Hager, G., Hofer, M., and Lesiv, M., Malek, Ž., Milenković, M., Moorthy, I., Orduña-Cabrera, F., Pérez-Guzmán, K., Schepaschenko, D., Shchepashchenko, M., Steinhauser, J.and McCallum, I. (2025), New Directions in Mapping the Earth’s Surface with Citizen Science and Generative AI, iScience, doi: https://doi.org/10.1016/j.isci.2025.111919. (pdf)
Saturday, December 14, 2024
AGU
Over the last few years, considerable efforts have been placed in creating digital twins from diverse fields ranging from engineering to urban planning and many things in-between. These digital twins have benefited from the growth and availability of computational power and data. For example, in urban planning the growth of computational resources and the explosion of spatial data sources(e.g. remote sensing) has lead to the creation and widespread adoption of detailed virtual urban environments or urban digital twins. However, we would argue that many of such works emphasize only the physical infrastructure or the built environment of the city instead of considering the key actors of urban systems: the people who live in them. In this work we aim to remedy this by introducing a framework that utilizes agent-based modeling to add humans to such urban digital twins. This framework consists of two components: 1)synthetic populations generated with census data; and 2) pipeline of using the population datasets for agent-based modeling applications within the urban digital twins domain. To demonstrate the utility of this framework, we have representative applications that showcase how digital twins can be created to study various urban phenomena (e.g., evacuation scenarios, traffic congestion and disease transmission). By doing so, we believe this framework will benefit researchers wishing to build urban digital twins and to explore complex urban issues with realistic populations.
![]() |
Workflow of utilizing synthetic populations within agent-based models. |
In a different presentation, we return to how one can use social media to monitor the world around us, in this case dust storms. This work entitled "Mining unconventional data sources: creating a social media-based catalog of dust events in the Western US" is collaboration with Stuart Evans and Festus Adegbola. Generally speaking we explore how social media has the potential for a new unconventional source of observations of windblown dust. If this sounds of interest, below you can read the abstract to the paper and see the visual overlap between social media posts about dust events and official National Weather Service (NWS) dust storm warning coverage.
Abstract
Complete observations of dust events are difficult, as dust’s spatial and temporal variability means satellites may miss dust due to overpass time or cloud coverage, while ground stations may miss dust due to not being in the plume. As a result, an unknown number of dust events go unrecorded in traditional datasets. Dust’s importance both for atmospheric processes and as a health and travel hazard makes detecting dust events whenever possible important, and in particular, studies of the health impacts of dust are limited by detailed exposure information, i.e. where is there dust and when. In recent years, social media platforms have provided an opportunity to access vast user-generated data. This research utilizes geotagged Flickr and Twitter posts referencing dust in the western US, and compares it to traditional datasets including blowing dust reports from the National Weather Service and satellite observations from Suomi-VIIRS. Results show that this unconventional dataset broadly recreates the observed spatial and seasonal distributions of dust. Daily analysis of the locations of the social media posts creates a novel catalog of dust events in the western US that can be used for further research. While this catalog is necessarily incomplete, it nonetheless provides a complementary list of events to those detected by traditional means. Analysis of individual events in this catalog shows that social media captures many dust events that previously went undetected by traditional datasets.
References:
Crooks, A.T., Jiang, N., Yin, F. and Wang, B. (2024), A Framework for Populating Urban Digital Twins with Agents, American Geophysical Union (AGU) Fall Meeting, 9th–13th December, Washington, DC. (pdf)
Evans, S., Adegbola, F. and Crooks, A.T. (2024), Mining Unconventional Data Sources: Creating a Social Media-based Catalog of Dust Events in the Western US, American Geophysical Union (AGU) Fall Meeting, 9th–13th December, Washington, DC. (pdf)