Sign In or Create an Account.

By continuing, you agree to the Terms of Service and acknowledge our Privacy Policy

Climate Tech

Google Is Using AI to Fill a Flood Risk Data Gap

Researchers at the hyperscaler say they can predict flash floods with a new Gemini-produced dataset.

A flooded house and the Gemini logo.
Heatmap Illustration/Getty Images

Flash floods, when stormwater pools and rises rapidly in an area within just a few hours of a storm's onset, are one of the more dangerous hazards of a warming planet prone to heavier rainfall. They are also notoriously difficult to predict. But research out of Google on Thursday shows how artificial intelligence could unlock better forecasts and help communities prepare.

Google researchers used Gemini, the tech giant’s signature AI agent, to process millions of news articles from around the world about past floods and extract data on when and where the deluges occurred. After assembling this vast new dataset — the largest of its kind to date — they used it to train a flood prediction model that uses local, hourly meteorological data to produce 24-hour forecasts for urban flash floods in more than 150 countries.

The dataset, which Google has named Groundsource, is free for anyone to download and use, and the forecasts are now live on Google’s Flood Hub, an online portal that also predicts river-related flood events. The tool is somewhat crude — it simply indicates whether there is a medium or high likelihood of a flash flood occurring in the next 24 hours in a given area. It only covers urban areas, and it doesn’t tell you how severe the flood could be. The resolution is also pretty coarse, indicating risks at the scale of a city rather than a street or neighborhood.

Still, the researchers said the forecasts would be useful for alerting authorities to potential risks.

“People have been very interested, even at that level of granularity,” Gila Loike, a product manager at Google Research, told reporters in a press conference this week.

According to Google, a regional disaster authority in Southern Africa caught a flash flood alert while the tool was still in beta, confirmed the flood on the ground, and then deployed a humanitarian worker to oversee the response. “We’re still in the early days of seeing the impact of Groundsource, but that chain of events from a prediction in Flood Hub to boots on the ground is exactly what Flood Hub was built for,” Juliet Rothenberg, the product director for Google’s crisis resilience work, said.

One of the key reasons it’s so hard to predict flash floods is the lack of historical data. We have decent flood models for “riverine” flooding, when rivers overflow, because of physical gauges in rivers around the world that have collected water levels for decades, but there’s no equivalent for city streets.

News articles present a largely untapped source to fill this gap. The challenge is that the key bits of information, such as where and when the flood occurred, are buried in narrative texts and expressed in wildly inconsistent formats. It would take human experts untold hours and resources to wade through each one and record the data in a standardized manner. An AI agent such as Gemini, however, can do it much faster.

Google’s research team started out by crawling the web for news articles describing flood events going back to the year 2000, gathering an initial pool of more than 9 million stories from around the world. After getting rid of ads and menus and the like and translating the articles that were in other languages to English, they fed them to Gemini.

“You are a meticulous flood event analyst,” the researchers told the AI agent. The rest of the elaborate prompt is included in a non-peer-reviewed preprint paper detailing the group’s methods for producing the dataset. In essence, they goaded Gemini to take a sentence such as “Main Street flooded on Tuesday,” and interpret where, exactly, this Main Street was located, and which Tuesday the article was referring to.

The resulting dataset contains 2.6 million historical flood events across more than 150 countries. As a comparison, the next largest public dataset, the National Oceanic and Atmospheric Administration’s Storm Events database, contains about 2 million storm events from 1950 to the present, only about 230,000 of which are flood events. The biggest global dataset, the United Nations Office for Disaster Risk Reduction’s DesInventar system, contains 500,000 events, only a fraction of which are records of floods. It’s also restricted to participating nations and inconsistently updated.

“Oftentimes, the first question our researchers will ask when we talk about going into a new domain within crisis resilience is, what data do you have? How many data entries do you have?” Rothenberg said. “That’s what really unlocks the ability to make breakthroughs here.”

Humberto Vergara, an assistant professor of civil and environmental engineering at the University of Iowa who studies flash floods, agreed that the lack of flood observation data has been a significant obstacle for the field. He told me the Groundsource dataset will “definitely be of great interest” and that there is “definitely great need for things like this.” Using news reports to fill out the global picture of flooding is something researchers have been thinking about doing for a while, he added.

While Vergara was cautiously optimistic the data would be useful, he was quick to note that it would take additional efforts to validate. His lab is working on its own dataset based on satellite estimates of rainfall that could be used to prove out Google’s records, he said.

The Google team already made some efforts to validate Groundsource, cross-checking it with manual annotations of the news reports as well as with other existing databases. It found that about 82% of the events were labeled with the correct location and timeframe. “From a research perspective, using an 82% accurate dataset is actually acceptable,” Loike said. “A well-trained model can smooth out the inconsistencies and thereby learn the dominant patterns while ignoring the 18% of labeling errors.”

They also validated the Flood Hub predictions by comparing its U.S. outputs to flood and flash flood warnings produced by the National Weather Service. “Achieving performance metrics comparable to such a sophisticated, instrumentation-rich framework demonstrates how AI can bridge the warning gap in underserved regions that lack equivalent infrastructure,” the researchers wrote in a second non-peer-reviewed preprint describing the model development.

Part of the reason Vergara was cautious in praising the effort is that predicting flash floods is challenging for reasons beyond the lack of historical data. “Most of the driving force is rainfall,” he said. “Everybody in the community knows that predicting rainfall is extremely difficult. The best models out there cannot predict rainfall with the accuracy that is needed for flash floods with more than one or two hours of lead time.”

The utility of Google’s Flood Hub depends on who will be consuming the information, he said. It’s probably not high-resolution enough to be useful for emergency responders, but there might be agencies at the city or regional level that can use it as a situational awareness tool.

Rothenberg, of Google, is optimistic that this same method can produce useful predictions for other kinds of extreme events.

“Applying this methodology to flash flood reports is just the beginning,” Rothenberg told reporters at the press conference. “We think there’s an immense opportunity in thinking about how we could use publicly available information to help predict heat waves or landslides, for example — other events that are hard to predict because the data hasn’t been centralized or it doesn’t exist.”

Blue

You’re out of free articles.

Subscribe today to experience Heatmap’s expert analysis 
of climate change, clean energy, and sustainability.
To continue reading
Create a free account or sign in to unlock more free articles.
or
Please enter an email address
By continuing, you agree to the Terms of Service and acknowledge our Privacy Policy
Adaptation

Anti-Mask Sentiment Is Making It Hard to Protect People From Wildfire Smoke

The COVID-era political divide is still having ripple effects.

Taking off a mask during a wildfire.
Heatmap Illustration/Getty Images

Six years ago this month, the Centers for Disease Control and Prevention began advising that even healthy individuals to wear face coverings to protect themselves against the spread of what we were then still calling the “novel coronavirus.” Mask debates, mandates, bans, and confrontations followed. To this day, in the right parts of the country, covering your face will still earn you dirty looks, or worse.

If there were ever another year to have an N95 on hand, though, it’s this one. This winter was the warmest on record in nine U.S. states; Oregon, Colorado, Utah, and Montana have also recorded some of their lowest snowpacks since record-keeping began. That cues up the landscape in the West for “above normal significant fire potential,” in the words of the National Interagency Fire Center, which issues predictive outlooks for the season ahead. And it’s not just the West: the 642,000-acre Morrill grass fire, which ignited in early March, was the largest in Nebraska’s history, while exceptional drought conditions stretching from East Texas through Florida have set the stage for “well above normal fire activity” heading into the spring lightning season. As of the end of March, wildfires have already burned more than 1.6 million acres in the U.S., or 231% of the previous 10-year average.

Keep reading...Show less
AM Briefing

Total Waste

On Eli Lilly’s nuclear, Sunrise Wind, and Brazil’s minerals

Offshore wind.
Heatmap Illustration/Getty Images

Current conditions: Temperatures in the Northeast are swinging from last week’s record 90 degrees Fahrenheit to a cold snap with the risk of freezing • After a sunny weekend, the United States’ southernmost capital — Pago Pago, American Samoa — is facing a week of roaring thunderstorms • It’s nearing 100 degrees in Bangui as the Central African Republic’s capital and largest city braces for another day of intense storms.


THE TOP FIVE

1. Oil prices jump as fragile Iran War ceasefire crumbles

The price of crude spiked nearly 7% in pre-market trading Sunday after the fragile ceasefire between Iran and the U.S.-Israeli alliance. Things had been looking up on Friday, when President Donald Trump announced what appeared to be a breakthrough in talks with Tehran in a post on Truth Social, saying Iran would “fully reopen” the Strait of Hormuz. By Sunday, however, the U.S. commander in chief was accusing Tehran of firing bullets at French and British vessels in the waterway in “a total violation of our ceasefire agreement,” adding: “That wasn’t nice, was it?” On Sunday afternoon, Trump posted again to announce that the U.S. had seized an Iranian-flagged cargo ship attempting to traverse the strait. The prolonged conflict will only harden the historic rupture the severe contraction of oil and gas supply to the global market in modern history has triggered in global energy planning. “As happened with Russia’s war against Ukraine, the consequences of the Hormuz closure cannot simply be undone. That leaves countries — especially poorer countries dependent on fossil fuel imports — with a stark choice about how to fuel their future economic growth,” Heatmap’s Matthew Zeitlin wrote last week. “The crisis may have tipped the balance towards renewable and storage technology from China over oil and natural gas from the Persian Gulf, Russia, or the United States.”

Keep reading...Show less
Blue
Climate Tech

Exclusive: Where We’re At in the Race to Save the Planet

Investor and philanthropist John Doerr shares a refresh to his Speed & Scale climate action tracker.

From heat to coolness.
Heatmap Illustration/Getty Images

John Doerr thinks it’s time to refresh his grand plan for decarbonization. The Kleiner Perkins chairman and climate-focused philanthropist published his book Speed & Scale: An Action Plan for Solving Our Climate Crisis Now five years ago; then a year later, he introduced an online tracker to measure global progress across the book’s core objectives, which includes sectoral targets such as electrifying transport as well as execution-related goals that cut across all sectors such as winning on politics and policy and increasing investment investing.

But in the time since, both the world and the climate outlook have shifted significantly. So Doerr, alongside his co-author and advisor Ryan Panchadsaram, concluded that both the action plan and the metrics used to assess progress were due for a major revamp.

Keep reading...Show less
Blue