VIDEO: University of Pennsylvania and Microsoft Research develop machine-learning weather prediction model

A team of researchers from the University of Pennsylvania and Microsoft Research has developed Aurora, a machine-learning model with predictive capabilities for air quality, ocean waves, tropical cyclone tracks and weather.

Research team

The research was supported by the Department of Energy’s Advanced Scientific Computing Research program (DE-SC0024563) and the Engineering & Physical Sciences Research Council Prosperity Partnership between Microsoft Research and the University of Cambridge (EP/T005386/1).

Paris Perdikaris, associate professor in the in the School of Engineering and Applied Science. Credit: Sylvia Zhang

Their findings have been published in Nature. The team was led by Paris Perdikaris, associate professor in the Department of Mechanical Engineering and Applied Mechanics in the School of Engineering and Applied Science at the University of Pennsylvania. Co-first author Megan Stanley is a senior researcher in the Machine Intelligence group at Microsoft Research. Other authors include Johannes Brandstetter of Johannes Kepler University Linz and Microsoft Research, Chun-Chieh Wu of National Taiwan University; Ana Lucic and Max Welling of the University of Amsterdam and Microsoft Research; Anna Allen and Alexander T. Archibald of the University of Cambridge; Richard E. Turner of the University of Cambridge and Microsoft Research; Haiyu Dong, Kit Thambiratnam, and Jonathan A. Weyn of Microsoft Corporation; Wessel P. Bruinsma, Patrick Garvan, Elizabeth Heider, and Maik Riechert of Microsoft Research; and Cristian Bodnar and Jayesh K. Gupta of Microsoft Research and Silurian AI.

Machine-learning model for extreme weather

According to the researchers, the model outperforms existing traditional systems at a fraction of the cost and could help emergency service providers better prepare for extreme weather events.

“Earth’s climate is perhaps the most complex system we study – with interactions spanning from quantum scales to planetary dynamics,” said Perdikaris. “With Aurora, we addressed a fundamental challenge in Earth system prediction – how to create forecasting tools that are both more accurate and dramatically more computationally efficient.”

For example, the team’s model correctly predicted landfall of 2023’s Typhoon Doksuri – the costliest Pacific typhoon to date – in the northern Philippines four days ahead of the event, while official forecasts erroneously predicted landfall off the coast of northern Taiwan.

Perdikaris says that instead of solving equations like numerical models, Aurora identifies complex relationships in historical Earth system data and uses these to generate predictions.

“This makes Aurora dramatically faster – generating predictions in seconds rather than hours – while maintaining or even exceeding the accuracy of traditional models,” he said.

Better, faster, stronger

To achieve better results for less time and money, the team turned to a “foundation model”, an artificial intelligence (AI) model trained on a wide variety of data, much like OpenAI’s GPT. Aurora is trained on more than one million hours of diverse geophysical data, including temperature, wind speeds, humidity, ocean wave heights and atmospheric chemical compositions. These came from weather analyses, reconstructions of historical weather, forecasts and climate simulations.

The learning process involved two key phases. First, Aurora was fed this diverse data, learning to predict the evolution of Earth system variables with a six-hour lead time and providing the model with fundamental insights into planetary dynamics. Then, during the fine-tuning phase, this pretrained model was adapted to perform specific tasks, such as using chemical composition data to predict air quality or pressure patterns associated with storm systems to track tropical storms.

The model achieves faster predictions, the researchers explained, because it learns patterns directly from extensive observational and simulation datasets – bypassing the need for explicit mathematical equations typically required in traditional models – and employs AI techniques specifically designed to leverage parallel processing capabilities of graphics processing units.

The enhanced accuracy of their approach arises from several key factors, the academics say. First, the model identifies and uses subtle patterns and correlations within data that conventional physics-based approaches might miss or not explicitly represent. Secondly, its neural network architecture is particularly well-suited for capturing complex physical processes occurring simultaneously at multiple scales.

Perdikaris states that Aurora also employs transfer learning, which means that knowledge gained from one area, such as atmospheric dynamics used in weather forecasting, enhances its predictive performance in other domains, including air quality modeling or predicting tropical cyclone formation.

“This cross-domain learning is central to the foundation model philosophy that guides my broader research program,” he said.

Beating the supercomputers

In testing Aurora’s predictive abilities, the team looked at a series of recent weather events as case studies and pitted their new AI against extant systems.

Perdikaris said that Aurora’s hurricane forecasting achievements are particularly remarkable: “When we compared Aurora to official forecasts from agencies like the National Hurricane Center, China Meteorological Administration and others, Aurora outperformed all of them across different basins worldwide.”

To examine air quality, the team looked at a sandstorm that took place in Iraq in June 2022, one of a series that resulted in more than 5,000 hospitalizations. Their AI accurately predicted it one day in advance at a fraction of the cost it takes to run a forecast on the Copernicus Atmosphere Monitoring Services, the gold standard in Earth observation and atmospheric monitoring.

Perdikaris adds that what’s particularly impressive is the model’s ability to handle the challenges of air quality data – sparse observations, large dynamic ranges in pollutant concentrations and complex chemical reactions through hundreds of equations – while accounting for human-generated emission pattern changes, like those seen during Covid-19.

Aurora “did not have any prior knowledge about atmospheric chemistry or how nitrogen dioxide, for instance, interacts with sunlight – that wasn’t part of the original training,” said Stanley. “And yet, in fine-tuning, Aurora was able to adapt to that because it had already learned enough about all of the other processes.”

In testing Aurora’s predictive capabilities for the heights and directions of ocean waves, the team conducted a case study of Typhoon Nanmadol, which struck the southern coast of Japan in 2022 and was the most intense typhoon that year. Their model exceeded expectations by perceiving intricate wave patterns in greater detail, drawing from prevailing wind patterns and accurately capturing the typhoon’s waves, they said.

The forecast for Aurora

“What makes these results particularly exciting is that they demonstrate how a single foundational approach can be applied across diverse domains,” said Perdikaris. “It’s something we’re now expanding to other scientific applications in my group.”

The researchers are interested in extending their model to generate predictions on Earth systems such as local and regional weather, seasonal weather and extreme weather events like floods and wildfires. Perdikaris says that he believes that this may represent a potential paradigm shift in how information on Earth systems is disseminated to key decision makers.

“The most transformative aspect is democratizing access to high-quality forecasts,” he commented. “Traditional systems require supercomputers and specialized teams, putting them out of reach for many communities worldwide. Aurora can run on modest hardware while matching or exceeding traditional model performance.”

For cities and local governments, Perdikaris notes that this means having localized, high-resolution predictions for air quality, extreme rainfall or heat waves without relying on downscaled global models. He said that the computational efficiency enables more frequent updates and forecasts that better quantify uncertainty, which is critical for risk management.

“What excites me most about this technology is its broader applicability,” concluded Perdikaris. “At Penn, we’re exploring how similar foundation model approaches can address other prediction challenges beyond weather – from urban flooding to renewable energy forecasting to air quality management – making powerful predictive tools accessible to communities that need them most.”

In related news, NOAA’s US$3m experimental Next Generation Fire System (NGFS) is undergoing a second evaluation in the Fire Weather Testbed, which is managed by the Global Systems Laboratory in Boulder, Colorado. The follow-up visit is intended to evaluate how to best send NGFS fire detections directly to land management partners across the western USA. Click here to read the full story

Cookie	Duration	Description
cookielawinfo-checbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

In this Issue – January 2026

In this Issue – January 2026

In this Issue – September 2025

In this Issue – April 2025

VIDEO: University of Pennsylvania and Microsoft Research develop machine-learning weather prediction model

Robotic Argo float sends back first ocean transect beneath East Antarctic ice shelf

US Navy trials Meteomatics weather drones to enhance maritime weather observation

StormGeo and Sedna partner to use advanced weather intelligence to deliver enhanced communication and operations management for shipping industry

Robotic Argo float sends back first ocean transect beneath East Antarctic ice shelf

Second phase of CREWS project launched to strengthen early warning systems in Southeast Asia

US Navy trials Meteomatics weather drones to enhance maritime weather observation

VIDEO: University of Pennsylvania and Microsoft Research develop machine-learning weather prediction model

Research team

Machine-learning model for extreme weather

Better, faster, stronger

Beating the supercomputers

The forecast for Aurora

Read Similar Stories