DARPA-ACTM: Assessing Risks of High-Impact Climate Changes and Tipping Points with a Data-Informed Climate Model
Climate change is poised to reshape our world, from challenging limits of water management infrastructure to necessitating adjustments to DoD’s strategic posture. For example, climate change may push outdoor temperature and humidity in regions such as the Persian Gulf beyond what humans can endure, or it may disrupt freshwater supply in sub-Saharan Africa, in either case possibly triggering mass migrations and geopolitical shifts.
The overall goal of this project is to achieve a step change in the accuracy of representation of clouds in climate models, to quantify uncertainties in the models, and to use the models with quantified uncertainties to assess risks of high-impact climate changes and tipping points. This project is funded by the Defense Advanced Research Projects Agency (Agreement No. HR00112290030), as a part of the AI-assisted Climate Tipping-point Modeling program.
Current Earth System models (ESMs) are not able to quantify the risks especially of low-probability, high-impact climate changes with irreversible consequences. These models suffer from biases that are traced back to the representation sub grid scale (SGS) variability that cannot be resolved in a global simulations. One of the most notorious of these processes is the dynamics of clouds and turbulence, which play a crucial role in controlling weather and climate. ESMs rely on parameterizations for representing clouds and turbulence and inadequacies in these parameterizations have hampered ESMs for decades; they are the largest source of physical uncertainties in climate predictions. Moreover, the correct representation of SGS variability is essential for representing extreme conditions.
Machine learning (ML) represents an exciting opportunity in Earth system modeling and specifically for the parameterization problem. However, the application of ML brings its own set of challenges. These include the difficulty in enforcing conservation laws and resolution dependence of ML models (which requires retraining for each ESM resolution); and the sparse and indirect data available for training such 'data hungry' models.
We address these ML issues by employing a hybrid model that combines a physical parameterization with ML closures, and training a Fourier Neural Operator (FNO) closure using a gradient free training method. The physical model is the Extended EDMF, which was shown to represent the entire spectrum of turbulent and convective motions using physical closures only [1][2][3] blog-post. Learning closures (i.e. terms in the equations) of a model greatly reduces the degrees of freedom (compared to learning the entire parameterization) thus reducing the amount of data needed to constrain the model. The gradient free - hybrid model - approach we plan to use is successfully applied for training a neural network as closures for the extended EDMF, and improves its generalizability to global warming scenarios [4]. We plan to apply this method using FNOs, which have been shown to be resolution agnostic in classical turbulent flow problems [5], blog-post. As training data we run a library of Large Eddy Simulations (LES) driven by the conditions found in state of the art climate simulations at selected location on the glob [6], blog-post.
Code repositories and data libraries associated with the project
The code-base for carrying out this research is part of the CLiMA model. This code is distributed among a large number of packages, and several of these are interacting to provide a modeling and learning framework in this project.
| Package | code | docs | Purpose |
|---|---|---|---|
| CalibrateEDMF.jl | code | docs | Framework to learn about cloud processes from data |
| EnsembleKalmanProcesses.jl | code | docs | Implementation of gradient-free optimization techniques |
| TurbulenceConvection.jl | code | docs | Implementation of EDMF scheme of turbulence, convection and clouds |
| OperatorFlux.jl | code | A machine learning package for Fourier Neural Operators | |
| LES library | data | LES generated training data at current climate and 4K warming simulations |
The interaction between these components is illustrated in the figure below, where CalibrateEDMF.jl is the central package that communicates between the model TurbulenceConvection.jl, the calibration algorithm EnsembleKalmanProcesses.jl, and the data from the LES library. The algorithm starts with a current parameter set (green $\theta$) which is passed to the EDMF model, and using FNOs, produces the closures ($\epsilon$, $\delta$). Using these closures the EDMF model computes profiles of observed variables, which are used to compute a loss. The loss informs the inversion algorithm which predicts a better set of parameters (orange $\theta$) and so on until the loss converges.
Further information about the individual packages can be found in their documentation pages:
CalibrateEDMF.jlis a software package that leverages the Bayesian optimization techniques housed inEnsembleKalmanProcesses.jlto learn cloud process parameterizations within an eddy-diffusivity mass-flux scheme (EDMF) of atmospheric turbulence and convection, and quantify their uncertainty. You can access the repository here, and the documentation here.TurbulenceConvection.jl(TC) is a software package that implements the extended eddy-diffusivity mass-flux scheme (EDMF) of atmospheric turbulence and convection. The package also provides standalone drivers to simulate the lower troposphere on a single-column basis. You can access the repository here and the documentation here.OperatorFlux.jlis a software package for FNOs. You can access the repository here.EnsembleKalmanProcesses.jl(EKP) is a library of derivative-free Bayesian optimization techniques based on the Ensemble Kalman Filters, a well known family of approximate filters used for data assimilation. You can access the repository here, and the documentation here.The LES library use for training data is available here.
Principal Investigators
- Tapio Schneider, California Institute of Technology
- Anima Anandkumar, California Institute of Technology
References
- 1Zhihong Tan, Colleen M Kaul, Kyle G Pressel, Yair Cohen, Tapio Schneider, Joao Teixeira (2018)
- 2Yair Cohen, Ignacio Lopez-Gomez, Anna Jaruga, Jia He, Colleen M Kaul, Tapio Schneider (2020)
- 3Ignacio Lopez-Gomez, Yair Cohen, Jia He, Anna Jaruga, Tapio Schneider (2020)
- 4Ignacio Lopez-Gomez, Costa D Christopoulos, Haakon Ludvig Ervik, Oliver R Dunbar, Yair Cohen, Tapio Schneider (2022)
- 5Zongyi Li, Nikola Kovachki, Kamyar Azizzadenesheli, Burigede Liu, Kaushik Bhattacharya, Andrew Stuart, Anima Anandkumar (2020)
- 6Zhaoyi Shen, Akshay Sridhar, Zhihong Tan, Anna Jaruga, Tapio Schneider (2022)