Long Short-term Memory Network
Long short-term memory (LSTM) networks are a deep learning (DL) algorithm that employs sequences, or time series, as inputs, can produce sequences of predicted outputs, and can learn system dynamics because of time-related learning and prediction.
Dynamic Integration of AI-ML Predictions with Process-Based Model Simulations
Data assimilation (DA) is used to integrate artificial intelligence including machine learning (AI-ML) and process-based models to produce a dynamic operational water balance tool for groundwater management. The management tool is a three-step calculation. In the first step, a traditional process-based water budget model provides forward model predictions of aquifer storage from meteorological observations, estimates of pumping and diversion discharge, and estimates of recharge. A Kalman filter-based DA approach is the second step and generates updated storage volumes by combining a trained AI-ML model, providing replacement 'measurements' for missing observations, with forward model predictions. The third 'correction' step uses modified recharge and pumping, adjusted to account for the difference between Kalman update storage and forward model predicted storage, in forward model re-simulation to approximate updated storage volume. Use of modified inputs in the correction provides a mass conservative water budget framework based on AI-ML predictions. Pumping and recharge values are uncertain and unobserved in the study region and can be adjusted without contradicting measurements.
AI-ML Accounting for Uncertain Water Resources Data Sets
Artificial intelligence (AI) is the effort to automate intellectual tasks normally performed by humans, and it includes machine learning (ML) and deep learning (DL) approaches. AI-ML is based on statistical learning, which tries to learn statistics-based rules for data analyses from known examples of inputs and corresponding outcomes. Data sets that are noisy, include significant uncertainty, and have extreme values hinder statistical learning. ML and DL aquifer recharge predictors are developed to: (1) examine prediction skill when trained using noisy and uncertain data and (2) identify advantages of AI-ML relative to traditional physics- and process-based calculations. Recharge was selected as the learning outcome because it is not observed and is inherently uncertain. A common-sense baseline is developed and implemented to account for uncertainty and noise in AI-ML predictions. The baseline provides a lower goodness-of-fit threshold that identifies when trained AI-ML generates prediction skill and an upper goodness-of-fit threshold above which the AI-ML is learning to reproduce noise and bias in the training data set (and is likely overfitting). Identified advantages for AI-ML (relative to physics- or process-based calculations) are the ability to use dimensionless trends for features and to represent a complex scenario with the same level of effort as for a simple case.