Abstract
Real engineering and scientific applications often involve one or more qualitative inputs. Standard Gaussian processes (GPs), however, cannot directly accommodate qualitative inputs. The recently introduced latent variable Gaussian process (LVGP) overcomes this issue by first mapping each qualitative factor to underlying latent variables (LVs) and then uses any standard GP covariance function over these LVs. The LVs are estimated similarly to the other GP hyperparameters through maximum likelihood estimation and then plugged into the prediction expressions. However, this plug-in approach will not account for uncertainty in estimation of the LVs, which can be significant especially with limited training data. In this work, we develop a fully Bayesian approach for the LVGP model and for visualizing the effects of the qualitative inputs via their LVs. We also develop approximations for scaling up LVGPs and fully Bayesian inference for the LVGP hyperparameters. We conduct numerical studies comparing plug-in inference against fully Bayesian inference over a few engineering models and material design applications. In contrast to previous studies on standard GP modeling that have largely concluded that a fully Bayesian treatment offers limited improvements, our results show that for LVGP modeling it offers significant improvements in prediction accuracy and uncertainty quantification over the plug-in approach.
Original language | English (US) |
---|---|
Pages (from-to) | 1357-1381 |
Number of pages | 25 |
Journal | SIAM-ASA Journal on Uncertainty Quantification |
Volume | 11 |
Issue number | 4 |
DOIs | |
State | Published - 2023 |
Funding
Acknowledgment. This research was supported in part by the computational resources and staff contributions provided for the Quest high-performance computing facility at North-western University, which is jointly supported by the Office of the Provost, the Office for Research, and Northwestern University Information Technology. \ast Received by the editors September 29, 2022; accepted for publication (in revised form) July 28, 2023; published electronically December 11, 2023. https://doi.org/10.1137/22M1525600 Funding: This work was supported by the Advanced Research Projects Agency-Energy (ARPA-E), U.S. Department of Energy, under award DE-AR0001209. \dagger Department of Industrial Engineering \& Management Sciences, Northwestern University, Evanston, IL 60208 USA ([email protected], [email protected]). \ddagger Department of Mechanical Engineering, Northwestern University, Evanston, IL 60208 USA (akshayiyer2021@ u.northwestern.edu, [email protected]).
Keywords
- categorical variables
- fully Bayesian inference
- Gaussian process
- latent variables
- uncertainty quantification
ASJC Scopus subject areas
- Statistics and Probability
- Modeling and Simulation
- Statistics, Probability and Uncertainty
- Discrete Mathematics and Combinatorics
- Applied Mathematics