Active Learning for Computationally Efficient Distribution of Binary Evolution Simulations

Kyle Akira Rocha, Jeff J. Andrews, Christopher Philip Luke Berry, Zoheyr Doctor, Aggelos K. Katsaggelos, Juan Gabriel Serra Pérez, Pablo Marchant, Vicky Kalogera, Scott Coughlin, Simone S. Bavera, Aaron Dotter, Tassos Fragos, Konstantinos Kovlakas, Devina Misra, Zepei Xing, Emmanouil Zapartas

Research output: Contribution to journalArticlepeer-review

4 Scopus citations

Abstract

Binary stars undergo a variety of interactions and evolutionary phases, critical for predicting and explaining observations. Binary population synthesis with full simulation of stellar structure and evolution is computationally expensive, requiring a large number of mass-transfer sequences. The recently developed binary population synthesis code POSYDON incorporates grids of MESA binary star simulations that are interpolated to model large-scale populations of massive binaries. The traditional method of computing a high-density rectilinear grid of simulations is not scalable for higher-dimension grids, accounting for a range of metallicities, rotation, and eccentricity. We present a new active learning algorithm, psy-cris, which uses machine learning in the data-gathering process to adaptively and iteratively target simulations to run, resulting in a custom, high-performance training set. We test psy-cris on a toy problem and find the resulting training sets require fewer simulations for accurate classification and regression than either regular or randomly sampled grids. We further apply psy-cris to the target problem of building a dynamic grid of MESA simulations, and we demonstrate that, even without fine tuning, a simulation set of only ∼1/4 the size of a rectilinear grid is sufficient to achieve the same classification accuracy. We anticipate further gains when algorithmic parameters are optimized for the targeted application. We find that optimizing for classification only may lead to performance losses in regression, and vice versa. Lowering the computational cost of producing grids will enable new population synthesis codes such as POSYDON to cover more input parameters while preserving interpolation accuracies.

Original languageEnglish (US)
Article number64
JournalAstrophysical Journal
Volume938
Issue number1
DOIs
StatePublished - Oct 1 2022

Funding

We thank Monica Gallegos-Garcia for useful conversation during the development of psy-cris. K.A.R. is supported by the Gordon and Betty Moore Foundation (PI Kalogera, grant award GBMF8477). K.A.R. also thanks the LSSTC Data Science Fellowship Program, which is funded by LSSTC, NSF Cybertraining grant No. 1829740, the Brinson Foundation, and the Moore Foundation; their participation in the program has benefited this work. J.J.A. acknowledges support from CIERA and Northwestern University through a Postdoctoral Fellowship. P.M. acknowledges support from the FWO junior postdoctoral fellowship No. 12ZY520N. C.P.L.B. and Z.D. acknowledge support from the CIERA Board of Visitors Research Professorship. V.K. was partially supported through a CIFAR Senior Fellowship and a Guggenheim Fellowship. S.B., T.F., K.K., D.M., Z.X., and E.Z. were supported by a Swiss National Science Foundation Professorship grant (PI Fragos, project No. PP00P2 176868). K.K. was partially supported by the Federal Commission for Scholarships for Foreign Students for the Swiss Government Excellence Scholarship (ESKAS No. 2021.0277). Z.X. was supported by the Chinese Scholarship Council (CSC). The computations were performed at Northwestern University on the Trident computer cluster (funded by the GBMF8477 award). This research was supported in part through the computational resources and staff contributions provided for the Quest high-performance computing facility at Northwestern University, which is jointly supported by the Office of the Provost, the Office for Research, and Northwestern University Information Technology.

ASJC Scopus subject areas

  • Astronomy and Astrophysics
  • Space and Planetary Science

Fingerprint

Dive into the research topics of 'Active Learning for Computationally Efficient Distribution of Binary Evolution Simulations'. Together they form a unique fingerprint.

Cite this