TY - JOUR
T1 - Exploration of data science techniques to predict fatigue strength of steel from composition and processing parameters
AU - Agrawal, Ankit
AU - Deshpande, Parijat D.
AU - Cecen, Ahmet
AU - Basavarsu, Gautham P.
AU - Choudhary, Alok N.
AU - Kalidindi, Surya R.
N1 - Funding Information:
Tbe authors are grateful to NIMS to make the raw data on fatigue steel strength publicly available. This work is supported in part by the following grants: NSF awards CCF-0833131, CNS-0830927, IIS-0905205, CCF-0938000, CCF-1029166, ACI-1144061, and IIS-1343639; DOE awards DE-FG02-08ER25848, DE-SC0001283, DE-SC0005309, DESC0005340, and DESC0007456; AFOSR award FA9550-12-1-0458.
Funding Information:
Tbe authors are grateful to NIMS to make the raw data on fatigue steel strength publicly available. This work is supported in part by the following grants: NSF awards CCF-0833131, CNS-0830927, IIS-0905205, CCF-0938000, CCF-1029166, ACI-1144061, and IIS-1343639; DOE awards DE-FG02-08ER25848, DE-SC0001283, DE-SC0005309, DESC0005340, and DESC0007456; AFOSR award FA9550-12-1-0458.
Publisher Copyright:
© 2014, Agrawal et al.; licensee Springer.
PY - 2014/12/1
Y1 - 2014/12/1
N2 - This paper describes the use of data analytics tools for predicting the fatigue strength of steels. Several physics-based as well as data-driven approaches have been used to arrive at correlations between various properties of alloys and their compositions and manufacturing process parameters. Data-driven approaches are of significant interest to materials engineers especially in arriving at extreme value properties such as cyclic fatigue, where the current state-of-the-art physics based models have severe limitations. Unfortunately, there is limited amount of documented success in these efforts. In this paper, we explore the application of different data science techniques, including feature selection and predictive modeling, to the fatigue properties of steels, utilizing the data from the National Institute for Material Science (NIMS) public domain database, and present a systematic end-to-end framework for exploring materials informatics. Results demonstrate that several advanced data analytics techniques such as neural networks, decision trees, and multivariate polynomial regression can achieve significant improvement in the prediction accuracy over previous efforts, with R2 values over 0.97. The results have successfully demonstrated the utility of such data mining tools for ranking the composition and process parameters in the order of their potential for predicting fatigue strength of steels, and actually develop predictive models for the same.
AB - This paper describes the use of data analytics tools for predicting the fatigue strength of steels. Several physics-based as well as data-driven approaches have been used to arrive at correlations between various properties of alloys and their compositions and manufacturing process parameters. Data-driven approaches are of significant interest to materials engineers especially in arriving at extreme value properties such as cyclic fatigue, where the current state-of-the-art physics based models have severe limitations. Unfortunately, there is limited amount of documented success in these efforts. In this paper, we explore the application of different data science techniques, including feature selection and predictive modeling, to the fatigue properties of steels, utilizing the data from the National Institute for Material Science (NIMS) public domain database, and present a systematic end-to-end framework for exploring materials informatics. Results demonstrate that several advanced data analytics techniques such as neural networks, decision trees, and multivariate polynomial regression can achieve significant improvement in the prediction accuracy over previous efforts, with R2 values over 0.97. The results have successfully demonstrated the utility of such data mining tools for ranking the composition and process parameters in the order of their potential for predicting fatigue strength of steels, and actually develop predictive models for the same.
KW - Data mining
KW - Materials informatics
KW - Processing-property linkages
KW - Regression analysis
UR - http://www.scopus.com/inward/record.url?scp=85075417319&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85075417319&partnerID=8YFLogxK
U2 - 10.1186/2193-9772-3-8
DO - 10.1186/2193-9772-3-8
M3 - Article
AN - SCOPUS:85075417319
SN - 2193-9764
VL - 3
SP - 90
EP - 108
JO - Integrating Materials and Manufacturing Innovation
JF - Integrating Materials and Manufacturing Innovation
IS - 1
ER -