Analyzing the Mutation Frequencies and Correlation of Genetic Diseases in Worldwide Populations Using Big Data Processing, Clustering, and Predictive Analytics

Kae Sawada, Michael W. Clark, Nabil Alshurafa, Mohammad Pourhomayoun

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In this paper, we utilize Big Data Processing and develop Predictive Analytics Models to examine and analyze mutations associated with osteoporosis and cardiovascular disease. The dataset consists of the genomic information of over 2,500 individuals. The genomic data was collected from all around the world. The data visualization allowed us to see geographical/regional clustering patterns in the above mentioned specific mutations. The visualized data clearly shows a high correlation between a person's regional background and the occurrence of the 35 single nucleotide polymorphisms (SNPs). The 35 SNPs are specifically associated with osteoporosis and/or cardiovascular disease (CVD). A predictive analytics model was developed based on machine learning algorithms to predict the risk of an individual manifesting osteoporosis in later life. The results of this predictive model confirmed the links between osteoporosis and Cardiovascular related parameters such as High Density Lipoprotein (HDL) and Systolic Blood Pressure (SBP), as determined by the preceding studies.

Original languageEnglish (US)
Title of host publicationProceedings - 2017 International Conference on Computational Science and Computational Intelligence, CSCI 2017
EditorsFernando G. Tinetti, Quoc-Nam Tran, Leonidas Deligiannidis, Mary Qu Yang, Mary Qu Yang, Hamid R. Arabnia
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1459-1464
Number of pages6
ISBN (Electronic)9781538626528
DOIs
StatePublished - Dec 4 2018
Event2017 International Conference on Computational Science and Computational Intelligence, CSCI 2017 - Las Vegas, United States
Duration: Dec 14 2017Dec 16 2017

Publication series

NameProceedings - 2017 International Conference on Computational Science and Computational Intelligence, CSCI 2017

Other

Other2017 International Conference on Computational Science and Computational Intelligence, CSCI 2017
CountryUnited States
CityLas Vegas
Period12/14/1712/16/17

Keywords

  • 1000 Genome Project
  • Classifiers
  • Clustering
  • Data Visualization
  • Genome Wide Association Study (GWAS)
  • Machine Learning
  • Predictive Model
  • Supervised Learning
  • osteoporosis

ASJC Scopus subject areas

  • Computer Science (miscellaneous)
  • Artificial Intelligence
  • Computer Networks and Communications
  • Computer Science Applications
  • Safety, Risk, Reliability and Quality

Fingerprint Dive into the research topics of 'Analyzing the Mutation Frequencies and Correlation of Genetic Diseases in Worldwide Populations Using Big Data Processing, Clustering, and Predictive Analytics'. Together they form a unique fingerprint.

  • Cite this

    Sawada, K., Clark, M. W., Alshurafa, N., & Pourhomayoun, M. (2018). Analyzing the Mutation Frequencies and Correlation of Genetic Diseases in Worldwide Populations Using Big Data Processing, Clustering, and Predictive Analytics. In F. G. Tinetti, Q-N. Tran, L. Deligiannidis, M. Q. Yang, M. Q. Yang, & H. R. Arabnia (Eds.), Proceedings - 2017 International Conference on Computational Science and Computational Intelligence, CSCI 2017 (pp. 1459-1464). [8561018] (Proceedings - 2017 International Conference on Computational Science and Computational Intelligence, CSCI 2017). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/CSCI.2017.255