A novel dimensionality reduction technique based on independent component analysis for modeling microarray gene expression data

Han Liu*, Rafal Kustra, Ji Zhang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

DNA microarray experiments generating thousands of gene expression measurements, are being used to gather information from tissue and cell samples regarding gene expression differences that will be useful in diagnosing disease. But one challenge of microarray studies is the fact that the number n of samples collected is relatively small compared to the number p of genes per sample which are usually in thousands. In statistical terms this very large number of predictors compared to a small number of samples or observations makes the classification problem difficult. This is known as the "curse of dimensionality problem". An efficient way to solve this problem is by using dimensionality reduction techniques. Principle Component Analysis(PCA) is a leading method for dimensionality reduction of gene expression data which is optimal in the sense of least square error. In this paper we propose a new dimensionality reduction technique for specific bioinformatics applications based on Independent component Analysis(ICA). Being able to exploit higher order statistics to identify a linear model result, this ICA based dimensionality reduction technique outperforms PCA from both statistical and biological significance aspects. We present experiments on NCI 60 dataset to show this result.

Original languageEnglish (US)
Title of host publicationProceedings of the International Conference on Artificial Intelligence, IC-AI'04 and Proceedings of the International Conference on Machine Learning; Models, Technologies and Applications, MLMTA'04)
EditorsH.R. Arabnia, M. Youngsong
Pages1133-1139
Number of pages7
Volume2
StatePublished - Dec 1 2004
EventProceedings of the International Conference on Artificial Intelligence, IC-AI'04 - Las Vegas, NV, United States
Duration: Jun 21 2004Jun 24 2004

Other

OtherProceedings of the International Conference on Artificial Intelligence, IC-AI'04
Country/TerritoryUnited States
CityLas Vegas, NV
Period6/21/046/24/04

Keywords

  • Dimensionality reduction
  • Gene expression data
  • Independent component analysis
  • Latent regulatory factors

ASJC Scopus subject areas

  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'A novel dimensionality reduction technique based on independent component analysis for modeling microarray gene expression data'. Together they form a unique fingerprint.

Cite this