Multiple gene sets for cancer classification using gene range selection based on random forest

Kohbalan Moorthy, Mohd Saberi Bin Mohamad, Safaai Deris

Research output: Chapter in Book/Report/Conference proceedingConference contribution

8 Citations (Scopus)

Abstract

The advancement of microarray technology allows obtaining genetic information from cancer patients, as computational data and cancer classification through computation software, has become possible. Through gene selection, we can identify certain numbers of informative genes that can be grouped into a smaller sets or subset of genes; which are informative genes taken from the initial data for the purpose of classification. In most available methods, the amount of genes selected in gene subsets are dependent on the gene selection technique used and cannot be fine-tuned to suit the requirement for particular number of genes. Hence, a proposed technique known as gene range selection based on a random forest method allows selective subset for better classification of cancer datasets. Our results indicate that various gene sets assist in increasing the overall classification accuracy of the cancer related datasets, as the amount of genes can be further scrutinized to create the best subset of genes. Moreover, it can assist the gene-filtering technique for further analysis of the microarray data in gene network analysis, gene-gene interaction analysis and many other related fields.

Original languageEnglish
Title of host publicationIntelligent Information and Database Systems - 5th Asian Conference, ACIIDS 2013, Proceedings
Pages385-393
Number of pages9
EditionPART 1
DOIs
Publication statusPublished - 2013
Externally publishedYes
Event5th Asian Conference on Intelligent Information and Database Systems, ACIIDS 2013 - Kuala Lumpur, Malaysia
Duration: Mar 18 2013Mar 20 2013

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 1
Volume7802 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference5th Asian Conference on Intelligent Information and Database Systems, ACIIDS 2013
Country/TerritoryMalaysia
CityKuala Lumpur
Period3/18/133/20/13

Keywords

  • Cancer Classification
  • Gene Expression
  • Gene Selection
  • Microarray Data
  • Random Forest

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Multiple gene sets for cancer classification using gene range selection based on random forest'. Together they form a unique fingerprint.

Cite this