TY - GEN
T1 - Multiple gene sets for cancer classification using gene range selection based on random forest
AU - Moorthy, Kohbalan
AU - Bin Mohamad, Mohd Saberi
AU - Deris, Safaai
PY - 2013
Y1 - 2013
N2 - The advancement of microarray technology allows obtaining genetic information from cancer patients, as computational data and cancer classification through computation software, has become possible. Through gene selection, we can identify certain numbers of informative genes that can be grouped into a smaller sets or subset of genes; which are informative genes taken from the initial data for the purpose of classification. In most available methods, the amount of genes selected in gene subsets are dependent on the gene selection technique used and cannot be fine-tuned to suit the requirement for particular number of genes. Hence, a proposed technique known as gene range selection based on a random forest method allows selective subset for better classification of cancer datasets. Our results indicate that various gene sets assist in increasing the overall classification accuracy of the cancer related datasets, as the amount of genes can be further scrutinized to create the best subset of genes. Moreover, it can assist the gene-filtering technique for further analysis of the microarray data in gene network analysis, gene-gene interaction analysis and many other related fields.
AB - The advancement of microarray technology allows obtaining genetic information from cancer patients, as computational data and cancer classification through computation software, has become possible. Through gene selection, we can identify certain numbers of informative genes that can be grouped into a smaller sets or subset of genes; which are informative genes taken from the initial data for the purpose of classification. In most available methods, the amount of genes selected in gene subsets are dependent on the gene selection technique used and cannot be fine-tuned to suit the requirement for particular number of genes. Hence, a proposed technique known as gene range selection based on a random forest method allows selective subset for better classification of cancer datasets. Our results indicate that various gene sets assist in increasing the overall classification accuracy of the cancer related datasets, as the amount of genes can be further scrutinized to create the best subset of genes. Moreover, it can assist the gene-filtering technique for further analysis of the microarray data in gene network analysis, gene-gene interaction analysis and many other related fields.
KW - Cancer Classification
KW - Gene Expression
KW - Gene Selection
KW - Microarray Data
KW - Random Forest
UR - http://www.scopus.com/inward/record.url?scp=84874606619&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84874606619&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-36546-1_40
DO - 10.1007/978-3-642-36546-1_40
M3 - Conference contribution
AN - SCOPUS:84874606619
SN - 9783642365454
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 385
EP - 393
BT - Intelligent Information and Database Systems - 5th Asian Conference, ACIIDS 2013, Proceedings
T2 - 5th Asian Conference on Intelligent Information and Database Systems, ACIIDS 2013
Y2 - 18 March 2013 through 20 March 2013
ER -