TY - GEN
T1 - Classification of colorectal cancer using clustering and feature selection approaches
AU - Nies, Hui Wen
AU - Daud, Kauthar Mohd
AU - Remli, Muhammad Akmal
AU - Mohamad, Mohd Saberi
AU - Deris, Safaai
AU - Omatu, Sigeru
AU - Kasim, Shahreen
AU - Sulong, Ghazali
N1 - Publisher Copyright:
© Springer International Publishing AG 2017.
PY - 2017
Y1 - 2017
N2 - Accurate cancer classification and responses to treatment are important in clinical cancer research since cancer acts as a family of gene-based diseases. Microarray technology has widely developed to measure gene expression level changes under normal and experimental conditions. Normally, gene expression data are high dimensional and characterized by small sample sizes. Thus, feature selection is needed to find the smallest number of informative genes and improve the classification accuracy and the biological interpretability results. Due to some feature selection methods neglect the interactions among genes, thus, clustering is used to group the similar genes together. Besides, the quality of the selected data can determine the effectiveness of the classifiers. This research proposed clustering and feature selection approaches to classify the gene expression data of colorectal cancer. Subsequently, a feature selection approach based on centroid clustering provide higher classification accuracy compared with other approaches.
AB - Accurate cancer classification and responses to treatment are important in clinical cancer research since cancer acts as a family of gene-based diseases. Microarray technology has widely developed to measure gene expression level changes under normal and experimental conditions. Normally, gene expression data are high dimensional and characterized by small sample sizes. Thus, feature selection is needed to find the smallest number of informative genes and improve the classification accuracy and the biological interpretability results. Due to some feature selection methods neglect the interactions among genes, thus, clustering is used to group the similar genes together. Besides, the quality of the selected data can determine the effectiveness of the classifiers. This research proposed clustering and feature selection approaches to classify the gene expression data of colorectal cancer. Subsequently, a feature selection approach based on centroid clustering provide higher classification accuracy compared with other approaches.
KW - Artificial intelligence
KW - Bioinformatics
KW - Cancer classification
KW - Clustering
KW - Feature selection
KW - Gene expression data
UR - http://www.scopus.com/inward/record.url?scp=85025141998&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85025141998&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-60816-7_8
DO - 10.1007/978-3-319-60816-7_8
M3 - Conference contribution
AN - SCOPUS:85025141998
SN - 9783319608150
T3 - Advances in Intelligent Systems and Computing
SP - 58
EP - 65
BT - 11th International Conference on Practical Applications of Computational Biology and Bioinformatics, 2017
A2 - Rocha, Miguel
A2 - De Paz, Juan F.
A2 - Pinto, Tiago
A2 - Fdez-Riverola, Florentino
A2 - Mohamad, Mohd Saberi
PB - Springer Verlag
T2 - 11th International Conference on Practical Applications of Computational Biology and Bioinformatics, PACBB 2017
Y2 - 21 June 2017 through 23 June 2017
ER -