TY - GEN
T1 - Pathway-Based Analysis Using SVM-RFE for Gene Selection and Classification
AU - Rahman, Nurazreen Afiqah A.
AU - Nasarudin, Nurul Athirah
AU - Mohamad, Mohd Saberi
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.
PY - 2024
Y1 - 2024
N2 - The pathway-based analysis is one method for selecting and classifying genes by incorporating pathway information. Integration of pathway knowledge into microarray data significantly advances researchers in the analysis of complex diseases. Microarray data involves thousands of genes to be selected, and therefore, a suitable method to eliminate noisy and uninformative genes is needed. Selecting significant genes for a specific disease is crucial to identifying genes highly related to disease production. Previous research shows that using both pathway information and gene expression data is more significant in disease identification. Therefore, pathway-based analysis using Support Vector Machine Recursive Feature Elimination (SVM-RFE) is introduced in this research to identify significant genes associated with analyzing the targeted phenotype. The datasets involved in this research are lung cancer and gender dataset. The results from the proposed method performed better than previous work as the significant genes are selected from the highest rank of genes in the highest rank of the pathway. The performance of the proposed method was evaluated using 10-fold cross-validation in terms of accuracy. Finally, a biological validation was conducted on selected genes in the top 5 pathways based on biological literature.
AB - The pathway-based analysis is one method for selecting and classifying genes by incorporating pathway information. Integration of pathway knowledge into microarray data significantly advances researchers in the analysis of complex diseases. Microarray data involves thousands of genes to be selected, and therefore, a suitable method to eliminate noisy and uninformative genes is needed. Selecting significant genes for a specific disease is crucial to identifying genes highly related to disease production. Previous research shows that using both pathway information and gene expression data is more significant in disease identification. Therefore, pathway-based analysis using Support Vector Machine Recursive Feature Elimination (SVM-RFE) is introduced in this research to identify significant genes associated with analyzing the targeted phenotype. The datasets involved in this research are lung cancer and gender dataset. The results from the proposed method performed better than previous work as the significant genes are selected from the highest rank of genes in the highest rank of the pathway. The performance of the proposed method was evaluated using 10-fold cross-validation in terms of accuracy. Finally, a biological validation was conducted on selected genes in the top 5 pathways based on biological literature.
KW - Artificial intelligence
KW - Bioinformatics
KW - Data science
KW - Gene selection
KW - Pathway-based analysis
KW - Support vector machine recursive feature elimination (SVM-RFE)
UR - http://www.scopus.com/inward/record.url?scp=85190682643&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85190682643&partnerID=8YFLogxK
U2 - 10.1007/978-981-99-9018-4_27
DO - 10.1007/978-981-99-9018-4_27
M3 - Conference contribution
AN - SCOPUS:85190682643
SN - 9789819990177
T3 - Smart Innovation, Systems and Technologies
SP - 369
EP - 379
BT - AI Technologies and Virtual Reality - Proceedings of 7th International Conference on Artificial Intelligence and Virtual Reality AIVR 2023
A2 - Nakamatsu, Kazumi
A2 - Patnaik, Srikanta
A2 - Kountchev, Roumen
PB - Springer Science and Business Media Deutschland GmbH
T2 - 7th International Conference on Artificial Intelligence and Virtual Reality, AIVR 2023
Y2 - 21 July 2023 through 23 July 2023
ER -