Abstract
In the bioinformatics and clinical research areas, microarray technology has been widely used to distinguish a cancer dataset between normal and tumour samples. However, the high dimensionality of gene expression data affects the classification accuracy of an experiment. Thus, feature selection is needed to select informative genes and remove non-informative genes. Some of the feature selection methods, yet, ignore the interaction between genes. Therefore, the similar genes are clustered together and dissimilar genes are clustered in other groups. Hence, to provide a higher classification accuracy, this research proposed k-means clustering and infinite feature selection for identifying informative genes in the selected subset. This research has been applied to colorectal cancer and small round blue cell tumors datasets. Eventually, this research successfully obtained higher classification accuracy than the previous work.
| Original language | English |
|---|---|
| Title of host publication | 11th International Conference on Practical Applications of Computational Biology and Bioinformatics, 2017 |
| Editors | Miguel Rocha, Juan F. De Paz, Tiago Pinto, Florentino Fdez-Riverola, Mohd Saberi Mohamad |
| Publisher | Springer Verlag |
| Pages | 50-57 |
| Number of pages | 8 |
| ISBN (Print) | 9783319608150 |
| DOIs | |
| Publication status | Published - 2017 |
| Externally published | Yes |
| Event | 11th International Conference on Practical Applications of Computational Biology and Bioinformatics, PACBB 2017 - Porto, Portugal Duration: Jun 21 2017 → Jun 23 2017 |
Publication series
| Name | Advances in Intelligent Systems and Computing |
|---|---|
| Volume | 616 |
| ISSN (Print) | 2194-5357 |
Conference
| Conference | 11th International Conference on Practical Applications of Computational Biology and Bioinformatics, PACBB 2017 |
|---|---|
| Country/Territory | Portugal |
| City | Porto |
| Period | 6/21/17 → 6/23/17 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Keywords
- Artificial intelligence
- Cancer classification
- Gene expression data
- Infinite feature selection
- Informative genes
- K-means clustering
- Small round blue cell tumors
ASJC Scopus subject areas
- Control and Systems Engineering
- General Computer Science
Fingerprint
Dive into the research topics of 'K-means clustering with infinite feature selection for classification tasks in gene expression data'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS