A TRIZ-inspired bat algorithm for gene selection in cancer classification
Abstract
Gene expression data are expected to make a great contribution in the producing of efficient cancer diagnosis
and prognosis. Gene expression data are coded by large measured genes, and only of a few number of them carry
precious information for different classes of samples. Recently, several researchers proposed gene selection
methods based on metaheuristic algorithms for analysing and interpreting gene expression data. However, due
to large number of selected genes with limited number of patient's samples and complex interaction between
genes, many gene selection methods experienced challenges in order to approach the most relevant and reliable
genes. Hence, in this paper, a hybrid filter/wrapper, called rMRMR-MBA is proposed for gene selection problem.
In this method, robust Minimum Redundancy Maximum Relevancy (rMRMR) as filter to select the most promising genes and an modified bat algorithm (MBA) as search engine in wrapper approach is proposed to identify
a small set of informative genes. The performance of the proposed method has been evaluated using ten gene
expression datasets. For performance evaluation, MBA is evaluated by studying the convergence behaviour of
MBA with and without TRIZ optimisation operators. For comparative evaluation, the results of the proposed
rMRMR-MBA were compared against ten state-of-arts methods using the same datasets. The comparative study
demonstrates that the proposed method produced better results in terms of classification accuracy and number of
selected genes in two out of ten datasets and competitive results on the remaining datasets. In a nutshell, the
proposed method is able to produce very promising results with high classification accuracy which can be
considered a promising contribution for gene selection domain.
Volume
112Issue
1Collections
The following license files are associated with this item: