ISBN-13: 9783639170245 / Angielski / Miękka / 2009 / 80 str.
Microarray expression data contain expression levels of a large number of genes and have been used in many scientific research and clinical studies. Due to its high dimensionalities, selecting a small number of genes has shown to be beneficial for many tasks such as building prediction models for a particular disease or gene regulatory network discovery. Traditional gene selection methods, however, fail to take the class distribution into the selection process. In Biomedical science, it is very common to have microarray expression data severely biased having very small number of diseased samples. These biased sample sets require special attention from researchers for identification of genes responsible for a particular disease. In this work, we propose three feature filtering techniques, Higher Weight ReliefF, ReliefF with Differential Minority Repeat and ReliefF with Balanced Minority Repeat to identify genes responsible for fatal diseases from biased microarray expression data. Our solutions will help Bioinformatics, Computer Science and Biomedical Research groups to filter potentially hazardous genes in an efficient way.