ISBN-13: 9783659588280 / Angielski / Miękka / 2014 / 232 str.
Through Genome Wide Association Studies (GWAS) many SNP-complex disease relations have been investigated so far. GWAS presents high amount - high dimensional data and relations between SNPs, phenotypes and diseases are most likely to be nonlinear. In order to handle high volume-high dimensional data and to be able to find the nonlinear relations, data mining approaches are needed. In this work, a hybrid feature selection model of support vector machine and decision tree has been designed. This model also combines the genotype and phenotype information to increase the diagnostic performance. The model is tested on prostate cancer and melanoma data and shows promising results.
Through Genome Wide Association Studies (GWAS) many SNP-complex disease relations have been investigated so far. GWAS presents high amount - high dimensional data and relations between SNPs, phenotypes and diseases are most likely to be nonlinear. In order to handle high volume-high dimensional data and to be able to find the nonlinear relations, data mining approaches are needed. In this work, a hybrid feature selection model of support vector machine and decision tree has been designed. This model also combines the genotype and phenotype information to increase the diagnostic performance. The model is tested on prostate cancer and melanoma data and shows promising results.