Publications

Publications

Reporting and analyzing alternative clustering solutions by employing multi-objective genetic algorithm and conducting experiments on cancer data

By:
Contributors: Reda Alhajj, PhD, Mohamad Elzohbi, Peter Peng

Knowledge-Based Systems 56 (2004) 108-122

Peter Penga, Omer Addama, Mohamad Elzohbia, Sibel T. Özyerb, Ahmad Elhajjc, Shang Gaoa, Yimin Liua, Tansel Özyerd, Mehmet Kayae, Mick Ridleyc, Jon Roknea, Reda Alhajja, f

Abstract

Clustering is an essential research problem which has received considerable attention in the research community for decades. It is a challenge because there is no unique solution that fits all problems and satisfies all applications. We target to get the most appropriate clustering solution for a given application domain. In other words, clustering algorithms in general need prior specification of the number of clusters, and this is hard even for domain experts to estimate especially in a dynamic environment where the data changes and/or become available incrementally. In this paper, we described and analyze the effectiveness of a robust clustering algorithm which integrates multi-objective genetic algorithm into a framework capable of producing alternative clustering solutions; it is called Multi-objective K-Means Genetic Algorithm (MOKGA). We investigate its application for clustering a variety of datasets, including microarray gene expression data. The reported results are promising. Though we concentrate on gene expression and mostly cancer data, the proposed approach is general enough and works equally to cluster other datasets as demonstrated by the two datasets Iris and Ruspini. After running MOKGA, a pareto-optimal front is obtained, and gives the optimal number of clusters as a solution set. The achieved clustering results are then analyzed and validated under several cluster validity techniques proposed in the literature. As a result, the optimal clusters are ranked for each validity index. We apply majority voting to decide on the most appropriate set of validity indexes applicable to every tested dataset. The proposed clustering approach is tested by conducting experiments using seven well cited benchmark data sets. The obtained results are compared with those reported in the literature to demonstrate the applicability and effectiveness of the proposed approach.

Download PDF

 

goes to…APCaRI member Russ Greiner

Image of DREAM challenge winners, Russ Greiner pictured on far left.

Dr. Russ Greiner, Canada CIFAR AI Chair, Fellow-in-Residence at Amii, University of Alberta Professor, and APCaRI member, received the CAIAC Lifetime Achievement Award announced at the Canadian AI Conference on May 27, 2021. This the highest honour bestowed by CAIAC, given in recognition to researchers who have distinguished themselves through outstanding research excellence in AI during the course of their academic career. APCaRI congratulates Russ Greiner for his well-deserved CAIAC Lifetime Achievement Award!

“Using machine learning techniques to produce effective, evidence-based personalized treatment”

The main foci of Russ Greiner’s current work are (1) bioinformatics and medical informatics; (2) learning and using effective probabilistic models and (3) formal foundations of learnability. He has published over 200 refereed papers and patents, most in the areas of machine learning and knowledge representation, including 4 that have been awarded Best Paper prizes.

One of these four papers was an entry into an international machine learning competition hosted by DREAM, an open-science effort dedicated to improving health and health care through crowdsourcing problem-solving. DREAM’s challenge was to develop an algorithm to predict which prostate cancer patients would respond to certain treatments and which would follow the medication regimen. The algorithm could be used by clinicians to help chose the best treatment plans for the patient.

Greiner and a team of students tied for the top place in the competition against over 50 teams from around the world. Then the winners collaborated to create an even better solution to the problem!

 

 

 

 

 

 

- Perrin Beatty