Analysis and visualization of gene expression data using biclustering: A comparative study
In the last few years the gene expression microarray technology has become a central tool in the field of functional genomics in which the expression levels of thousands of genes in a biological sample are determined in a single experiment. Several clustering and biclustering methods have been introduced to analyze the gene expression data by identifying the similar patterns and grouping genes into subsets that share biological significance. However, it is not clear how the different methods compare with each other with respect to the biological relevance of the biclusters and clusters, as well as with other characteristics such as robustness and predictability. This research described the development of an automatic comparative tool called BicAT-plus that was designed to help researchers in evaluating the results of different biclustering methods. It also compared the results against each other and allowed a comparison of results via convenient graphical displays. BicAT-plus incorporates a reasonable biological comparative methodology based on the enrichment of the output biclusters with gene ontology functional categories. No exact algorithm can be considered the optimum one. Instead, biclustering algorithms can be used as integrated techniques to highlight the most enriched biclusters that help biologists to draw biological prediction about the unknown genes.
Key words: Bioinformatics, functional genomics, gene expression analysis, microarrays data, comparison, clustering, biclustering, functional analysis, gene ontology.