14. Clustering Analysis of SAGE Transcription Profiles Using a Poisson Approach
| Abstract |
|
|
To gain insights into the biological function and relevance of genes using serial analysis of gene expression (SAGE) transcription
profiles, one essential method is to perform clustering analysis on genes. A successful clustering analysis depends on the
use of effective distance or similarity measures. For this purpose, by considering the specific properties of SAGE technology,
we modeled the SAGE data by Poisson statistics and developed two Poisson-based measures to assess similarity of gene expression
profiles. By employing these two distances into a K-means clustering procedure, we further developed a software package to
perform clustering analysis on SAGE data. The software implementing our Poisson-based algorithms can be downloaded from http://genome.dfci.harvard.edu/sager. Our algorithm is guaranteed to converge to a local maximum when Poisson likelihood-based measure is used. The results from
simulation and experimental mouse retina data demonstrate that the Poisson-based distances are more appropriate and reliable
for analyzing SAGE data compared to other commonly used distances or similarity measures.
Affiliation(s): (2) Department of Statistics, University of California at Berkeley, Berkeley, CA
(3) University of California at Berkeley, Berkeley, CA
(4) University of California at Berkeley, Berkeley, CA
(3) University of California at Berkeley, Berkeley, CA
(4) University of California at Berkeley, Berkeley, CA
Series: Methods in Molecular Biology | Volume: 387 | Pub. Date: Jun-05-2007 | Page Range: 185-198 | DOI: 10.1007/978-1-59745-454-4_14
Subject: Genetics/Genomics
Key Words: Clustering analysis - (Dis)similarity measures - Poisson statistics - K-means clustering - SAGE data.
Comments (Loading...) |
||
Loading... |





















