The Classification of HLA Supertypes by GRID/CPCA and Hierarchical Clustering Methods
| Abstract |
|
|
Biological experiments often produce enormous amount of data, which are usually analyzed by data clustering. Cluster analysis
refers to statistical methods that are used to assign data with similar properties into several smaller, more meaningful groups.
Two commonly used clustering techniques are introduced in the following section: principal component analysis (PCA) and hierarchical
clustering. PCA calculates the variance between variables and groups them into a few uncorrelated groups or principal components
(PCs) that are orthogonal to each other. Hierarchical clustering is carried out by separating data into many clusters and
merging similar clusters together. Here, we use an example of human leukocyte antigen (HLA) supertype classification to demonstrate
the usage of the two methods. Two programs, Generating Optimal Linear Partial Least Square Estimations (GOLPE) and Sybyl,
are used for PCA and hierarchical clustering, respectively. However, the reader should bear in mind that the methods have
been incorporated into other software as well, such as SIMCA, statistiXL, and R.
Affiliation(s): (3) Computational Biology Group, John Innes Centre, Norwich, UK
(4) The Jenner Institute, University of Oxford, Berkshire, UK
(4) The Jenner Institute, University of Oxford, Berkshire, UK
Series: Methods in Molecular Biology | Volume: 409 | Pub. Date: Jun-21-2007 | Page Range: 143-154 | DOI: 10.1007/978-1-60327-118-9_9
Subject: Immunology
Comments (Loading...) |
||
Loading... |






















