Lawrence Berkeley National Laboratory
Phenotype Clustering of Breast Epithelial Cells in Confocal Images based on Nuclear Protein
- Author(s): Long, Fuhui
- Peng, Hanchuan
- Sudar, Damir
- Levievre, Sophie A.
- Knowles, David W.
- et al.
Background: The distribution of the chromatin-associated proteins plays a key role in directing nuclear function. Previously, we developed an image-based method to quantify the nuclear distributions of proteins and showed that these distributions depended on the phenotype of human mammary epithelial cells. Here we describe a method that creates a hierarchical tree of the given cell phenotypes and calculates the statistical significance between them, based on the clustering analysis of nuclear protein distributions. Results: Nuclear distributions of nuclear mitotic apparatus protein were previously obtained for non-neoplastic S1 and malignant T4-2 human mammary epithelial cells cultured for up to 12 days. Cell phenotype was defined as S1 or T4-2 and the number of days in cultured. A probabilistic ensemble approach was used to define a set of consensus clusters from the results of multiple traditional cluster analysis techniques applied to the nuclear distribution data. Cluster histograms were constructed to show how cells in any one phenotype were distributed across the consensus clusters. Grouping various phenotypes allowed us to build phenotype trees and calculate the statistical difference between each group. The results showed that non-neoplastic S1 cells could be distinguished from malignant T4-2 cells with 94.19 percent accuracy; that proliferating S1 cells could be distinguished from differentiated S1 cells with 92.86 percent accuracy; and showed no significant difference between the various phenotypes of T4-2 cells corresponding to increasing tumor sizes. Conclusion: This work presents a cluster analysis method that can identify significant cell phenotypes, based on the nuclear distribution of specific proteins, with high accuracy.