The Isolation Principle of Clustering: Structural Characteristics and Implementation

Gregorius, Hans-Rolf

doi:10.1007/s10441-006-8255-3

The Isolation Principle of Clustering: Structural Characteristics and Implementation

Published: September 2006

Volume 54, pages 219–233, (2006)
Cite this article

Acta Biotheoretica Aims and scope Submit manuscript

Hans-Rolf Gregorius¹

44 Accesses
3 Citations
Explore all metrics

Abstract

The isolation principle rests on defining internal and external differentiation for each subset of at least two objects. Subsets with larger external than internal differentiation form isolated groups in the sense that they are internally cohesive and externally isolated. Objects that do not belong to any isolated group are termed solitary. The collection of all isolated groups and solitary objects forms a hierarchical (encaptic) structure. This ubiquitous characteristic of biological organization provides the motivation to identify universally applicable practical methods for the detection of such structure, to distinguish primary types of structure, to quantify their distinctiveness, and to simplify interpretation of structural aspects. A method implementing the isolation principle (by generating all isolated groups and solitary objects) is proven to be specified by single-linkage clustering. Basically, the absence of structure can be stated if no isolated groups exist, the condition for which is provided. Structures that allow for classifications in the sense of complete partitioning into disjoint isolated groups are characterized, and measures of distinctiveness of classification are developed. Among other primary types of structure, chaining (complete nesting) and ties (isolated groups without internal structure) are considered in more detail. Some biological examples for the interpretation of structure resulting from application of the isolation principle are outlined.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Arabie, P. and L.J. Hubert (1996). An overview of combinatorial data analysis. In: P., L.J. Arabie, Hubert and G. DeSoete (eds). pp. 5–63.
Arabie, P., L.J. Hubert and G. De Soete (eds.), (1996). Clustering and Classification. World Scientific, Singapore etc.
Google Scholar
Barthélemy, J.-P. and F. Brucker (2001). NP-hard approximation problems in overlapping clustering. Journal of Classification 18: 159–183.
Google Scholar
Estabrook, G.F. (1966). A mathematical model in graph theory for biological classification. Journal of Theoretical Biology 12: 297–310.
Article Google Scholar
Gordon, A.D. (1996). Hierarchical classification. In: P. Arabie and L.J. Hubert, G. De Soete (eds.),. pp. 65–121.
Gregorius, H.-R. (2004). The isolation approach to hierarchical clustering. Journal of Classification 21: 51–69.
Article Google Scholar
Jain, A.K. and R.C. Dubes (1988). Algorithms for Clustering Data. Prentice Hall.
Jardine, N.J. and R. Sibson (1971). Mathematical Taxonomy. John Wiley & Sons, London etc.
Google Scholar
Kaufman, L. and P.J. Rousseeuw (1990). Finding Groups in Data. An Introduction to Cluster analysis. John Wiley & Sons, New York etc.
Google Scholar
Ludwig, J.A. and J.F. Reynolds (1988). Statistical Ecology – A Primer on Methods and Computing. John Wiley & Sons, New York, etc.
Google Scholar
Milligan, G.W. (1996). Clustering validation: results and implications for applied analysis. In: P. Arabie, L.J. Hubert and G. De Soete (eds.),. pp. 341–375.
Muchnik, I.B. and I.A. Rybina (1989). Definitive conditions for isolation of classes in empiric classifications. Automatic Documentation and Mathematical Linguistics 23: 97–107.
Google Scholar
Olman, V., D. Xu and Y. Xu (2003). CUBIC: Identification of regulatory binding sites through data clustering. Journal of Bioinformatics and Computational Biology 1: 21–40.
Article Google Scholar
Prim, R.C. (1957). Shortest connection networks and some generalizations. Bell System Technical Journal 36: 1389–1401.
Google Scholar
Xu, Y., V. Olman and D. Xu (2002). Clustering gene expression data using a graph-theoretic approach: an application of minimum spanning trees. Bioinformatics 18: 536–545.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Institut für Forstgenetik und Forstpflanzenzüchtung, Universität Göttingen, Büsgenweg 2, 37077, Göttingen, Germany
Hans-Rolf Gregorius

Authors

Hans-Rolf Gregorius
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gregorius, HR. The Isolation Principle of Clustering: Structural Characteristics and Implementation. Acta Biotheor 54, 219–233 (2006). https://doi.org/10.1007/s10441-006-8255-3

Download citation

Received: 08 December 2005
Accepted: 04 May 2006
Issue Date: September 2006
DOI: https://doi.org/10.1007/s10441-006-8255-3

Key Words:

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The Isolation Principle of Clustering: Structural Characteristics and Implementation

Abstract

Access this article

Similar content being viewed by others

Versatile Linkage: a Family of Space-Conserving Strategies for Agglomerative Hierarchical Clustering

A Heuristic Automatic Clustering Method Based on Hierarchical Clustering

Clustering Models

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Key Words:

Navigation

The Isolation Principle of Clustering: Structural Characteristics and Implementation

Abstract

Access this article

Similar content being viewed by others

Versatile Linkage: a Family of Space-Conserving Strategies for Agglomerative Hierarchical Clustering

A Heuristic Automatic Clustering Method Based on Hierarchical Clustering

Clustering Models

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Key Words:

Search

Navigation