Skip to main content
eScholarship
Open Access Publications from the University of California

UC Irvine

UC Irvine Electronic Theses and Dissertations bannerUC Irvine

Potentially near-optimal community discovery via stochastic graphlet sampling

Creative Commons 'BY-NC-SA' version 4.0 license
Abstract

Graph theory has extensive applications across various fields, such as social science, biology, and physics. One prominent graph-related task is the search for subgraphs possessing significantly higher edge densities than the mean; such subgraphs are referred to as communities. Community detection is an NP-hard problem, and existing algorithms addressing this problem are categorized between those that try to find a partition of non-overlapping communities, and those that find overlapping communities.

In this work, we reimagine the definition of community and propose a novel algorithm for overlapping community detection based on graphlet sampling using BLANT, a subgraph sampling method developed at UCI by the Wayne Hayes's Lab. By adopting a definition that emphasizes uniformly dense subgraphs and allows for edges extending beyond the community boundaries, our algorithm offers an extensive collection of dense overlapping communities. Furthermore, it introduces a community overlap graph, providing users with insights into the degree of overlap and empowering them to identify relevant communities based on their specific use cases. Our algorithm, BLANT-clusters, demonstrates the ability to discover larger and denser communities compared to existing state-of-the-art methods. We discuss the strengths and weaknesses of our algorithm and provide a comparative analysis with current approaches. Finally, we conclude by outlining potential future work and applications for subsequent iterations of this method, highlighting the potential capability of achieving near-optimal community discovery.

Main Content
For improved accessibility of PDF content, download the file to your device.
Current View