Efficient Maximal Biclique Enumeration on Large Signed Bipartite Graphs

J Wang, J Yang, Z Gu, D Ouyang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
J Wang, J Yang, Z Gu, D Ouyang, Z Tian, X Lin
IEEE Transactions on Knowledge and Data Engineering, 2024ieeexplore.ieee.org
In this paper, we study the problem of maximal biclique enumeration on large signed
bipartite graphs. Given a signed bipartite graph, a parameter, our goal is to efficiently
enumerate all maximal-bicliques in, where a maximal-biclique is a complete subgraph of
with (1) the proportion of positive neighbors for each vertex in is at least, and (2) is not
contained in another biclique, while also satisfies (1). This problem has many applications,
such as biclustering for genes, recommendation of similar groups, collaboration in …
In this paper, we study the problem of maximal biclique enumeration on large signed bipartite graphs. Given a signed bipartite graph , a parameter , our goal is to efficiently enumerate all maximal -bicliques in , where a maximal -biclique is a complete subgraph of with (1) the proportion of positive neighbors for each vertex in is at least , and (2) is not contained in another biclique , while also satisfies (1). This problem has many applications, such as biclustering for genes, recommendation of similar groups, collaboration in communities, etc. However, it is computationally challenging due to its #P-completeness. Besides, we prove that even determining the maximality of a -biclique is NP-hard. To the best of our knowledge, there is no efficient and scalable solution to this problem in the literature. In this paper, we first propose a branch-and-bound framework, namely , which enumerates all maximal -bicliques in a depth-first manner. Then, we develop three effective optimizations to improve the performance of . (1) The local information of each search space is utilized to enhance the pruning capacity. (2) When expanding the partial biclique, we always focus on the side with fewer candidates first, by which fruitless search branches can be skipped early. (3) We implement with efficient array reordering techniques and set intersection strategy. To further accelerate the computation, we introduce useful graph reduction techniques. Comprehensive performance studies on 10 real datasets demonstrate that our proposals can significantly outperform the baseline methods by up to 3 orders of magnitude.
ieeexplore.ieee.org
Showing the best result for this search. See all results