Reachability-state encoding: bridging tradeoffs between the minimal storage and the fastest query for reachability query
Hu-Quan KANG , Xing-Li WANG , Zhou-Yang JIN , Yu-Ang DING , Yi-Ming LIU , Luo-Yi FU , Jia-Xin DING , Xin-Bing WANG
Front. Comput. Sci. ›› 2026, Vol. 20 ›› Issue (10) : 2010502
Reachability-state encoding: bridging tradeoffs between the minimal storage and the fastest query for reachability query
Reachability query, which determines whether a path exists between any two nodes in a graph, has been the cornerstone of graph operation research for decades. Traditional algorithms, such as depth-first search and transitive closure, are often limited by either slow query or substantial storage. Extensive efforts have increasingly focused on designing reachability indexes to balance the tradeoff between index space and query time. However, existing solutions are limited to specific tradeoff points and lack theory for adjustable, quantifiable tradeoffs. In this work, we introduce Reachability-state Encoding (RE), which adjusts the number of encoded node pairs per chunk to affect compression ratio and decoding time, thus allowing for a smooth transition between the minimal storage and the fastest query. We establish theoretical upper bounds for the minimum encoding length of RE in relation to the size and entropy of reachability-states, which quantifies the limits of compression within the encoding process and ensures superior performance in dense graphs compared to traditional methods. We have implemented a prototype algorithm, RE-toy, to validate the effectiveness of RE. Through experiments conducted on both synthetic and real-world graphs, RE-toy consistently achieves numerous optimal tradeoff points in space-time, demonstrating robust performance across graphs of varying densities. RE thus provides a practical and adaptable approach for managing the balance between storage demands and computational efficiency in reachability queries.
reachability query / reachability index / reachability-state encoding / graph algorithm
| [1] |
|
| [2] |
Zhang C, Bonifati A, Özsu M T. An overview of reachability indexes on graphs. In: Proceedings of Companion of 2023 International Conference on Management of Data. 2023, 61−68 |
| [3] |
|
| [4] |
|
| [5] |
|
| [6] |
|
| [7] |
van Schaik S J, de Moor O. A memory efficient reachability data structure through bit vector compression. In: Proceedings of 2011 ACM SIGMOD International Conference on Management of Data. 2011, 913−924 |
| [8] |
|
| [9] |
|
| [10] |
|
| [11] |
in R, Xiang Y, Ruan N, Wang H. Efficiently answering reachability queries on very large directed graphs. In: Proceedings of 2008 ACM SIGMOD International Conference on MANAGEMENT of Data. 2008, 595−608 |
| [12] |
Trißl S, Leser U. Fast and practical indexing and querying of very large graphs. In: Proceedings of 2007 ACM SIGMOD International Conference on Management of Data. 2007, 845−856 |
| [13] |
|
| [14] |
|
| [15] |
|
| [16] |
|
| [17] |
Jin R, Xiang Y, Ruan N, Fuhry D. 3-HOP: a high-compression indexing scheme for reachability query. In: Proceedings of 2009 ACM SIGMOD International Conference on Management of Data. 2009, 813−826 |
| [18] |
|
| [19] |
|
| [20] |
|
| [21] |
Cheng J, Huang S, Wu H, Fu A W C. TF-Label: a topological-folding labeling scheme for reachability querying in a large graph. In: Proceedings of 2013 ACM SIGMOD International Conference on Management of Data. 2013, 193−204 |
| [22] |
|
| [23] |
Zhu A D, Lin W, Wang S, Xiao X. Reachability queries on large dynamic graphs: a total order approach. In: Proceedings of 2014 ACM SIGMOD International Conference on Management of Data. 2014, 1323−1334 |
| [24] |
|
| [25] |
Su J, Zhu Q, Wei H, Yu J X. Reachability querying: can it be even faster? IEEE Transactions on Knowledge and Data Engineering, 2017, 29(3): 683−697 |
| [26] |
Lyu Q, Li Y, He B, Gong B. DBL: efficient reachability queries on dynamic graphs. In: Proceedings of the 26th International Conference on Database Systems for Advanced Applications. 2021, 761−777 |
| [27] |
Zhang C, Bonifati A, Kapp H, Haprian V I, Lozi J P. A reachability index for recursive label-concatenated graph queries. In: Proceedings of the 39th IEEE International Conference on Data Engineering (ICDE). 2023, 67−81 |
| [28] |
Valstar L D J, Fletcher G H L, Yoshida Y. Landmark indexing for evaluation of label-constrained reachability queries. In: Proceedings of 2017 ACM International Conference on Management of Data. 2017, 345−358 |
| [29] |
|
| [30] |
|
| [31] |
|
| [32] |
|
| [33] |
Zhou J, Zhou S, Yu J X, Wei H, Chen Z, Tang X. Dag reduction: fast answering reachability queries. In: Proceedings of 2017 ACM International Conference on Management of Data. 2017, 375−390 |
| [34] |
Zhou J, Yu J X, Qiu Y, Tang X, Chen Z, Du M. Fast reachability queries answering based on RCN reduction (extended abstract). In: Proceedings of the 38th IEEE International Conference on Data Engineering (ICDE). 2022, 1543−1544 |
| [35] |
|
| [36] |
|
| [37] |
Williams V V, Xu Y, Xu Z, Zhou R. New bounds for matrix multiplication: from alpha to omega. In: Proceedings of 2024 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA). 2024, 3792−3835 |
| [38] |
|
| [39] |
|
| [40] |
|
Higher Education Press
/
| 〈 |
|
〉 |