Budgeted spatial data acquisition: when coverage and connectivity matter
Wenzhe YANG , Shixun HUANG , Sheng WANG , Zhiyong PENG
Front. Comput. Sci. ›› 2026, Vol. 20 ›› Issue (3) : 2003612
Budgeted spatial data acquisition: when coverage and connectivity matter
Data is undoubtedly becoming a commodity like oil, land, and labour in the 21st century. Although there have been many successful marketplaces for data trading, the existing data marketplaces lack consideration of the case where buyers want to acquire a collection of datasets (instead of one), and the overall spatial coverage and connectivity matter. In this paper, we make the first attempt to formulate this problem as Budgeted Maximum Coverage with Connectivity Constraint (BMCC), which aims to acquire a dataset collection with the maximum spatial coverage under a limited budget while maintaining spatial connectivity. To solve the problem, we propose two approximate algorithms with detailed theoretical guarantees and time complexity analysis, followed by two acceleration strategies to further improve the efficiency of the algorithm. Experiments are conducted on five real-world spatial dataset collections to verify the efficiency and effectiveness of our algorithms.
spatial data acquisition / data marketplace / budget / spatial coverage / spatial connectivity
| [1] |
|
| [2] |
|
| [3] |
|
| [4] |
Wang S, Sun Y, Musco C, Bao Z. Public transport planning: When transit network connectivity meets commuting demand. In: Proceedings of 2021 International Conference on Management of Data. 2021, 1906−1919 |
| [5] |
|
| [6] |
|
| [7] |
|
| [8] |
|
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
|
| [16] |
|
| [17] |
|
| [18] |
|
| [19] |
Agarwal A, Dahleh M, Sarkar T. A marketplace for data: an algorithmic solution. In: Proceedings of 2019 ACM Conference on Economics and Computation. 2019, 701−726 |
| [20] |
|
| [21] |
|
| [22] |
Koutris P, Upadhyaya P, Balazinska M, Howe B, Suciu D. Toward practical query pricing with QueryMarket. In: Proceedings of 2013 ACM SIGMOD International Conference on Management of Data. 2013, 613−624 |
| [23] |
|
| [24] |
|
| [25] |
Chen L, Koutris P, Kumar A. Towards model-based pricing for machine learning in a data marketplace. In: Proceedings of 2019 International Conference on Management of Data. 2019, 1535−1552 |
| [26] |
|
| [27] |
|
| [28] |
|
| [29] |
|
| [30] |
|
| [31] |
Chen C, Yuan Y, Wen Z, Wang G, Li A. GQP: a framework for scalable and effective graph query-based pricing. In: Proceedings of the 38th International Conference on Data Engineering. 2022, 1573−1585 |
| [32] |
Tong Y, Wang L, Zhou Z, Chen L, Du B, Ye J. Dynamic pricing in spatial crowdsourcing: A matching-based approach. In: Proceedings of 2018 International Conference on Management of Data. 2018, 773−788 |
| [33] |
|
| [34] |
|
| [35] |
|
| [36] |
Deep S, Koutris P. QIRANA: A framework for scalable query pricing. In: Proceedings of 2017 ACM International Conference on Management of Data. 2017, 699−713 |
| [37] |
|
| [38] |
|
| [39] |
|
| [40] |
|
| [41] |
|
| [42] |
|
| [43] |
|
| [44] |
|
| [45] |
|
| [46] |
Kempe D, Kleinberg J, Tardos É. Maximizing the spread of influence through a social network. In: Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2003, 137−146 |
| [47] |
Tang Y, Xiao X, Shi Y. Influence maximization: near-optimal time complexity meets practical efficiency. In: Proceedings of 2014 ACM SIGMOD International Conference on Management of Data. 2014, 75−86 |
| [48] |
|
| [49] |
|
| [50] |
|
| [51] |
|
| [52] |
|
| [53] |
|
| [54] |
|
| [55] |
|
| [56] |
|
| [57] |
|
| [58] |
|
| [59] |
|
| [60] |
|
| [61] |
|
| [62] |
|
| [63] |
|
| [64] |
|
| [65] |
|
| [66] |
|
| [67] |
|
| [68] |
|
| [69] |
|
| [70] |
|
| [71] |
|
| [72] |
|
| [73] |
Peng J, Wang H, Li J, Gao H. Set-based similarity search for time series. In: Proceedings of 2016 International Conference on Management of Data. 2016, 2039−2052 |
| [74] |
|
| [75] |
|
| [76] |
|
| [77] |
Corral A, Manolopoulos Y, Theodoridis Y, Vassilakopoulos M. Closest pair queries in spatial databases. In: Proceedings of 2000 ACM SIGMOD International Conference on Management of Data. 2000, 189−200 |
| [78] |
|
| [79] |
|
Higher Education Press
/
| 〈 |
|
〉 |