A visual analysis approach for data imputation via multi-party tabular data correlation strategies

Haiyang ZHU, Dongming HAN, Jiacheng PAN, Yating WEI, Yingchaojie FENG, Luoxuan WENG, Ketian MAO, Yuankai XING, Jianshu LV, Qiucheng WAN, Wei CHEN

PDF(1117 KB)
PDF(1117 KB)
Front. Inform. Technol. Electron. Eng ›› 2024, Vol. 25 ›› Issue (3) : 398-414. DOI: 10.1631/FITEE.2300480

A visual analysis approach for data imputation via multi-party tabular data correlation strategies

Author information +
History +

Abstract

Data imputation is an essential pre-processing task for data governance, aimed at filling in incomplete data. However, conventional data imputation methods can only partly alleviate data incompleteness using isolated tabular data, and they fail to achieve the best balance between accuracy and efficiency. In this paper, we present a novel visual analysis approach for data imputation. We develop a multi-party tabular data association strategy that uses intelligent algorithms to identify similar columns and establish column correlations across multiple tables. Then, we perform the initial imputation of incomplete data using correlated data entries from other tables. Additionally, we develop a visual analysis system to refine data imputation candidates. Our interactive system combines the multi-party data imputation approach with expert knowledge, allowing for a better understanding of the relational structure of the data. This significantly enhances the accuracy and efficiency of data imputation, thereby enhancing the quality of data governance and the intrinsic value of data assets. Experimental validation and user surveys demonstrate that this method supports users in verifying and judging the associated columns and similar rows using their domain knowledge.

Keywords

Data governance / Data incompleteness / Data imputation / Data visualization / Interactive visual analysis

Cite this article

Download citation ▾
Haiyang ZHU, Dongming HAN, Jiacheng PAN, Yating WEI, Yingchaojie FENG, Luoxuan WENG, Ketian MAO, Yuankai XING, Jianshu LV, Qiucheng WAN, Wei CHEN. A visual analysis approach for data imputation via multi-party tabular data correlation strategies. Front. Inform. Technol. Electron. Eng, 2024, 25(3): 398‒414 https://doi.org/10.1631/FITEE.2300480

RIGHTS & PERMISSIONS

2024 Zhejiang University Press
PDF(1117 KB)

Accesses

Citations

Detail

Sections
Recommended

/