A Practical Data Quality Assessment Method for Raw Data in Vessel Operations
Gang Chen , Jie Cai , Niels Gorm Maly Rytter , Marie Lützen
Journal of Marine Science and Application ›› 2023, Vol. 22 ›› Issue (2) : 370 -380.
A Practical Data Quality Assessment Method for Raw Data in Vessel Operations
With the current revolution in Shipping 4.0, a tremendous amount of data is accumulated during vessel operations. Data quality (DQ) is becoming more and more important for the further digitalization and effective decision-making in shipping industry. In this study, a practical DQ assessment method for raw data in vessel operations is proposed. In this method, specific data categories and data dimensions are developed based on engineering practice and existing literature. Concrete validation rules are then formed, which can be used to properly divide raw datasets. Afterwards, a scoring method is used for the assessment of the data quality. Three levels, namely good, warning and alarm, are adopted to reflect the final data quality. The root causes of bad data quality could be revealed once the internal dependency among rules has been built, which will facilitate the further improvement of DQ in practice. A case study based on the datasets from a Danish shipping company is conducted, where the DQ variation is monitored, assessed and compared. The results indicate that the proposed method is effective to help shipping industry improve the quality of raw data in practice. This innovation research can facilitate shipping industry to set a solid foundation at the early stage of their digitalization journeys.
Data quality / Vessel operations / Shipping / Validation rules / Noon reports
| [1] |
Ahn K, Rakha H, Hill D (2008) Data quality white paper. Technical Report. United States. Federal Highway Administration. Office of Operations |
| [2] |
|
| [3] |
Bates MJ (2019) Understanding information retrieval systems: management, types, and standards. Auerbach Publications |
| [4] |
|
| [5] |
|
| [6] |
|
| [7] |
|
| [8] |
|
| [9] |
|
| [10] |
De Mauro A, Greco M, Grimaldi M (2015) What is big data? A consensual definition and a review of key research topics, in: AIP Conference Proceedings, American Institute of Physics, 97–104. https://doi.org/10.1063/1.4907823 |
| [11] |
|
| [12] |
|
| [13] |
Falge C, Otto B, Österle H (2012) Data quality requirements of collaborative business processes, in: 2012 IEEE 45th Hawaii International Conference on System Sciences, 4316–4325. https://doi.org/10.1109/HICSS.2012.8 |
| [14] |
FORCE Technology (2021) Onboard decision support system. URL: https://forcetechnology.com/en/services/onboard-decision-support-system |
| [15] |
|
| [16] |
Hermann M, Pentek T, Otto B (2016) Design principles for industrie 4.0 scenarios, in: 2016 49th Hawaii international conference on system sciences (HICSS), IEEE. pp. 3928–3937. https://doi.org/10.1109/HICSS.2016.488 |
| [17] |
|
| [18] |
|
| [19] |
Knight Sa, Burn J (2005) Developing a framework for assessing information quality on the world wide web. Informing Science 8 KONGSBERG (2021) KONGSBERG Vessel Performance. URL: https://www.kongsberg.com/digital/kognifaiecosystem/kognifai-marketplace/maritime/vessel-performance/ |
| [20] |
|
| [21] |
Liao CF, Davis GA (2012) Traffic data quality verification and sensor calibration for weigh-in-motion (wim) systems |
| [22] |
|
| [23] |
|
| [24] |
|
| [25] |
|
| [26] |
|
| [27] |
Røseth ØJ (2016) Integrating iec and iso information models into the s-100 common maritime data structure |
| [28] |
|
| [29] |
|
| [30] |
|
| [31] |
Tejay G, Dhillon G, Chin AG (2004) Data quality dimensions for information systems security: A theoretical exposition, in: Working Conference on Integrity and Internal Control in Information Systems, Springer. pp. 21–39 |
| [32] |
TORM (1889) TORM SHIPPING. URL: https://torm.com/ |
| [33] |
|
| [34] |
US Department of Transportation (2021) Bureau of Transportation Statistics. URL: http://ntl.bts.gov/lib/jpodocs/reptste/14058files/chap3.htm |
| [35] |
VPS (2021) Vessel Performance Solutions. URL: https://www.vpsolutions.dk/ |
| [36] |
|
| [37] |
Wang RY, Ziad M, Lee YW (2006) Data quality. volume 23. Springer Science & Business Media |
| [38] |
|
| [39] |
|
/
| 〈 |
|
〉 |