A review of systematic evaluation and improvement in the big data environment

Feng YANG; Manman WANG

doi:10.1007/s42524-020-0092-6

PDF(308 KB)

Front. Eng ›› 2020, Vol. 7 ›› Issue (1) : 27-46. DOI: 10.1007/s42524-020-0092-6

REVIEW ARTICLE

A review of systematic evaluation and improvement in the big data environment

Feng YANG ,
Manman WANG

Author information +

History +

Abstract

The era of big data brings unprecedented opportunities and challenges to management research. As one of the important functions of management decision-making, evaluation has been given more functions and application space. Exploring the applicable evaluation methods in the big data environment has become an important subject of research. The purpose of this paper is to provide an overview and discussion of systematic evaluation and improvement in the big data environment. We first review the evaluation methods based on the main analytic techniques of big data such as data mining, statistical methods, optimization and simulation, and deep learning. Focused on the characteristics of big data (association feature, data loss, data noise, and visualization), the relevant evaluation methods are given. Furthermore, we explore the systematic improvement studies and application fields. Finally, we analyze the new application areas of evaluation methods and give the future directions of evaluation method research in a big data environment from six aspects. We hope our research could provide meaningful insights for subsequent research.

Keywords

big data / evaluation methods / systematic improvement / big data analytic techniques / data mining

Cite this article

EndNote

Ris (Procite)

Bibtex

Download citation ▾

Feng YANG, Manman WANG. A review of systematic evaluation and improvement in the big data environment. Front. Eng, 2020, 7(1): 27‒46 https://doi.org/10.1007/s42524-020-0092-6

This is a preview of subscription content, contact us for subscripton.

References

Publishing order | Descend order by publishing year | Descend order by cited within

[1]	Abedinia O, Amjady N, Zareipour H (2017). A new feature selection technique for load and price forecast of electrical power systems. IEEE Transactions on Power Systems, 32(1): 62–74 CrossRef Google scholar

[2]	Adamopoulos P, Ghose A, Todri V (2018). The impact of user personality traits on word of mouth: Text-mining social media platforms. Information Systems Research, 29(3): 612–640 CrossRef Google scholar

[3]	Adjerid I, Acquisti A, Telang R, Padman R, Adler-Milstein J (2016). The impact of privacy regulation and technology incentives: The case of health information exchanges. Management Science, 62(4): 1042–1063 CrossRef Google scholar

[4]	Adnan K, Akbar R (2019). An analytical study of information extraction from unstructured and multidimensional big data. Journal of Big Data, 6(1): 91 CrossRef Google scholar

[5]	Adomavicius G, Zhang J (2016). Classification, ranking, and top-K stability of recommendation algorithms. INFORMS Journal on Computing, 28(1): 129–147 CrossRef Google scholar

[6]	Agarwal R, Dhar V (2014). Big data, data science, and analytics: The opportunity and challenge for IS research. Information Systems Research, 25(3): 443–448 CrossRef Google scholar

[7]	Agrawal R, Imieliński T, Swami A (1993). Mining association rules between sets of items in large databases. SIGMOD Record, 22(2): 207–216 CrossRef Google scholar

[8]	Akter S, Wamba S F (2016). Big data analytics in e-commerce: A systematic review and agenda for future research. Electronic Markets, 26(2): 173–194 CrossRef Google scholar

[9]	Allodi L, Massacci F (2017). Security events and vulnerability data for cyber security risk. Risk Analysis, 37(8): 1606–1627 CrossRef Pubmed Google scholar

[10]	Ambusaidi M A, He X, Nanda P, Tan Z (2016). Building an intrusion detection system using a filter-based feature selection algorithm. IEEE Transactions on Computers, 65(10): 2986–2998 CrossRef Google scholar

[11]	Amorin C, Kegelmeyer L M, Kegelmeyer W P (2019). A hybrid deep learning architecture for classification of microscopic damage on National Ignition Facility laser optics. Statistical Analysis and Data Mining: The ASA Data Science Journal, 1–9 CrossRef Google scholar

[12]	Ansari A, Li Y, Zhang J Z (2018). Probabilistic topic model for hybrid recommender systems: A stochastic variational Bayesian approach. Marketing Science, 37(6): 987–1008 CrossRef Google scholar

[13]	Aung M M, Han T T, Ko S M (2019). Customer churn prediction using association rule mining. International Journal of Trend in Scientific Research and Development, 3(5): 1886–1890

[14]	Badiezadeh T, Saen R F, Samavati T (2018). Assessing sustainability of supply chains by double frontier network DEA: A big data approach. Computers & Operations Research, 98: 284–290 CrossRef Google scholar

[15]	Bai X, Bhattacharjee S, Boylu F, Gopal R (2015). Growth projections and assortment planning of commodity products across multiple stores: A data mining and optimization approach. INFORMS Journal on Computing, 27(4): 619–635 CrossRef Google scholar

[16]	Bai X, Nunez M, Kalagnanam J R (2012). Managing data quality risk in accounting information systems. Information Systems Research, 23(2): 453–473 CrossRef Google scholar

[17]	Ball R C, Branke J, Meisel S (2018). Optimal sampling for simulated annealing under noise. INFORMS Journal on Computing, 30(1): 200–215 CrossRef Google scholar

[18]	Bennasar M, Hicks Y, Setchi R (2015). Feature selection using joint mutual information maximization. Expert Systems with Applications, 42(22): 8520–8532 CrossRef Google scholar

[19]	Bertsimas D, Delarue A, Jaillet P, Martin S (2019a). Travel time estimation in the age of big data. Operations Research, 67(2): 498–515 CrossRef Google scholar

[20]	Bertsimas D, Jaillet P, Martin S (2019b). Online vehicle routing: The edge of optimization in large-scale applications. Operations Research, 67(1): 143–162 CrossRef Google scholar

[21]	Bertsimas D, Kallus N, Hussain A (2016). Inventory management in the era of big data. Production and Operations Management, 25(12): 2002–2013 CrossRef Google scholar

[22]	Bhatia S (2019). Predicting risk perception: New insights from data science. Management Science, 65(8): 3800–3823 CrossRef Google scholar

[23]	Bi G, Wang P, Yang F, Liang L (2014). Energy and environmental efficiency of China’s transportation sector: A multidirectional analysis approach. Mathematical Problems in Engineering, 1–12 CrossRef Google scholar

[24]	Bibri S E (2018). The IoT for smart sustainable cities of the future: An analytical framework for sensor-based big data applications for environmental sustainability. Sustainable Cities and Society, 38: 230–253 CrossRef Google scholar

[25]	Biffis E, Chavez E (2017). Satellite data and machine learning for weather risk management and food security. Risk Analysis, 37(8): 1508–1521 CrossRef Pubmed Google scholar

[26]	Boone T, Ganeshan R, Hicks R L, Sanders N R (2018). Can Google Trends improve your sales forecast? Production and Operations Management, 27(10): 1770–1774 CrossRef Google scholar

[27]	Borovkova S, Tsiamas I (2019). An ensemble of LSTM neural networks for high-frequency stock market classification. Journal of Forecasting (in press) doi: 10.1002/for.2585

[28]	Boudellioua I, Saidi R, Hoehndorf R, Martin M J, Solovyev V (2016). Prediction of metabolic pathway involvement in prokaryotic UniProtKB data by association rule mining. PLoS One, 11(7): e0158896 CrossRef Pubmed Google scholar

[29]	Buckman J R, Bockstedt J C, Hashim M J (2019). Relative privacy valuations under varying disclosure characteristics. Information Systems Research, 30(2): 375–388 CrossRef Google scholar

[30]	Buijs P, Alvarez J A L, Veenstra M, Roodbergen K J (2016). Improved collaborative transport planning at Dutch logistics service provider Fritom. Interfaces, 46(2): 119–132 CrossRef Google scholar

[31]	Cang S, Yu H (2012). Mutual information based input feature selection for classification problems. Decision Support Systems, 54(1): 691–698 CrossRef Google scholar

[32]	Cao Z, Grima R (2019). Accuracy of parameter estimation for auto-regulatory transcriptional feedback loops from noisy data. Journal of the Royal Society Interface, 16(153): 20180967 CrossRef Pubmed Google scholar

[33]	Chan A P, Osei-Kyei R, Hu Y, Yun L E (2018). A fuzzy model for assessing the risk exposure of procuring infrastructure mega-projects through public-private partnership: The case of Hong Kong–Zhuhai–Macao Bridge. Frontiers of Engineering Management, 5(1): 64–77 CrossRef Google scholar

[34]	Chehrazi N, Weber T A (2015). Dynamic valuation of delinquent credit-card accounts. Management Science, 61(12): 3077–3096 CrossRef Google scholar

[35]	Chen P C L, Zhang C Y (2014). Data-intensive applications, challenges, techniques and technologies: A survey on Big Data. Information Sciences, 275: 314–347 CrossRef Google scholar

[36]	Choi H S, Lee W S, Sohn S Y (2017a). Analyzing research trends in personal information privacy using topic modeling. Computers & Security, 67: 244–253 CrossRef Google scholar

[37]	Choi T M, Chan H K, Yue X (2017b). Recent development in big data analytics for business operations and risk management. IEEE Transactions on Cybernetics, 47(1): 81–92 CrossRef Pubmed Google scholar

[38]	Choi T M, Wallace S W, Wang Y (2018). Big data analytics in operations management. Production and Operations Management, 27(10): 1868–1883 CrossRef Google scholar

[39]	Chung S H, Ma H L, Chan H K (2017). Cascading delay risk of airline workforce deployments with crew pairing and schedule optimization. Risk Analysis, 37(8): 1443–1458 CrossRef Pubmed Google scholar

[40]	Cui R, Gallino S, Moreno A, Zhang D J (2018). The operational value of social media information. Production and Operations Management, 27(10): 1749–1769 CrossRef Google scholar

[41]	Czibula G, Czibula I G, Miholca D L, Crivei L M (2019). A novel concurrent relational association rule mining approach. Expert Systems with Applications, 125: 142–156 CrossRef Google scholar

[42]	Das A S, Gupta A, Singh G, Subramaniam L V (2016). Mining qualitative attributes to assess corporate performance. In: INFORMS Tutorials in Operations Research: Optimization Challenges in Complex, Networked and Risky Systems. INFORMS, 269–281

[43]	DeFond M, Erkens D H, Zhang J (2017). Do client characteristics really drive the Big N audit quality effect? New evidence from propensity score matching. Management Science, 63(11): 3628–3649 CrossRef Google scholar

[44]	Dhar V (2013). Data science and prediction. Communications of the ACM, 56(12): 64–73 CrossRef Google scholar

[45]	Distelhorst G, Hainmueller J, Locke R M (2017). Does lean improve labor standards? Management and social performance in the Nike supply chain. Management Science, 63(3): 707–728 CrossRef Google scholar

[46]	Dudel C, Klüsener S (2018). Estimating men’s fertility from vital registration data with missing values. Population Studies, 73(3): 439–449 Pubmed

[47]	Dutta K, Ghoshal A, Kumar S (2017). The interdependence of data analytics and operations management. In: Martin K S, Sushil K G, eds. The Routledge Companion to Production and Operations Management. New York: Taylor and Francis, 291–308

[48]	Faccini R, Konstantinidi E, Skiadopoulos G, Sarantopoulou-Chiourea S (2018). A new predictor of US real economic activity: The S&P 500 option implied risk aversion. Management Science, 65(10): 1–23 CrossRef Google scholar

[49]	Feng F, Cho J, Pedrycz W, Fujita H, Herawan T (2016). Soft set based association rule mining. Knowledge-Based Systems, 111: 268–282 CrossRef Google scholar

[50]	France S L, Ghose S (2016). An analysis and visualization methodology for identifying and testing market structure. Marketing Science, 35(1): 182–197 CrossRef Google scholar

[51]	Galeshchuk S, Mukherjee S (2017). Deep networks for predicting direction of change in foreign exchange rates. Intelligent Systems in Accounting, Finance & Management, 24(4): 100–110 CrossRef Google scholar

[52]	Gatto L, Breckels L M, Naake T, Gibb S (2015). Visualization of proteomics data using R and bioconductor. Proteomics, 15(8): 1375–1389 CrossRef Pubmed Google scholar

[53]	Geczy P (2014). Big data characteristics. The Macrotheme Review, 3(6): 94–104

[54]	Genta R M, Sonnenberg A (2014). Big data in gastroenterology research. Nature Reviews Gastroenterology & Hepatology, 11(6): 386–390 CrossRef Pubmed Google scholar

[55]	Ghose A, Ipeirotis P G, Li B (2012). Designing ranking systems for hotels on travel search engines by mining user-generated and crowdsourced content. Marketing Science, 31(3): 493–520 CrossRef Google scholar

[56]	Ghoshal A, Kumar S, Mookerjee V (2015). Impact of recommender system on competition between personalizing and non-personalizing firms. Journal of Management Information Systems, 31(4): 243–277 CrossRef Google scholar

[57]	Graham J W, Cumsille P E, Shevock A E (2012). Methods for handling missing data. In: Schinka J A, Velicer W F, eds. Handbook of Psychology: Vol. 2. Research methods in psychology. 2nd ed. New York, NY: John Wiley & Sons, 109–141 CrossRef Google scholar

[58]	Hashem I A T, Chang V, Anuar N B, Adewole K, Yaqoob I, Gani A, Ahmed E, Chiroma H (2016). The role of big data in smart city. International Journal of Information Management, 36(5): 748–758 CrossRef Google scholar

[59]	Hastie T, Tibshirani R, Friedman J (2005). The elements of statistical learning: Data mining, inference and prediction. The Mathematical Intelligencer, 27(2): 83–85 CrossRef Google scholar

[60]	Hochbaum D S (2018). Machine learning and data mining with combinatorial optimization algorithms. In: INFORMS Tutorials in Operations Research: Recent Advances in Optimization and Modeling of Contemporary Problems. INFORMS, 109–129 CrossRef Google scholar

[61]	Hoeksma R, Uetz M (2016). Optimal mechanism design for a sequencing problem with two-dimensional types. Operations Research, 64(6): 1438–1450 CrossRef Google scholar

[62]	Hu H, Wen Y G, Chua T S, Li X L (2014). Toward scalable systems for big data analytics: A technology tutorial. IEEE Access, 2: 652–687 CrossRef Google scholar

[63]	Huang T, Dong W, Xie X, Shi G, Bai X (2017). Mixed noise removal via Laplacian scale mixture modeling and nonlocal low-rank approximation. IEEE Transactions on Image Processing, 26(7): 3171–3186 CrossRef Pubmed Google scholar

[64]	Huang T, van Mieghem J A (2014). Clickstream data and inventory management: Model and empirical analysis. Production and Operations Management, 23(3): 333–347 CrossRef Google scholar

[65]	Huang Y, Jasin S, Manchanda P (2019). “Level Up”: Leveraging skill and engagement to maximize player game-play in online video games. Information Systems Research, 30(3): 927–947 CrossRef Google scholar

[66]	Hydari M Z, Telang R, Marella W M (2018). Saving patient Ryan—Can advanced electronic medical records make patient care safer? Management Science, 65(5): 2041–2059 CrossRef Google scholar

[67]	Ilow J, Hatzinakos D (1998). Analytic alpha-stable noise modeling in a Poisson field of interferers or scatterers. IEEE Transactions on Signal Processing, 46(6): 1601–1611 CrossRef Google scholar

[68]	Jagabathula S, Subramanian L, Venkataraman A (2018). A model-based embedding technique for segmenting customers. Operations Research, 66(5): 1247–1267 CrossRef Google scholar

[69]	Jamshidi A, Faghih-Roohi S, Hajizadeh S, Núñez A, Babuska R, Dollevoet R, Li Z L, de Schutter B (2017). A big data analysis approach for rail failure risk assessment. Risk Analysis, 37(8): 1495–1507 CrossRef Pubmed Google scholar

[70]	Jia F, Wu W (2019). Evaluating methods for handling missing ordinal data in structural equation modeling. Behavior Research Methods, 51(5): 2337–2355 CrossRef Pubmed Google scholar

[71]	Jiang G, Hong L J, Nelson B L (2019). Online risk monitoring using offline simulation. INFORMS Journal on Computing (in press) doi: 10.1287/ijoc.2019.0892

[72]	Jiang J, Wang I Y, Wang K P (2018). Revolving rating analysts and ratings of mortgage-backed and asset-backed securities: Evidence from LinkedIn. Management Science, 64(12): 5832–5854 CrossRef Google scholar

[73]	Joseph R C, Johnson N A (2013). Big data and transformational government. IT Professional, 15(6): 43–48 CrossRef Google scholar

[74]	Kalbandi I, Anuradha J (2015). A brief introduction on Big Data 5Vs characteristics and Hadoop technology. Procedia Computer Science, 48: 319–324 CrossRef Google scholar

[75]

Kishore N, Mitchell R, Lash T L, Reed C, Danon L, Sigmundsdóttir G, Vigfusson Y (2020). Flying, phones and flu: Anonymized call records suggest that Keflavik International Airport introduced pandemic H1N1 into Iceland in 2009. Influenza and Other Respiratory Viruses, 14(1): 37–45

CrossRef Pubmed Google scholar

[76]	Kitchin R, Lauriault T P (2015). Small data in the era of big data. GeoJournal, 80(4): 463–475 CrossRef Google scholar

[77]	Kopcso D, Pachamanova D (2018). Case article—Business value in integrating predictive and prescriptive analytics models. INFORMS Transactions on Education, 19(1): 36–42 CrossRef Google scholar

[78]	Kumar N, Venugopal D, Qiu L, Kumar S (2018). Detecting review manipulation on online platforms with hierarchical supervised learning. Journal of Management Information Systems, 35(1): 350–380 CrossRef Google scholar

[79]	Li C, Gu J (2019). An integration approach of hybrid databases based on SQL in cloud computing environment. Software, Practice & Experience, 49(3): 401–422 CrossRef Google scholar

[80]	Li Z, Yu H, Zhang G, Wang J (2019). A Bayesian vector autoregression-based data analytics approach to enable irregularly-spaced mixed-frequency traffic collision data imputation with missing values. Transportation Research Part C: Emerging Technologies, 108: 302–319 CrossRef Google scholar

[81]	Lim C, Maglio P P (2018). Data-driven understanding of smart service systems through text mining. Service Science, 10(2): 154–180 CrossRef Google scholar

[82]	Little R J A, Rubin D B (2019). Statistical Analysis with Missing Data. 3rd ed. Hoboken, NJ: John Wiley & Sons Pubmed

[83]	Liu J, Wang X, Khattak A J, Hu J, Cui J, Ma J (2016a). How big data serves for freight safety management at highway-rail grade crossings? A spatial approach fused with path analysis. Neurocomputing, 181: 38–52 CrossRef Google scholar

[84]	Liu X, Singh P V, Srinivasan K (2016b). A structured analysis of unstructured big data by leveraging cloud computing. Marketing Science, 35(3): 363–388 CrossRef Google scholar

[85]	Lizzette P L, Suzanna L, Shoberg T, Corns S (2019). A model for the evaluation of environmental impact indicators for a sustainable maritime transportation systems. Frontiers of Engineering Management, 6(3): 368–383 CrossRef Google scholar

[86]	Lou Y, Jones M P, Sun W (2019). Estimation of causal effects in clinical endpoint bioequivalence studies in the presence of intercurrent events: Noncompliance and missing data. Journal of Biopharmaceutical Statistics, 29(1): 151–173 CrossRef Pubmed Google scholar

[87]	Lutu P E N, Engelbrecht A P (2013). Positive-versus-negative classification for model aggregation in predictive data mining. INFORMS Journal on Computing, 25(4): 792–807 CrossRef Google scholar

[88]	Lv Y, Duan Y, Kang W, Li Z, Wang F (2015). Traffic flow prediction with big data: A deep learning approach. IEEE Transactions on Intelligent Transportation Systems, 16(2): 865–873

[89]	Mehra A, Kumar S, Raju J S (2018). Competitive strategies for brick-and-mortar stores to counter “showrooming”. Management Science, 64(7): 3076–3090 CrossRef Google scholar

[90]	Mookerjee R, Kumar S, Mookerjee V S (2017). Optimizing performance-based Internet advertisement campaigns. Operations Research, 65(1): 38–54 CrossRef Google scholar

[91]	Moreau V, Bage G, Marcotte D, Samson R (2012). Statistical estimation of missing data in life cycle inventory: An application to hydroelectric power plants. Journal of Cleaner Production, 37: 335–341 CrossRef Google scholar

[92]	Naghdi M, Shafiyi M A, Haghifam M R (2018). Quadratic optimization method for a dual index combination of the penetration level and the dispersion factor of the distributed generation. International Transactions on Electrical Energy Systems, 28(8): e2575 CrossRef Google scholar

[93]	Nambisan P, Luo Z, Kapoor A, Patrick T B, Cisler R A (2015). Social media, big data, and public health informatics: Ruminating behavior of depression revealed through Twitter. In: 48th Hawaii International Conference on System Sciences. IEEE, 2906–2913 CrossRef Google scholar

[94]	Newman J P, Ferguson M E, Garrow L A, Jacobs T L (2014). Estimation of choice-based models using sales data from a single firm. Manufacturing & Service Operations Management, 16(2): 184–197 CrossRef Google scholar

[95]

Nie J, Xiao L, Zheng L M, Du Z F, Liu D, Zhou J W, Xiang J, Hou J J, Wang X G, Fang J B (2019a). An integration of UPLC-DAD/ESI-Q-TOF MS, GC-MS, and PCA analysis for quality evaluation and identification of cultivars of Chrysanthemi Flos (Juhua). Phytomedicine, 59: 152803

CrossRef Pubmed Google scholar

[96]	Nie Z, Wan C, Chen C, Chen J (2019b). Comprehensive evaluation of the postharvest antioxidant capacity of Majiayou Pomelo harvested at different maturities based on PCA. Antioxidants, 8(5): 136 CrossRef Pubmed Google scholar

[97]	Park Y W, Jiang Y, Klabjan D, Williams L (2017). Algorithms for generalized clusterwise linear regression. INFORMS Journal on Computing, 29(2): 301–317 CrossRef Google scholar

[98]	Parkinson S, Somaraki V, Ward R (2016). Auditing file system permissions using association rule mining. Expert Systems with Applications, 55: 274–283 CrossRef Google scholar

[99]	Qiu L, Kumar S (2017). Understanding voluntary knowledge provision and content contribution through a social-media-based prediction market: A field experiment. Information Systems Research, 28(3): 529–546 CrossRef Google scholar

[100]

Rajwan Y G, Barclay P W, Lee T, Sun I F, Passaretti C, Lehmann H (2013). Visualizing central line-associated blood stream infection (CLABSI) outcome data for decision making by health care consumers and practitioners—An evaluation study. Online Journal of Public Health Informatics, 5(2): 218

CrossRef Pubmed Google scholar

[101]

Ramasubbu N, Kemerer C F (2016). Technical debt and the reliability of enterprise software systems: A competing risks analysis. Management Science, 62(5): 1487–1510

CrossRef Google scholar

[102]

Rezghi M, Obulkasim A (2014). Noise-free principal component analysis: An efficient dimension reduction technique for high dimensional molecular data. Expert Systems with Applications, 41(17): 7797–7804

CrossRef Google scholar

[103]

Ringel D M, Skiera B (2016). Visualizing asymmetric competition among more than 1000 products using big search data. Marketing Science, 35(3): 511–534

CrossRef Google scholar

[104]

Roy A, Qureshi S, Pande K, Nair D, Gairola K, Jain P, Singh S, Sharma K, Jagadale A, Lin Y Y, Sharma S, Gotety R, Zhang Y X, Tang J, Mehta T, Sindhanuru H, Okafor N, Das S, Gopal C N, Rudraraju S B, Kakarlapudi A V (2019). Performance comparison of machine learning platforms. INFORMS Journal on Computing, 31(2): 207–225

CrossRef Google scholar

[105]

Ruths D, Pfeffer J (2014). Social media for large studies of behavior. Science, 346(6213): 1063–1064

CrossRef Pubmed Google scholar

[106]

Sagaert Y R, Aghezzaf E H, Kourentzes N, Desmet B (2018). Temporal big data for tactical sales forecasting in the tire industry. Interfaces, 48(2): 121–129

CrossRef Google scholar

[107]

Salemi P L, Song E, Nelson B L, Staum J (2019). Gaussian Markov random fields for discrete optimization via simulation: Framework and algorithms. Operations Research, 67(1): 250–266

CrossRef Google scholar

[108]

Sato Y, Izui K, Yamada T, Nishiwaki S (2019). Data mining based on clustering and association rule analysis for knowledge discovery in multiobjective topology optimization. Expert Systems with Applications, 119: 247–261

CrossRef Google scholar

[109]

Senot C, Chandrasekaran A, Ward P T, Tucker A L, Moffatt-Bruce S D (2016). The impact of combining conformance and experiential quality on hospitals’ readmissions and cost performance. Management Science, 62(3): 829–848

CrossRef Google scholar

[110]

Shang Y, Dunson D, Song J S (2017). Exploiting big data in logistics risk assessment via Bayesian nonparametrics. Operations Research, 65(6): 1574–1588

CrossRef Google scholar

[111]

Simon D (2013). Evolutionary Optimization Algorithms. Hoboken, NJ: John Wiley & Sons

[112]

Sirignano J, Giesecke K (2018). Risk analysis for large pools of loans. Management Science, 65(1): 107–121 doi:10.1287/mnsc.2017.2947

[113]

Sivarajah U, Kamal M M, Irani Z, Weerakkody V (2017). Critical analysis of Big Data challenges and analytical methods. Journal of Business Research, 70: 263–286

CrossRef Google scholar

[114]

Soley-Bori M (2013). Dealing with missing data: Key assumptions and methods for applied analysis. Technical Report No. 4. Boston University

[115]

Sun T, Vasarhelyi M A (2018). Predicting credit card delinquencies: An application of deep neural networks. Intelligent Systems in Accounting, Finance & Management, 25(4): 174–189

CrossRef Google scholar

[116]

Timoshenko A, Hauser J R (2019). Identifying customer needs from user-generated content. Marketing Science, 38(1): 1–20

CrossRef Google scholar

[117]

van Vliet M, Salmelin R (2020). Post-hoc modification of linear models: Combining machine learning with domain information to make solid inferences from noisy data. NeuroImage, 204: 116221

CrossRef Pubmed Google scholar

[118]

Vanli O A, Zhang C, Wang B (2013). An adaptive Bayesian approach for robust parameter design with observable time series noise factors. IIE Transactions, 45(4): 374–390

[119]

Varshney U, Chang C K (2016). Smart health and well-being. Computer, 49(11): 11–13

CrossRef Google scholar

[120]

Wamba S F, Akter S, Edwards A, Chopin G, Gnanzou D (2015). How “big data” can make big impact: Findings from a systematic review and a longitudinal case study. International Journal of Production Economics, 165: 234–246

CrossRef Google scholar

[121]

Wang G, Gunasekaran A, Ngai E W, Papadopoulos T (2016). Big data analytics in logistics and supply chain management: Certain investigations for research and applications. International Journal of Production Economics, 176: 98–110

CrossRef Google scholar

[122]

Wang P, Li X (2019). Assessing the quality of information on Wikipedia: A deep-learning approach. Journal of the Association for Information Science and Technology, 71(1): 16–28

CrossRef Google scholar

[123]

Wang Y, Wu M (2019). A novel systematic algorithm paradigm for the electric vehicle data anomaly detection based on association data mining. Concurrency and Computation, 31(9): e5073

CrossRef Google scholar

[124]

Wani H, Ashtankar N (2017). Big data in supply chain management. In: 4th International Conference on Advanced Computing and Communication Systems (ICACCS). IEEE, 1–4

CrossRef Google scholar

[125]

Wiwatcharakoses C, Berrar D (2019). SOINN+, a self-organizing incremental neural network for unsupervised learning from noisy data streams. Expert Systems with Applications, 143: 113069

CrossRef Google scholar

[126]

Wu L, Hitt L, Lou B (2019a). Data analytics, innovation, and firm productivity. Management Science, 65(10): 4863–4877

CrossRef Google scholar

[127]

Wu X, Akbarzadeh Khorshidi H, Aickelin U, Edib Z, Peate M (2019b). Imputation techniques on missing values in breast cancer treatment and fertility data. Health Information Science and Systems, 7(1): 19

CrossRef Pubmed Google scholar

[128]

Xia F, Chatterjee R, May J H (2019). Using conditional restricted Boltzmann machines to model complex consumer shopping patterns. Marketing Science, 38(4): 711–727

CrossRef Google scholar

[129]

Xie K, Ozbay K, Kurkcu A, Yang H (2017). Analysis of traffic crashes involving pedestrians using big data: Investigation of contributing factors and identification of hotspots. Risk Analysis, 37(8): 1459–1476

CrossRef Pubmed Google scholar

[130]

Xu L, Jiang C X, Wang J, Yuan J, Ren Y (2014). Information security in big data: Privacy and data mining. IEEE Access, 2: 1149–1176

CrossRef Google scholar

[131]

Yang F, Du F, Liang L, Yang Z (2014). Forecasting the production abilities of recycling systems: A DEA based research. Journal of Applied Mathematics, 2014: 1–9

CrossRef Google scholar

[132]

Yang F, Jiang L, Ang S (2019a). A winner-take-all evaluation in data envelopment analysis. Annals of Operations Research, 278(1–2): 141–158

CrossRef Google scholar

[133]

Yang F, Jiao C, Ang S (2019b). The optimal technology licensing strategy under supply disruption. International Journal of Production Research, 57(7): 2057–2082

CrossRef Google scholar

[134]

Yang F, Kong J, Jin M (2019c). Two-period pricing with selling effort in the presence of strategic customers. Asia-Pacific Journal of Operational Research, 36(03): 1–21

CrossRef Google scholar

[135]

Yang F, Shan F, Jin M (2017a). Capacity investment under cost sharing contracts. International Journal of Production Economics, 191: 278–285

CrossRef Google scholar

[136]

Yang F, Song S, Huang W, Xia Q (2015). SMAA-PO: Project portfolio optimization problems based on stochastic multicriteria acceptability analysis. Annals of Operations Research, 233(1): 535–547

CrossRef Google scholar

[137]

Yang F, Yang M, Xia Q, Liang L (2016a). Collaborative distribution between two logistics service providers. International Transactions in Operational Research, 23(6): 1025–1050

CrossRef Google scholar

[138]

Yang F, Yang M, Xia Q, Liang L (2017b). Cooperation between two logistics service providers with different distribution ranges. International Journal of Shipping and Transport Logistics, 9(2): 186–201

CrossRef Google scholar

[139]

Yang F, Yuan Q, Du S, Liang L (2016b). Reserving relief supplies for earthquake: A multi-attribute decision making of China Red Cross. Annals of Operations Research, 247(2): 759–785

CrossRef Google scholar

[140]

Yang Z, Liu H, Bi T, Li Z, Yang Q (2020). An adaptive PMU missing data recovery method. International Journal of Electrical Power & Energy Systems, 116: 105577

CrossRef Google scholar

[141]

Zhang C, Xue X, Zhao Y, Zhang X, Li T (2019). An improved association rule mining-based method for revealing operational problems of building heating, ventilation and air conditioning (HVAC) systems. Applied Energy, 253: 113492

CrossRef Google scholar

[142]

Zheng X, Men J, Yang F, Gong X (2019). Understanding impulse buying in mobile commerce: An investigation into hedonic and utilitarian browsing. International Journal of Information Management, 48: 151–160

CrossRef Google scholar

[143]

Zhou Z F, Ou J, Wang S S, Chen X H (2016). The building of papermaking enterprise’s recycling economy evaluation index system based on value flow analysis. Frontiers of Engineering Management, 3(1): 9–17

CrossRef Google scholar

[144]

Zoph B, Yuret D, May J, Knight K (2016). Transfer learning for low-resource neural machine translation. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing. Austin, Texas: Association for Computational Linguistics, 1568–1575

CrossRef Google scholar