Analysis of the prediction capability of web search data based on the HE-TDC method ‒ prediction of the volume of daily tourism visitors

Geng Peng , Ying Liu , Jiyuan Wang , Jifa Gu

Journal of Systems Science and Systems Engineering ›› 2017, Vol. 26 ›› Issue (2) : 163 -182.

PDF
Journal of Systems Science and Systems Engineering ›› 2017, Vol. 26 ›› Issue (2) : 163 -182. DOI: 10.1007/s11518-016-5311-7
Article

Analysis of the prediction capability of web search data based on the HE-TDC method ‒ prediction of the volume of daily tourism visitors

Author information +
History +
PDF

Abstract

Web search query data are obtained to reflect social spots and serve as novel economic indicators. When faced with high-dimensional query data, selecting keywords that have plausible predictive ability and can reduce dimensionality is critical. This paper presents a new integrative method that combines Hurst Exponent (HE) and Time Difference Correlation (TDC) analysis to select keywords with powerful predictive ability. The method is called the HE-TDC screening method and requires keywords with predictive ability to satisfy two characteristics, namely, high correlation and fluctuation memorability similar to the predicting target series. An empirical study is employed to predict the volume of tourism visitors in the Jiuzhai Valley scenic area. The study shows that keywords selected using HE-TDC method produce a model with better robustness and predictive ability.

Keywords

Tourism visitor volume prediction / web-search data / HE-TDC method / Jiuzhai Valley / time series / Hurst exponent

Cite this article

Download citation ▾
Geng Peng, Ying Liu, Jiyuan Wang, Jifa Gu. Analysis of the prediction capability of web search data based on the HE-TDC method ‒ prediction of the volume of daily tourism visitors. Journal of Systems Science and Systems Engineering, 2017, 26(2): 163-182 DOI:10.1007/s11518-016-5311-7

登录浏览全文

4963

注册一个新账户 忘记密码

References

[1]

Bangwayo-Skeete P. F., Skeete R. W.. Can Google data improve the forecasting performance of tourist arrivals? mixed-data sampling approach. Tourism Management, 2015, 46: 454-464.

[2]

Brynjolfsson E., Geva T., Reichman S.. Crowd-squared: amplifying the predictive power of search trend data, 2015

[3]

CNNIC. Statistical Report on the Development of China Internet Network in the Thirty-Fifth Time, 2014

[4]

Butler D.. When Google got flu wrong. Nature, 2013, 494(7436): 155.

[5]

Du J., Xu H., Huang X.. Box office prediction based on microblog. Expert Systems with Applications, 2014, 41(4): 1680-1689.

[6]

Ginsberg J., Mohebbi M. H., Patel R. S., Brammer L., Smolinski M. S., Brilliant L.. Detecting influenza epidemics using search engine query data. Nature, 2009, 457(7232): 1012-1014.

[7]

Lazer D., Kennedy R., King G., Vespignani A.. Big data. The parable of Google flu: traps in big data analysis. Science (NY), 2014, 343(6176): 1203

[8]

Liu Y., Chen Y., Wu S., Peng G., Lv B.. Composite leading search index: a preprocessing method of internet search data for stock trends prediction. Annals of Operations Research, 2015, 234(1): 77-94.

[9]

Peng G., Wang J.Y.. Detecting syphilis amount in China based on Baidu query data. International Conference on Soft Computing in Information Communication Technology, 2014

[10]

Preis T., Moat H.S., Stanley H.E.. Quantifying trading behavior in financial markets using google trends. Scientific Reports, 2013, 3: 1684.

[11]

Scott S. L., Varian H. R.. Bayesian variable selection for nowcasting economic time series. National Bureau of Economic Research, 2013.

[12]

Vaughan L., Romero-Frías E.. Web search volume as a predictor of academic fame: an exploration of Google Trends. Journal of the Association for Information Science and Technology, 2014, 65(4): 707-720.

[13]

Wang J.Y., Peng G., Dai W.. Prediction of online trade growth using search-ANFIS: transactions on Taobao as examples. 2014 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), July 6-11, 2014, Beijing, China, 2014

[14]

Wu L., Brynjolfsson E.. The future of prediction: how Google searches foreshadow housing prices and sales, 2014

[15]

Yang X., Pan B., Evans J. A., Lv B.. Forecasting Chinese tourist volume with search engine data. Tourism Management, 2015, 46: 386-397.

[16]

Yang Y., Pan B., Song H.. Predicting hotel demand using destination marketing organization’s WEB traffic data. Journal of Travel Research, 2014, 53(4): 433-447.

[17]

Yuan Q., Nsoesie E. O., Lv B., Peng G., Chunara R., Brownstein J. S.. Monitoring influenza epidemics in china with search query from Baidu. PloS one, 2013, 8(5): e64323.

AI Summary AI Mindmap
PDF

132

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/