计算机科学
旅游
过度拟合
特征选择
北京
选择(遗传算法)
水准点(测量)
Web搜索查询
机器学习
特征(语言学)
数据挖掘
互联网
人工智能
情报检索
搜索引擎
中国
地理
万维网
人工神经网络
考古
哲学
语言学
大地测量学
作者
Xin Li,Hengyun Li,Bing Pan,Rob Law
标识
DOI:10.1177/0047287520934871
摘要
Prior studies have shown that Internet search query data have great potential to improve tourism forecasting. As such, selecting the most relevant information from large amounts of search query data is crucial to enhancing forecasting accuracy and reducing overfitting; however, such feature selection methods have not been considered in the tourism forecasting literature. This study employs four machine learning–based feature selection methods to extract useful search query data and construct relevant econometric models. We examined the proposed methods based on monthly forecasting of tourist arrivals in Beijing, China, along with weekly forecasting of hotel occupancy in the city of Charleston, South Carolina, USA. Our findings indicate that the forecasting model with the selected search keywords outperformed the benchmark ARMAX model without feature selection in forecasting tourism demand and hotel occupancy. Therefore, machine learning methods can identify the most useful search query data to significantly improve forecasting accuracy in tourism and hospitality.
科研通智能强力驱动
Strongly Powered by AbleSci AI