计算机科学
情绪分析
自然语言处理
股票市场
人工智能
数据科学
特征选择
选择(遗传算法)
库存(枪支)
背景(考古学)
机器学习
情报检索
机械工程
生物
工程类
古生物学
作者
Andrew Chin,Yuyu Fan,Chang Ge,Haobo Zhang
出处
期刊:The journal of financial data science
[Pageant Media US]
日期:2023-12-13
卷期号:6 (1): 116-137
标识
DOI:10.3905/jfds.2023.1.142
摘要
With the explosive growth of text documents and the huge interest in text-mining techniques, investors are increasingly looking for natural language processing (NLP)-based features to provide an edge in stock selection. Although there has been an abundance of research on English documents, other languages have not seen the same attention. The authors apply these techniques to Chinese documents to generate features for stock selection strategies in the China A-shares market. They first introduce a novel dataset of Chinese broker research reports. The authors then provide an overview of the challenges in processing Chinese text data and detail the text-mining techniques they use to extract meaningful insights to separate outperformers from underperformers. They find that NLP models show significant promise in the Chinese market, with various types of features carrying impactful information. Indeed, this article suggests that sentiment and complexity features are predictive of future stock performance. The authors also find that context-aware approaches using BERT-based models are superior to bag-of-words approaches. Finally, they show that combinations of complexity and sentiment features can yield even stronger results.
科研通智能强力驱动
Strongly Powered by AbleSci AI