计算机科学
代表(政治)
词(群论)
自然语言处理
基线(sea)
人工智能
自然语言
自然语言理解
语言模型
情报检索
语言学
地质学
法学
哲学
海洋学
政治
政治学
作者
Narendra Babu Unnam,P. Krishna Reddy,Amit Pandey,Naresh Manwani
标识
DOI:10.1145/3632410.3632500
摘要
In the current digital era, about 80% of the digital data which is being generated is unstructured and unlabeled natural language text. In the development cycle of information retrieval and text mining applications, text representation is the most fundamental and critical step, as its effectiveness directly impacts the application's performance. The existing traditional text representation frameworks are mostly frequency distribution-based. In this work, we explored the spatial distribution of word embeddings and proposed two text representational models. The experimental demonstrated that proposed models perform consistently better at text mining tasks compared to baseline methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI