计算机科学
依赖关系(UML)
命名实体识别
人工智能
对象(语法)
情报检索
基线(sea)
信息抽取
目标检测
自然语言处理
数据挖掘
机器学习
模式识别(心理学)
任务(项目管理)
海洋学
管理
经济
地质学
作者
Hsiu-Wei Yang,Abhinav Agrawal
标识
DOI:10.1145/3539618.3591852
摘要
Accurate Named Entity Recognition (NER) is crucial for various information retrieval tasks in industry. However, despite significant progress in traditional NER methods, the extraction of Complex Named Entities remains a relatively unexplored area. In this paper, we propose a novel system that combines object detection for Document Layout Analysis (DLA) with weakly supervised learning to address the challenge of extracting discontinuous complex named entities in legal documents. Notably, to the best of our knowledge, this is the first work to apply weak supervision to DLA. Our experimental results show that the model trained solely on pseudo labels outperforms the supervised baseline when gold-standard data is limited, highlighting the effectiveness of our proposed approach in reducing the dependency on annotated data.
科研通智能强力驱动
Strongly Powered by AbleSci AI