计算机科学
地铁列车时刻表
数据科学
样品(材料)
领域(数学分析)
基线(sea)
人工智能
自然语言处理
情报检索
操作系统
数学分析
化学
海洋学
数学
色谱法
地质学
作者
Nan Gao,Ali Touran,Qi Wang
标识
DOI:10.1061/9780784483961.034
摘要
Cost overruns and schedule delays in US transit projects have been of growing concern for years. Nevertheless, the data availability and sample size have restricted quantitative analysis toward investigating the risks leading to overruns. Innovative data sources and collection methods need to be identified in addition to traditional surveys and case studies. News articles report on issues and risk events leading to overruns as projects progress but have not yet been explored in the construction domain. The difficulty lies in data compilation and analysis. To fill this gap, the paper tested combinations of different natural language processing (NLP) and machine learning methods to automatically identify risk narratives from news articles. The risk sentences are classified into 5 categories and 26 subcategories through a content analysis approach. Then the risks are ranked and analyzed using an appropriate co-occurrence network. The research demonstrates the possibility of integrating NLP and network analysis for exploring publicly available textual documents to explain project performance issues. The approach serves as a baseline for future studies to develop more intelligent models to examine a wide range of media data and other textual reports in the construction domain.
科研通智能强力驱动
Strongly Powered by AbleSci AI