计算机科学
级联
变压器
人工智能
机器学习
数据挖掘
安全监测
管道(软件)
实时计算
工程类
化学工程
电气工程
生物技术
电压
生物
程序设计语言
作者
Bubryur Kim,E. An,Sung-Ho Kim,K. R. Sri Preethaa,Dong‐Eun Lee,R. R. Lukacs
标识
DOI:10.1007/s10462-024-10839-7
摘要
Abstract In the inherently hazardous construction industry, where injuries are frequent, the unsafe operation of heavy construction machinery significantly contributes to the injury and accident rates. To reduce these risks, this study introduces a novel framework for detecting and classifying these unsafe operations for five types of construction machinery. Utilizing a cascade learning architecture, the approach employs a Super-Resolution Generative Adversarial Network (SRGAN), Real-Time Detection Transformers (RT-DETR), self-DIstillation with NO labels (DINOv2), and Dilated Neighborhood Attention Transformer (DiNAT) models. The study focuses on enhancing the detection and classification of unsafe operations in construction machinery through upscaling low-resolution surveillance footage and creating detailed high-resolution inputs for the RT-DETR model. This enhancement, by leveraging temporal information, significantly improves object detection and classification accuracy. The performance of the cascaded pipeline yielded an average detection and first-level classification precision of 96%, a second-level classification accuracy of 98.83%, and a third-level classification accuracy of 98.25%, among other metrics. The cascaded integration of these models presents a well-rounded solution for near-real-time surveillance in dynamic construction environments, advancing surveillance technologies and significantly contributing to safety management within the industry.
科研通智能强力驱动
Strongly Powered by AbleSci AI