Xiaoyan Lu,Xinde Li,Chuanfei Hu,Fir Dunkin,Heqing Li,Shuzhi Sam Ge
出处
期刊:IEEE Transactions on Instrumentation and Measurement [Institute of Electrical and Electronics Engineers] 日期:2023-11-06卷期号:73: 1-10
标识
DOI:10.1109/tim.2023.3330216
摘要
Group Re-identification (GReID) is crucial in intelligent video surveillance for retrieving human groups across cameras. However, existing works mainly focus on group variation challenges, including membership and layout changes and neglect the occluded groups. To address this, we propose a novel Siamese Transformer for GReID, integrating multi-scale feature transform and joint learning. Specifically, the multi-scale feature transform includes global features mapping and local features encoding. To enhance robustness against occlusions, local features encoding utilizes a random patch regrouping module (RPRM) and dynamical alignment of local features (DALF). RPRM rearranges and transforms member patch embeddings, generating local features with diversified coverage for handling occluded groups. DALF dynamically aligns the local features to handle misalignment caused by occlusions. Additionally, we employ joint learning of identification and verification to extract robust and discriminative group representations. Experimental results on three benchmark datasets confirm the effectiveness and superiority of our proposed method.