Abstract In practical engineering, the features of rolling bearing vibration signals often vary in distribution under different working conditions, and obtaining sample labels for target working conditions can be challenging. Therefore, a multi-scale depth subdomain adaptive convolutional neural network (MS-DSACNN) fault diagnosis method is proposed. The MS-DSACNN method is based on a multi-scale feature extraction method combined with an improved 1D-ConvNeXt architecture, which fuses low- and high-level features into multi-scale fault features. A channel attention mechanism is also introduced to adaptively assign weights to the fused multi-scale features. The local maximum mean discrepancy is constructed by extracting features and their corresponding source and pseudo-label inputs for the source and target working conditions, respectively, to construct the loss function for backpropagation. The MS-DSACNN model was evaluated in experiments with two different loads and one variable speed dataset. The experimental results indicate that MS-DSACNN has higher feature extraction capability and diagnostic accuracy than other unsupervised cross-condition fault diagnosis techniques. Additionally, the scenario where the signal captured by the sensor contains a considerable amount of noise in actual working conditions is considered.