高级检索

一种Transformer与多维特征融合的主动回波目标检测方法

An Active Echo Target Detection Method Combining Transformer and Multi-dimensional Feature Fusion

  • 摘要: 为解决复杂海洋环境下主动声呐目标检测中异构特征难以统一建模、低虚警约束下检测稳定性不足的问题,提出一种多维特征提取与Transformer融合的窗口级检测方法。该方法从波束形成回波中构建原始波形、小波变换匹配滤波、恒虚警检测、短时傅里叶变换、平滑伪Wigner-Ville分布、离散小波分解和梅尔频率倒谱系数等7种互补特征,经一维卷积与线性投影对齐后输入Transformer,实现跨时间、跨特征联合建模。基于10 kHz采样、1.2 s分析窗的海试数据,Transformer-small和Transformer-base在测试集上的F1值分别为0.95750.9606;当窗口级虚警率不高于0.001时,召回率值分别为0.93920.9453,期望校准误差值分别为0.00550.0056。消融实验进一步表明,多维特征融合与Transformer结合能够有效整合MFCC、DWT、SPWVD和RAW等优势模态的信息,无Transformer的深度基线整体低于融合Transformer模型。结果表明,该方法能够兼顾判别能力、低虚警检出能力和概率可靠性,适用于复杂背景下的主动声呐目标检测。

     

    Abstract: To address the challenges of unified modeling for heterogeneous features and unstable detection performance under strict false-alarm constraints in active sonar target detection, this paper proposes a window-level method that integrates multidimensional feature extraction with Transformer-based fusion. Seven complementary features—namely, raw waveform (RAW), wavelet-modulated frequency (Wavelet-MF), CFAR output, short-time Fourier transform (STFT), smoothed pseudo-Wigner–Ville distribution (SPWVD), discrete wavelet decomposition (DWT), and mel-frequency cepstral coefficients (MFCC)—are extracted from beamformed echoes and temporally aligned via one-dimensional convolution and linear projection prior to joint temporal–feature modeling by the Transformer. Experiments on sea-trial data (10 kHz sampling rate, 1.2 s analysis window) show that Transformer-Small and Transformer-Base achieve F1 scores of 0.9575 and 0.9606, respectively. When the window-level false alarm rate is constrained to below 0.001, their recall values remain at 0.9392 and 0.9453, and their expected calibration errors (ECE) are 0.0055 and 0.0056, respectively. Ablation studies further demonstrate that the proposed fusion scheme effectively aggregates the most informative modalities—particularly MFCC, DWT, SPWVD, and RAW—whereas non-Transformer deep baselines consistently underperform the fused Transformer model. Thus, the proposed method provides a robust and reliable solution for active sonar target detection in complex acoustic backgrounds.

     

/

返回文章
返回