187 / 2025-06-13 15:04:50
CDADCLIP: Learning Prompts with Hybrid Semantic Fusion for Few-Shot Anomaly Detection under Domain Shift
Few-shot, Visual-Language, Domain shift
全文待审
Ran An / Xi’an Jiaotong University; Xi’an; PR China; 710049;School of Mechanical Engineering
Jiafeng Tang / Xi’an Jiaotong University;School of Mechanical Engineering
Zhibin Zhao / 西安交通大学;School of Mechanical Engineering
Xuefeng Chen / State Key Laboratory for Manufacturing Systems Engineering Xi’an Jiaotong University
Few-shot anomaly detection (FSAD) aims to identify anomalies using models trained on minimal samples, a task made particularly challenging in real-world scenarios due to domain shifts caused by variations in lighting conditions, object pose, and other environmental factors. Recently, large pre-trained vision-language models like CLIP have shown promise in FSAD visual tasks. However, most of existing approaches often rely on manually designed prompts to capture anomaly semantics, which are susceptible to environmental interference and labor-intensive to implement. To address this, we propose a cross-domain CLIP for anomaly detection (CDADCLIP) to adapt CLIP for FSAD under conditions with domain shift. CDADCLIP incorporates domain-invariant learnable prompts into CLIP to model normal and abnormal semantics. Furthermore, a Hybrid Semantic Fusion (HSF) module is utilized to enhance anomaly detection performance by integrating region-level information with global features. Experiments result on the AeBAD-S dataset with domain shift demonstrates the superior performance of our method compared with existing state-of-the-art methods.
重要日期
  • 会议日期

    08月01日

    2025

    08月04日

    2025

  • 06月23日 2025

    初稿截稿日期

主办单位
中国机械工程学会设备智能运维分会
承办单位
新疆大学
移动端
在手机上打开
小程序
打开微信小程序
客服
扫码或点此咨询