VLM guided training

大纲

Object Detection

  1. ViLD (two-stage, open vocabulary)
  2. DetPro (ViLD+prompt learning)
  3. HierKD (one-stage, open vocabulary)
  4. Pix2Seq (language modeling framework)

Image Segmentation

  1. Lseg (open vocabulary)
  2. CLIMS (weak-supervised segmentation)

Other applications

  1. GALS (attention)
  2. PLG (deep metric learning)

Talk 视频回放

https://meeting.tencent.com/v2/cloud-record/share?id=a44ef18e-b57f-4c0d-90a9-14f9ffe20415&from=3

Slide

链接:https://pan.baidu.com/s/1vJ9S2Cv-YLpN_w4aefq6tg 提取码:o1f6