大纲

Task Definition

Existing Methods

  • Chen, Dave Zhenyu, et al. “Scanrefer: 3d object localization in rgb-d scans using natural language.” ECCV, 2020.
  • Achlioptas, Panos, et al. “Referit3d: Neural listeners for fine-grained 3d object identification in real-world scenes.” ECCV, 2020.
  • Yuan, Zhihao, et al. “Instancerefer: Cooperative holistic understanding for visual grounding on point clouds through instance multi-level contextual referring.” ICCV. 2021.
  • Yang, Zhengyuan, et al. “SAT: 2d semantics assisted training for 3d visual grounding.” ICCV. 2021.
  • Zhao, Lichen, et al. “3DVG-Transformer: Relation modeling for visual grounding on point clouds.” ICCV. 2021.
  • Huang, Shijia, et al. “Multi-View Transformer for 3D Visual Grounding.” CVPR. 2022.
  • Luo, Junyu, et al. “3D-SPS: Single-Stage 3D Visual Grounding via Referred Point Progressive Selection.” CVPR. 2022.

视频 & PPT

  • 会议录制视频&PPT链接: