"transformer decoder" Papers
11 papers found
Conference
ArcPro: Architectural Programs for Structured 3D Abstraction of Sparse Points
Qirui Huang, Runze Zhang, Kangjun Liu et al.
CVPR 2025highlightarXiv:2503.02745
3
citations
Bridging Training and Execution via Dynamic Directed Graph-Based Communication in Cooperative Multi-Agent Systems
Zhuohui Zhang, Bin He, Bin Cheng et al.
AAAI 2025paperarXiv:2408.07397
6
citations
Memory Decoder: A Pretrained, Plug-and-Play Memory for Large Language Models
Jiaqi Cao, Jiarui Wang, Rubin Wei et al.
NEURIPS 2025arXiv:2508.09874
3
citations
See What You Are Told: Visual Attention Sink in Large Multimodal Models
Seil Kang, Jinyeong Kim, Junhyeok Kim et al.
ICLR 2025arXiv:2503.03321
61
citations
Sim-DETR: Unlock DETR for Temporal Sentence Grounding
Jiajin Tang, Zhengxuan Wei, Yuchen Zhu et al.
ICCV 2025arXiv:2509.23867
2
citations
An Efficient and Effective Transformer Decoder-Based Framework for Multi-Task Visual Grounding
Wei Chen, Long Chen, Yu Wu
ECCV 2024arXiv:2408.01120
17
citations
DiffSED: Sound Event Detection with Denoising Diffusion
Swapnil Bhosale, Sauradip Nag, Diptesh Kanojia et al.
AAAI 2024paperarXiv:2308.07293
13
citations
OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection
Jinghua Hou, Tong Wang, Xiaoqing Ye et al.
ECCV 2024arXiv:2407.10753
12
citations
RoadPainter: Points Are Ideal Navigators for Topology transformER
Zhongxing Ma, Liang Shuang, Yongkun Wen et al.
ECCV 2024arXiv:2407.15349
11
citations
SAMFusion: Sensor-Adaptive Multimodal Fusion for 3D Object Detection in Adverse Weather
Edoardo Palladin, Roland Dietze, Praveen Narayanan et al.
ECCV 2024arXiv:2508.16408
13
citations
Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach
Mir Rayat Imtiaz Hossain, Mennatullah Siam, Leonid Sigal et al.
CVPR 2024arXiv:2404.11732
21
citations