"transformer-based encoder" Papers
2 papers found
Conference
FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance Head-pose and Facial Expression Features
Andre Rochow, Max Schwarz, Sven Behnke
CVPR 2024arXiv:2404.09736
23
citations
Scene-Graph ViT: End-to-End Open-Vocabulary Visual Relationship Detection
Tim Salzmann, Markus Ryll, Alex Bewley et al.
ECCV 2024arXiv:2403.14270
8
citations