Paper "vision transformers" Papers
13 papers found
Conference
A Stochastic Approach to Bi-Level Optimization for Hyperparameter Optimization and Meta Learning
Minyoung Kim, Timothy Hospedales
AAAI 2025paperarXiv:2410.10417
2
citations
Boosting ViT-based MRI Reconstruction from the Perspectives of Frequency Modulation, Spatial Purification, and Scale Diversification
Yucong Meng, Zhiwei Yang, Yonghong Shi et al.
AAAI 2025paperarXiv:2412.10776
6
citations
Configuring Data Augmentations to Reduce Variance Shift in Positional Embedding of Vision Transformers
Bum Jun Kim, Sang Woo Kim
AAAI 2025paperarXiv:2405.14115
2
citations
Generative Medical Segmentation
Jiayu Huo, Xi Ouyang, Sébastien Ourselin et al.
AAAI 2025paperarXiv:2403.18198
5
citations
MimiQ: Low-Bit Data-Free Quantization of Vision Transformers with Encouraging Inter-Head Attention Similarity
Kanghyun Choi, Hyeyoon Lee, Dain Kwon et al.
AAAI 2025paperarXiv:2407.20021
7
citations
Prior-guided Hierarchical Harmonization Network for Efficient Image Dehazing
Xiongfei Su, Siyuan Li, Yuning Cui et al.
AAAI 2025paperarXiv:2503.01136
16
citations
TGBFormer: Transformer-GraphFormer Blender Network for Video Object Detection
Qiang Qi, Xiao Wang
AAAI 2025paperarXiv:2503.13903
5
citations
A Unified Masked Autoencoder with Patchified Skeletons for Motion Synthesis
Esteve Valls Mascaro, Hyemin Ahn, Dongheui Lee
AAAI 2024paperarXiv:2308.07301
9
citations
Contrastive Tuning: A Little Help to Make Masked Autoencoders Forget
Johannes Lehner, Benedikt Alkin, Andreas Fürst et al.
AAAI 2024paperarXiv:2304.10520
22
citations
LION: Implicit Vision Prompt Tuning
Haixin Wang, Jianlong Chang, Yihang Zhai et al.
AAAI 2024paperarXiv:2303.09992
36
citations
Spatial Transform Decoupling for Oriented Object Detection
Hongtian Yu, Yunjie Tian, Qixiang Ye et al.
AAAI 2024paperarXiv:2308.10561
52
citations
TOP-ReID: Multi-Spectral Object Re-identification with Token Permutation
Yuhao Wang, Xuehu Liu, Pingping Zhang et al.
AAAI 2024paperarXiv:2312.09612
45
citations
ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining
Dezhi Peng, Chongyu Liu, Yuliang Liu et al.
AAAI 2024paperarXiv:2306.12106
18
citations