Paper Papers

5,964 papers found • Page 72 of 120

ViPCap: Retrieval Text-Based Visual Prompts for Lightweight Image Captioning

Taewhan Kim, Soeun Lee, Si-Woo Kim et al.

AAAI 2025paperarXiv:2412.19289

ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy Prediction

Yi Feng, Yu Han, Xijing Zhang et al.

AAAI 2025paperarXiv:2412.11210
7
citations

Virtual Museum Tour Agent: Effects of Responsiveness and Awareness

Anant Upadhyay, Fu-Chia Yang, Christos Mousas

ISMAR 2025paper

Virtual Nodes Can Help: Tackling Distribution Shifts in Federated Graph Learning

Xingbo Fu, Zihan Chen, Yinhan He et al.

AAAI 2025paperarXiv:2412.19229
5
citations

Virtual Pass-through: Evaluating 3D Gaussian Splatting as an Alternative to Conventional Video Pass-through in Static Environments

Andy Schleising, Christian Kunert, Tobias Schwandt et al.

ISMAR 2025paper
1
citations

Virtual Roomie: Immersive Layout Co-design with a Virtual Agent

Angela L. Jimenez, Pedro Acevedo, Christos Mousas

ISMAR 2025paper

Visceral Notices and Privacy Mechanisms for Eye Tracking in Augmented Reality

Nissi Otoo, Kailon Blue, G. Nikki Ramirez et al.

ISMAR 2025paper
1
citations

Vision-aware Multimodal Prompt Tuning for Uploadable Multi-source Few-shot Domain Adaptation

Kuanghong Liu, Jin Wang, Kangjian He et al.

AAAI 2025paperarXiv:2503.06106
2
citations

Vision-Based Generic Potential Function for Policy Alignment in Multi-Agent Reinforcement Learning

Hao Ma, Shijie Wang, Zhiqiang Pu et al.

AAAI 2025paperarXiv:2502.13430

Vision-guided Text Mining for Unsupervised Cross-modal Hashing with Community Similarity Quantization

Haozhi Fan, Yuan Cao

AAAI 2025paper

Vision-Language Models Are Not Pragmatically Competent in Referring Expression Generation

Ziqiao Ma, Jing Ding, Xuejun Zhang et al.

COLM 2025paperarXiv:2504.16060
3
citations

Vision Transformers Beat WideResNets on Small Scale Datasets Adversarial Robustness

Juntao Wu, Ziyu Song, Xiaoyu Zhang et al.

AAAI 2025paper

VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception of Geometric Information

Ryo Kamoi, Yusen Zhang, Sarkar Snigdha Sarathi Das et al.

COLM 2025paperarXiv:2412.00947
22
citations

VisRec: A Semi-Supervised Approach to Visibility Data Reconstruction in Radio Astronomy

Ruoqi Wang, Haitao Wang, Qiong Luo et al.

AAAI 2025paper

Visual and Auditory Feedback of Vibration, and Particle Effects for Enhancing Pseudo-Haptic Button Interaction in VR

Myeongji Ko, Woojoo Kim

ISMAR 2025paper

Visual Perturbation for Text-Based Person Search

Pengcheng Zhang, Xiaohan Yu, Xiao Bai et al.

AAAI 2025paper

Visual Prompting Upgrades Neural Network Sparsification: A Data-Model Perspective

Can Jin, Tianjin Huang, Yihua Zhang et al.

AAAI 2025paperarXiv:2312.01397
30
citations

Visual Reinforcement Learning with Residual Action

Zhenxian Liu, Peixi Peng, Yonghong Tian

AAAI 2025paper
4
citations

Visual Representations inside the Language Model

Benlin Liu, Amita Kamath, Madeleine Grunde-McLaughlin et al.

COLM 2025paper
2
citations

VisualTrap: A Stealthy Backdoor Attack on GUI Agents via Visual Grounding Manipulation

Ziang Ye, Yang Zhang, Wentao Shi et al.

COLM 2025paperarXiv:2507.06899
3
citations

Visuo-Tactile Feedback with Hand Outline Styles for Modulating Affective Roughness Perception

Minju Baeck, Yoonseok Shin, Dooyoung Kim et al.

ISMAR 2025paperarXiv:2508.13504

VLScene: Vision-Language Guidance Distillation for Camera-Based 3D Semantic Scene Completion

Meng Wang, Huilong Pi, Ruihui Li et al.

AAAI 2025paperarXiv:2503.06219
9
citations

VOILA: Complexity-Aware Universal Segmentation of CT Images by Voxel Interacting with Language

Zishuo Wan, Yu Gao, Wanyuan Pang et al.

AAAI 2025paperarXiv:2501.03482

Voter Priming Campaigns: Strategies, Equilibria, and Algorithms

Jonathan Shaki, Yonatan Aumann, Sarit Kraus

AAAI 2025paperarXiv:2412.13380

Vox-UDA: Voxel-wise Unsupervised Domain Adaptation for Cryo-Electron Subtomogram Segmentation with Denoised Pseudo-Labeling

Haoran Li, Xingjian Li, Jiahua Shi et al.

AAAI 2025paperarXiv:2406.18610
3
citations

VProChart: Answering Chart Question Through Visual Perception Alignment Agent and Programmatic Solution Reasoning

Muye Huang, Lingling Zhang, Han Lai et al.

AAAI 2025paperarXiv:2409.01667
5
citations

VQ4DiT: Efficient Post-Training Vector Quantization for Diffusion Transformers

Juncan Deng, Shuaiting Li, Zeyu Wang et al.

AAAI 2025paperarXiv:2408.17131
11
citations

VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering

Chun-Mei Feng, Yang Bai, Tao Luo et al.

AAAI 2025paperarXiv:2312.12273
10
citations

VQTalker: Towards Multilingual Talking Avatars Through Facial Motion Tokenization

Tao Liu, Ziyang Ma, Qi Chen et al.

AAAI 2025paperarXiv:2412.09892
8
citations

VR as a ``Drop-In'' Well-being Tool for Knowledge Workers

Sophia Ppali, Haris Psallidopoulos, Marios Constantinides et al.

ISMAR 2025paperarXiv:2510.02836

VR Onboarding Procedures for Multiple Collocated Users: See-Through Tutorials and Group Transitions

Ephraim Schott, Tony Jan Zoeppig, Pramoch Viriyathomrongul et al.

ISMAR 2025paper

VRtalk: Real-time Interactive Intelligent Anime Avatars in Virtual Reality

Yuan Yu, Chunlei Xu, Shirao Yang et al.

ISMAR 2025paper

VRTennis: Forehand Training in Virtual Reality with Rule-Based Motion Analysis and Multimodal Feedback

Anna Sebernegg, Peter Kán

ISMAR 2025paper

VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression

Qiang Hu, Houqiang Zhong, Zihan Zheng et al.

AAAI 2025paperarXiv:2412.11362
9
citations

VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding

Yongxin Guo, Jingyu Liu, Mingda Li et al.

AAAI 2025paperarXiv:2405.13382
57
citations

VVRec: Reconstruction Attacks on DL-based Volumetric Video Upstreaming via Latent Diffusion Model with Gamma Distribution

Rui Lu, Bihai Zhang, Dan Wang

AAAI 2025paperarXiv:2502.17880

Walking the Web of Concept-Class Relationships in Incrementally Trained Interpretable Models

Susmit Agrawal, Deepika Vemuri, Sri Siddarth Chakaravarthy P et al.

AAAI 2025paperarXiv:2502.20393
1
citations

Walk Wisely on Graph: Knowledge Graph Reasoning with Dual Agents via Efficient Guidance-Exploration

Zijian Wang, Bin Wang, Haifeng Jing et al.

AAAI 2025paperarXiv:2408.01880
2
citations

WarpVision: Using Spatial Curvature to Guide Attention in Virtual Reality

Jérôme Kudnick, Martin Weier, Colin Groth et al.

ISMAR 2025paper
1
citations

Wasserstein Distance Constraint and Parameter Sparsification for Batched and Iterative Knowledge Editing

Shanbao Qiao, Xuebing Liu, Seung-Hoon Na

AAAI 2025paper

Watch Out for Your Guidance on Generation! Exploring Conditional Backdoor Attacks against Large Language Models

Jiaming He, Wenbo Jiang, Guanyu Hou et al.

AAAI 2025paperarXiv:2404.14795

Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection

Sung Jin Um, Dongjin Kim, Sangmin Lee et al.

AAAI 2025paperarXiv:2501.02504
4
citations

WatE: A Wasserstein t-distributed Embedding Method for Information-enriched Graph Visualization

Minjie Cheng, Dixin Luo, Hongteng Xu

AAAI 2025paper

WaterDiffusion: Learning a Prior-involved Unrolling Diffusion for Joint Underwater Saliency Detection and Visual Restoration

Laibin Chang, Yunke Wang, Longxiang Deng et al.

AAAI 2025paper
6
citations

Wavelet-Assisted Multi-Frequency Attention Network for Pansharpening

Jie Huang, Rui Huang, Jinghao Xu et al.

AAAI 2025paperarXiv:2502.04903
20
citations

Wavelet-Driven Masked Image Modeling: A Path to Efficient Visual Representation

Wenzhao Xiang, Chang Liu, Hongyang Yu et al.

AAAI 2025paperarXiv:2503.00782
1
citations

WaveletMixer: A Multi-Resolution Wavelets Based MLP-Mixer for Multivariate Long-Term Time Series Forecasting

Zichi Zhang, Tuan Dung Pham, Yimeng An et al.

AAAI 2025paper
2
citations

WaveLoss: An Adaptive Dynamic Loss for Deep Gait Recognition

Zicheng Wang, Qiuxia Wu

AAAI 2025paper
1
citations

Weak-for-Strong: Training Weak Meta-Agent to Harness Strong Executors

Fan Nie, Lan Feng, Haotian Ye et al.

COLM 2025paperarXiv:2504.04785
11
citations

Weakly Supervised Gland Segmentation with Class Semantic Consistency and Purified Labels Filtration

Siyang Feng, Huadeng Wang, Chu Han et al.

AAAI 2025paper