Most Cited 2025 "encoder-decoder adapters" Papers

22,274 papers found • Page 35 of 112

#6801

From Easy to Hard: Progressive Active Learning Framework for Infrared Small Target Detection with Single Point Supervision

Chuang Yu, Jinmiao Zhao, Yunpeng Liu et al.

ICCV 2025arXiv:2412.11154
6
citations
#6802

Improved Representation Steering for Language Models

Zhengxuan Wu, Qinan Yu, Aryaman Arora et al.

NEURIPS 2025spotlightarXiv:2505.20809
6
citations
#6803

FluxSpace: Disentangled Semantic Editing in Rectified Flow Models

Yusuf Dalva, Kavana Venkatesh, Pinar Yanardag

CVPR 2025
6
citations
#6804

How to Train Your LLM Web Agent: A Statistical Diagnosis

Dheeraj Vattikonda, Santhoshi Ravichandran, Emiliano Penaloza et al.

NEURIPS 2025arXiv:2507.04103
6
citations
#6805

ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual Instructions

Tomas Soucek, Prajwal Gatti, Michael Wray et al.

CVPR 2025arXiv:2412.01987
6
citations
#6806

Multi-View Pose-Agnostic Change Localization with Zero Labels

Chamuditha Jayanga Galappaththige, Jason Lai, Lloyd Windrim et al.

CVPR 2025arXiv:2412.03911
6
citations
#6807

NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images

Lingen Li, Zhaoyang Zhang, Yaowei Li et al.

CVPR 2025arXiv:2412.03517
6
citations
#6808

Bootstrapped Model Predictive Control

Yuhang Wang, Hanwei Guo, Sizhe Wang et al.

ICLR 2025arXiv:2503.18871
6
citations
#6809

4DTAM: Non-Rigid Tracking and Mapping via Dynamic Surface Gaussians

Hidenobu Matsuki, Gwangbin Bae, Andrew J. Davison

CVPR 2025arXiv:2505.22859
6
citations
#6810

High-Dimensional Calibration from Swap Regret

Maxwell Fishelson, Noah Golowich, Mehryar Mohri et al.

NEURIPS 2025oralarXiv:2505.21460
6
citations
#6811

A Comprehensive Study of Decoder-Only LLMs for Text-to-Image Generation

Andrew Z Wang, Songwei Ge, Tero Karras et al.

CVPR 2025arXiv:2506.08210
6
citations
#6812

DynamicFace: High-Quality and Consistent Face Swapping for Image and Video using Composable 3D Facial Priors

Runqi Wang, Yang Chen, Sijie Xu et al.

ICCV 2025arXiv:2501.08553
6
citations
#6813

Functionality Understanding and Segmentation in 3D Scenes

Jaime Corsetti, Francesco Giuliari, Alice Fasoli et al.

CVPR 2025highlightarXiv:2411.16310
6
citations
#6814

Strategyproof Reinforcement Learning from Human Feedback

Thomas Kleine Buening, Jiarui Gan, Debmalya Mandal et al.

NEURIPS 2025arXiv:2503.09561
6
citations
#6815

Search and Refine During Think: Facilitating Knowledge Refinement for Improved Retrieval-Augmented Reasoning

Yaorui Shi, Sihang Li, Chang Wu et al.

NEURIPS 2025arXiv:2505.11277
6
citations
#6816

GaussianVideo: Efficient Video Representation via Hierarchical Gaussian Splatting

Andrew Bond, Jui-Hsien Wang, Long Mai et al.

ICCV 2025arXiv:2501.04782
6
citations
#6817

SVIP: Semantically Contextualized Visual Patches for Zero-Shot Learning

Zhi Chen, Zecheng Zhao, Jingcai Guo et al.

ICCV 2025arXiv:2503.10252
6
citations
#6818

Skip-Vision: Efficient and Scalable Acceleration of Vision-Language Models via Adaptive Token Skipping

Weili Zeng, Ziyuan Huang, Kaixiang Ji et al.

ICCV 2025arXiv:2503.21817
6
citations
#6819

Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents

Han Lin, Jaemin Cho, Amir Zadeh et al.

NEURIPS 2025arXiv:2508.05954
6
citations
#6820

Towards Resilient Safety-driven Unlearning for Diffusion Models against Downstream Fine-tuning

Boheng Li, Renjie Gu, Junjie Wang et al.

NEURIPS 2025arXiv:2507.16302
6
citations
#6821

IMFine: 3D Inpainting via Geometry-guided Multi-view Refinement

Zhihao Shi, Dong Huo, Yuhongze Zhou et al.

CVPR 2025arXiv:2503.04501
6
citations
#6822

SAFEx: Analyzing Vulnerabilities of MoE-Based LLMs via Stable Safety-critical Expert Identification

Zhenglin Lai, Mengyao Liao, Bingzhe Wu et al.

NEURIPS 2025arXiv:2506.17368
6
citations
#6823

ZPressor: Bottleneck-Aware Compression for Scalable Feed-Forward 3DGS

Weijie Wang, Donny Y. Chen, Zeyu Zhang et al.

NEURIPS 2025arXiv:2505.23734
6
citations
#6824

Iterative Self-Incentivization Empowers Large Language Models as Agentic Searchers

Zhengliang Shi, Lingyong Yan, Dawei Yin et al.

NEURIPS 2025arXiv:2505.20128
6
citations
#6825

New Perspectives on the Polyak Stepsize: Surrogate Functions and Negative Results

Francesco Orabona, Ryan D'Orazio

NEURIPS 2025arXiv:2505.20219
6
citations
#6826

TokenUnify: Scaling Up Autoregressive Pretraining for Neuron Segmentation

Yinda Chen, Haoyuan Shi, Xiaoyu Liu et al.

ICCV 2025arXiv:2405.16847
6
citations
#6827

GIFStream: 4D Gaussian-based Immersive Video with Feature Stream

Hao Li, Sicheng Li, Xiang Gao et al.

CVPR 2025arXiv:2505.07539
6
citations
#6828

Edit360: 2D Image Edits to 3D Assets from Any Angle

Junchao Huang, Xinting Hu, Shaoshuai Shi et al.

ICCV 2025highlightarXiv:2506.10507
6
citations
#6829

PEER Pressure: Model-to-Model Regularization for Single Source Domain Generalization

Dongkyu Cho, Inwoo Hwang, Sanghack Lee

CVPR 2025arXiv:2505.12745
6
citations
#6830

Unified Dense Prediction of Video Diffusion

Lehan Yang, Lu Qi, Xiangtai Li et al.

CVPR 2025arXiv:2503.09344
6
citations
#6831

DexGarmentLab: Dexterous Garment Manipulation Environment with Generalizable Policy

Yuran Wang, Ruihai Wu, Yue Chen et al.

NEURIPS 2025spotlightarXiv:2505.11032
6
citations
#6832

Neuro-Symbolic Evaluation of Text-to-Video Models using Formal Verification

S P Sharan, Minkyu Choi, Sahil Shah et al.

CVPR 2025arXiv:2411.16718
6
citations
#6833

Satellite Observations Guided Diffusion Model for Accurate Meteorological States at Arbitrary Resolution

Siwei Tu, Ben Fei, Weidong Yang et al.

CVPR 2025highlightarXiv:2502.07814
6
citations
#6834

Normalized Attention Guidance: Universal Negative Guidance for Diffusion Models

Dar-Yen Chen, Hmrishav Bandyopadhyay, Kai Zou et al.

NEURIPS 2025arXiv:2505.21179
6
citations
#6835

Unbiased Region-Language Alignment for Open-Vocabulary Dense Prediction

Yunheng Li, Yuxuan Li, Quan-Sheng Zeng et al.

ICCV 2025arXiv:2412.06244
6
citations
#6836

AutoURDF: Unsupervised Robot Modeling from Point Cloud Frames Using Cluster Registration

Jiong Lin, Lechen Zhang, Kwansoo Lee et al.

CVPR 2025arXiv:2412.05507
6
citations
#6837

Augmented Deep Contexts for Spatially Embedded Video Coding

Yifan Bian, Chuanbo Tang, Li Li et al.

CVPR 2025highlightarXiv:2505.05309
6
citations
#6838

State Entropy Regularization for Robust Reinforcement Learning

Yonatan Ashlag, Uri Koren, Mirco Mutti et al.

NEURIPS 2025oralarXiv:2506.07085
6
citations
#6839

Blurred LiDAR for Sharper 3D: Robust Handheld 3D Scanning with Diffuse LiDAR and RGB

Nikhil Behari, Aaron Young, Siddharth Somasundaram et al.

CVPR 2025highlightarXiv:2411.19474
6
citations
#6840

From Sparse to Dense: Camera Relocalization with Scene-Specific Detector from Feature Gaussian Splatting

Zhiwei Huang, Hailin Yu, Yichun Shentu et al.

CVPR 2025arXiv:2503.19358
6
citations
#6841

Reducing Unimodal Bias in Multi-Modal Semantic Segmentation with Multi-Scale Functional Entropy Regularization

Xu Zheng, Yuanhuiyi Lyu, Lutao Jiang et al.

ICCV 2025arXiv:2505.06635
6
citations
#6842

PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation

Ao Wang, Hui Chen, Jianchao Tan et al.

NEURIPS 2025arXiv:2412.03409
6
citations
#6843

One-Step Offline Distillation of Diffusion-based Models via Koopman Modeling

Nimrod Berman, Ilan Naiman, Moshe Eliasof et al.

NEURIPS 2025arXiv:2505.13358
6
citations
#6844

D^2iT: Dynamic Diffusion Transformer for Accurate Image Generation

Weinan Jia, Mengqi Huang, Nan Chen et al.

CVPR 2025
6
citations
#6845

Time Series Generation Under Data Scarcity: A Unified Generative Modeling Approach

Tal Gonen, Itai Pemper, Ilan Naiman et al.

NEURIPS 2025oralarXiv:2505.20446
6
citations
#6846

Asymptotics of SGD in Sequence-Single Index Models and Single-Layer Attention Networks

Luca Arnaboldi, Bruno Loureiro, Ludovic Stephan et al.

NEURIPS 2025arXiv:2506.02651
6
citations
#6847

Geometric Learning with Positively Decomposable Kernels

Nathael Da Costa, Cyrus Mostajeran, Juan-Pablo Ortega et al.

NEURIPS 2025arXiv:2310.13821
6
citations
#6848

CL-Splats: Continual Learning of Gaussian Splatting with Local Optimization

Jan Ackermann, Jonas Kulhanek, Shengqu Cai et al.

ICCV 2025arXiv:2506.21117
6
citations
#6849

Uniform Generalization Bounds on Data-Dependent Hypothesis Sets via PAC-Bayesian Theory on Random Sets

Benjamin Dupuis, Paul Viallard, George Deligiannidis et al.

NEURIPS 2025arXiv:2404.17442
6
citations
#6850

EF-3DGS: Event-Aided Free-Trajectory 3D Gaussian Splatting

Bohao Liao, Wei Zhai, Zengyu Wan et al.

NEURIPS 2025oralarXiv:2410.15392
6
citations
#6851

Test-time Adaptation for Regression by Subspace Alignment

Kazuki Adachi, Shin'ya Yamaguchi, Atsutoshi Kumagai et al.

ICLR 2025arXiv:2410.03263
6
citations
#6852

DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation

Jiashuo Sun, Xianrui Zhong, Sizhe Zhou et al.

NEURIPS 2025arXiv:2505.07233
6
citations
#6853

OpenMIBOOD: Open Medical Imaging Benchmarks for Out-Of-Distribution Detection

Max Gutbrod, David Rauber, Danilo Weber Nunes et al.

CVPR 2025arXiv:2503.16247
6
citations
#6854

Mask Image Watermarking

Runyi Hu, Jie Zhang, Shiqian Zhao et al.

NEURIPS 2025arXiv:2504.12739
6
citations
#6855

Estimating Model Performance Under Covariate Shift Without Labels

Jakub Białek, Juhani Kivimäki, Wojciech Kuberski et al.

NEURIPS 2025arXiv:2401.08348
6
citations
#6856

GeoSVR: Taming Sparse Voxels for Geometrically Accurate Surface Reconstruction

Jiahe Li, Jiawei Zhang, Youmin Zhang et al.

NEURIPS 2025spotlightarXiv:2509.18090
6
citations
#6857

MambaVLT: Time-Evolving Multimodal State Space Model for Vision-Language Tracking

Xinqi Liu, Li Zhou, Zikun Zhou et al.

CVPR 2025highlightarXiv:2411.15459
6
citations
#6858

Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning

Ali Taghibakhshi, Sharath Turuvekere Sreenivas, Saurav Muralidharan et al.

NEURIPS 2025arXiv:2504.11409
6
citations
#6859

Memories of Forgotten Concepts

Matan Rusanovsky, Shimon Malnick, Amir Jevnisek et al.

CVPR 2025highlightarXiv:2412.00782
6
citations
#6860

YOLO-Count: Differentiable Object Counting for Text-to-Image Generation

Guanning Zeng, Xiang Zhang, Zirui Wang et al.

ICCV 2025arXiv:2508.00728
6
citations
#6861

StreamForest: Efficient Online Video Understanding with Persistent Event Memory

Xiangyu Zeng, Kefan Qiu, Qingyu Zhang et al.

NEURIPS 2025oralarXiv:2509.24871
6
citations
#6862

COAP: Memory-Efficient Training with Correlation-Aware Gradient Projection

Jinqi Xiao, Shen Sang, Tiancheng Zhi et al.

CVPR 2025arXiv:2412.00071
6
citations
#6863

AnyCalib: On-Manifold Learning for Model-Agnostic Single-View Camera Calibration

Javier Tirado-Garín, Javier Civera

ICCV 2025arXiv:2503.12701
6
citations
#6864

Scaffolding Dexterous Manipulation with Vision-Language Models

Vincent de Bakker, Joey Hejna, Tyler Lum et al.

NEURIPS 2025arXiv:2506.19212
6
citations
#6865

UIBDiffusion: Universal Imperceptible Backdoor Attack for Diffusion Models

Yuning Han, Bingyin Zhao, Rui Chu et al.

CVPR 2025highlightarXiv:2412.11441
6
citations
#6866

UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian Splatting

Ziyi Wang, Yanran Zhang, Jie Zhou et al.

CVPR 2025arXiv:2506.09952
6
citations
#6867

Intrinsic Dimension Correlation: uncovering nonlinear connections in multimodal representations

Lorenzo Basile, Santiago Acevedo, Luca Bortolussi et al.

ICLR 2025arXiv:2406.15812
6
citations
#6868

Shape it Up! Restoring LLM Safety during Finetuning

ShengYun Peng, Pin-Yu Chen, Jianfeng Chi et al.

NEURIPS 2025arXiv:2505.17196
6
citations
#6869

Differentially Private Fine-Tuning of Diffusion Models

Yu-Lin Tsai, Yizhe Li, Zekai Chen et al.

ICCV 2025arXiv:2406.01355
6
citations
#6870

AdsQA: Towards Advertisement Video Understanding

Xinwei Long, Kai Tian, Peng Xu et al.

ICCV 2025arXiv:2509.08621
6
citations
#6871

BF-STVSR: B-Splines and Fourier---Best Friends for High Fidelity Spatial-Temporal Video Super-Resolution

Eunjin Kim, HYEONJIN KIM, Kyong Hwan Jin et al.

CVPR 2025arXiv:2501.11043
6
citations
#6872

When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding

Yan Shu, Hangui Lin, Yexin Liu et al.

NEURIPS 2025arXiv:2506.05551
6
citations
#6873

NavBench: Probing Multimodal Large Language Models for Embodied Navigation

Yanyuan Qiao, Haodong Hong, Wenqi Lyu et al.

NEURIPS 2025oralarXiv:2506.01031
6
citations
#6874

HALO: Hadamard-Assisted Lower-Precision Optimization for LLMs

Saleh Ashkboos, Mahdi Nikdan, Rush Tabesh et al.

NEURIPS 2025arXiv:2501.02625
6
citations
#6875

Distilling Parallel Gradients for Fast ODE Solvers of Diffusion Models

Beier Zhu, Ruoyu Wang, Tong Zhao et al.

ICCV 2025arXiv:2507.14797
6
citations
#6876

GarmentPile: Point-Level Visual Affordance Guided Retrieval and Adaptation for Cluttered Garments Manipulation

Ruihai Wu, Ziyu Zhu, Yuran Wang et al.

CVPR 2025arXiv:2503.09243
6
citations
#6877

MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems

Xuanming Zhang, Yuxuan Chen, Samuel (Min-Hsuan) Yeh et al.

NEURIPS 2025oralarXiv:2505.18943
6
citations
#6878

FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model

Yukang Cao, Chenyang Si, Jinghao Wang et al.

ICCV 2025arXiv:2507.01953
6
citations
#6879

TokenMotion: Decoupled Motion Control via Token Disentanglement for Human-centric Video Generation

Ruineng Li, Daitao Xing, Huiming Sun et al.

CVPR 2025arXiv:2504.08181
6
citations
#6880

StochSync: Stochastic Diffusion Synchronization for Image Generation in Arbitrary Spaces

Kyeongmin Yeo, Jaihoon Kim, Minhyuk Sung

ICLR 2025arXiv:2501.15445
6
citations
#6881

Decoupled Graph Energy-based Model for Node Out-of-Distribution Detection on Heterophilic Graphs

Yuhan Chen, Yihong Luo, Yifan Song et al.

ICLR 2025arXiv:2502.17912
6
citations
#6882

DeRS: Towards Extremely Efficient Upcycled Mixture-of-Experts Models

Yongqi Huang, Peng Ye, Chenyu Huang et al.

CVPR 2025arXiv:2503.01359
6
citations
#6883

HumanoidGen: Data Generation for Bimanual Dexterous Manipulation via LLM Reasoning

Zhi Jing, Siyuan Yang, Jicong Ao et al.

NEURIPS 2025arXiv:2507.00833
6
citations
#6884

Emulating Self-attention with Convolution for Efficient Image Super-Resolution

Dongheon Lee, Seokju Yun, Youngmin Ro

ICCV 2025highlightarXiv:2503.06671
6
citations
#6885

B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens

Zhuqiang Lu, Zhenfei Yin, Mengwei He et al.

ICCV 2025arXiv:2412.09919
6
citations
#6886

Towards RAW Object Detection in Diverse Conditions

Zhong-Yu Li, Xin Jin, Bo-Yuan Sun et al.

CVPR 2025highlightarXiv:2411.15678
6
citations
#6887

Optimal Strong Regret and Violation in Constrained MDPs via Policy Optimization

Francesco Emanuele Stradi, Matteo Castiglioni, Alberto Marchesi et al.

ICLR 2025arXiv:2410.02275
6
citations
#6888

Attention Mechanism, Max-Affine Partition, and Universal Approximation

Hude Liu, Jerry Yao-Chieh Hu, Zhao Song et al.

NEURIPS 2025arXiv:2504.19901
6
citations
#6889

AD-GS: Object-Aware B-Spline Gaussian Splatting for Self-Supervised Autonomous Driving

Jiawei Xu, Kai Deng, Zexin Fan et al.

ICCV 2025arXiv:2507.12137
6
citations
#6890

Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception

ruotian peng, Haiying He, Yake Wei et al.

CVPR 2025arXiv:2504.06666
6
citations
#6891

FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation

Zhuguanyu Wu, Shihe Wang, Jiayi Zhang et al.

CVPR 2025highlightarXiv:2506.11543
6
citations
#6892

Decoupling Fine Detail and Global Geometry for Compressed Depth Map Super-Resolution

Huan Zheng, Wencheng Han, Jianbing Shen

CVPR 2025arXiv:2411.03239
6
citations
#6893

Audio Super-Resolution with Latent Bridge Models

Chang Li, Zehua Chen, Liyuan Wang et al.

NEURIPS 2025arXiv:2509.17609
6
citations
#6894

BiggerGait: Unlocking Gait Recognition with Layer-wise Representations from Large Vision Models

Dingqiang Ye, Chao Fan, Zhanbo Huang et al.

NEURIPS 2025arXiv:2505.18132
6
citations
#6895

DeRIS: Decoupling Perception and Cognition for Enhanced Referring Image Segmentation through Loopback Synergy

Ming Dai, Wenxuan Cheng, Jiang-Jiang Liu et al.

ICCV 2025arXiv:2507.01738
6
citations
#6896

Attend to Not Attended: Structure-then-Detail Token Merging for Post-training DiT Acceleration

Haipeng Fang, Sheng Tang, Juan Cao et al.

CVPR 2025arXiv:2505.11707
6
citations
#6897

Training-free Dense-Aligned Diffusion Guidance for Modular Conditional Image Synthesis

Zixuan Wang, DUO PENG, Feng Chen et al.

CVPR 2025arXiv:2504.01515
6
citations
#6898

Reverse Diffusion Sequential Monte Carlo Samplers

Luhuan Wu, Yi Han, Christian Andersson Naesseth et al.

NEURIPS 2025arXiv:2508.05926
6
citations
#6899

Neuron-based Multifractal Analysis of Neuron Interaction Dynamics in Large Models

Xiongye Xiao, Heng Ping, Chenyu Zhou et al.

ICLR 2025arXiv:2402.09099
6
citations
#6900

HybridGS: Decoupling Transients and Statics with 2D and 3D Gaussian Splatting

Jingyu Lin, Jiaqi Gu, Lubin Fan et al.

CVPR 2025arXiv:2412.03844
6
citations
#6901

Language-Guided Audio-Visual Learning for Long-Term Sports Assessment

Huangbiao Xu, Xiao Ke, Huanqi Wu et al.

CVPR 2025
6
citations
#6902

Show and Tell: Visually Explainable Deep Neural Nets via Spatially-Aware Concept Bottleneck Models

Itay Benou, Tammy Riklin Raviv

CVPR 2025highlightarXiv:2502.20134
6
citations
#6903

Token Perturbation Guidance for Diffusion Models

Javad Rajabi, Soroush Mehraban, Seyedmorteza Sadat et al.

NEURIPS 2025arXiv:2506.10036
6
citations
#6904

Denoising Functional Maps: Diffusion Models for Shape Correspondence

Aleksei Zhuravlev, Zorah Lähner, Vladislav Golyanik

CVPR 2025arXiv:2503.01845
6
citations
#6905

Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers

Efstathios Karypidis, Ioannis Kakogeorgiou, Spyros Gidaris et al.

CVPR 2025arXiv:2501.08303
6
citations
#6906

Cocoon: Robust Multi-Modal Perception with Uncertainty-Aware Sensor Fusion

Minkyoung Cho, Yulong Cao, Jiachen Sun et al.

ICLR 2025arXiv:2410.12592
6
citations
#6907

Iris: Breaking GUI Complexity with Adaptive Focus and Self-Refining

Zhiqi Ge, Juncheng Li, Xinglei Pang et al.

ICCV 2025arXiv:2412.10342
6
citations
#6908

Týr-the-Pruner: Structural Pruning LLMs via Global Sparsity Distribution Optimization

Guanchen Li, Yixing Xu, Zeping Li et al.

NEURIPS 2025arXiv:2503.09657
6
citations
#6909

The Hawthorne Effect in Reasoning Models: Evaluating and Steering Test Awareness

Sahar Abdelnabi, Ahmed Salem

NEURIPS 2025spotlightarXiv:2505.14617
6
citations
#6910

SaMam: Style-aware State Space Model for Arbitrary Image Style Transfer

Hongda Liu, Longguang Wang, Ye Zhang et al.

CVPR 2025highlightarXiv:2503.15934
6
citations
#6911

Manipulating Feature Visualizations with Gradient Slingshots

Dilyara Bareeva, Marina Höhne, Alexander Warnecke et al.

NEURIPS 2025arXiv:2401.06122
6
citations
#6912

One is Plenty: A Polymorphic Feature Interpreter for Immutable Heterogeneous Collaborative Perception

Yuchen Xia, Quan Yuan, Guiyang Luo et al.

CVPR 2025arXiv:2411.16799
6
citations
#6913

Gaussian Splatting with Discretized SDF for Relightable Assets

Zuo-Liang Zhu, jian Yang, Beibei Wang

ICCV 2025arXiv:2507.15629
6
citations
#6914

FinMMR: Make Financial Numerical Reasoning More Multimodal, Comprehensive, and Challenging

Zichen Tang, Haihong E, Jiacheng Liu et al.

ICCV 2025arXiv:2508.04625
6
citations
#6915

Forte : Finding Outliers with Representation Typicality Estimation

Debargha Ganguly, Warren Morningstar, Andrew Yu et al.

ICLR 2025arXiv:2410.01322
6
citations
#6916

VoiceCraft-Dub: Automated Video Dubbing with Neural Codec Language Models

Kim Sung-Bin, Jeongsoo Choi, Puyuan Peng et al.

ICCV 2025arXiv:2504.02386
6
citations
#6917

SimMotionEdit: Text-Based Human Motion Editing with Motion Similarity Prediction

Zhengyuan Li, Kai Cheng, Anindita Ghosh et al.

CVPR 2025arXiv:2503.18211
6
citations
#6918

Exploring Visual Vulnerabilities via Multi-Loss Adversarial Search for Jailbreaking Vision-Language Models

Shuyang Hao, Bryan Hooi, Jun Liu et al.

CVPR 2025arXiv:2411.18000
6
citations
#6919

Harnessing Uncertainty-aware Bounding Boxes for Unsupervised 3D Object Detection

Ruiyang Zhang, Hu Zhang, Zhedong Zheng

ICCV 2025arXiv:2408.00619
6
citations
#6920

Spreading Out-of-Distribution Detection on Graphs

Daeho Um, Jongin Lim, Sunoh Kim et al.

ICLR 2025
6
citations
#6921

ACAM-KD: Adaptive and Cooperative Attention Masking for Knowledge Distillation

Qizhen Lan, Qing Tian

ICCV 2025arXiv:2503.06307
6
citations
#6922

Constant Bit-size Transformers Are Turing Complete

Qian Li, Yuyi Wang

NEURIPS 2025oralarXiv:2506.12027
6
citations
#6923

Zeroth-Order Fine-Tuning of LLMs in Random Subspaces

Ziming Yu, Pan Zhou, Sike Wang et al.

ICCV 2025arXiv:2410.08989
6
citations
#6924

Towards foundational LiDAR world models with efficient latent flow matching

Tianran Liu, Shengwen Zhao, Nicholas Rhinehart

NEURIPS 2025arXiv:2506.23434
6
citations
#6925

7DGS: Unified Spatial-Temporal-Angular Gaussian Splatting

Zhongpai Gao, Benjamin Planche, Meng Zheng et al.

ICCV 2025arXiv:2503.07946
6
citations
#6926

Exploring Simple Open-Vocabulary Semantic Segmentation

Zihang Lai

CVPR 2025arXiv:2401.12217
6
citations
#6927

4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time

Ziqiao Ma, Xuweiyi Chen, Shoubin Yu et al.

NEURIPS 2025oralarXiv:2506.18890
6
citations
#6928

Parametric Point Cloud Completion for Polygonal Surface Reconstruction

Zhaiyu Chen, Yuqing Wang, Liangliang Nan et al.

CVPR 2025arXiv:2503.08363
6
citations
#6929

Prediction-Powered Causal Inferences

Riccardo Cadei, Ilker Demirel, Piersilvio De Bartolomeis et al.

NEURIPS 2025arXiv:2502.06343
6
citations
#6930

Weakly-Supervised Affordance Grounding Guided by Part-Level Semantic Priors

Peiran Xu, Yadong MU

ICLR 2025arXiv:2505.24103
6
citations
#6931

Lift3D Policy: Lifting 2D Foundation Models for Robust 3D Robotic Manipulation

Yueru Jia, Jiaming Liu, Sixiang Chen et al.

CVPR 2025
6
citations
#6932

Visual-Oriented Fine-Grained Knowledge Editing for MultiModal Large Language Models

Zhen Zeng, Leijiang Gu, Xun Yang et al.

ICCV 2025arXiv:2411.12790
6
citations
#6933

Procedural Synthesis of Synthesizable Molecules

Michael Sun, Alston Lo, Minghao Guo et al.

ICLR 2025arXiv:2409.05873
6
citations
#6934

GaussHDR: High Dynamic Range Gaussian Splatting via Learning Unified 3D and 2D Local Tone Mapping

Jinfeng Liu, Lingtong Kong, Bo Li et al.

CVPR 2025arXiv:2503.10143
6
citations
#6935

Optimized Minimal 3D Gaussian Splatting

Joo Chan Lee, Jong Hwan Ko, Eunbyung Park

NEURIPS 2025arXiv:2503.16924
6
citations
#6936

Self-Learning Hyperspectral and Multispectral Image Fusion via Adaptive Residual Guided Subspace Diffusion Model

Jian Zhu, He Wang, Yang Xu et al.

CVPR 2025arXiv:2505.11800
6
citations
#6937

On Disentangled Training for Nonlinear Transform in Learned Image Compression

Han Li, Shaohui Li, Wenrui Dai et al.

ICLR 2025arXiv:2501.13751
6
citations
#6938

Provable Scaling Laws for the Test-Time Compute of Large Language Models

Yanxi Chen, Xuchen Pan, Yaliang Li et al.

NEURIPS 2025arXiv:2411.19477
6
citations
#6939

Doubly Robust Alignment for Large Language Models

Erhan Xu, Kai Ye, Hongyi Zhou et al.

NEURIPS 2025arXiv:2506.01183
6
citations
#6940

SpectralAR: Spectral Autoregressive Visual Generation

Yuanhui Huang, Weiliang Chen, Wenzhao Zheng et al.

ICCV 2025arXiv:2506.10962
6
citations
#6941

Order-Robust Class Incremental Learning: Graph-Driven Dynamic Similarity Grouping

Guannan Lai, Yujie Li, Xiangkun Wang et al.

CVPR 2025arXiv:2502.20032
6
citations
#6942

FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution

Gene Chou, Wenqi Xian, Guandao Yang et al.

ICCV 2025highlightarXiv:2504.07093
6
citations
#6943

Unsupervised Model Tree Heritage Recovery

Eliahu Horwitz, Asaf Shul, Yedid Hoshen

ICLR 2025arXiv:2405.18432
6
citations
#6944

Time-to-Event Pretraining for 3D Medical Imaging

Zepeng Frazier Huo, Jason Fries, Alejandro Lozano et al.

ICLR 2025oralarXiv:2411.09361
6
citations
#6945

TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation

Wenhao Wang, Yi Yang

ICCV 2025arXiv:2411.04709
6
citations
#6946

Multi-Task Dense Predictions via Unleashing the Power of Diffusion

Yuqi Yang, Peng-Tao Jiang, Qibin Hou et al.

ICLR 2025
6
citations
#6947

Contradicted in Reliable, Replicated in Unreliable: Dual-Source Reference for Fake News Early Detection

Yifan Feng, Weimin Li, Yue Wang et al.

AAAI 2025paper
6
citations
#6948

Bridging Molecular Graphs and Large Language Models

Runze Wang, Mingqi Yang, Yanming Shen

AAAI 2025paperarXiv:2503.03135
6
citations
#6949

A Critical Look At Tokenwise Reward-Guided Text Generation

Ahmad Rashid, Ruotian Wu, Julia Grosse et al.

COLM 2025paperarXiv:2406.07780
6
citations
#6950

CAMEx: Curvature-aware Merging of Experts

Dung Viet Nguyen, Minh Nguyen, Luc Nguyen et al.

ICLR 2025arXiv:2502.18821
6
citations
#6951

DualCP: Rehearsal-Free Domain-Incremental Learning via Dual-Level Concept Prototype

Qiang Wang, Yuhang He, Songlin Dong et al.

AAAI 2025paperarXiv:2503.18042
6
citations
#6952

Tight Clusters Make Specialized Experts

Stefan Nielsen, Rachel Teo, Laziz Abdullaev et al.

ICLR 2025arXiv:2502.15315
6
citations
#6953

Beyond Entropy: Region Confidence Proxy for Wild Test-Time Adaptation

Zixuan Hu, Yichun Hu, Xiaotong Li et al.

ICML 2025arXiv:2505.20704
6
citations
#6954

Generating Counterfactual Explanations Under Temporal Constraints

Andrei Buliga, Chiara Di Francescomarino, Chiara Ghidini et al.

AAAI 2025paperarXiv:2503.01792
6
citations
#6955

PINNsAgent: Automated PDE Surrogation with Large Language Models

Qingpo Wuwu, Chonghan Gao, Tianyu Chen et al.

ICML 2025arXiv:2501.12053
6
citations
#6956

Algorithms with Calibrated Machine Learning Predictions

Judy Hanwen Shen, Ellen Vitercik, Anders Wikum

ICML 2025spotlightarXiv:2502.02861
6
citations
#6957

LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models

Dachuan Shi, Yonggan Fu, Xiangchi Yuan et al.

ICML 2025arXiv:2507.14204
6
citations
#6958

uniINF: Best-of-Both-Worlds Algorithm for Parameter-Free Heavy-Tailed MABs

Yu Chen, Jiatai Huang, Yan Dai et al.

ICLR 2025arXiv:2410.03284
6
citations
#6959

Beyond Content Relevance: Evaluating Instruction Following in Retrieval Models

Jianqun Zhou, Yuanlei Zheng, Wei Chen et al.

ICLR 2025arXiv:2410.23841
6
citations
#6960

KIND: Knowledge Integration and Diversion for Training Decomposable Models

Yucheng Xie, Fu Feng, Ruixiao Shi et al.

ICML 2025arXiv:2408.07337
6
citations
#6961

Augmenting Sequential Recommendation with Balanced Relevance and Diversity

Yizhou Dang, Jiahui Zhang, Yuting Liu et al.

AAAI 2025paperarXiv:2412.08300
6
citations
#6962

Inverse Reinforcement Learning by Estimating Expertise of Demonstrators

Mark Beliaev, Ramtin Pedarsani

AAAI 2025paperarXiv:2402.01886
6
citations
#6963

Runtime Analysis for Multi-Objective Evolutionary Algorithms in Unbounded Integer Spaces

Benjamin Doerr, Martin S. Krejca, Günter Rudolph

AAAI 2025paperarXiv:2412.11684
6
citations
#6964

When Bad Data Leads to Good Models

Kenneth Li, Yida Chen, Fernanda Viégas et al.

ICML 2025arXiv:2505.04741
6
citations
#6965

CSformer: Combining Channel Independence and Mixing for Robust Multivariate Time Series Forecasting

Haoxin Wang, Yipeng Mo, Kunlan Xiang et al.

AAAI 2025paperarXiv:2312.06220
6
citations
#6966

Jacobian Sparse Autoencoders: Sparsify Computations, Not Just Activations

Lucy Farnik, Tim Lawson, Conor Houghton et al.

ICML 2025spotlightarXiv:2502.18147
6
citations
#6967

Are Expressive Models Truly Necessary for Offline RL?

Guan Wang, Haoyi Niu, Jianxiong Li et al.

AAAI 2025paperarXiv:2412.11253
6
citations
#6968

GlycanML: A Multi-Task and Multi-Structure Benchmark for Glycan Machine Learning

Minghao Xu, Yunteng Geng, Yihang Zhang et al.

ICLR 2025arXiv:2405.16206
6
citations
#6969

Aligning Multimodal Representations through an Information Bottleneck

Antonio Almudévar, Jose Miguel Hernandez-Lobato, Sameer Khurana et al.

ICML 2025arXiv:2506.04870
6
citations
#6970

An Information Criterion for Controlled Disentanglement of Multimodal Data

Chenyu Wang, Sharut Gupta, Xinyi Zhang et al.

ICLR 2025arXiv:2410.23996
6
citations
#6971

Flexible Tails for Normalizing Flows

Tennessee Hickling, Dennis Prangle

ICML 2025arXiv:2406.16971
6
citations
#6972

Local Mixtures of Experts: Essentially Free Test-Time Training via Model Merging

Ryo Bertolissi, Jonas Hübotter, Ido Hakimi et al.

COLM 2025paperarXiv:2505.14136
6
citations
#6973

Accelerating Linear Recurrent Neural Networks for the Edge with Unstructured Sparsity

Alessandro Pierro, Steven Abreu, Jonathan Timcheck et al.

ICML 2025arXiv:2502.01330
6
citations
#6974

RouterRetriever: Routing over a Mixture of Expert Embedding Models

Hyunji Lee, Luca Soldaini, Arman Cohan et al.

AAAI 2025paperarXiv:2409.02685
6
citations
#6975

IAA: Inner-Adaptor Architecture Empowers Frozen Large Language Model with Multimodal Capabilities

Bin Wang, Chunyu Xie, Dawei Leng et al.

AAAI 2025paperarXiv:2408.12902
6
citations
#6976

LeakAgent: RL-based Red-teaming Agent for LLM Privacy Leakage

Yuzhou Nie, Zhun Wang, Ye Yu et al.

COLM 2025paperarXiv:2412.05734
6
citations
#6977

Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous Inference

Matt Riemer, Gopeshh Raaj Subbaraj, Glen Berseth et al.

ICLR 2025arXiv:2412.14355
6
citations
#6978

All You Need in Knowledge Distillation Is a Tailored Coordinate System

Junjie Zhou, Ke Zhu, Jianxin Wu

AAAI 2025paperarXiv:2412.09388
6
citations
#6979

Déjà Vu: Multilingual LLM Evaluation through the Lens of Machine Translation Evaluation

Julia Kreutzer, Eleftheria Briakou, Sweta Agrawal et al.

COLM 2025paperarXiv:2504.11829
6
citations
#6980

VAE-Var: Variational Autoencoder-Enhanced Variational Methods for Data Assimilation in Meteorology

Yi Xiao, Qilong Jia, Kun Chen et al.

ICLR 2025
6
citations
#6981

Self-Steering Language Models

Gabriel Grand, Joshua B. Tenenbaum, Vikash Mansinghka et al.

COLM 2025paperarXiv:2504.07081
6
citations
#6982

Bridging Compressed Image Latents and Multimodal Large Language Models

Chia-Hao Kao, Cheng Chien, Yu-Jen Tseng et al.

ICLR 2025arXiv:2407.19651
6
citations
#6983

Self-Evolving Critique Abilities in Large Language Models

Zhengyang Tang, Ziniu Li, Zhenyang Xiao et al.

COLM 2025paperarXiv:2501.05727
6
citations
#6984

Concept-ROT: Poisoning Concepts in Large Language Models with Model Editing

Keltin Grimes, Marco Christiani, David Shriver et al.

ICLR 2025arXiv:2412.13341
6
citations
#6985

Stiefel Flow Matching for Moment-Constrained Structure Elucidation

Austin H Cheng, Alston Lo, Kin Long Kelvin Lee et al.

ICLR 2025arXiv:2412.12540
6
citations
#6986

Cooperation of Experts: Fusing Heterogeneous Information with Large Margin

Shuo Wang, Shunyang Huang, Jinghui Yuan et al.

ICML 2025arXiv:2505.20853
6
citations
#6987

LS-TGNN: Long and Short-Term Temporal Graph Neural Network for Session-Based Recommendation

Zhonghong Ou, Xiao Zhang, Yifan Zhu et al.

AAAI 2025paper
6
citations
#6988

Guaranteeing Out-Of-Distribution Detection in Deep RL via Transition Estimation

Mohit Prashant, Arvind Easwaran, Suman Das et al.

AAAI 2025paperarXiv:2503.05238
6
citations
#6989

CTD4 – a Deep Continuous Distributional Actor-Critic Agent with a Kalman Fusion of Multiple Critics

David Valencia, Henry Williams, Yuning Xing et al.

AAAI 2025paperarXiv:2405.02576
6
citations
#6990

ScaleOT: Privacy-utility-scalable Offsite-tuning with Dynamic LayerReplace and Selective Rank Compression

Kai Yao, Zhaorui Tan, Tiandi Ye et al.

AAAI 2025paperarXiv:2412.09812
6
citations
#6991

Adversarial Cooperative Rationalization: The Risk of Spurious Correlations in Even Clean Datasets

Wei Liu, Zhongyu Niu, Lang Gao et al.

ICML 2025arXiv:2505.02118
6
citations
#6992

Optimizing Posterior Samples for Bayesian Optimization via Rootfinding

Taiwo Adebiyi, Bach Do, Ruda Zhang

ICLR 2025arXiv:2410.22322
6
citations
#6993

Lawma: The Power of Specialization for Legal Annotation

Ricardo Dominguez-Olmedo, Vedant Nanda, Rediet Abebe et al.

ICLR 2025arXiv:2407.16615
6
citations
#6994

Zero-shot Meta-learning for Tabular Prediction Tasks with Adversarially Pre-trained Transformer

Yulun Wu, Doron Bergman

ICML 2025arXiv:2502.04573
6
citations
#6995

Prompt-based Unifying Inference Attack on Graph Neural Networks

Yuecen Wei, Xingcheng Fu, Lingyun Liu et al.

AAAI 2025paperarXiv:2412.15735
6
citations
#6996

In-Context Denoising with One-Layer Transformers: Connections between Attention and Associative Memory Retrieval

Matthew Smart, Alberto Bietti, Anirvan Sengupta

ICML 2025oralarXiv:2502.05164
6
citations
#6997

Conditional Latent Coding with Learnable Synthesized Reference for Deep Image Compression

Siqi Wu, Yinda Chen, Dong Liu et al.

AAAI 2025paperarXiv:2502.09971
6
citations
#6998

Graph Coarsening via Supervised Granular-Ball for Scalable Graph Neural Network Training

Shuyin Xia, Xinjun Ma, Zhiyuan Liu et al.

AAAI 2025paperarXiv:2412.13842
6
citations
#6999

MEDSAGE: Enhancing Robustness of Medical Dialogue Summarization to ASR Errors with LLM-generated Synthetic Dialogues

Kuluhan Binici, Abhinav Ramesh Kashyap, Viktor Schlegel et al.

AAAI 2025paperarXiv:2408.14418
6
citations
#7000

Robust Offline Reinforcement Learning with Linearly Structured $f$-Divergence Regularization

Cheng Tang, Zhishuai Liu, Pan Xu

ICML 2025arXiv:2411.18612
6
citations