Most Cited 2025 "weakly submodular function" Papers

22,274 papers found • Page 32 of 112

#6201

Smoothed Preference Optimization via ReNoise Inversion for Aligning Diffusion Models with Varied Human Preferences

Yunhong Lu, Qichao Wang, Hengyuan Cao et al.

ICML 2025arXiv:2506.02698
7
citations
#6202

FactorGCL: A Hypergraph-Based Factor Model with Temporal Residual Contrastive Learning for Stock Returns Prediction

Yitong Duan, Weiran Wang, Jian Li

AAAI 2025paperarXiv:2502.05218
7
citations
#6203

MultiVENT 2.0: A Massive Multilingual Benchmark for Event-Centric Video Retrieval

Reno Kriz, Kate Sanders, David Etter et al.

CVPR 2025arXiv:2410.11619
7
citations
#6204

MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent

Xinyao Liao, Xianfang Zeng, Liao Wang et al.

ICCV 2025arXiv:2502.03207
7
citations
#6205

Reasoning Models Hallucinate More: Factuality-Aware Reinforcement Learning for Large Reasoning Models

Junyi Li, Hwee Tou Ng

NEURIPS 2025arXiv:2505.24630
7
citations
#6206

OpenRSD: Towards Open-prompts for Object Detection in Remote Sensing Images

Ziyue Huang, Yongchao Feng, Ziqi Liu et al.

ICCV 2025arXiv:2503.06146
7
citations
#6207

Variational Regularized Unbalanced Optimal Transport: Single Network, Least Action

Yuhao Sun, Zhenyi Zhang, Zihan Wang et al.

NEURIPS 2025arXiv:2505.11823
7
citations
#6208

Discovering Fine-Grained Visual-Concept Relations by Disentangled Optimal Transport Concept Bottleneck Models

Yan Xie, Zequn Zeng, Hao Zhang et al.

CVPR 2025arXiv:2505.07209
7
citations
#6209

Is Complex Query Answering Really Complex?

Cosimo Gregucci, Bo Xiong, Daniel Hernández et al.

ICML 2025spotlightarXiv:2410.12537
7
citations
#6210

AerialVG: A Challenging Benchmark for Aerial Visual Grounding by Exploring Positional Relations

Junli Liu, Qizhi Chen, Zhigang Wang et al.

ICCV 2025arXiv:2504.07836
7
citations
#6211

TVNet: A Novel Time Series Analysis Method Based on Dynamic Convolution and 3D-Variation

Chenghan Li, Mingchen LI, Ruisheng Diao

ICLR 2025arXiv:2503.07674
7
citations
#6212

Physics-Informed Generative Modeling of Wireless Channels

Benedikt Böck, Andreas Oeldemann, Timo Mayer et al.

ICML 2025arXiv:2502.10137
7
citations
#6213

Provable and Practical Online Learning Rate Adaptation with Hypergradient Descent

Ya-Chi Chu, Wenzhi Gao, Yinyu Ye et al.

ICML 2025arXiv:2502.11229
7
citations
#6214

CWNet: Causal Wavelet Network for Low-Light Image Enhancement

Tongshun Zhang, Pingping Liu, Yubing Lu et al.

ICCV 2025arXiv:2507.10689
7
citations
#6215

SceneDiffuser++: City-Scale Traffic Simulation via a Generative World Model

Shuhan Tan, John Wheatley Lambert, Hong Jeon et al.

CVPR 2025arXiv:2506.21976
7
citations
#6216

Rethinking the Embodied Gap in Vision-and-Language Navigation: A Holistic Study of Physical and Visual Disparities

Liuyi Wang, Xinyuan Xia, Hui Zhao et al.

ICCV 2025arXiv:2507.13019
7
citations
#6217

Enhancing Uncertainty Modeling with Semantic Graph for Hallucination Detection

Kedi Chen, Qin Chen, Jie Zhou et al.

AAAI 2025paperarXiv:2501.02020
7
citations
#6218

Mitigating Plasticity Loss in Continual Reinforcement Learning by Reducing Churn

Hongyao Tang, Johan Obando-Ceron, Pablo Samuel Castro et al.

ICML 2025oralarXiv:2506.00592
7
citations
#6219

GROVE: A Generalized Reward for Learning Open-Vocabulary Physical Skill

Jieming Cui, Tengyu Liu, Ziyu Meng et al.

CVPR 2025arXiv:2504.04191
7
citations
#6220

Enhancing Privacy-Utility Trade-offs to Mitigate Memorization in Diffusion Models

Chen Chen, Daochang Liu, Mubarak Shah et al.

CVPR 2025arXiv:2504.18032
7
citations
#6221

Efficient Active Imitation Learning with Random Network Distillation

Emilien Biré, Anthony Kobanda, Ludovic Denoyer et al.

ICLR 2025arXiv:2411.01894
7
citations
#6222

Attributing Culture-Conditioned Generations to Pretraining Corpora

Huihan Li, Arnav Goel, Keyu He et al.

ICLR 2025arXiv:2412.20760
7
citations
#6223

ROPO: Robust Preference Optimization for Large Language Models

Xize Liang, Chao Chen, Shuang Qiu et al.

ICML 2025arXiv:2404.04102
7
citations
#6224

Privacy Attacks on Image AutoRegressive Models

Antoni Kowalczuk, Jan Dubiński, Franziska Boenisch et al.

ICML 2025arXiv:2502.02514
7
citations
#6225

Conformalized Interactive Imitation Learning: Handling Expert Shift and Intermittent Feedback

Michelle Zhao, Henny Admoni, Reid Simmons et al.

ICLR 2025arXiv:2410.08852
7
citations
#6226

The Bandit Whisperer: Communication Learning for Restless Bandits

Yunfan Zhao, Tonghan Wang, Dheeraj Mysore Nagaraj et al.

AAAI 2025paperarXiv:2408.05686
7
citations
#6227

TruthPrInt: Mitigating Large Vision-Language Models Object Hallucination Via Latent Truthful-Guided Pre-Intervention

Jinhao Duan, Fei Kong, Hao Cheng et al.

ICCV 2025
7
citations
#6228

Multi-Agent Motion Planning for Differential Drive Robots Through Stationary State Search

Jingtian Yan, Jiaoyang Li

AAAI 2025paperarXiv:2412.13359
7
citations
#6229

RoBridge: A Hierarchical Architecture Bridging Cognition and Execution for General Robotic Manipulation

Kaidong Zhang, Rongtao Xu, Ren Pengzhen et al.

ICCV 2025arXiv:2505.01709
7
citations
#6230

Fully Test-time Adaptation for Tabular Data

Zhi Zhou, Kun-Yang Yu, Lan-Zhe Guo et al.

AAAI 2025paperarXiv:2412.10871
7
citations
#6231

Factor Augmented Tensor-on-Tensor Neural Networks

Guanhao Zhou, Yuefeng Han, Xiufan Yu

AAAI 2025paperarXiv:2405.19610
7
citations
#6232

MathGAP: Out-of-Distribution Evaluation on Problems with Arbitrarily Complex Proofs

Andreas Opedal, Haruki Shirakami, Bernhard Schölkopf et al.

ICLR 2025arXiv:2410.13502
7
citations
#6233

Detecting Visual Information Manipulation Attacks in Augmented Reality: A Multimodal Semantic Reasoning Approach

Yanming Xiu, Maria Gorlatova

ISMAR 2025paperarXiv:2507.20356
7
citations
#6234

CO-MOT: Boosting End-to-end Transformer-based Multi-Object Tracking via Coopetition Label Assignment and Shadow Sets

feng yan, Weixin Luo, Yujie Zhong et al.

ICLR 2025
7
citations
#6235

On the Expressiveness and Length Generalization of Selective State Space Models on Regular Languages

Aleksandar Terzic, Michael Hersche, Giacomo Camposampiero et al.

AAAI 2025paper
7
citations
#6236

CLIPDrag: Combining Text-based and Drag-based Instructions for Image Editing

Ziqi Jiang, Zhen Wang, Long Chen

ICLR 2025arXiv:2410.03097
7
citations
#6237

Scene Map-based Prompt Tuning for Navigation Instruction Generation

Sheng Fan, Rui Liu, Wenguan Wang et al.

CVPR 2025
7
citations
#6238

AgentRecBench: Benchmarking LLM Agent-based Personalized Recommender Systems

Yu Shang, Peijie Liu, Yuwei Yan et al.

NEURIPS 2025spotlightarXiv:2505.19623
7
citations
#6239

Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models

Jun Zhang, Jue Wang, Huan Li et al.

ICLR 2025arXiv:2502.13533
7
citations
#6240

Depth-Bounds for Neural Networks via the Braid Arrangement

Moritz Grillo, Christoph Hertrich, Georg Loho

NEURIPS 2025oralarXiv:2502.09324
7
citations
#6241

Breaking Silos: Adaptive Model Fusion Unlocks Better Time Series Forecasting

Zhining Liu, Ze Yang, Xiao Lin et al.

ICML 2025oralarXiv:2505.18442
7
citations
#6242

GCE-Pose: Global Context Enhancement for Category-level Object Pose Estimation

Weihang Li, Hongli XU, Junwen Huang et al.

CVPR 2025arXiv:2502.04293
7
citations
#6243

TOPLOC: A Locality Sensitive Hashing Scheme for Trustless Verifiable Inference

Jack Min Ong, Matthew Di Ferrante, Aaron Pazdera et al.

ICML 2025arXiv:2501.16007
7
citations
#6244

Error Bounds for Gaussian Process Regression Under Bounded Support Noise with Applications to Safety Certification

Robert Reed, Luca Laurenti, Morteza Lahijanian

AAAI 2025paperarXiv:2408.09033
7
citations
#6245

Improving Complex Reasoning with Dynamic Prompt Corruption: A Soft Prompt Optimization Approach

Sinan Fan, Liang Xie, Chen Shen et al.

ICLR 2025arXiv:2503.13208
7
citations
#6246

Doubly Robust Conformalized Survival Analysis with Right-Censored Data

Matteo Sesia, vladimir svetnik

ICML 2025spotlightarXiv:2412.09729
7
citations
#6247

Compliant Residual DAgger: Improving Real-World Contact-Rich Manipulation with Human Corrections

Xiaomeng Xu, Yifan Hou, Zeyi Liu et al.

NEURIPS 2025arXiv:2506.16685
7
citations
#6248

BrainOOD: Out-of-distribution Generalizable Brain Network Analysis

Jiaxing Xu, Yongqiang Chen, Xia Dong et al.

ICLR 2025arXiv:2502.01688
7
citations
#6249

DebGCD: Debiased Learning with Distribution Guidance for Generalized Category Discovery

Yuanpei Liu, Kai Han

ICLR 2025arXiv:2504.04804
7
citations
#6250

Salvaging the Overlooked: Leveraging Class-Aware Contrastive Learning for Multi-Class Anomaly Detection

Lei Fan, Junjie Huang, Donglin Di et al.

ICCV 2025arXiv:2412.04769
7
citations
#6251

GS-LIVM: Real-Time Photo-Realistic LiDAR-Inertial-Visual Mapping with Gaussian Splatting

Yusen XIE, Zhenmin Huang, Jin Wu et al.

ICCV 2025arXiv:2410.17084
7
citations
#6252

HyGen: Efficient LLM Serving via Elastic Online-Offline Request Co-location

Ting Sun, Penghan Wang, Fan Lai

NEURIPS 2025arXiv:2501.14808
7
citations
#6253

Can Students Beyond the Teacher? Distilling Knowledge from Teacher’s Bias

Jianhua Zhang, Yi Gao, Ruyu Liu et al.

AAAI 2025paperarXiv:2412.09874
7
citations
#6254

Adaptive Gradient Clipping for Robust Federated Learning

Youssef Allouah, Rachid Guerraoui, Nirupam Gupta et al.

ICLR 2025arXiv:2405.14432
7
citations
#6255

The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation

Bingjie Gao, Xinyu Gao, Xiaoxue Wu et al.

CVPR 2025arXiv:2504.11739
7
citations
#6256

BinaryDM: Accurate Weight Binarization for Efficient Diffusion Models

Xingyu Zheng, Xianglong Liu, Haotong Qin et al.

ICLR 2025arXiv:2404.05662
7
citations
#6257

Question-Aware Gaussian Experts for Audio-Visual Question Answering

Hongyeob Kim, Inyoung Jung, Dayoon Suh et al.

CVPR 2025highlightarXiv:2503.04459
7
citations
#6258

Expressivity of Neural Networks with Random Weights and Learned Biases

Ezekiel Williams, Alexandre Payeur, Avery Ryoo et al.

ICLR 2025arXiv:2407.00957
7
citations
#6259

Continual Learning Using a Kernel-Based Method Over Foundation Models

Saleh Momeni, Sahisnu Mazumder, Bing Liu

AAAI 2025paperarXiv:2412.15571
7
citations
#6260

What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?

Jinhong Ni, Chang-Bin Zhang, Qiang Zhang et al.

ICCV 2025arXiv:2505.22129
7
citations
#6261

AutoSGNN: Automatic Propagation Mechanism Discovery for Spectral Graph Neural Networks

Shibing Mo, Kai Wu, Qixuan Gao et al.

AAAI 2025paperarXiv:2412.12483
7
citations
#6262

Sequential Conditional Transport on Probabilistic Graphs for Interpretable Counterfactual Fairness

Agathe Fernandes Machado, Arthur Charpentier, Ewen Gallic

AAAI 2025paperarXiv:2408.03425
7
citations
#6263

LEDiff: Latent Exposure Diffusion for HDR Generation

Chao Wang, Zhihao Xia, Thomas Leimkuehler et al.

CVPR 2025arXiv:2412.14456
7
citations
#6264

UniDetox: Universal Detoxification of Large Language Models via Dataset Distillation

Huimin LU, Masaru Isonuma, Junichiro Mori et al.

ICLR 2025arXiv:2504.20500
7
citations
#6265

Kinetic Langevin Diffusion for Crystalline Materials Generation

François Cornet, Federico Bergamin, Arghya Bhowmik et al.

ICML 2025arXiv:2507.03602
7
citations
#6266

Training Consistent Mixture-of-Experts-Based Prompt Generator for Continual Learning

Yue Lu, Shizhou Zhang, De Cheng et al.

AAAI 2025paper
7
citations
#6267

Distance-Based Tree-Sliced Wasserstein Distance

Viet-Hoang Tran, Minh-Khoi Nguyen-Nhat, Trang Pham et al.

ICLR 2025arXiv:2503.11050
7
citations
#6268

Explaining Modern Gated-Linear RNNs via a Unified Implicit Attention Formulation

Itamar Zimerman, ameen ali ali, Lior Wolf

ICLR 2025arXiv:2405.16504
7
citations
#6269

MobileIE: An Extremely Lightweight and Effective ConvNet for Real-Time Image Enhancement on Mobile Devices

HAILONG YAN, Ao Li, Xiangtao Zhang et al.

ICCV 2025arXiv:2507.01838
7
citations
#6270

RelationField: Relate Anything in Radiance Fields

Sebastian Koch, Johanna Wald, Mirco Colosi et al.

CVPR 2025arXiv:2412.13652
7
citations
#6271

AnoLLM: Large Language Models for Tabular Anomaly Detection

Che-Ping Tsai, Ganyu Teng, Phillip Wallis et al.

ICLR 2025
7
citations
#6272

Causal Representation Learning from Multimodal Biomedical Observations

Yuewen Sun, Lingjing Kong, Guangyi Chen et al.

ICLR 2025arXiv:2411.06518
7
citations
#6273

HotSpot: Signed Distance Function Optimization with an Asymptotically Sufficient Condition

Zimo Wang, Cheng Wang, Taiki Yoshino et al.

CVPR 2025highlightarXiv:2411.14628
7
citations
#6274

The emergence of sparse attention: impact of data distribution and benefits of repetition

Nicolas Zucchet, Francesco D'Angelo, Andrew Lampinen et al.

NEURIPS 2025oralarXiv:2505.17863
7
citations
#6275

Assessing Pre-Trained Models for Transfer Learning Through Distribution of Spectral Components

Tengxue Zhang, Yang Shu, Xinyang Chen et al.

AAAI 2025paperarXiv:2412.19085
7
citations
#6276

Feature Responsiveness Scores: Model-Agnostic Explanations for Recourse

Seung Hyun Cheon, Anneke Wernerfelt, Sorelle Friedler et al.

ICLR 2025arXiv:2410.22598
7
citations
#6277

Sherlock: Self-Correcting Reasoning in Vision-Language Models

Yi Ding, Ruqi Zhang

NEURIPS 2025arXiv:2505.22651
7
citations
#6278

Balancing Multimodal Training Through Game-Theoretic Regularization

Konstantinos Kontras, Thomas Strypsteen, Christos Chatzichristos et al.

NEURIPS 2025spotlightarXiv:2411.07335
7
citations
#6279

AtomSurf: Surface Representation for Learning on Protein Structures

Vincent Mallet, Yangyang Miao, Souhaib Attaiki et al.

ICLR 2025arXiv:2309.16519
7
citations
#6280

CREIMBO: Cross-Regional Ensemble Interactions in Multi-view Brain Observations

Noga Mudrik, Ryan Ly, Oliver Ruebel et al.

ICLR 2025oralarXiv:2405.17395
7
citations
#6281

Video2BEV: Transforming Drone Videos to BEVs for Video-based Geo-localization

Hao Ju, Shaofei Huang, Si Liu et al.

ICCV 2025arXiv:2411.13610
7
citations
#6282

ARIG: Autoregressive Interactive Head Generation for Real-time Conversations

Ying Guo, Xi Liu, Cheng Zhen et al.

ICCV 2025arXiv:2507.00472
7
citations
#6283

LIRM: Large Inverse Rendering Model for Progressive Reconstruction of Shape, Materials and View-dependent Radiance Fields

Zhengqin Li, Dilin Wang, Ka chen et al.

CVPR 2025arXiv:2504.20026
7
citations
#6284

Object-centric binding in Contrastive Language-Image Pretraining

Rim Assouel, Pietro Astolfi, Florian Bordes et al.

NEURIPS 2025arXiv:2502.14113
7
citations
#6285

Controllable Satellite-to-Street-View Synthesis with Precise Pose Alignment and Zero-Shot Environmental Control

Xianghui Ze, Zhenbo Song, Qiwei Wang et al.

ICLR 2025arXiv:2502.03498
7
citations
#6286

Modeling Cell Dynamics and Interactions with Unbalanced Mean Field Schrödinger Bridge

Zhenyi Zhang, Zihan Wang, Yuhao Sun et al.

NEURIPS 2025arXiv:2505.11197
7
citations
#6287

Emergent Response Planning in LLMs

Zhichen Dong, Zhanhui Zhou, Zhixuan Liu et al.

ICML 2025arXiv:2502.06258
7
citations
#6288

EEE-Bench: A Comprehensive Multimodal Electrical And Electronics Engineering Benchmark

Ming Li, Jike Zhong, Tianle Chen et al.

CVPR 2025arXiv:2411.01492
7
citations
#6289

Small Singular Values Matter: A Random Matrix Analysis of Transformer Models

Max Staats, Matthias Thamm, Bernd Rosenow

NEURIPS 2025arXiv:2410.17770
7
citations
#6290

Mix-CPT: A Domain Adaptation Framework via Decoupling Knowledge Learning and Format Alignment

Jinhao Jiang, Junyi Li, Xin Zhao et al.

ICLR 2025arXiv:2407.10804
7
citations
#6291

VCT: Training Consistency Models with Variational Noise Coupling

Gianluigi Silvestri, Luca Ambrogioni, Chieh-Hsin Lai et al.

ICML 2025arXiv:2502.18197
7
citations
#6292

Keyframe-oriented Vision Token Pruning: Enhancing Efficiency of Large Vision Language Models on Long-Form Video Processing

Yudong Liu, Jingwei Sun, Yueqian Lin et al.

ICCV 2025arXiv:2503.10742
7
citations
#6293

Enhancing 3D Reconstruction for Dynamic Scenes

Jisang Han, Honggyu An, Jaewoo Jung et al.

NEURIPS 2025oralarXiv:2504.06264
7
citations
#6294

Adaptive Part Learning for Fine-Grained Generalized Category Discovery: A Plug-and-Play Enhancement

Qiyuan Dai, Hanzhuo Huang, Yu Wu et al.

CVPR 2025arXiv:2507.06928
7
citations
#6295

On the Transfer of Object-Centric Representation Learning

Aniket Rajiv Didolkar, Andrii Zadaianchuk, Anirudh Goyal et al.

ICLR 2025
7
citations
#6296

Humanizing the Machine: Proxy Attacks to Mislead LLM Detectors

Tianchun Wang, Yuanzhou Chen, Zichuan Liu et al.

ICLR 2025arXiv:2410.19230
7
citations
#6297

Improving Language Model Distillation through Hidden State Matching

Sayantan Dasgupta, Trevor Cohn

ICLR 2025
7
citations
#6298

On the Completeness of Invariant Geometric Deep Learning Models

Zian Li, Xiyuan Wang, Shijia Kang et al.

ICLR 2025arXiv:2402.04836
7
citations
#6299

Vulnerability-Aware Alignment: Mitigating Uneven Forgetting in Harmful Fine-Tuning

Liang CHEN, Xueting Han, Li Shen et al.

ICML 2025arXiv:2506.03850
7
citations
#6300

Provable Maximum Entropy Manifold Exploration via Diffusion Models

Riccardo De Santi, Marin Vlastelica, Ya-Ping Hsieh et al.

ICML 2025arXiv:2506.15385
7
citations
#6301

StdGEN: Semantic-Decomposed 3D Character Generation from Single Images

Yuze He, Yanning Zhou, Wang Zhao et al.

CVPR 2025arXiv:2411.05738
7
citations
#6302

It’s a (Blind) Match! Towards Vision-Language Correspondence without Parallel Data

Dominik Schnaus, Nikita Araslanov, Daniel Cremers

CVPR 2025arXiv:2503.24129
7
citations
#6303

Not Just Text: Uncovering Vision Modality Typographic Threats in Image Generation Models

Hao Cheng, Erjia Xiao, Jiayan Yang et al.

CVPR 2025arXiv:2412.05538
7
citations
#6304

Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL

Ghada Sokar, Johan S Obando Ceron, Aaron Courville et al.

ICLR 2025arXiv:2410.01930
7
citations
#6305

FlipSketch: Flipping Static Drawings to Text-Guided Sketch Animations

Hmrishav Bandyopadhyay, Yi-Zhe Song

CVPR 2025arXiv:2411.10818
7
citations
#6306

Aligning Protein Conformation Ensemble Generation with Physical Feedback

Jiarui Lu, Xiaoyin Chen, Stephen Lu et al.

ICML 2025arXiv:2505.24203
7
citations
#6307

Sweeping Heterogeneity with Smart MoPs: Mixture of Prompts for LLM Task Adaptation

Chen Dun, Mirian Del Carmen Hipolito Garcia, Guoqing Zheng et al.

AAAI 2025paperarXiv:2310.02842
7
citations
#6308

Do Computer Vision Foundation Models Learn the Low-level Characteristics of the Human Visual System?

Yancheng Cai, Fei Yin, Dounia Hammou et al.

CVPR 2025highlightarXiv:2502.20256
7
citations
#6309

Image Quality Assessment: From Human to Machine Preference

Chunyi Li, Yuan Tian, Xiaoyue Ling et al.

CVPR 2025highlightarXiv:2503.10078
7
citations
#6310

Activation Control for Efficiently Eliciting Long Chain-of-thought Ability of Language Models

Zekai Zhao, Qi Liu, Kun Zhou et al.

NEURIPS 2025spotlightarXiv:2505.17697
7
citations
#6311

Classifier-Free Guidance Inside the Attraction Basin May Cause Memorization

Anubhav Jain, Yuya Kobayashi, Takashi Shibuya et al.

CVPR 2025arXiv:2411.16738
7
citations
#6312

Training Language Models to Generate Quality Code with Program Analysis Feedback

Feng Yao, Zilong Wang, Liyuan Liu et al.

NEURIPS 2025arXiv:2505.22704
7
citations
#6313

Semantic Watermarking Reinvented: Enhancing Robustness and Generation Quality with Fourier Integrity

Sung Ju Lee, Nam Ik Cho

ICCV 2025arXiv:2509.07647
7
citations
#6314

Generative Densification: Learning to Densify Gaussians for High-Fidelity Generalizable 3D Reconstruction

Seungtae Nam, Xiangyu Sun, Gyeongjin Kang et al.

CVPR 2025highlightarXiv:2412.06234
7
citations
#6315

Pose Priors from Language Models

Sanjay Subramanian, Evonne Ng, Lea Müller et al.

CVPR 2025arXiv:2405.03689
7
citations
#6316

InstaSHAP: Interpretable Additive Models Explain Shapley Values Instantly

James Enouen, Yan Liu

ICLR 2025arXiv:2502.14177
7
citations
#6317

A Simple yet Effective Layout Token in Large Language Models for Document Understanding

Zhaoqing Zhu, Chuwei Luo, Zirui Shao et al.

CVPR 2025arXiv:2503.18434
7
citations
#6318

VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning

Han Lin, Tushar Nagarajan, Nicolas Ballas et al.

ICLR 2025arXiv:2410.03478
7
citations
#6319

Stochastic Online Instrumental Variable Regression: Regrets for Endogeneity and Bandit Feedback

Riccardo Della Vecchia, Debabrota Basu

AAAI 2025paperarXiv:2302.09357
7
citations
#6320

Bidirectional Decoding: Improving Action Chunking via Guided Test-Time Sampling

Yuejiang Liu, Jubayer Hamid, Annie Xie et al.

ICLR 2025oralarXiv:2408.17355
7
citations
#6321

Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos

Rundong Luo, Matthew Wallingford, Ali Farhadi et al.

ICCV 2025arXiv:2504.07940
7
citations
#6322

Distilling Reinforcement Learning Algorithms for In-Context Model-Based Planning

Jaehyeon Son, Soochan Lee, Gunhee Kim

ICLR 2025arXiv:2502.19009
7
citations
#6323

MimiQ: Low-Bit Data-Free Quantization of Vision Transformers with Encouraging Inter-Head Attention Similarity

Kanghyun Choi, Hyeyoon Lee, Dain Kwon et al.

AAAI 2025paperarXiv:2407.20021
7
citations
#6324

Inversion Circle Interpolation: Diffusion-based Image Augmentation for Data-scarce Classification

Yanghao Wang, Long Chen

CVPR 2025arXiv:2408.16266
7
citations
#6325

Enhancing Target-unspecific Tasks through a Features Matrix

Fangming Cui, Yonggang Zhang, Xuan Wang et al.

ICML 2025arXiv:2505.03414
7
citations
#6326

Graph Neural Ricci Flow: Evolving Feature from a Curvature Perspective

Jialong Chen, Bowen Deng, Zhen WANG et al.

ICLR 2025
7
citations
#6327

Generalized Dimension Reduction Using Semi-Relaxed Gromov-Wasserstein Distance

Ranthony A. Clark, Tom Needham, Thomas Weighill

AAAI 2025paperarXiv:2405.15959
7
citations
#6328

DiverseFlow: Sample-Efficient Diverse Mode Coverage in Flows

Mashrur M. Morshed, Vishnu Naresh Boddeti

CVPR 2025arXiv:2504.07894
7
citations
#6329

Last-Iterate Convergence Properties of Regret-Matching Algorithms in Games

Yang Cai, Gabriele Farina, Julien Grand-Clément et al.

ICLR 2025arXiv:2311.00676
7
citations
#6330

PWM: Policy Learning with Multi-Task World Models

Ignat Georgiev, Varun Giridhar, Nick Hansen et al.

ICLR 2025arXiv:2407.02466
7
citations
#6331

CAX: Cellular Automata Accelerated in JAX

Maxence Faldor, Antoine Cully

ICLR 2025arXiv:2410.02651
7
citations
#6332

SVasP: Self-Versatility Adversarial Style Perturbation for Cross-Domain Few-Shot Learning

Wenqian Li, Pengfei Fang, Hui Xue

AAAI 2025paperarXiv:2412.09073
7
citations
#6333

REDUCIO! Generating 1K Video within 16 Seconds using Extremely Compressed Motion Latents

Rui Tian, Qi Dai, Jianmin Bao et al.

ICCV 2025arXiv:2411.13552
7
citations
#6334

HiPART: Hierarchical Pose AutoRegressive Transformer for Occluded 3D Human Pose Estimation

Hongwei Zheng, Han Li, Wenrui Dai et al.

CVPR 2025arXiv:2503.23331
7
citations
#6335

Parameter-Efficient Adaptation of Geospatial Foundation Models through Embedding Deflection

Romain Thoreau, Valerio Marsocci, Dawa Derksen

ICCV 2025arXiv:2503.09493
7
citations
#6336

ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models

Hongbo Liu, Jingwen He, Yi Jin et al.

NEURIPS 2025arXiv:2506.21356
7
citations
#6337

Circuit Representation Learning with Masked Gate Modeling and Verilog-AIG Alignment

Haoyuan Wu, Haisheng Zheng, Yuan Pu et al.

ICLR 2025arXiv:2502.12732
7
citations
#6338

Loosely Synchronized Rule-Based Planning for Multi-Agent Path Finding with Asynchronous Actions

Shuai Zhou, Shizhe Zhao, Zhongqiang Ren

AAAI 2025paperarXiv:2412.11678
7
citations
#6339

Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Evaluation Metrics

Lee Chae-Yeon, Oh Hyun-Bin, Han EunGi et al.

CVPR 2025highlightarXiv:2503.20308
7
citations
#6340

Visual Lexicon: Rich Image Features in Language Space

XuDong Wang, Xingyi Zhou, Alireza Fathi et al.

CVPR 2025arXiv:2412.06774
7
citations
#6341

BitStack: Any-Size Compression of Large Language Models in Variable Memory Environments

Xinghao Wang, Pengyu Wang, Bo Wang et al.

ICLR 2025arXiv:2410.23918
7
citations
#6342

Adjoint Schrödinger Bridge Sampler

Guan-Horng Liu, Jaemoo Choi, Yongxin Chen et al.

NEURIPS 2025oralarXiv:2506.22565
7
citations
#6343

Routing Experts: Learning to Route Dynamic Experts in Existing Multi-modal Large Language Models

Qiong Wu, Zhaoxi Ke, Yiyi Zhou et al.

ICLR 2025
7
citations
#6344

DoF: A Diffusion Factorization Framework for Offline Multi-Agent Reinforcement Learning

Chao Li, Ziwei Deng, Chenxing Lin et al.

ICLR 2025
7
citations
#6345

Uncertainty Quantification with the Empirical Neural Tangent Kernel

Joseph Wilson, Chris van der Heide, Liam Hodgkinson et al.

NEURIPS 2025arXiv:2502.02870
7
citations
#6346

Time-o1: Time-Series Forecasting Needs Transformed Label Alignment

Hao Wang, Licheng Pan, Zhichao Chen et al.

NEURIPS 2025oralarXiv:2505.17847
7
citations
#6347

Instruction-Augmented Long-Horizon Planning: Embedding Grounding Mechanisms in Embodied Mobile Manipulation

Fangyuan Wang, Shipeng Lyu, Peng Zhou et al.

AAAI 2025paperarXiv:2503.08084
7
citations
#6348

Safe Planner: Empowering Safety Awareness in Large Pre-Trained Models for Robot Task Planning

Siyuan Li, Feifan Liu, Lingfei Cui et al.

AAAI 2025paperarXiv:2411.06920
7
citations
#6349

Can Generative Video Models Help Pose Estimation?

Ruojin Cai, Jason Y. Zhang, Philipp Henzler et al.

CVPR 2025highlightarXiv:2412.16155
7
citations
#6350

Understanding Individual Agent Importance in Multi-Agent System via Counterfactual Reasoning

Jianming Chen, Yawen Wang, Junjie Wang et al.

AAAI 2025paperarXiv:2412.15619
7
citations
#6351

Enhancing Text-to-Image Diffusion Transformer via Split-Text Conditioning

Yu Zhang, Jialei Zhou, Xinchen Li et al.

NEURIPS 2025arXiv:2505.19261
7
citations
#6352

Scene Splatter: Momentum 3D Scene Generation from Single Image with Video Diffusion Model

Shengjun Zhang, Jinzhao Li, Xin Fei et al.

CVPR 2025arXiv:2504.02764
7
citations
#6353

Earlier Tokens Contribute More: Learning Direct Preference Optimization From Temporal Decay Perspective

Ruichen Shao, Bei Li, Gangao Liu et al.

ICLR 2025oralarXiv:2502.14340
7
citations
#6354

Dense Video Object Captioning from Disjoint Supervision

Xingyi Zhou, Anurag Arnab, Chen Sun et al.

ICLR 2025oralarXiv:2306.11729
7
citations
#6355

Int2Planner: An Intention-based Multi-modal Motion Planner for Integrated Prediction and Planning

Xiaolei Chen, Junchi Yan, Wenlong Liao et al.

AAAI 2025paperarXiv:2501.12799
7
citations
#6356

Filter Images First, Generate Instructions Later: Pre-Instruction Data Selection for Visual Instruction Tuning

Bardia Safaei, Faizan Siddiqui, Jiacong Xu et al.

CVPR 2025highlightarXiv:2503.07591
7
citations
#6357

RealEdit: Reddit Edits As a Large-scale Empirical Dataset for Image Transformations

Peter Sushko, Ayana Bharadwaj, Zhi Yang Lim et al.

CVPR 2025arXiv:2502.03629
7
citations
#6358

ChatHuman: Chatting about 3D Humans with Tools

Jing Lin, Yao Feng, Weiyang Liu et al.

CVPR 2025arXiv:2405.04533
7
citations
#6359

ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting

Chengyou Jia, Changliang Xia, Zhuohang Dang et al.

CVPR 2025arXiv:2411.17176
7
citations
#6360

Equivariant Symmetry Breaking Sets

YuQing Xie, Tess Smidt

ICLR 2025arXiv:2402.02681
7
citations
#6361

Long-Term EEG Partitioning for Seizure Onset Detection

Zheng Chen, Yasuko Matsubara, Yasushi Sakurai et al.

AAAI 2025paperarXiv:2412.15598
7
citations
#6362

Relative Pose Estimation through Affine Corrections of Monocular Depth Priors

Yifan Yu, Shaohui Liu, Rémi Pautrat et al.

CVPR 2025highlightarXiv:2501.05446
7
citations
#6363

GPS: A Probabilistic Distributional Similarity with Gumbel Priors for Set-to-Set Matching

Ziming Zhang, Fangzhou Lin, Haotian Liu et al.

ICLR 2025oral
7
citations
#6364

Driving by the Rules: A Benchmark for Integrating Traffic Sign Regulations into Vectorized HD Map

Xinyuan Chang, Maixuan Xue, Xinran Liu et al.

CVPR 2025highlightarXiv:2410.23780
7
citations
#6365

SITE: towards Spatial Intelligence Thorough Evaluation

Wenqi Wang, Reuben Tan, Pengyue Zhu et al.

ICCV 2025arXiv:2505.05456
7
citations
#6366

Enhancing Language Model Agents using Diversity of Thoughts

Vijay Chandra Lingam, Behrooz Tehrani, sujay sanghavi et al.

ICLR 2025
7
citations
#6367

Actions Speak Louder Than Words: Rate-Reward Trade-off in Markov Decision Processes

Haotian Wu, Gongpu Chen, Deniz Gunduz

ICLR 2025arXiv:2502.03335
7
citations
#6368

Learned Image Transmission with Hierarchical Variational Autoencoder

Guangyi Zhang, Hanlei Li, Yunlong Cai et al.

AAAI 2025paperarXiv:2408.16340
7
citations
#6369

Learning Distances from Data with Normalizing Flows and Score Matching

Peter Sorrenson, Daniel Behrend-Uriarte, Christoph Schnörr et al.

ICML 2025arXiv:2407.09297
7
citations
#6370

TR-PTS: Task-Relevant Parameter and Token Selection for Efficient Tuning

Siqi Luo, Haoran Yang, Yi Xin et al.

ICCV 2025arXiv:2507.22872
7
citations
#6371

Better autoregressive regression with LLMs via regression-aware fine-tuning

Michal Lukasik, Zhao Meng, Harikrishna Narasimhan et al.

ICLR 2025
7
citations
#6372

SkySense V2: A Unified Foundation Model for Multi-modal Remote Sensing

Yingying Zhang, Lixiang Ru, Kang Wu et al.

ICCV 2025arXiv:2507.13812
7
citations
#6373

TurboFill: Adapting Few-step Text-to-image Model for Fast Image Inpainting

Liangbin Xie, Daniil Pakhomov, Zhonghao Wang et al.

CVPR 2025arXiv:2504.00996
7
citations
#6374

Stochastic Semi-Gradient Descent for Learning Mean Field Games with Population-Aware Function Approximation

Chenyu Zhang, Xu Chen, Xuan Di

ICLR 2025arXiv:2408.08192
7
citations
#6375

AdaDPCC: Adaptive Rate Control and Rate-Distortion-Complexity Optimization for Dynamic Point Cloud Compression

Chenhao Zhang, Wei Gao

AAAI 2025paperarXiv:2508.20741
7
citations
#6376

MMSearch: Unveiling the Potential of Large Models as Multi-modal Search Engines

Dongzhi Jiang, Renrui Zhang, Ziyu Guo et al.

ICLR 2025
7
citations
#6377

Event-based Tiny Object Detection: A Benchmark Dataset and Baselines

Nuo Chen, Chao Xiao, Yimian Dai et al.

ICCV 2025arXiv:2506.23575
7
citations
#6378

Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion

Vitor Guizilini, Muhammad Zubair Irshad, Dian Chen et al.

CVPR 2025arXiv:2501.18804
7
citations
#6379

Do We Always Need the Simplicity Bias? Looking for Optimal Inductive Biases in the Wild

Damien Teney, Liangze Jiang, Florin Gogianu et al.

CVPR 2025arXiv:2503.10065
7
citations
#6380

Stochastic Process Learning via Operator Flow Matching

Yaozhong Shi, Zachary Ross, Domniki Asimaki et al.

NEURIPS 2025spotlightarXiv:2501.04126
7
citations
#6381

Agentic Plan Caching: Test-Time Memory for Fast and Cost-Efficient LLM Agents

Qizheng Zhang, Michael Wornow, Kunle Olukotun

NEURIPS 2025arXiv:2506.14852
7
citations
#6382

Exact Expressive Power of Transformers with Padding

Will Merrill, Ashish Sabharwal

NEURIPS 2025arXiv:2505.18948
7
citations
#6383

Object-centric Video Question Answering with Visual Grounding and Referring

Haochen Wang, Qirui Chen, Cilin Yan et al.

ICCV 2025arXiv:2507.19599
7
citations
#6384

StableCodec: Taming One-Step Diffusion for Extreme Image Compression

Tianyu Zhang, Xin Luo, Li Li et al.

ICCV 2025arXiv:2506.21977
7
citations
#6385

Implicit Neural Surface Deformation with Explicit Velocity Fields

Lu Sang, Zehranaz Canfes, Dongliang Cao et al.

ICLR 2025arXiv:2501.14038
7
citations
#6386

Reasoning Elicitation in Language Models via Counterfactual Feedback

Alihan Hüyük, Xinnuo Xu, Jacqueline Maasch et al.

ICLR 2025arXiv:2410.03767
7
citations
#6387

Discrete GCBF Proximal Policy Optimization for Multi-agent Safe Optimal Control

Songyuan Zhang, Oswin So, Mitchell Black et al.

ICLR 2025arXiv:2502.03640
7
citations
#6388

On the Robustness of Reward Models for Language Model Alignment

Jiwoo Hong, Noah Lee, Eunki Kim et al.

ICML 2025arXiv:2505.07271
7
citations
#6389

Shielded Diffusion: Generating Novel and Diverse Images using Sparse Repellency

Michael Kirchhof, James Thornton, Louis Béthune et al.

ICML 2025arXiv:2410.06025
7
citations
#6390

CoVoMix2: Advancing Zero-Shot Dialogue Generation with Fully Non-Autoregressive Flow Matching

Leying Zhang, Yao Qian, Xiaofei Wang et al.

NEURIPS 2025arXiv:2506.00885
7
citations
#6391

Speculate, then Collaborate: Fusing Knowledge of Language Models during Decoding

Ziyao Wang, Muneeza Azmat, Ang Li et al.

ICML 2025arXiv:2502.08020
7
citations
#6392

Rethinking Fair Representation Learning for Performance-Sensitive Tasks

Charles Jones, Fabio De Sousa Ribeiro, Mélanie Roschewitz et al.

ICLR 2025arXiv:2410.04120
7
citations
#6393

GDiffRetro: Retrosynthesis Prediction with Dual Graph Enhanced Molecular Representation and Diffusion Generation

Shengyin Sun, Wenhao Yu, Yuxiang Ren et al.

AAAI 2025paperarXiv:2501.08001
7
citations
#6394

Dynamic Typography: Bringing Text to Life via Video Diffusion Prior

Zichen Liu, Yihao Meng, Hao Ouyang et al.

ICCV 2025arXiv:2404.11614
7
citations
#6395

Let Your Features Tell The Differences: Understanding Graph Convolution By Feature Splitting

Yilun Zheng, Xiang Li, Sitao Luan et al.

ICLR 2025
7
citations
#6396

Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge Graphs

Xiaqiang Tang, Jian Li, Nan Du et al.

AAAI 2025paperarXiv:2412.07618
7
citations
#6397

SEAL: Semantic Attention Learning for Long Video Representation

Lan Wang, Yujia Chen, Wen-Sheng Chu et al.

CVPR 2025arXiv:2412.01798
7
citations
#6398

A Theory for Token-Level Harmonization in Retrieval-Augmented Generation

Shicheng Xu, Liang Pang, Huawei Shen et al.

ICLR 2025arXiv:2406.00944
7
citations
#6399

MotiF: Making Text Count in Image Animation with Motion Focal Loss

Shijie Wang, Samaneh Azadi, Rohit Girdhar et al.

CVPR 2025arXiv:2412.16153
7
citations
#6400

Scaling Laws for Task-Optimized Models of the Primate Visual Ventral Stream

Abdulkadir Gokce, Martin Schrimpf

ICML 2025oralarXiv:2411.05712
7
citations