Oral Papers

ICLR 2025oralarXiv:2507.03393

Masked Temporal Interpolation Diffusion for Procedure Planning in Instructional Videos

Yufan Zhou, Zhaobo Qi, Lingshuai Lin et al.

MATCH: Multi-faceted Adaptive Topo-Consistency for Semi-Supervised Histopathology Segmentation

Meilong Xu, Xiaoling Hu, Shahira Abousamra et al.

NEURIPS 2025oralarXiv:2510.01532

NEURIPS 2025oralarXiv:2502.15798

MaxSup: Overcoming Representation Collapse in Label Smoothing

Yuxuan Zhou, Heng Li, Zhi-Qi Cheng et al.

MCNC: Manifold-Constrained Reparameterization for Neural Compression

Chayne Thrash, Reed Andreas, Ali Abbasi et al.

ICLR 2025oralarXiv:2406.19301

ICLR 2025oralarXiv:2410.02130

MDSGen: Fast and Efficient Masked Diffusion Temporal-Aware Transformers for Open-Domain Sound Generation

Trung X. Pham, Tri Ton, Chang Yoo

NEURIPS 2025oralarXiv:2505.13447

Mean Flows for One-step Generative Modeling

Zhengyang Geng, Mingyang Deng, Xingjian Bai et al.

185

ICML 2025oralarXiv:2502.18377

Mechanistic PDE Networks for Discovery of Governing Equations

Adeel Pervez, Efstratios Gavves, Francesco Locatello

NEURIPS 2025oralarXiv:2501.04184

MedicalNarratives: Connecting Medical Vision and Language with Localized Narratives

Wisdom Ikezogwo, Kevin M. Zhang, Saygin Seyfioglu

NEURIPS 2025oralarXiv:2505.11852

MedSG-Bench: A Benchmark for Medical Image Sequences Grounding

Jingkun Yue, Siqi Zhang, Zinan Jia et al.

NEURIPS 2025oralarXiv:2505.16602

MEgoHand: Multimodal Egocentric Hand-Object Interaction Motion Generation

Bohan Zhou, Yi Zhan, Zhongbin Zhang et al.

MemFreezing: A Novel Adversarial Attack on Temporal Graph Neural Networks under Limited Future Knowledge

Yue Dai, Liang Liu, Xulong Tang et al.

NEURIPS 2025oralarXiv:2507.03285

Memory Mosaics at scale

Jianyu Zhang, Leon Bottou

NEURIPS 2025oralarXiv:2506.03144

MERIT: Multilingual Semantic Retrieval with Interleaved Multi-Condition Query

Wei Chow, Yuan Gao, Linfeng Li et al.

Meta-D2AG: Causal Graph Learning with Interventional Dynamic Data

Tian Gao, Songtao Lu, Junkyu Lee et al.

ICLR 2025oralarXiv:2408.14608

Meta Flow Matching: Integrating Vector Fields on the Wasserstein Manifold

Lazar Atanackovic, Xi (Nicole) Zhang, Brandon Amos et al.

Meta Guidance: Incorporating Inductive Biases into Deep Time Series Imputers

Jiacheng You, Xinyang Chen, Yu Sun et al.

NEURIPS 2025oralarXiv:2506.13690

Meta-learning how to Share Credit among Macro-Actions

Ionel-Alexandru Hosu, Traian Rebedea, Razvan Pascanu

MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems

Xuanming Zhang, Yuxuan Chen, Samuel (Min-Hsuan) Yeh et al.

NEURIPS 2025oralarXiv:2505.18943

MGD$^3$ : Mode-Guided Dataset Distillation using Diffusion Models

Jeffrey A. Chan-Santiago, praveen tirupattur, Gaurav Kumar Nayak et al.

ICML 2025oralarXiv:2406.19680

MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

Yuang Zhang, Jiaxi Gu, Li-Wen Wang et al.

161

MiNT: Multi-Network Transfer Benchmark for Temporal Graph Learning

Kiarash Shamsi, Tran Gia Bao Ngo, Razieh Shirzadkhani et al.

NEURIPS 2025oralarXiv:2506.07584

MIRA: Medical Time Series Foundation Model for Real-World Health Data

Hao Li, Bowen Deng, Chang Xu et al.

NEURIPS 2025oralarXiv:2505.12826

Mitigating Hallucination in VideoLLMs via Temporal-Aware Activation Engineering

JIANFENG CAI, Jiale Hong, Zongmeng Zhang et al.

ICML 2025oralarXiv:2506.00592

Mitigating Plasticity Loss in Continual Reinforcement Learning by Reducing Churn

Hongyao Tang, Johan Obando-Ceron, Pablo Samuel Castro et al.

NEURIPS 2025oralarXiv:2510.27432

Mitigating Semantic Collapse in Partially Relevant Video Retrieval

WonJun Moon, MinSeok Jung, Gilhan Park et al.

MI-TRQR: Mutual Information-Based Temporal Redundancy Quantification and Reduction for Energy-Efficient Spiking Neural Networks

Dengfeng Xue, Wenjuan Li, Yifan Lu et al.

MixSignGraph: A Sign Sequence is Worth Mixed Graphs of Nodes

Shiwei Gan, Yafeng Yin, Zhiwei Jiang et al.

ICML 2025oralarXiv:2503.15798

Mixture of Lookup Experts

Shibo Jie, Yehui Tang, Kai Han et al.

NEURIPS 2025oralarXiv:2505.21333

MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios

Yang Shi, Huanqian Wang, Xie et al.

MMInference: Accelerating Pre-filling for Long-Context Visual Language Models via Modality-Aware Permutation Sparse Attention

Yucheng Li, Huiqiang Jiang, Chengruidong Zhang et al.

MobileUse: A Hierarchical Reflection-Driven GUI Agent for Autonomous Mobile Operation

Ning Li, Xiangmou Qu, Jiamu Zhou et al.

Model–Behavior Alignment under Flexible Evaluation: When the Best-Fitting Model Isn’t the Right One

Itamar Avitan, Tal Golan

ICML 2025oralarXiv:2505.23760

Model Immunization from a Condition Number Perspective

Amber Yijia Zheng, Cedar Site Bai, Brian Bullins et al.

Modeling Complex System Dynamics with Flow Matching Across Time and Conditions

Martin Rohbeck, Edward De Brouwer, Charlotte Bunne et al.

ICLR 2025oral

NEURIPS 2025oralarXiv:2410.16136

Modeling Dynamic Neural Activity by combining Naturalistic Video Stimuli and Stimulus-independent Latent Factors

Finn Schmidt, Polina Turishcheva, Suhas Shrinivasan et al.

NEURIPS 2025oralarXiv:2511.00977

Modeling Microenvironment Trajectories on Spatial Transcriptomics with NicheFlow

Kristiyan Sakalyan, Alessandro Palma, Filippo Guerranti et al.

NEURIPS 2025oralarXiv:2502.18347

Modeling Neural Activity with Conditionally Linear Dynamical Systems

Victor Geadah, Amin Nejatbakhsh, David Lipshutz et al.

Modularized Self-Reflected Video Reasoner for Multimodal LLM with Application to Video Question Answering

Zihan Song, Xin Wang, Zi Qian et al.

NEURIPS 2025oralarXiv:2506.05191

MokA: Multimodal Low-Rank Adaptation for MLLMs

Yake Wei, Yu Miao, Dongzhan Zhou et al.

ICML 2025oralarXiv:2506.23283

MoMa: Modulating Mamba for Adapting Image Foundation Models to Video Recognition

Yuhuan Yang, Chaofan Ma, Zhenjie Mao et al.

NEURIPS 2025oralarXiv:2506.10168

Momentum Multi-Marginal Schrödinger Bridge Matching

Panagiotis Theodoropoulos, Augustinos Saravanos, Evangelos Theodorou et al.

NEURIPS 2025oralarXiv:2510.21449

MoniTor: Exploiting Large Language Models with Instruction for Online Video Anomaly Detection

shengtian yang, Yue Feng, Yingshi Liu et al.

MONITRS: Multimodal Observations of Natural Incidents Through Remote Sensing

Shreelekha Revankar, Utkarsh Mall, Cheng Perng Phoo et al.

NEURIPS 2025oralarXiv:2507.16228

MonoLift: Learning 3D Manipulation Policies from Monocular RGB via Distillation

Ziru Wang, Mengmeng Wang, Guang Dai et al.

NEURIPS 2025oralarXiv:2503.14345

MoonCast: High-Quality Zero-Shot Podcast Generation

Zeqian Ju, Dongchao Yang, Shen Kai et al.

NEURIPS 2025oralarXiv:2505.20744

MoPFormer: Motion-Primitive Transformer for Wearable-Sensor Activity Recognition

Hao Zhang, Zhan Zhuang, Xuehao Wang et al.

ICLR 2025oralarXiv:2410.01639

Moral Alignment for LLM Agents

Elizaveta Tennant, Stephen Hailes, Mirco Musolesi

More effort is needed to protect pedestrian privacy in the era of AI

Xingchen Zhang, Zixian Zhao