Oral Papers
1,594 papers found • Page 14 of 32
Conference
MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
Yekun Chai, Haoran Sun, Huang Fang et al.
Masked Temporal Interpolation Diffusion for Procedure Planning in Instructional Videos
Yufan Zhou, Zhaobo Qi, Lingshuai Lin et al.
MATCH: Multi-faceted Adaptive Topo-Consistency for Semi-Supervised Histopathology Segmentation
Meilong Xu, Xiaoling Hu, Shahira Abousamra et al.
MaxSup: Overcoming Representation Collapse in Label Smoothing
Yuxuan Zhou, Heng Li, Zhi-Qi Cheng et al.
MCNC: Manifold-Constrained Reparameterization for Neural Compression
Chayne Thrash, Reed Andreas, Ali Abbasi et al.
MDSGen: Fast and Efficient Masked Diffusion Temporal-Aware Transformers for Open-Domain Sound Generation
Trung X. Pham, Tri Ton, Chang Yoo
Mean Flows for One-step Generative Modeling
Zhengyang Geng, Mingyang Deng, Xingjian Bai et al.
Mechanistic PDE Networks for Discovery of Governing Equations
Adeel Pervez, Efstratios Gavves, Francesco Locatello
MedicalNarratives: Connecting Medical Vision and Language with Localized Narratives
Wisdom Ikezogwo, Kevin M. Zhang, Saygin Seyfioglu
MedSG-Bench: A Benchmark for Medical Image Sequences Grounding
Jingkun Yue, Siqi Zhang, Zinan Jia et al.
MEgoHand: Multimodal Egocentric Hand-Object Interaction Motion Generation
Bohan Zhou, Yi Zhan, Zhongbin Zhang et al.
MemFreezing: A Novel Adversarial Attack on Temporal Graph Neural Networks under Limited Future Knowledge
Yue Dai, Liang Liu, Xulong Tang et al.
Memory Mosaics at scale
Jianyu Zhang, Leon Bottou
MERIT: Multilingual Semantic Retrieval with Interleaved Multi-Condition Query
Wei Chow, Yuan Gao, Linfeng Li et al.
Meta-D2AG: Causal Graph Learning with Interventional Dynamic Data
Tian Gao, Songtao Lu, Junkyu Lee et al.
Meta Flow Matching: Integrating Vector Fields on the Wasserstein Manifold
Lazar Atanackovic, Xi (Nicole) Zhang, Brandon Amos et al.
Meta Guidance: Incorporating Inductive Biases into Deep Time Series Imputers
Jiacheng You, Xinyang Chen, Yu Sun et al.
Meta-learning how to Share Credit among Macro-Actions
Ionel-Alexandru Hosu, Traian Rebedea, Razvan Pascanu
MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems
Xuanming Zhang, Yuxuan Chen, Samuel (Min-Hsuan) Yeh et al.
MGD$^3$ : Mode-Guided Dataset Distillation using Diffusion Models
Jeffrey A. Chan-Santiago, praveen tirupattur, Gaurav Kumar Nayak et al.
MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Yuang Zhang, Jiaxi Gu, Li-Wen Wang et al.
MiNT: Multi-Network Transfer Benchmark for Temporal Graph Learning
Kiarash Shamsi, Tran Gia Bao Ngo, Razieh Shirzadkhani et al.
MIRA: Medical Time Series Foundation Model for Real-World Health Data
Hao Li, Bowen Deng, Chang Xu et al.
Mitigating Hallucination in VideoLLMs via Temporal-Aware Activation Engineering
JIANFENG CAI, Jiale Hong, Zongmeng Zhang et al.
Mitigating Plasticity Loss in Continual Reinforcement Learning by Reducing Churn
Hongyao Tang, Johan Obando-Ceron, Pablo Samuel Castro et al.
Mitigating Semantic Collapse in Partially Relevant Video Retrieval
WonJun Moon, MinSeok Jung, Gilhan Park et al.
MI-TRQR: Mutual Information-Based Temporal Redundancy Quantification and Reduction for Energy-Efficient Spiking Neural Networks
Dengfeng Xue, Wenjuan Li, Yifan Lu et al.
MixSignGraph: A Sign Sequence is Worth Mixed Graphs of Nodes
Shiwei Gan, Yafeng Yin, Zhiwei Jiang et al.
Mixture of Lookup Experts
Shibo Jie, Yehui Tang, Kai Han et al.
MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios
Yang Shi, Huanqian Wang, Xie et al.
MMInference: Accelerating Pre-filling for Long-Context Visual Language Models via Modality-Aware Permutation Sparse Attention
Yucheng Li, Huiqiang Jiang, Chengruidong Zhang et al.
MobileUse: A Hierarchical Reflection-Driven GUI Agent for Autonomous Mobile Operation
Ning Li, Xiangmou Qu, Jiamu Zhou et al.
Model–Behavior Alignment under Flexible Evaluation: When the Best-Fitting Model Isn’t the Right One
Itamar Avitan, Tal Golan
Model Immunization from a Condition Number Perspective
Amber Yijia Zheng, Cedar Site Bai, Brian Bullins et al.
Modeling Complex System Dynamics with Flow Matching Across Time and Conditions
Martin Rohbeck, Edward De Brouwer, Charlotte Bunne et al.
Modeling Dynamic Neural Activity by combining Naturalistic Video Stimuli and Stimulus-independent Latent Factors
Finn Schmidt, Polina Turishcheva, Suhas Shrinivasan et al.
Modeling Microenvironment Trajectories on Spatial Transcriptomics with NicheFlow
Kristiyan Sakalyan, Alessandro Palma, Filippo Guerranti et al.
Modeling Neural Activity with Conditionally Linear Dynamical Systems
Victor Geadah, Amin Nejatbakhsh, David Lipshutz et al.
Modularized Self-Reflected Video Reasoner for Multimodal LLM with Application to Video Question Answering
Zihan Song, Xin Wang, Zi Qian et al.
MokA: Multimodal Low-Rank Adaptation for MLLMs
Yake Wei, Yu Miao, Dongzhan Zhou et al.
MoMa: Modulating Mamba for Adapting Image Foundation Models to Video Recognition
Yuhuan Yang, Chaofan Ma, Zhenjie Mao et al.
Momentum Multi-Marginal Schrödinger Bridge Matching
Panagiotis Theodoropoulos, Augustinos Saravanos, Evangelos Theodorou et al.
MoniTor: Exploiting Large Language Models with Instruction for Online Video Anomaly Detection
shengtian yang, Yue Feng, Yingshi Liu et al.
MONITRS: Multimodal Observations of Natural Incidents Through Remote Sensing
Shreelekha Revankar, Utkarsh Mall, Cheng Perng Phoo et al.
MonoLift: Learning 3D Manipulation Policies from Monocular RGB via Distillation
Ziru Wang, Mengmeng Wang, Guang Dai et al.
MoonCast: High-Quality Zero-Shot Podcast Generation
Zeqian Ju, Dongchao Yang, Shen Kai et al.
MoPFormer: Motion-Primitive Transformer for Wearable-Sensor Activity Recognition
Hao Zhang, Zhan Zhuang, Xuehao Wang et al.
Moral Alignment for LLM Agents
Elizaveta Tennant, Stephen Hailes, Mirco Musolesi
More effort is needed to protect pedestrian privacy in the era of AI
Xingchen Zhang, Zixian Zhao
Motion4D: Learning 3D-Consistent Motion and Semantics for 4D Scene Understanding
Haoran Zhou, Gim Hee Lee