Oral Papers
1,594 papers found • Page 11 of 32
Conference
HyperIMTS: Hypergraph Neural Network for Irregular Multivariate Time Series Forecasting
Boyuan Li, Yicheng Luo, Zhen Liu et al.
Identifiability of Deep Polynomial Neural Networks
Konstantin Usevich, Ricardo Borsoi, Clara Dérand et al.
Identification of Intermittent Temporal Latent Process
Yuke Li, Yujia Zheng, Guangyi Chen et al.
IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation
Yuanze Lin, Yi-Wen Chen, Yi-Hsuan Tsai et al.
Image as a World: Generating Interactive World from Single Image via Panoramic Video Generation
Dongnan Gui, Xun Guo, Wengang Zhou et al.
ImageNet-trained CNNs are not biased towards texture: Revisiting feature reliance through controlled suppression
Tom Burgert, Oliver Stoll, Paolo Rota et al.
Imitation Learning from a Single Temporally Misaligned Video
William Huey, Yuki (Huaxiaoyue) Wang, Anne Wu et al.
Imitation Learning with Temporal Logic Constraints
Zining Fan, He Zhu
Implicit Regularization for Tubal Tensor Factorizations via Gradient Descent
Santhosh Karnik, Anna Veselovska, Mark Iwen et al.
Impossible Videos
Zechen Bai, Hai Ci, Mike Zheng Shou
Improved Regret Analysis in Gaussian Process Bandits: Optimality for Noiseless Reward, RKHS norm, and Non-Stationary Variance
Shogo Iwazaki, Shion Takeno
Improved Regret Bounds for Gaussian Process Upper Confidence Bound in Bayesian Optimization
Shogo Iwazaki
Improved Sampling Of Diffusion Models In Fluid Dynamics With Tweedie's Formula
Youssef Shehata, Benjamin Holzschuh, Nils Thuerey
Improve Temporal Reasoning in Multimodal Large Language Models via Video Contrastive Decoding
Daiqing Qi, Dongliang Guo, Hanzhang Yuan et al.
Improving Generative Behavior Cloning via Self-Guidance and Adaptive Chunking
Junhyuk So, Chiwoong Lee, Shinyoung Lee et al.
Improving LLM Video Understanding with 16 Frames Per Second
Yixuan Li, Changli Tang, Jimin Zhuang et al.
Improving planning and MBRL with temporally-extended actions
Palash Chatterjee, Roni Khardon
Improving Target Sound Extraction via Disentangled Codec Representations with Privileged Knowledge Distillation
Dail Kim, Joon-Hyuk Chang
Improving the Scaling Laws of Synthetic Data with Deliberate Practice
Reyhane Askari Hemmat, Mohammad Pezeshki, Elvis Dohmatob et al.
IMTS is Worth Time $\times$ Channel Patches: Visual Masked Autoencoders for Irregular Multivariate Time Series Prediction
Zhangyi Hu, Jiemin Wu, Hua XU et al.
In-Context Denoising with One-Layer Transformers: Connections between Attention and Associative Memory Retrieval
Matthew Smart, Alberto Bietti, Anirvan Sengupta
In-Context Reinforcement Learning From Suboptimal Historical Data
Juncheng Dong, Moyang Guo, Ethan Fang et al.
Incremental Sequence Classification with Temporal Consistency
Lucas Maystre, Gabriel Barello, Tudor Berariu et al.
Inductive Moment Matching
Linqi (Alex) Zhou, Stefano Ermon, Jiaming Song
IndustryEQA: Pushing the Frontiers of Embodied Question Answering in Industrial Scenarios
Yifan Li, Yuhang Chen, Anh Dao et al.
INFER: A Neural-symbolic Model For Extrapolation Reasoning on Temporal Knowledge Graph
Ningyuan Li, Haihong E, Tianyu Yao et al.
InfiniPot-V: Memory-Constrained KV Cache Compression for Streaming Video Understanding
Minsoo Kim, Kyuhong Shim, Jungwook Choi et al.
Infinite-Resolution Integral Noise Warping for Diffusion Models
Yitong Deng, Winnie Lin, Lingxiao Li et al.
InfinityStar: Unified Spacetime AutoRegressive Modeling for Visual Generation
Jinlai Liu, Jian Han, Bin Yan et al.
Information Bottleneck-guided MLPs for Robust Spatial-temporal Forecasting
Min Chen, Guansong Pang, Wenjun Wang et al.
Inner Speech as Behavior Guides: Steerable Imitation of Diverse Behaviors for Human-AI coordination
Rakshit Trivedi, Kartik Sharma, David Parkes
In Search of Adam’s Secret Sauce
Antonio Orvieto, Robert Gower
Inspection and Control of Self-Generated-Text Recognition Ability in Llama3-8b-Instruct
Christopher Ackerman, Nina Panickssery
Instant4D: 4D Gaussian Splatting in Minutes
Zhanpeng Luo, Haoxi Ran, Li Lu
Instant Video Models: Universal Adapters for Stabilizing Image-Based Networks
Matthew Dutson, Nathan Labiosa, Yin Li et al.
INST-IT: Boosting Instance Understanding via Explicit Visual Prompt Instruction Tuning
Wujian Peng, Lingchen Meng, Yitong Chen et al.
InstructFlow: Adaptive Symbolic Constraint-Guided Code Generation for Long-Horizon Planning
Haotian Chi, Zeyu Feng, Yueming LYU et al.
Interactive Cross-modal Learning for Text-3D Scene Retrieval
Yanglin Feng, Yongxiang Li, Yuan Sun et al.
Interchangeable Token Embeddings for Extendable Vocabulary and Alpha-Equivalence
İlker Işık, Ramazan Gokberk Cinbis, Ebru Gol
InterMask: 3D Human Interaction Generation via Collaborative Masked Modeling
Muhammad Gohar Javed, chuan guo, Li Cheng et al.
In the Eye of MLLM: Benchmarking Egocentric Video Intent Understanding with Gaze-Guided Prompting
Taiying Peng, Jiacheng Hua, Miao Liu et al.
Intrinsic Goals for Autonomous Agents: Model-Based Exploration in Virtual Zebrafish Predicts Ethological Behavior and Whole-Brain Dynamics
Reece Keller, Alyn Kirsch, Felix Pei et al.
Introducing 3D Representation for Dense Volume-to-Volume Translation via Score Fusion
Xiyue Zhu, Dou Kwark, Ruike Zhu et al.
Inverse decision-making using neural amortized Bayesian actors
Dominik Straub, Tobias Fabian Niehues, Jan Peters et al.
Inverse Reinforcement Learning with Switching Rewards and History Dependency for Characterizing Animal Behaviors
Jingyang Ke, Feiyang Wu, Jiyi Wang et al.
InvestESG: A multi-agent reinforcement learning benchmark for studying climate investment as a social dilemma
Xiaoxuan Hou, Jiayi Yuan, Joel Z Leibo et al.
ITBench: Evaluating AI Agents across Diverse Real-World IT Automation Tasks
Saurabh Jha, Rohan Arora, Yuji Watanabe et al.
ITFormer: Bridging Time Series and Natural Language for Multi-Modal QA with Large-Scale Multitask Dataset
Yilin Wang, Peixuan Lei, Jie Song et al.
IV-mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis
Shitong Shao, zikai zhou, Lichen Bai et al.
JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation
Kai Liu, Jungang Li, Yuchong Sun et al.