Most Cited 2025 "camera image processing" Papers
22,274 papers found • Page 97 of 112
Conference
Conformalized Survival Analysis for General Right-Censored Data
Hen Davidov, Shai Feldman, Gil Shamai et al.
SaFiRe: Saccade-Fixation Reiteration with Mamba for Referring Image Segmentation
Zhenjie Mao, Yang Yuhuan, Chaofan Ma et al.
Effective Policy Learning for Multi-Agent Online Coordination Beyond Submodular Objectives
Qixin Zhang, Yan Sun, Can Jin et al.
CIDD: Collaborative Intelligence for Structure-Based Drug Design Empowered by LLMs
Bowen Gao, Yanwen Huang, Yiqiao Liu et al.
AANet: Virtual Screening under Structural Uncertainty via Alignment and Aggregation
Wenyu Zhu, Jianhui Wang, Bowen Gao et al.
BO4Mob: Bayesian Optimization Benchmarks for High-Dimensional Urban Mobility Problem
Seunghee Ryu, Donghoon Kwon, Seongjin Choi et al.
Towards Realistic Earth-Observation Constellation Scheduling: Benchmark and Methodology
Luting Wang, Yinghao Xiang, Hongliang Huang et al.
Stability and Oracle Inequalities for Optimal Transport Maps between General Distributions
Shubo Li, Yizhe Ding, Lingzhou Xue et al.
$\texttt{AVROBUSTBENCH}$: Benchmarking the Robustness of Audio-Visual Recognition Models at Test-Time
Sarthak Kumar Maharana, Saksham Singh Kushwaha, Baoming Zhang et al.
EPFL-Smart-Kitchen: An Ego-Exo Multi-Modal Dataset for Challenging Action and Motion Understanding in Video-Language Models
Andy Bonnetto, Haozhe Qi, Franklin Leong et al.
OCTDiff: Bridged Diffusion Model for Portable OCT Super-Resolution and Enhancement
Ye Tian, Angela McCarthy, Gabriel Gomide et al.
Can Knowledge be Transferred from Unimodal to Multimodal? Investigating the Transitivity of Multimodal Knowledge Editing
Lingyong Fang, Xinzhong Wang, Depeng depeng wang et al.
Learning to Zoom with Anatomical Relations for Medical Structure Detection
Bin Pu, Liwen Wang, Xingbo Dong et al.
Region-Level Data Attribution for Text-to-Image Generative Models
Trong Bang Nguyen, Phi Le Nguyen, Simon Lucey et al.
Abstain Mask Retain Core: Time Series Prediction by Adaptive Masking Loss with Representation Consistency
Renzhao Liang, Sizhe Xu, Chenggang Xie et al.
PSMBench: A Benchmark and Dataset for Evaluating LLMs Extraction of Protocol State Machines from RFC Specifications
Zilin Shen, Xinyu Luo, Imtiaz Karim et al.
Co-Speech Gesture Video Generation with Implicit Motion-Audio Entanglement
Xinjie Li, Ziyi Chen, Xinlu Yu et al.
MuGS: Multi-Baseline Generalizable Gaussian Splatting Reconstruction
Yaopeng Lou, Liao Shen, Tianqi Liu et al.
Ridge Boosting is Both Robust and Efficient
David Bruns-Smith, Zhongming Xie, Avi Feller
Top2Pano: Learning to Generate Indoor Panoramas from Top-Down View
Zitong Zhang, Suranjan Gautam, Rui Yu
Optimal Single-Policy Sample Complexity and Transient Coverage for Average-Reward Offline RL
Matthew Zurek, Guy Zamir, Yudong Chen
Forensics Adapter: Adapting CLIP for Generalizable Face Forgery Detection
Xinjie Cui, Yuezun Li, Ao Luo et al.
Towards Generalizable Detector for Generated Image
Qianshu Cai, Chao Wu, Yonggang Zhang et al.
MedVSR: Medical Video Super-Resolution with Cross State-Space Propagation
Xinyu Liu, Guolei Sun, Cheng Wang et al.
Uncertain Knowledge Graph Completion via Semi-Supervised Confidence Distribution Learning
Tianxing Wu, Shutong Zhu, Jingting Wang et al.
PPMStereo: Pick-and-Play Memory Construction for Consistent Dynamic Stereo Matching
WANG Yun, Qiaole Dong, Yongjian Zhang et al.
DSAS: A Universal Plug-and-Play Framework for Attention Optimization in Multi-Document Question Answering
Jiakai Li, Rongzheng Wang, Yizhuo Ma et al.
Diversifying Parallel Ergodic Search: A Signature Kernel Evolution Strategy
Sreevardhan Sirigiri, Christian Hughes, Ian Abraham et al.
Uncertainty-Based Smooth Policy Regularisation for Reinforcement Learning with Few Demonstrations
Yujie Zhu, Charles Hepburn, Matthew Thorpe et al.
RobAVA: A Large-scale Dataset and Baseline Towards Video based Robotic Arm Action Understanding
Baoli Sun, Ning Wang, Xinzhu Ma et al.
Predictive Preference Learning from Human Interventions
Haoyuan Cai, Zhenghao (Mark) Peng, Bolei Zhou
A Difference-of-Convex Functions Approach to Energy-Based Iterative Reasoning
Daniel Tschernutter, David Diego Castro, Maciej Kasiński
Under the Shadow: Exploiting Opacity Variation for Fine-grained Shadow Detection
Xiaotian Qiao, Ke Xu, Xianglong Yang et al.
HouseLayout3D: A Benchmark and Training-free Baseline for 3D Layout Estimation in the Wild
Valentin Bieri, Marie-Julie Rakotosaona, Keisuke Tateno et al.
Hypergraph Clustering Network with Partial Attribute Imputation
Qianqian Wang, Bowen Zhao, Zhengming Ding et al.
Understanding Fairness and Prediction Error through Subspace Decomposition and Influence Analysis
Enze Shi, Pankaj Bhagwat, Zhixian Yang et al.
Generic Event Boundary Detection via Denoising Diffusion
Jaejun Hwang, Dayoung Gong, Manjin Kim et al.
On Learning Verifiers and Implications to Chain-of-Thought Reasoning
Maria-Florina Balcan, Avrim Blum, Zhiyuan Li et al.
On Hierarchies of Fairness Notions in Cake Cutting: From Proportionality to Super Envy-Freeness
Arnav Mehra, Alexandros Psomas
Consistency of the $k_n$-nearest neighbor rule under adaptive sampling
Robi Bhattacharjee, Geelon So, Sanjoy Dasgupta
DeepDiver: Adaptive Web-Search Intensity Scaling via Reinforcement Learning
Wenxuan Shi, Haochen Tan, Chuqiao Kuang et al.
Beyond Expectations: Quantile-Guided Alignment for Risk-Calibrated Language Models
Xinran Wang, Jin Du, Azal Khan et al.
Video Diffusion Models Excel at Tracking Similar-Looking Objects Without Supervision
Chenshuang Zhang, Kang Zhang, Joon Son Chung et al.
DiffEye: Diffusion-Based Continuous Eye-Tracking Data Generation Conditioned on Natural Images
Ozgur Kara, Harris Nisar, James Rehg
Variational Polya Tree
Lu Xu, Tsai Hor Chan, Lequan Yu et al.
PhysDiff: A Physically-Guided Diffusion Model for Multivariate Time Series Anomaly Detection
Long Li, Wanghu Chen, Wencheng Zhang et al.
Situat3DChange: Situated 3D Change Understanding Dataset for Multimodal Large Language Model
Ruiping Liu, Junwei Zheng, Yufan Chen et al.
Absence Bench: Language Models Can’t See What’s Missing
Harvey Yiyun Fu, Aryan Shrivastava, Jared Moore et al.
Not All Degradations Are Equal: A Targeted Feature Denoising Framework for Generalizable Image Super-Resolution
hongjun wang, Jiyuan Chen, Zhengwei Yin et al.
Flexible Realignment of Language Models
Wenhong Zhu, Ruobing Xie, Weinan Zhang et al.
LABridge: Text–Image Latent Alignment Framework via Mean-Conditioned OU Process
Huiyang Shao, Xin Xia, Yuxi Ren et al.
GenieBlue: Integrating both Linguistic and Multimodal Capabilities for Large Language Models on Mobile Devices
Xudong LU, Yinghao Chen, Renshou Wu et al.
Prompt-driven Transferable Adversarial Attack on Person Re-Identification with Attribute-aware Textual Inversion
Yuan Bian, Min Liu, Yunqi Yi et al.
Can Multi-Modal LLMs Provide Live Step-by-Step Task Guidance?
Apratim Bhattacharyya, Bicheng Xu, Sanjay Haresh et al.
A Minimalistic Unified Framework for Incremental Learning across Image Restoration Tasks
Xiaoxuan Gong, Jie Ma
Struct2D: A Perception-Guided Framework for Spatial Reasoning in MLLMs
Fangrui Zhu, Hanhui Wang, Yiming Xie et al.
POTEC: Off-Policy Contextual Bandits for Large Action Spaces via Policy Decomposition
Yuta Saito, Jihan Yao, Thorsten Joachims
Minimizing Labeled, Maximizing Unlabeled: An Image-Driven Approach for Video Instance Segmentation
Fangyun Wei, Jinjing Zhao, Kun Yan et al.
A General Framework for Off-Policy Learning with Partially-Observed Reward
Rikiya Takehi, Masahiro Asami, Kosuke Kawakami et al.
Beyond Clean Training Data: A Versatile and Model-Agnostic Framework for Out-of-Distribution Detection with Contaminated Training Data
Yuchuan Li, Jae-Mo Kang, Il-Min Kim
Feature Purification Matters: Suppressing Outlier Propagation for Training-Free Open-Vocabulary Semantic Segmentation
Shuo Jin, Siyue Yu, Bingfeng Zhang et al.
Heatmap Regression without Soft-Argmax for Facial Landmark Detection
Chiao-An Yang, Raymond A. Yeh
A deep inverse-mapping model for a flapping robotic wing
Hadar Sharvit, Raz Karl, Tsevi Beatus
Real-World Adverse Weather Image Restoration via Dual-Level Reinforcement Learning with High-Quality Cold Start
Fuyang Liu, Jiaqi Xu, Xiaowei Hu
Posterior Contraction for Sparse Neural Networks in Besov Spaces with Intrinsic Dimensionality
Kyeongwon Lee, Lizhen Lin, Jaewoo Park et al.
SECODEPLT: A Unified Benchmark for Evaluating the Security Risks and Capabilities of Code GenAI
Yuzhou Nie, Zhun Wang, Yu Yang et al.
Learning Endogenous Attention for Incremental Object Detection
Xiang Song, Yuhang He, Jingyuan Li et al.
Deep Edge Filter: Return of the Human-Crafted Layer in Deep Learning
Dongkwan Lee, JunHoo Lee, Nojun Kwak
FairDICE: Fairness-Driven Offline Multi-Objective Reinforcement Learning
Woosung Kim, Jinho Lee, Jongmin Lee et al.
BAM-ICL: Causal Hijacking In-Context Learning with Budgeted Adversarial Manipulation
Rui Chu, Bingyin Zhao, Hanling Jiang et al.
Fine-Grained 3D Gaussian Head Avatars Modeling from Static Captures via Joint Reconstruction and Registration
Yuan Sun, Xuan Wang, Cong Wang et al.
Reinforced Active Learning for Large-Scale Virtual Screening with Learnable Policy Model
Yicong Chen, Jiahua Rao, Jiancong Xie et al.
Repurposing AlphaFold3-like Protein Folding Models for Antibody Sequence and Structure Co-design
Nianzu Yang, Songlin Jiang, Jian Ma et al.
Heterogeneous Graph Transformers for Simultaneous Mobile Multi-Robot Task Allocation and Scheduling under Temporal Constraints
Batuhan Altundas, Shengkang Chen, Shivika Singh et al.
Vocabulary-Guided Gait Recognition
Panjian Huang, Saihui Hou, Chunshui Cao et al.
Is Your Diffusion Model Actually Denoising?
Daniel Pfrommer, Zehao Dou, Christopher Scarvelis et al.
Learning in Stackelberg Mean Field Games: A Non-Asymptotic Analysis
Sihan Zeng, Benjamin Patrick Evans, Sujay Bhatt et al.
SGC-Net: Stratified Granular Comparison Network for Open-Vocabulary HOI Detection
Xin Lin, Chong Shi, Zuopeng Yang et al.
RF-Agent: Automated Reward Function Design via Language Agent Tree Search
Ning Gao, Xiuhui Zhang, Xingyu Jiang et al.
Rethinking Graph Prompts: Unraveling the Power of Data Manipulation in Graph Neural Networks
Chenyi Zi, Bowen LIU, Xiangguo SUN et al.
NeuroRenderedFake: A Challenging Benchmark to Detect Fake Images Generated by Advanced Neural Rendering Methods
Chengdong Dong, B. V. K. Vijaya Kumar, Zhenyu Zhou et al.
Mamba Modulation: On the Length Generalization of Mamba Models
Peng Lu, Jerry Huang, QIUHAO Zeng et al.
Streaming Stochastic Submodular Maximization with On-Demand User Requests
Honglian Wang, Sijing Tu, Lutz Oettershagen et al.
Joint Fine-tuning and Conversion of Pretrained Speech and Language Models towards Linear Complexity
Mutian He, Philip N. Garner
FLAME: Fast Long-context Adaptive Memory for Event-based Vision
Biswadeep Chakraborty, Saibal Mukhopadhyay
DiffPS: Leveraging Prior Knowledge of Diffusion Model for Person Search
Giyeol Kim, Sooyoung Yang, Jihyong Oh et al.
The Fragile Truth of Saliency: Improving LLM Input Attribution via Attention Bias Optimization
Yihua Zhang, Changsheng Wang, Yiwei Chen et al.
Constrained Feedback Learning for Non-Stationary Multi-Armed Bandits
Shaoang Li, Jian Li
MVBoost: Boost 3D Reconstruction with Multi-View Refinement
Xiangyu Liu, Xiaomei Zhang, Zhiyuan Ma et al.
Attention to Trajectory: Trajectory-Aware Open-Vocabulary Tracking
Yunhao Li, Yifan Jiao, Dan Meng et al.
3D Interaction Geometric Pre-training for Molecular Relational Learning
Namkyeong Lee, Yunhak Oh, Heewoong Noh et al.
IceDiff: High Resolution and High-Quality Arctic Sea Ice Forecasting with Generative Diffusion Prior
Jingyi Xu, Siwei Tu, Weidong Yang et al.
Can NeRFs "See" without Cameras?
Chaitanya Amballa, Yu-Lin Wei, Sattwik Basu et al.
ControlFusion: A Controllable Image Fusion Network with Language-Vision Degradation Prompts
Linfeng Tang, Yeda Wang, Zhanchuan Cai et al.
Dual-Res Tandem Mamba-3D: Bilateral Breast Lesion Detection and Classification on Non-contrast Chest CT
Jiaheng Zhou, Wei Fang, Luyuan Xie et al.
Forming Auxiliary High-confident Instance-level Loss to Promote Learning from Label Proportions
Tianhao Ma, Han Chen, Juncheng Hu et al.
Dual Focus-Attention Transformer for Robust Point Cloud Registration
Kexue Fu, Ming'zhi Yuan, Changwei Wang et al.
Mind the Gap: Aligning Vision Foundation Models to Image Feature Matching
Yuhan Liu, Jingwen Fu, Yang Wu et al.
Learning with Noisy Triplet Correspondence for Composed Image Retrieval
Shuxian Li, Changhao He, XitingLiu et al.
Learning to Route: Per-Sample Adaptive Routing for Multimodal Multitask Prediction
Marzieh Ajirak, Oded Bein, Ellen Bowen et al.
The Indra Representation Hypothesis
Jianglin Lu, Hailing Wang, Kuo Yang et al.
Rectification-specific Supervision and Constrained Estimator for Online Stereo Rectification
Rui Gong, Kim-Hui Yap, Weide Liu et al.
CausalVerse: Benchmarking Causal Representation Learning with Configurable High-Fidelity Simulations
Guangyi Chen, Yunlong Deng, Peiyuan Zhu et al.
MyoChallenge 2024: A New Benchmark for Physiological Dexterity and Agility in Bionic Humans
Huiyi Wang, Chun Kwang Tan, Balint Hodossy et al.
Predictive Coding Enhances Meta-RL To Achieve Interpretable Bayes-Optimal Belief Representation Under Partial Observability
Po-Chen Kuo, Han Hou, Will Dabney et al.
Neural Wave Equation for Irregularly Sampled Sequence Data
Arkaprava Majumdar, M Anand Krishna, P. K. Srijith
Residual Stream Analysis of Overfitting And Structural Disruptions
Quan Liu, Han Zhou, Wenquan Wu et al.
Ranking-based Preference Optimization for Diffusion Models from Implicit User Feedback
Yi-Lun Wu, Bo-Kai Ruan, Chiang Tseng et al.
DKC: Differentiated Knowledge Consolidation for Cloth-Hybrid Lifelong Person Re-identification
Zhenyu Cui, Jiahuan Zhou, Yuxin Peng
OrbitZoo: Real Orbital Systems Challenges for Reinforcement Learning
Alexandre Oliveira, Katarina Dyreby, Francisco Caldas et al.
CountSE: Soft Exemplar Open-set Object Counting
Shuai Liu, Peng Zhang, Shiwei Zhang et al.
Explaining in Diffusion: Explaining a Classifier with Diffusion Semantics
Tahira Kazimi, Ritika Allada, Pinar Yanardag
On the Stability of Graph Convolutional Neural Networks: A Probabilistic Perspective
Ning Zhang, Henry Kenlay, Li Zhang et al.
Memo: Training Memory-Efficient Embodied Agents with Reinforcement Learning
Gunshi Gupta, Karmesh Yadav, Zsolt Kira et al.
Reward-oriented Causal Representation Learning
Zirui Yan, Emre Acartürk, Ali Tajer
Distribution Learning Meets Graph Structure Sampling
Arnab Bhattacharyya, Sutanu Gayen, Philips George John et al.
NUTS: Eddy-Robust Reconstruction of Surface Ocean Nutrients via Two-Scale Modeling
Hao Zheng, Shiyu Liang, Yuting Zheng et al.
Adaptive Variance Inflation in Thompson Sampling: Efficiency, Safety, Robustness, and Beyond
Feng Zhu, David Simchi-Levi
Learning Human-Like RL Agents Through Trajectory Optimization With Action Quantization
Jian-Ting Guo, Yu-Cheng Chen, Ping-Chun Hsieh et al.
A Tiny Change, A Giant Leap: Long-Tailed Class-Incremental Learning via Geometric Prototype Alignment
xinyi lai, Luojun Lin, Weijie Chen et al.
Looking Into the Water by Unsupervised Learning of the Surface Shape
Ori Lifschitz, Tali Treibitz, Dan Rosenbaum
Explainably Safe Reinforcement Learning
Sabine Rieder, Stefan Pranger, Debraj Chakraborty et al.
Differentiable Sparsity via $D$-Gating: Simple and Versatile Structured Penalization
Chris Kolb, Laetitia Frost, Bernd Bischl et al.
Attribute-formed Class-specific Concept Space: Endowing Language Bottleneck Model with Better Interpretability and Scalability
Jianyang Zhang, Qianli Luo, Guowu Yang et al.
Exploiting Vision Language Model for Training-Free 3D Point Cloud OOD Detection via Graph Score Propagation
Tiankai Chen, Yushu Li, Adam Goodge et al.
Seeds of Structure: Patch PCA Reveals Universal Compositional Cues in Diffusion Models
Qingsong Wang, Zhengchao Wan, Misha Belkin et al.
Private Set Union with Multiple Contributions
Travis Dick, Haim Kaplan, Alex Kulesza et al.
Generalized Category Discovery under Domain Shift: A Frequency Domain Perspective
Wei Feng, Zongyuan Ge
Shape and Texture: What Influences Reliable Optical Flow Estimation?
Libo Long, Xiao Hu, Jochen Lang
Let's Verify and Reinforce Image Generation Step by Step
Renrui Zhang, Chengzhuo Tong, Zhizheng Zhao et al.
CORAL: Disentangling Latent Representations in Long-Tailed Diffusion
Esther Rodriguez, Monica Welfert, Samuel McDowell et al.
Online Time Series Forecasting with Theoretical Guarantees
Zijian Li, Changze Zhou, Minghao Fu et al.
Two Causally Related Needles in a Video Haystack
Miaoyu Li, Qin Chao, Boyang Li
ECO: Evolving Core Knowledge for Efficient Transfer
Fu Feng, Yucheng Xie, Ruixiao Shi et al.
Chain of Execution Supervision Promotes General Reasoning in Large Language Models
Nuo Chen, Zehua Li, Keqin Bao et al.
SSIMBaD: Sigma Scaling with SSIM-Guided Balanced Diffusion for AnimeFace Colorization
Junpyo Seo, HanbinKoo, jieun yook et al.
Enhancing Prompt Generation with Adaptive Refinement for Camouflaged Object Detection
Xuehan Chen, Guangyu Ren, Tianhong Dai et al.
Implicit Modeling for Transferability Estimation of Vision Foundation Models
Yaoyan Zheng, Huiqun Wang, Nan Zhou et al.
Neural Networks for Learnable and Scalable Influence Estimation of Instruction Fine-Tuning Data
Ishika Agarwal, Dilek Hakkani-Tur
Near-Exponential Savings for Population Mean Estimation with Active Learning
Julian Morimoto, JACOB GOLDIN, Daniel Ho
Exploiting Hankel-Toeplitz Structures for Fast Computation of Kernel Precision Matrices
Frida Viset, Frederiek Wesel, Arno Solin et al.
Heterogeneous Diffusion Structure Inference for Network Cascade
Siyu Huang, Abdul Basit Adeel, Yubai Yuan
MetaDefense: Defending Fine-tuning based Jailbreak Attack Before and During Generation
Weisen Jiang, Sinno Pan
How Ensembles of Distilled Policies Improve Generalisation in Reinforcement Learning
Max Weltevrede, Moritz Zanger, Matthijs Spaan et al.
A Closed-Form Solution for Fast and Reliable Adaptive Testing
Yan Zhuang, Chenye Ke, Zirui Liu et al.
Projection-Manifold Regularized Latent Diffusion for Robust General Image Fusion
Lei Cao, Hao Zhang, Chunyu Li et al.
Decomposing motor units through elimination for real-time intention driven assistive neurotechnology
Nicholas Tacca, Bryan Schlink, Jackson Levine et al.
FlexWorld: Progressively Expanding 3D Scenes for Flexible-View Exploration
Luxi Chen, Zihan Zhou, Min Zhao et al.
User-Instructed Disparity-aware Defocus Control
Yudong Han, Yan Yang, Hao Yang et al.
Instant Video Models: Universal Adapters for Stabilizing Image-Based Networks
Matthew Dutson, Nathan Labiosa, Yin Li et al.
Dynamic Group Detection using VLM-augmented Temporal Groupness Graph
Kaname Yokoyama, Chihiro Nakatani, Norimichi Ukita
Latent Zoning Network: A Unified Principle for Generative Modeling, Representation Learning, and Classification
Zinan Lin, Enshu Liu, Xuefei Ning et al.
STEAD: Robust Provably Secure Linguistic Steganography with Diffusion Language Model
Yuang Qi, Na Zhao, Qiyi Yao et al.
Differentiable Cyclic Causal Discovery Under Unmeasured Confounders
Muralikrishnna Guruswamy Sethuraman, Faramarz Fekri
SegGraph: Leveraging Graphs of SAM Segments for Few-Shot 3D Part Segmentation
Yueyang Hu, Haiyong Jiang, Haoxuan Song et al.
Temporal-Difference Variational Continual Learning
Luckeciano Carvalho Melo, Alessandro Abate, Yarin Gal
Abstract Rendering: Certified Rendering Under 3D Semantic Uncertainty
Chenxi Ji, Yangge Li, Xiangru Zhong et al.
LoRA-EnVar: Parameter-Efficient Hybrid Ensemble Variational Assimilation for Weather Forecasting
Yi Xiao, Hang Fan, Kun Chen et al.
The Computational Advantage of Depth in Learning High-Dimensional Hierarchical Targets
Yatin Dandi, Luca Pesce, Lenka Zdeborová et al.
FDPT: Federated Discrete Prompt Tuning for Black-Box Visual-Language Models
Jiaqi Wu, Simin Chen, Jing Tang et al.
UniGTE: Unified Graph–Text Encoding for Zero-Shot Generalization across Graph Tasks and Domains
Duo Wang, Yuan Zuo, Guangyue Lu et al.
Unbiased Sliced Wasserstein Kernels for High-Quality Audio Captioning
Tien Manh Luong, Khai Nguyen, Dinh Phung et al.
Beyond Pairwise Connections: Extracting High-Order Functional Brain Network Structures under Global Constraints
Ling Zhan, Junjie Huang, Xiaoyao Yu et al.
EVPGS: Enhanced View Prior Guidance for Splatting-based Extrapolated View Synthesis
Jiahe Li, Feiyu Wang, Xiaochao Qu et al.
ESCA: Enabling Seamless Codec Avatar Execution through Algorithm and Hardware Co-Optimization for Virtual Reality
Mingzhi Zhu, Ding Shang, Sai Qian Zhang
PANTHER: Generative Pretraining Beyond Language for Sequential User Behavior Modeling
Guilin Li, Yun Zhang, Xiuyuan Chen et al.
Loquetier: A Virtualized Multi-LoRA Framework for Unified LLM Fine-tuning and Serving
Yuchen Zhang, Hanyue Du, Chun Cao et al.
Detecting Open World Objects via Partial Attribute Assignment
Muli Yang, Gabriel James Goenawan, Huaiyuan Qin et al.
HistoFS: Non-IID Histopathologic Whole Slide Image Classification via Federated Style Transfer with RoI-Preserving
Farchan Hakim Raswa, Chun-Shien Lu, Jia-Ching Wang
A Technical Report on “Erasing the Invisible”: The 2024 NeurIPS Competition on Stress Testing Image Watermarks
Mucong Ding, Bang An, Tahseen Rabbani et al.
Deno-IF: Unsupervised Noisy Visible and Infrared Image Fusion Method
Han Xu, Yuyang Li, Yunfei Deng et al.
Channel-wise Noise Scheduled Diffusion for Inverse Rendering in Indoor Scenes
JunYong Choi, Min-Cheol Sagong, SeokYeong Lee et al.
Retro-R1: LLM-based Agentic Retrosynthesis
Wei Liu, Jiangtao Feng, Hongli Yu et al.
Random Search Neural Networks for Efficient and Expressive Graph Learning
Michael Ito, Danai Koutra, Jenna Wiens
Reasoning Is Not a Race: When Stopping Early Beats Going Deeper
Mohan Zhang, Jiaxuan Gao, Shusheng Xu et al.
World Models as Reference Trajectories for Rapid Motor Adaptation
Carlos Stein Brito, Daniel McNamee
Representation Shift: Unifying Token Compression with FlashAttention
Joonmyung Choi, Sanghyeok Lee, Byungoh Ko et al.
EddyFormer: Accelerated Neural Simulations of Three-Dimensional Turbulence at Scale
Yiheng Du, Aditi Krishnapriyan
MISA: Memory-Efficient LLMs Optimization with Module-wise Importance Sampling
Yuxi Liu, Renjia Deng, Yutong He et al.
Less is More: Unlocking Specialization of Time Series Foundation Models via Structured Pruning
Lifan Zhao, Yanyan Shen, Zhaoyang Liu et al.
Breaking the Order Barrier: Off-Policy Evaluation for Confounded POMDPs
Qi Kuang, Jiayi Wang, Fan Zhou et al.
DEAL: Diffusion Evolution Adversarial Learning for Sim-to-Real Transfer
Wentao Xu, Huiqiao Fu, Haoyu Dong et al.
ProbMED: A Probabilistic Framework for Medical Multimodal Binding
Yuan Gao, Sangwook Kim, Jianzhong You et al.
Sample-Efficient Tabular Self-Play for Offline Robust Reinforcement Learning
Na Li, Zewu Zheng, Wei Ni et al.
Rectifying Soft-Label Entangled Bias in Long-Tailed Dataset Distillation
Chenyang Jiang, Hang Zhao, Xinyu Zhang et al.
Learning Extremely High Density Crowds as Active Matters
Feixiang He, Jiangbei Yue, Jialin Zhu et al.
WaLRUS: Wavelets for Long range Representation Using State Space Methods
Hossein Babaei, Mel White, Sina Alemohammad et al.
Mixtures of Subspaces for Bandwidth Efficient Context Parallel Training
Sameera Ramasinghe, Thalaiyasingam Ajanthan, Hadi Mohaghegh Dolatabadi et al.
Differentiable Constraint-Based Causal Discovery
Jincheng Zhou, Mengbo Wang, Anqi He et al.
State Size Independent Statistical Error Bound for Discrete Diffusion Models
Shintaro Wakasugi, Taiji Suzuki
VLMLight: Safety-Critical Traffic Signal Control via Vision-Language Meta-Control and Dual-Branch Reasoning Architecture
Maonan Wang, Yirong Chen, Aoyu Pang et al.
One Token per Highly Selective Frame: Towards Extreme Compression for Long Video Understanding
Zheyu Zhang, Ziqi Pang, Shixing Chen et al.
Keep It on a Leash: Controllable Pseudo-label Generation Towards Realistic Long-Tailed Semi-Supervised Learning
Yaxin Hou, Bo Han, Yuheng Jia et al.
Dynamic Algorithm for Explainable $k$-medians Clustering under $\ell_p$ Norm
Konstantin Makarychev, Ilias Papanikolaou, Liren Shan
CaricatureBooth: Data-Free Interactive Caricature Generation in a Photo Booth
Zhiyu Qu, Yunqi Miao, Zhensong Zhang et al.
Factual Context Validation and Simplification: A Scalable Method to Enhance GPT Trustworthiness and Efficiency
Tianyi Huang
StyleGuard: Preventing Text-to-Image-Model-based Style Mimicry Attacks by Style Perturbations
Yanjie Li, Wenxuan Zhang, Xinqi Lyu et al.
Computational Hardness of Reinforcement Learning with Partial $q^{\pi}$-Realizability
Shayan Karimi, Xiaoqi Tan
LILO: Learning to Reason at the Frontier of Learnability
Thomas Foster, Anya Sims, Johannes Forkel et al.
FreeDNA: Endowing Domain Adaptation of Diffusion-Based Dense Prediction with Training-Free Domain Noise Alignment
Hang Xu, Jie Huang, Linjiang Huang et al.