Most Cited AAAI "task-prompt optimization" Papers
5,317 papers found • Page 6 of 27
Conference
Dual-Window Multiscale Transformer for Hyperspectral Snapshot Compressive Imaging
Fulin Luo, Xi Chen, Xiuwen Gong et al.
Dynamic Feature Pruning and Consolidation for Occluded Person Re-identification
YuTeng Ye, Hang Zhou, Jiale Cai et al.
Exploiting Auxiliary Caption for Video Grounding
Hongxiang Li, Meng Cao, Xuxin Cheng et al.
Knowledge Editing with Dynamic Knowledge Graphs for Multi-Hop Question Answering
Yifan Lu, Yigeng Zhou, Jing Li et al.
PNVC: Towards Practical INR-based Video Compression
Ge Gao, Ho Man Kwan, Fan Zhang et al.
Calibrated One Round Federated Learning with Bayesian Inference in the Predictive Space
Mohsin Hasan, Guojun Zhang, Kaiyang Guo et al.
Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints
Ming Dai, Jian Li, Jiedong Zhuang et al.
Progressive Feature Self-Reinforcement for Weakly Supervised Semantic Segmentation
Jingxuan He, Lechao Cheng, Chaowei Fang et al.
Spatial-Temporal Interplay in Human Mobility: A Hierarchical Reinforcement Learning Approach with Hypergraph Representation
Zhaofan Zhang, Yanan Xiao, Lu Jiang et al.
Situation-Dependent Causal Influence-Based Cooperative Multi-Agent Reinforcement Learning
Xiao Du, Yutong Ye, Pengyu Zhang et al.
HandDiffuse: Generative Controllers for Two-Hand Interactions via Diffusion Models
Pei Lin
Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training
Longtian Qiu, Shan Ning, Xuming He
M3SOT: Multi-Frame, Multi-Field, Multi-Space 3D Single Object Tracking
Jiaming Liu, Yue Wu, Maoguo Gong et al.
Amplifier: Bringing Attention to Neglected Low-Energy Components in Time Series Forecasting
Jingru Fei, Kun Yi, Wei Fan et al.
Boosting Segment Anything Model Towards Open-Vocabulary Learning
Xumeng Han, Longhui Wei, Xuehui Yu et al.
Fine-Grained Knowledge Selection and Restoration for Non-exemplar Class Incremental Learning
Authors: Jiang-Tian Zhai, Xialei Liu, Lu Yu et al.
GradTree: Learning Axis-Aligned Decision Trees with Gradient Descent
Generalizing across Temporal Domains with Koopman Operators
QIUHAO Zeng, Wei Wang, Fan Zhou et al.
Beyond Prototypes: Semantic Anchor Regularization for Better Representation Learning
Yanqi Ge, Qiang Nie, Ye Huang et al.
Scale Optimization Using Evolutionary Reinforcement Learning for Object Detection on Drone Imagery
Jialu Zhang, Xiaoying Yang, Wentao He et al.
CaRDiff: Video Salient Object Ranking Chain of Thought Reasoning for Saliency Prediction with Diffusion
Yunlong Tang, Gen Zhan, Li Yang et al.
Efficient Conditional Diffusion Model with Probability Flow Sampling for Image Super-resolution
Yutao Yuan, Chun Yuan
OLiDM: Object-aware LiDAR Diffusion Models for Autonomous Driving
Tianyi Yan, Junbo Yin, Xianpeng Lang et al.
Identifying Query-Relevant Neurons in Large Language Models for Long-Form Texts
Lihu Chen, Adam Dejl, Francesca Toni
Pre-Training Graph Neural Networks on Molecules by Using Subgraph-Conditioned Graph Information Bottleneck
Van Thuy Hoang, O-Joun Lee
Label-Efficient Few-Shot Semantic Segmentation with Unsupervised Meta-Training
Jianwu Li, Kaiyue Shi, Guo-Sen Xie et al.
AdapEdit: Spatio-Temporal Guided Adaptive Editing Algorithm for Text-Based Continuity-Sensitive Image Editing
Zhiyuan Ma, Guoli Jia, Bowen Zhou
Distilling Reliable Knowledge for Instance-Dependent Partial Label Learning
Dong-Dong Wu, Deng-Bao Wang, Min-Ling Zhang
Knowledge Is Power: Harnessing Large Language Models for Enhanced Cognitive Diagnosis
Zhiang Dong, Jingyuan Chen, Fei Wu
DVP-MVS: Synergize Depth-Edge and Visibility Prior for Multi-View Stereo
Zhenlong Yuan, Jinguo Luo, Fei Shen et al.
Federated Causality Learning with Explainable Adaptive Optimization
Dezhi Yang, Xintong He, Jun Wang et al.
Object-level Geometric Structure Preserving for Natural Image Stitching
Wenxiao Cai, Wankou Yang
USDRL: Unified Skeleton-Based Dense Representation Learning with Multi-Grained Feature Decorrelation
Wanjiang Weng, Hongsong Wang, Junbo Wang et al.
TreeEval: Benchmark-Free Evaluation of Large Language Models through Tree Planning
Xiang Li, Yunshi Lan, Chao Yang
Enhancing RAW-to-sRGB with Decoupled Style Structure in Fourier Domain
Xuanhua He, Tao Hu, Guoli Wang et al.
A Generalizable Anomaly Detection Method in Dynamic Graphs
Xiao Yang, Xuejiao Zhao, Zhiqi Shen
CoRA: Collaborative Information Perception by Large Language Model’s Weights for Recommendation
Yuting Liu, Jinghao Zhang, Yizhou Dang et al.
Rethinking the Paradigm of Content Constraints in Unpaired Image-to-Image Translation
Xiuding Cai, Yaoyao Zhu, Dong Miao et al.
FashionERN: Enhance-and-Refine Network for Composed Fashion Image Retrieval
Yanzhe Chen, Huasong Zhong, Xiangteng He et al.
Parameterized Approximation Algorithms for Sum of Radii Clustering and Variants
Xianrun Chen, Dachuan Xu, Yicheng Xu et al.
Enhancing Ensemble Clustering with Adaptive High-Order Topological Weights
Jiaxuan Xu, Taiyong Li, Lei Duan
Mitigating Label Noise through Data Ambiguation
Julian Lienen, Eyke Hüllermeier
FairGP: A Scalable and Fair Graph Transformer Using Graph Partitioning
Renqiang Luo, Huafei Huang, Ivan Lee et al.
Language-Guided Transformer for Federated Multi-Label Classification
I-Jieh Liu, Ci-Siang Lin, Fu-En Yang et al.
Performative Federated Learning: A Solution to Model-Dependent and Heterogeneous Distribution Shifts
Kun Jin, Tongxin Yin, Zhongzhu Chen et al.
OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments
Jinyi Liu, Zhi Wang, Yan Zheng et al.
Scribble Hides Class: Promoting Scribble-Based Weakly-Supervised Semantic Segmentation
Xinliang Zhang, Lei Zhu, Hangzhou He et al.
RhythmMamba: Fast, Lightweight, and Accurate Remote Physiological Measurement
Bochao Zou, Zizheng Guo, Xiaocheng Hu et al.
GAD-PVI: A General Accelerated Dynamic-Weight Particle-Based Variational Inference Framework
Fangyikang Wang, Huminhao Zhu, Chao Zhang et al.
Multi-Scene Generalized Trajectory Global Graph Solver with Composite Nodes for Multiple Object Tracking
Yan Gao, Haojun Xu, Jie Li et al.
DiffSED: Sound Event Detection with Denoising Diffusion
Swapnil Bhosale, Sauradip Nag, Diptesh Kanojia et al.
Wavelet Dynamic Selection Network for Inertial Sensor Signal Enhancement
Yifeng Wang, Yi Zhao
On the Relationship Between Monotone and Squared Probabilistic Circuits
Benjie Wang, Guy Van den Broeck
Chronic Poisoning: Backdoor Attack against Split Learning
Fangchao Yu, Bo Zeng, Kai Zhao et al.
Task-Driven Causal Feature Distillation: Towards Trustworthy Risk Prediction
Zhixuan Chu, Mengxuan Hu, Qing Cui et al.
Learning Temporal Resolution in Spectrogram for Audio Classification
Haohe Liu, Xubo Liu, Qiuqiang Kong et al.
Neural Causal Abstractions
Kevin Xia, Elias Bareinboim
Approval-Based Committee Voting in Practice: A Case Study of (over-)Representation in the Polkadot Blockchain
Niclas Boehmer, Markus Brill, Alfonso Cevallos et al.
Local Conditional Controlling for Text-to-Image Diffusion Models
Yibo Zhao, Liang Peng, Yang Yang et al.
Identifiability of Direct Effects from Summary Causal Graphs
Simon Ferreira, Charles Assaad
Exploring More from Multiple Gait Modalities for Human Identification
Dongyang Jin, Chao Fan, Weihua Chen et al.
Full Bayesian Significance Testing via Neural Networks
Zehua Liu, Zimeng Li, Jingyuan Wang et al.
Envisioning Class Entity Reasoning by Large Language Models for Few-shot Learning
Mushui Liu, Fangtai Wu, Bozheng Li et al.
GeoBEV: Learning Geometric BEV Representation for Multi-view 3D Object Detection
Jinqing Zhang, Yanan Zhang, Yunlong Qi et al.
Enhancing Multi-Robot Semantic Navigation Through Multimodal Chain-of-Thought Score Collaboration
Zhixuan Shen, Haonan Luo, Kexun Chen et al.
Forget to Flourish: Leveraging Machine-Unlearning on Pretrained Language Models for Privacy Leakage
Md Rafi Ur Rashid, Jing Liu, Toshiaki Koike-Akino et al.
SimCalib: Graph Neural Network Calibration Based on Similarity between Nodes
Boshi Tang, Zhiyong Wu, Xixin Wu et al.
Improving Complex Reasoning over Knowledge Graph with Logic-Aware Curriculum Tuning
Tianle Xia, Liang Ding, Guojia Wan et al.
Federated Learning with Sample-level Client Drift Mitigation
Haoran Xu, Jiaze Li, Wanyi Wu et al.
ISP-Teacher: Image Signal Process with Disentanglement Regularization for Unsupervised Domain Adaptive Dark Object Detection
Yin Zhang, Yongqiang Zhang, Zian Zhang et al.
Fed-QSSL: A Framework for Personalized Federated Learning under Bitwidth and Data Heterogeneity
Yiyue Chen, Haris Vikalo, Chianing Wang
FSTA-SNN:Frequency-Based Spatial-Temporal Attention Module for Spiking Neural Networks
Kairong Yu, Tianqing Zhang, Hongwei Wang et al.
Bridging Traffic State and Trajectory for Dynamic Road Network and Trajectory Representation Learning
Chengkai Han, Jingyuan Wang, Yongyao Wang et al.
SpectralNeRF: Physically Based Spectral Rendering with Neural Radiance Field
Ru Li, Jia Liu, Guanghui Liu et al.
SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression
Qingwen Bu, Sungrae Park, Minsoo Khang et al.
Scalable Geometric Fracture Assembly via Co-creation Space among Assemblers
Ruiyuan Zhang, Jiaxiang Liu, Zexi Li et al.
Enhanced Fine-Grained Motion Diffusion for Text-Driven Human Motion Synthesis
Dong Wei, Xiaoning Sun, Huaijiang Sun et al.
Eve: Efficient Multimodal Vision Language Models with Elastic Visual Experts
Miao Rang, Zhenni Bi, Chuanjian Liu et al.
SocialCVAE: Predicting Pedestrian Trajectory via Interaction Conditioned Latents
Wei Xiang, Haoteng YIN, He Wang et al.
Generalized Planning for the Abstraction and Reasoning Corpus
Chao Lei, Nir Lipovetzky, Krista A. Ehinger
Gaussian Process Neural Additive Models
Wei Zhang, Brian Barr, John Paisley
Phoneme Hallucinator: One-Shot Voice Conversion via Set Expansion
Siyuan Shan, Yang Li, Amartya Banerjee et al.
ConfigX: Modular Configuration for Evolutionary Algorithms via Multitask Reinforcement Learning
Hongshu Guo, Zeyuan Ma, Jiacheng Chen et al.
Efficient Rectification of Neuro-Symbolic Reasoning Inconsistencies by Abductive Reflection
Wen-Chao Hu, Wang-Zhou Dai, Yuan Jiang et al.
Hyperbolic Graph Diffusion Model
Lingfeng Wen, Xuan Tang, Mingjie Ouyang et al.
OmniCount: Multi-label Object Counting with Semantic-Geometric Priors
Anindya Mondal, Sauradip Nag, Xiatian Zhu et al.
Defeasible Visual Entailment: Benchmark, Evaluator, and Reward-Driven Optimization
Yue Zhang, Liqiang Jing, Vibhav Gogate
Exploring Vacant Classes in Label-Skewed Federated Learning
Kuangpu Guo, Yuhe Ding, Jian Liang et al.
MambaLCT: Boosting Tracking via Long-term Context State Space Model
Xiaohai Li, Bineng Zhong, Qihua Liang et al.
Let All Be Whitened: Multi-Teacher Distillation for Efficient Visual Retrieval
Zhe Ma, Jianfeng Dong, Shouling Ji et al.
Enhancing Trustworthiness of Graph Neural Networks with Rank-Based Conformal Training
Ting Wang, Zhixin Zhou, Rui Luo
Bidirectional Temporal Plan Graph: Enabling Switchable Passing Orders for More Efficient Multi-Agent Path Finding Plan Execution
Yifan Su, Rishi Veerapaneni, Jiaoyang Li
Generative Planning with 3D-Vision Language Pre-training for End-to-End Autonomous Driving
Tengpeng Li, Hanli Wang, Xianfei Li et al.
ScanERU: Interactive 3D Visual Grounding Based on Embodied Reference Understanding
Ziyang Lu, Yunqiang Pei, Guoqing Wang et al.
Advancing Spiking Neural Networks Towards Multiscale Spatiotemporal Interaction Learning
Yimeng Shan, Malu Zhang, Rui-jie Zhu et al.
DASK: Distribution Rehearsing via Adaptive Style Kernel Learning for Exemplar-Free Lifelong Person Re-Identification
Kunlun Xu, Chenghao Jiang, Peixi Xiong et al.
CAVEN: An Embodied Conversational Agent for Efficient Audio-Visual Navigation in Noisy Environments
Xiulong Liu, Sudipta Paul, Moitreya Chatterjee et al.
UniMuMo: Unified Text, Music, and Motion Generation
Han Yang, Kun Su, Yutong Zhang et al.
Deep Copula-Based Survival Analysis for Dependent Censoring with Identifiability Guarantees
Weijia Zhang, Chun Kai Ling, Xuanhui Zhang
OmniSR: Shadow Removal Under Direct and Indirect Lighting
Jiamin Xu, Zelong Li, Yuxin Zheng et al.
A Generalized Shuffle Framework for Privacy Amplification: Strengthening Privacy Guarantees and Enhancing Utility
Chen E, Yang Cao, Ge Yifei
Jointly Improving the Sample and Communication Complexities in Decentralized Stochastic Minimax Optimization
Xuan Zhang, Gabriel Mancino-Ball, Necdet Serhat Aybat et al.
Symbolic Regression Enhanced Decision Trees for Classification Tasks
Kei Sen Fong, Mehul Motani
CF-NeRF: Camera Parameter Free Neural Radiance Fields with Incremental Learning
Qingsong Yan, Qiang Wang, Kaiyong Zhao et al.
Speech Recognition Meets Large Language Model: Benchmarking, Models, and Exploration
Ziyang Ma, Guanrou Yang, Yifan Yang et al.
FLAME: Learning to Navigate with Multimodal LLM in Urban Environments
Yunzhe Xu, Yiyuan Pan, Zhe Liu et al.
ConcaveQ: Non-monotonic Value Function Factorization via Concave Representations in Deep Multi-Agent Reinforcement Learning
Huiqun Li, Hanhan Zhou, Yifei Zou et al.
Contrastive Balancing Representation Learning for Heterogeneous Dose-Response Curves Estimation
Minqin Zhu, Anpeng Wu, Haoxuan Li et al.
Tuning-Free Inversion-Enhanced Control for Consistent Image Editing
Xiaoyue Duan, Shuhao Cui, Guoliang Kang et al.
Learning Personalized Decision Support Policies
Umang Bhatt, Valerie Chen, Katherine M. Collins et al.
STD-PLM: Understanding Both Spatial and Temporal Properties of Spatial-Temporal Data with PLM
Yiheng Huang, Xiaowei Mao, Shengnan Guo et al.
Improving Robustness for Joint Optimization of Camera Pose and Decomposed Low-Rank Tensorial Radiance Fields
BOYU Chen, Wei-Chen Chiu, Yu-Lun Liu
Measuring Human and AI Values Based on Generative Psychometrics with Large Language Models
Haoran Ye, Yuhang Xie, Yuanyi Ren et al.
SLIP: Spoof-Aware One-Class Face Anti-Spoofing with Language Image Pretraining
Pei-Kai Huang, Jun-Xiong Chong, Cheng-Hsuan Chiang et al.
Noisy Correspondence Learning with Self-Reinforcing Errors Mitigation
Zhuohang Dang, Minnan Luo, Chengyou Jia et al.
Diffusion Models for Attribution
Xiongren Chen, Jiuyong Li, Jixue Liu et al.
Any-Stereo: Arbitrary Scale Disparity Estimation for Iterative Stereo Matching
Zhaohuai Liang, Changhe Li
Adaptive Discovering and Merging for Incremental Novel Class Discovery
Guangyao Chen, Peixi Peng, Yangru Huang et al.
Efficient 3D Recognition with Event-driven Spike Sparse Convolution
Xuerui Qiu, Man Yao, Jieyuan Zhang et al.
Dirichlet-Based Prediction Calibration for Learning with Noisy Labels
Chen-Chen Zong, Ye-Wen Wang, Ming-Kun Xie et al.
Yuan: Yielding Unblemished Aesthetics Through a Unified Network for Visual Imperfections Removal in Generated Images
Zhenyu Yu, Chee Seng Chan
Backdoor Attacks Against No-Reference Image Quality Assessment Models via a Scalable Trigger
Yi Yu, Song Xia, Xun Lin et al.
SDGMNet: Statistic-Based Dynamic Gradient Modulation for Local Descriptor Learning
Yuxin Deng, Jiayi Ma
Continuous Treatment Effect Estimation Using Gradient Interpolation and Kernel Smoothing
Lokesh Nagalapatti, Akshay Iyer, Abir De et al.
Robust Nonparametric Regression under Poisoning Attack
Puning Zhao, Zhiguo Wan
D3: A Methodological Exploration of Domain Division, Modeling, and Balance in Multi-Domain Recommendations
Pengyue Jia, Yichao Wang, Shanru LIN et al.
SegFace: Face Segmentation of Long-Tail Classes
Kartik Narayan, Vibashan Vs, Vishal M. Patel
From 2D CAD Drawings to 3D Parametric Models: A Vision-Language Approach
Xilin Wang, Jia Zheng, Yuanchao Hu et al.
Learning Task-Aware Language-Image Representation for Class-Incremental Object Detection
Hongquan Zhang, Bin-Bin Gao, Yi Zeng et al.
Transformer as Linear Expansion of Learngene
Shiyu Xia, Miaosen Zhang, Xu Yang et al.
RemDet: Rethinking Efficient Model Design for UAV Object Detection
Chen Li, Rui Zhao, Zeyu Wang et al.
Planning in the Dark: LLM-Symbolic Planning Pipeline Without Experts
Sukai Huang, Nir Lipovetzky, Trevor Cohn
VQAttack: Transferable Adversarial Attacks on Visual Question Answering via Pre-trained Models
Ziyi Yin, Muchao Ye, Tianrong Zhang et al.
Memory-Efficient Reversible Spiking Neural Networks
Hong Zhang, Yu Zhang
RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors
Fengshuo Bai, Runze Liu, Yali Du et al.
Light-T2M: A Lightweight and Fast Model for Text-to-motion Generation
Ling-An Zeng, Guohong Huang, Gaojie Wu et al.
AdaSkip: Adaptive Sublayer Skipping for Accelerating Long-Context LLM Inference
Zhuomin He, Yizhen Yao, Pengfei Zuo et al.
Generalizable Fourier Augmentation for Unsupervised Video Object Segmentation
Huihui Song, Tiankang Su, Yuhui Zheng et al.
Refining Latent Homophilic Structures over Heterophilic Graphs for Robust Graph Convolution Networks
Chenyang Qiu, Guoshun Nan, Tianyu Xiong et al.
Out of Thin Air: Exploring Data-Free Adversarial Robustness Distillation
Yuzheng Wang, Zhaoyu Chen, Dingkang Yang et al.
Graph Mixture of Experts and Memory-augmented Routers for Multivariate Time Series Anomaly Detection
Xiaoyu Huang, Weidong Chen, Bo Hu et al.
Understanding and Improving Optimization in Predictive Coding Networks
Nicholas Alonso, Jeffrey Krichmar, Emre Neftci
Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models
Yifang Xu, Yunzhuo Sun, Benxiang Zhai et al.
Image Content Generation with Causal Reasoning
Xiaochuan Li, Baoyu Fan, Run Zhang et al.
Considering Nonstationary within Multivariate Time Series with Variational Hierarchical Transformer for Forecasting
Muyao Wang, Wenchao Chen, Bo Chen
Learning to Pivot as a Smart Expert
Tianhao Liu, Shanwen Pu, Dongdong Ge et al.
CLIMB-ReID: A Hybrid CLIP-Mamba Framework for Person Re-Identification
Chenyang Yu, Xuehu Liu, Jiawen Zhu et al.
Conformal Thresholded Intervals for Efficient Regression
Rui Luo, Zhixin Zhou
CL-Attack: Textual Backdoor Attacks via Cross-Lingual Triggers
Jingyi Zheng, Tianyi Hu, Tianshuo Cong et al.
One Step Closer to Unbiased Aleatoric Uncertainty Estimation
Wang Zhang, Ziwen Martin Ma, Subhro Das et al.
11293 Cross-Class Feature Augmentation for Class Incremental Learning
Taehoon Kim, JaeYoo Park, Bohyung Han
Pedestrian Attribute Recognition: A New Benchmark Dataset and a Large Language Model Augmented Framework
Jiandong Jin, Xiao Wang, Qian Zhu et al.
GRPose: Learning Graph Relations for Human Image Generation with Pose Priors
Xiangchen Yin, Donglin Di, Lei Fan et al.
Revisiting Tampered Scene Text Detection in the Era of Generative AI
Chenfan Qu, Yiwu Zhong, Fengjun Guo et al.
Learning 2D Invariant Affordance Knowledge for 3D Affordance Grounding
Xianqiang Gao, Pingrui Zhang, Delin Qu et al.
Adversarial Attacks on the Interpretation of Neuron Activation Maximization
Géraldin Nanfack, Alexander Fulleringer, Jonathan Marty et al.
Exploring Gradient Explosion in Generative Adversarial Imitation Learning: A Probabilistic Perspective
Wanying Wang, Yichen Zhu, Yirui Zhou et al.
Weisfeiler and Lehman Go Paths: Learning Topological Features via Path Complexes
Quang Truong, Peter Chin
MGNet: Learning Correspondences via Multiple Graphs
Dai Luanyuan, Xiaoyu Du, Hanwang Zhang et al.
Another Way to the Top: Exploit Contextual Clustering in Learned Image Coding
Yichi Zhang, Zhihao Duan, Ming Lu et al.
Debiased Multimodal Understanding for Human Language Sequences
Zhi Xu, Dingkang Yang, Mingcheng Li et al.
Decouple Content and Motion for Conditional Image-to-Video Generation
Cuifeng Shen, Yulu Gan, Chen Chen et al.
DELTA: Pre-Train a Discriminative Encoder for Legal Case Retrieval via Structural Word Alignment
Haitao Li, Qingyao Ai, Xinyan Han et al.
Robust Test-Time Adaptation for Zero-Shot Prompt Tuning
Ding-Chu Zhang, Zhi Zhou, Yufeng Li
SymmCompletion: High-Fidelity and High-Consistency Point Cloud Completion with Symmetry Guidance
Hongyu Yan, Zijun Li, Kunming Luo et al.
Debiased All-in-one Image Restoration with Task Uncertainty Regularization
Gang Wu, Junjun Jiang, Yijun Wang et al.
Towards Trustworthy Knowledge Graph Reasoning: An Uncertainty Aware Perspective
Bo Ni, Yu Wang, Lu Cheng et al.
Federated Modality-Specific Encoders and Multimodal Anchors for Personalized Brain Tumor Segmentation
Shadow Generation with Decomposed Mask Prediction and Attentive Shadow Filling
Xinhao Tao, Junyan Cao, Yan Hong et al.
Self-Supervised Disentangled Representation Learning for Robust Target Speech Extraction
Zhaoxi Mu, Xinyu Yang, Sining Sun et al.
Learning Efficient and Robust Multi-Agent Communication via Graph Information Bottleneck
Shifei Ding, Wei Du, Ling Ding et al.
Rethinking Mesh Watermark: Towards Highly Robust and Adaptable Deep 3D Mesh Watermarking
Xingyu Zhu, Guanhui Ye, Xiapu Luo et al.
Exploring Transformer Extrapolation
Zhen Qin, Yiran Zhong, Hui Deng
Task-Disruptive Background Suppression for Few-Shot Segmentation
Suho Park, SuBeen Lee, Sangeek Hyun et al.
QLABGrad: A Hyperparameter-Free and Convergence-Guaranteed Scheme for Deep Learning
Fang-Xiang Wu, Minghan Fu
UCF-Crime-DVS: A Novel Event-Based Dataset for Video Anomaly Detection with Spiking Neural Networks
Yuanbin Qian, Shuhan Ye, Chong Wang et al.
No More Shortcuts: Realizing the Potential of Temporal Self-Supervision
Ishan Rajendrakumar Dave, Simon Jenni, Mubarak Shah
Double-Layer Hybrid-Label Identification Feature Selection for Multi-View Multi-Label Learning
Pingting Hao, Kunpeng Liu, Wanfu Gao
Retrieval-Augmented Visual Question Answering via Built-in Autoregressive Search Engines
Xinwei Long, Zhiyuan Ma, Ermo Hua et al.
Temporal Fair Division
Benjamin Cookson, Soroush Ebadian, Nisarg Shah
MPTSNet: Integrating Multiscale Periodic Local Patterns and Global Dependencies for Multivariate Time Series Classification
Yang Mu, Muhammad Shahzad, Xiao Xiang Zhu
Segment beyond View: Handling Partially Missing Modality for Audio-Visual Semantic Segmentation
Renjie Wu, Hu Wang, Feras Dayoub et al.
Unraveling Batch Normalization for Realistic Test-Time Adaptation
Zixian Su, Jingwei Guo, Kai Yao et al.
Privacy-Preserving Low-Rank Adaptation Against Membership Inference Attacks for Latent Diffusion Models
Zihao Luo, Xilie Xu, Feng Liu et al.
StyO: Stylize Your Face in Only One-Shot
Bonan Li, Zicheng Zhang, Xuecheng Nie et al.
Delivering Inflated Explanations
Yacine Izza, Alexey Ignatiev, Peter Stuckey et al.
Not All Tasks Are Equally Difficult: Multi-Task Deep Reinforcement Learning with Dynamic Depth Routing
Jinmin He, Kai Li, Yifan Zang et al.
DistilVPR: Cross-Modal Knowledge Distillation for Visual Place Recognition
Sijie Wang, Rui She, Qiyu Kang et al.
MM-CamObj: A Comprehensive Multimodal Dataset for Camouflaged Object Scenarios
Jiacheng Ruan, Wenzhen Yuan, Zehao Lin et al.
LAMPAT: Low-Rank Adaption for Multilingual Paraphrasing Using Adversarial Training
Khoi M. Le, Trinh Pham, Tho Quan et al.
Evaluating Image Hallucination in Text-to-Image Generation with Question-Answering
Youngsun Lim, Hojun Choi, Hyunjung Shim
Diff-Shadow: Global-guided Diffusion Model for Shadow Removal
Jinting Luo, Ru Li, Chengzhi Jiang et al.
Attention-Driven GUI Grounding: Leveraging Pretrained Multimodal Large Language Models Without Fine-Tuning
Hai-Ming Xu, Qi Chen, Lei Wang et al.
ConsistNER: Towards Instructive NER Demonstrations for LLMs with the Consistency of Ontology and Context
Chenxiao Wu, Ke Wenjun, Peng Wang et al.
Global Graph Propagation with Hierarchical Information Transfer for Incomplete Contrastive Multi-view Clustering
Guoqing Chao, Kaixin Xu, Xijiong Xie et al.
VQ4DiT: Efficient Post-Training Vector Quantization for Diffusion Transformers
Juncan Deng, Shuaiting Li, Zeyu Wang et al.
CustomTTT: Motion and Appearance Customized Video Generation via Test-Time Training
Xiuli Bi, Jian Lu, Bo Liu et al.
KnowPO: Knowledge-Aware Preference Optimization for Controllable Knowledge Selection in Retrieval-Augmented Language Models
Ruizhe Zhang, Yongxin Xu, Yuzhen Xiao et al.
Semantic Segmentation in Multiple Adverse Weather Conditions with Domain Knowledge Retention
Xin Yang, Wending Yan, Yuan Yuan et al.
Test-Time Personalization with Meta Prompt for Gaze Estimation
Huan Liu, Julia Qi, Zhenhao Li et al.