Most Cited AAAI "conditional coding" Papers
5,317 papers found • Page 4 of 27
Conference
ResMaster: Mastering High-Resolution Image Generation via Structural and Fine-Grained Guidance
Shuwei Shi, Wenbo Li, Yuechen Zhang et al.
Hierarchical Classification Auxiliary Network for Time Series Forecasting
Yanru Sun, Zongxia Xie, Dongyue Chen et al.
STAR: Boosting Low-Resource Information Extraction by Structure-to-Text Data Generation with Large Language Models
Mingyu Derek Ma, Xiaoxuan Wang, Po-Nien Kung et al.
SAVSR: Arbitrary-Scale Video Super-resolution via a Learned Scale-Adaptive Network
Zekun Li, Hongying Liu, Fanhua Shang et al.
ZO-AdaMU Optimizer: Adapting Perturbation by the Momentum and Uncertainty in Zeroth-Order Optimization
Shuoran Jiang, Qingcai Chen, Yang Xiang et al.
Question Calibration and Multi-Hop Modeling for Temporal Question Answering
Chao Xue, Di Liang, Pengfei Wang et al.
Pruning Large Language Models with Semi-Structural Adaptive Sparse Training
Weiyu Huang, Yuezhou Hu, Guohao Jian et al.
Adaptive FSS: A Novel Few-Shot Segmentation Framework via Prototype Enhancement
Jing Wang, Jiangyun Li, Chen Chen et al.
Proportional Aggregation of Preferences for Sequential Decision Making
Nikhil Chandak, Shashwat Goel, Dominik Peters
HR-Pro: Point-Supervised Temporal Action Localization via Hierarchical Reliability Propagation
Huaxin Zhang, Xiang Wang, Xiaohao Xu et al.
Instrumental Variable Estimation for Causal Inference in Longitudinal Data with Time-Dependent Latent Confounders
Debo Cheng, Ziqi Xu, Jiuyong Li et al.
Quantifying and Analyzing Entity-Level Memorization in Large Language Models
Zhenhong Zhou, Jiuyang Xiang, Chaomeng Chen et al.
Boosting Neural Cognitive Diagnosis with Student’s Affective State Modeling
Shanshan Wang, Zhen Zeng, Xun Yang et al.
Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation
Jiaqi Huang, Zunnan Xu, Ting Liu et al.
Trusted Unified Feature-Neighborhood Dynamics for Multi-View Classification
Haojian Huang, Chuanyu Qin, Zhe Liu et al.
Argumentative Large Language Models for Explainable and Contestable Claim Verification
Gabriel Freedman, Adam Dejl, Deniz Gorur et al.
LogicAD: Explainable Anomaly Detection via VLM-based Text Feature Extraction
Er Jin, Qihui Feng, Yongli Mou et al.
Spatiotemporal-aware Trend-Seasonality Decomposition Network for Traffic Flow Forecasting
Lingxiao Cao, Bin Wang, Guiyuan Jiang et al.
Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation
Chengyang Ye, Yunzhi Zhuge, Pingping Zhang
GOODAT: Towards Test-Time Graph Out-of-Distribution Detection
Luzhi Wang, Di Jin, He Zhang et al.
Federated Learning with Extremely Noisy Clients via Negative Distillation
Yang Lu, Lin Chen, Yonggang Zhang et al.
T2MAC: Targeted and Trusted Multi-Agent Communication through Selective Engagement and Evidence-Driven Integration
Chuxiong Sun, Zehua Zang, Jiabao Li et al.
Is Sarcasm Detection a Step-by-Step Reasoning Process in Large Language Models?
Ben Yao, Yazhou Zhang, Qiuchi Li et al.
Training on the Benchmark Is Not All You Need
Shiwen Ni, Xiangtao Kong, Chengming Li et al.
Enhancing the Robustness of Spiking Neural Networks with Stochastic Gating Mechanisms
Jianhao Ding, Zhaofei Yu, Tiejun Huang et al.
3DMambaIPF: A State Space Model for Iterative Point Cloud Filtering via Differentiable Rendering
Qingyuan Zhou, Weidong Yang, Ben Fei et al.
Leveraging Partial Symmetry for Multi-Agent Reinforcement Learning
Xin Yu, Rongye Shi, Pu Feng et al.
On Oversquashing in Graph Neural Networks Through the Lens of Dynamical Systems
Alessio Gravina, Moshe Eliasof, Claudio Gallicchio et al.
Leaving the Nest: Going beyond Local Loss Functions for Predict-Then-Optimize
Sanket Shah, Bryan Wilder, Andrew Perrault et al.
Zero-Shot Aerial Object Detection with Visual Description Regularization
Chenyu Lin, Zhengqing Zang, Chenwei Tang et al.
Self-Distillation Regularized Connectionist Temporal Classification Loss for Text Recognition: A Simple Yet Effective Approach
Ziyin Zhang, Ning Lu, Minghui Liao et al.
Offline Model-Based Optimization via Policy-Guided Gradient Search
Yassine Chemingui, Aryan Deshwal, Nghia Hoang et al.
CL2CM: Improving Cross-Lingual Cross-Modal Retrieval via Cross-Lingual Knowledge Transfer
Yabing Wang, Fan Wang, Jianfeng Dong et al.
FedFixer: Mitigating Heterogeneous Label Noise in Federated Learning
Xinyuan Ji, Zhaowei Zhu, Wei Xi et al.
Improved Graph Contrastive Learning for Short Text Classification
Yonghao Liu, Lan Huang, Fausto Giunchiglia et al.
HGE: Embedding Temporal Knowledge Graphs in a Product Space of Heterogeneous Geometric Subspaces
Jiaxin Pan, Mojtaba Nayyeri, Yinan Li et al.
EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE
Junyi Chen, Longteng Guo, Jia Sun et al.
Wavelet-Assisted Multi-Frequency Attention Network for Pansharpening
Jie Huang, Rui Huang, Jinghao Xu et al.
Customizing Language Model Responses with Contrastive In-Context Learning
Xiang Gao, Kamalika Das
Generating and Reweighting Dense Contrastive Patterns for Unsupervised Anomaly Detection
Songmin Dai, Yifan Wu, Xiaoqiang Li et al.
Conformal Autoregressive Generation: Beam Search with Coverage Guarantees
Nicolas Deutschmann, Marvin Alberts, María Rodríguez Martínez
ExpCLIP: Bridging Text and Facial Expressions via Semantic Alignment
Yicheng Zhong, Huawei Wei, Peiji Yang et al.
AdaDiff: Adaptive Step Selection for Fast Diffusion Models
Hui Zhang, Zuxuan Wu, Zhen Xing et al.
StableVC: Style Controllable Zero-Shot Voice Conversion with Conditional Flow Matching
Jixun Yao, Yang Yuguang, Yu Pan et al.
What Kind of Visual Tokens Do We Need? Training-Free Visual Token Pruning for Multi-Modal Large Language Models from the Perspective of Graph
Yutao Jiang, Qiong Wu, Wenhao Lin et al.
Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data?
Michael-Andrei Panaitescu-Liess, Zora Che, Bang An et al.
Occlusion-Embedded Hybrid Transformer for Light Field Super-Resolution
Zeyu Xiao, Zhuoyuan Li, Wei Jia
The Illusion of Empathy: How AI Chatbots Shape Conversation Perception
Tingting Liu, Salvatore Giorgi, Ankit Aich et al.
FLAME: A Small Language Model for Spreadsheet Formulas
Harshit Joshi, José Cambronero Sanchez, Abishai Ebenezer et al.
Enhancing Chain of Thought Prompting in Large Language Models via Reasoning Patterns
Yufeng Zhang, Xuepeng Wang, Lingxiang Wu et al.
Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks
Siyu Zou, Jiji Tang, Yiyi Zhou et al.
Underwater Organism Color Fine-Tuning via Decomposition and Guidance
Xiaofeng Cong, Jie Gui, Junming Hou
SAFIRE: Segment Any Forged Image Region
Myung-Joon Kwon, Wonjun Lee, Seung-Hun Nam et al.
GaussianSR: High Fidelity 2D Gaussian Splatting for Arbitrary-Scale Image Super-Resolution
Jintong Hu, Bin Xia, Bin Chen et al.
LF-ViT: Reducing Spatial Redundancy in Vision Transformer for Efficient Image Recognition
Youbing Hu, Yun Cheng, Anqi Lu et al.
MESED: A Multi-Modal Entity Set Expansion Dataset with Fine-Grained Semantic Classes and Hard Negative Entities
Li Yangning, Tingwei Lu, Hai-Tao Zheng et al.
FedA3I: Annotation Quality-Aware Aggregation for Federated Medical Image Segmentation against Heterogeneous Annotation Noise
Nannan Wu, Zhaobin Sun, Zengqiang Yan et al.
InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation
Yuchi Wang, Junliang Guo, Jianhong Bai et al.
Upper Bounding Barlow Twins: A Novel Filter for Multi-Relational Clustering
Xiaowei Qian, Bingheng Li, Zhao Kang
Dense Audio-Visual Event Localization Under Cross-Modal Consistency and Multi-Temporal Granularity Collaboration
Ziheng Zhou, Jinxing Zhou, Wei Qian et al.
Homophily-Related: Adaptive Hybrid Graph Filter for Multi-View Graph Clustering
Zichen Wen, Yawen Ling, Yazhou Ren et al.
TaskLAMA: Probing the Complex Task Understanding of Language Models
Quan Yuan, Mehran Kazemi, Xin Xu et al.
SpikingSSMs: Learning Long Sequences with Sparse and Parallel Spiking State Space Models
Shuaijie Shen, Chao Wang, Renzhuo Huang et al.
MaskDiff: Modeling Mask Distribution with Diffusion Probabilistic Model for Few-Shot Instance Segmentation
Authors: Minh-Quan Le, Tam Nguyen, Trung-Nghia Le et al.
Controlling Large Language Models Through Concept Activation Vectors
Hanyu Zhang, Xiting Wang, Chengao Li et al.
Personalized LoRA for Human-Centered Text Understanding
You Zhang, Jin Wang, Liang-Chih Yu et al.
Exploring Diverse Representations for Open Set Recognition
Yu Wang, Junxian Mu, Pengfei Zhu et al.
Improving Cross-Modal Alignment with Synthetic Pairs for Text-Only Image Captioning
Zhiyue Liu, Jinyuan Liu, Fanrong Ma
Towards Squeezing-Averse Virtual Try-On via Sequential Deformation
Sang-Heon Shim, Jiwoo Chung, Jae-Pil Heo
EAT: Towards Long-Tailed Out-of-Distribution Detection
Tong Wei, Bo-Lin Wang, Min-Ling Zhang
CrAM: Credibility-Aware Attention Modification in LLMs for Combating Misinformation in RAG
Boyi Deng, Wenjie Wang, Fengbin Zhu et al.
Learning from History: Task-agnostic Model Contrastive Learning for Image Restoration
Gang Wu, Junjun Jiang, Kui Jiang et al.
Design Principle Transfer in Neural Architecture Search via Large Language Models
Xun Zhou, Xingyu Wu, Liang Feng et al.
Constrained Bayesian Optimization under Partial Observations: Balanced Improvements and Provable Convergence
Shengbo Wang, Ke Li
Rethinking Transformer-Based Blind-Spot Network for Self-Supervised Image Denoising
Junyi Li, Zhilu Zhang, Wangmeng Zuo
Augmented Commonsense Knowledge for Remote Object Grounding
Bahram Mohammadi, Yicong Hong, Yuankai Qi et al.
Temporally and Distributionally Robust Optimization for Cold-Start Recommendation
Xinyu Lin, Wenjie Wang, Jujia Zhao et al.
Weighted Envy-Freeness for Submodular Valuations
Luisa Montanari, Ulrike Schmidt-Kraepelin, Warut Suksompong et al.
Diffusion Language-Shapelets for Semi-supervised Time-Series Classification
Zhen Liu, Wenbin Pei, Disen Lan et al.
Comparing the Robustness of Modern No-Reference Image- and Video-Quality Metrics to Adversarial Attacks
Anastasia Antsiferova, Khaled Abud, Aleksandr Gushchin et al.
Principal-Agent Reward Shaping in MDPs
Omer Ben-Porat, Yishay Mansour, Michal Moshkovitz et al.
MotionCraft: Crafting Whole-Body Motion with Plug-and-Play Multimodal Controls
Yuxuan Bian, Ailing Zeng, Xuan Ju et al.
Code-Style In-Context Learning for Knowledge-Based Question Answering
Zhijie Nie, Richong Zhang, Zhongyuan Wang et al.
Combating Multimodal LLM Hallucination via Bottom-Up Holistic Reasoning
Shengqiong Wu, Hao Fei, Liangming Pan et al.
Fine-Tuning Graph Neural Networks by Preserving Graph Generative Patterns
Yifei Sun, Qi Zhu, Yang Yang et al.
Two-stream Beats One-stream: Asymmetric Siamese Network for Efficient Visual Tracking
Jiawen Zhu, Huayi Tang, Xin Chen et al.
ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models
Chao Zeng, Songwei Liu, Yusheng Xie et al.
RPSC: Robust Pseudo-Labeling for Semantic Clustering
Sihang Liu, Wenming Cao, Ruigang Fu et al.
Minimum Coverage Sets for Training Robust Ad Hoc Teamwork Agents
Arrasy Rahman, Jiaxun Cui, Peter Stone
POI-Enhancer: An LLM-based Semantic Enhancement Framework for POI Representation Learning
Jiawei Cheng, Jingyuan Wang, Yichuan Zhang et al.
Shaping Up SHAP: Enhancing Stability through Layer-Wise Neighbor Selection
Gwladys Kelodjou, Laurence Rozé, Véronique Masson et al.
PSC-CPI: Multi-Scale Protein Sequence-Structure Contrasting for Efficient and Generalizable Compound-Protein Interaction Prediction
Lirong Wu, Yufei Huang, Cheng Tan et al.
FusionFormer: A Concise Unified Feature Fusion Transformer for 3D Pose Estimation
Yanlu Cai, Weizhong Zhang, Yuan Wu et al.
Patch-Wise Graph Contrastive Learning for Image Translation
Chanyong Jung, Gihyun Kwon, Jong Chul Ye
Deep Incomplete Multi-View Learning Network with Insufficient Label Information
Zhangqi Jiang, Tingjin Luo, Xinyan Liang
WebVLN: Vision-and-Language Navigation on Websites
Qi Chen, Dileepa Pitawela, Chongyang Zhao et al.
M-BEV: Masked BEV Perception for Robust Autonomous Driving
Siran Chen, Yue Ma, Yu Qiao et al.
Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation
Shaofei Huang, Rui Ling, Hongyu Li et al.
Enhancing Cognitive Diagnosis Using Un-interacted Exercises: A Collaboration-Aware Mixed Sampling Approach
Haiping Ma, Changqian Wang, Hengshu Zhu et al.
AGLLDiff: Guiding Diffusion Models Towards Unsupervised Training-free Real-world Low-light Image Enhancement
Yunlong Lin, Tian Ye, Sixiang Chen et al.
Look Inside for More: Internal Spatial Modality Perception for 3D Anomaly Detection
Hanzhe Liang, Guoyang Xie, Chengbin Hou et al.
Transformer-Based Selective Super-resolution for Efficient Image Refinement
Tianyi Zhang, Kishore Kasichainula, Yaoxin Zhuo et al.
Repeated Fair Allocation of Indivisible Items
Ayumi Igarashi, Martin Lackner, Oliviero Nardi et al.
Agile Multi-Source-Free Domain Adaptation
Xinyao Li, Jingjing Li, Fengling Li et al.
AE-NeRF: Augmenting Event-Based Neural Radiance Fields for Non-ideal Conditions and Larger Scenes
Chaoran Feng, Wangbo Yu, Xinhua Cheng et al.
QJL: 1-Bit Quantized JL Transform for KV Cache Quantization with Zero Overhead
Amir Zandieh, Majid Daliri, Insu Han
Decentralized Monte Carlo Tree Search for Partially Observable Multi-Agent Pathfinding
Alexey Skrynnik, Anton Andreychuk, Konstantin Yakovlev et al.
BCLNet: Bilateral Consensus Learning for Two-View Correspondence Pruning
Xiangyang Miao, Guobao Xiao, Shiping Wang et al.
ReGCL: Rethinking Message Passing in Graph Contrastive Learning
Cheng Ji, Zixuan Huang, Qingyun Sun et al.
Standing on the Shoulders of Giants: Reprogramming Visual-Language Model for General Deepfake Detection
Kaiqing Lin, Yuzhen Lin, Weixiang Li et al.
AlphaForge: A Framework to Mine and Dynamically Combine Formulaic Alpha Factors
Hao Shi, Weili Song, Xinting Zhang et al.
Text Image Inpainting via Global Structure-Guided Diffusion Models
Shipeng Zhu, Pengfei Fang, Chenjie Zhu et al.
Spectral Motion Alignment for Video Motion Transfer Using Diffusion Models
Geon Yeong Park, Hyeonho Jeong, Sang Wan Lee et al.
Aspect-Based Sentiment Analysis with Explicit Sentiment Augmentations
Jihong Ouyang, Zhiyao Yang, Silong Liang et al.
A New Mechanism for Eliminating Implicit Conflict in Graph Contrastive Learning
Dongxiao He, Jitao Zhao, Cuiying Huo et al.
Deep Quantum Error Correction
Yoni Choukroun, Lior Wolf
Geometric-Facilitated Denoising Diffusion Model for 3D Molecule Generation
6428 Can Xu, Haosen Wang, Weigang Wang et al.
Kernel-Aware Graph Prompt Learning for Few-Shot Anomaly Detection
Fenfang Tao, Guo-Sen Xie, Fang Zhao et al.
Harnessing Holistic Discourse Features and Triadic Interaction for Sentiment Quadruple Extraction in Dialogues
Bobo Li, Hao Fei, Lizi Liao et al.
Project-Fair and Truthful Mechanisms for Budget Aggregation
Rupert Freeman, Ulrike Schmidt-Kraepelin
Gradient-Guided Modality Decoupling for Missing-Modality Robustness
Dense Projection for Anomaly Detection
Dazhi Fu, Zhao Zhang, Jicong Fan
DHGCN: Dynamic Hop Graph Convolution Network for Self-Supervised Point Cloud Learning
Jincen Jiang, Lizhi Zhao, Xuequan Lu et al.
Almost Envy-Free Allocations of Indivisible Goods or Chores with Entitlements
Max Springer, MohammadTaghi Hajiaghayi, Hadi Yami
MSP-MVS: Multi-Granularity Segmentation Prior Guided Multi-View Stereo
Zhenlong Yuan, Cong Liu, Fei Shen et al.
TIME-FS: Joint Learning of Tensorial Incomplete Multi-View Unsupervised Feature Selection and Missing-View Imputation
Yanyong Huang, Minghui Lu, Wei Huang et al.
CoPL: Contextual Prompt Learning for Vision-Language Understanding
Koustava Goswami, Srikrishna Karanam, Prateksha Udhayanan et al.
A Label-free Heterophily-guided Approach for Unsupervised Graph Fraud Detection
Junjun Pan, Yixin Liu, Xin Zheng et al.
Resisting Backdoor Attacks in Federated Learning via Bidirectional Elections and Individual Perspective
Zhen Qin, Feiyi Chen, Chen Zhi et al.
S^3cMath: Spontaneous Step-Level Self-Correction Makes Large Language Models Better Mathematical Reasoners
Yuchen Yan, Jin Jiang, Yang Liu et al.
Generating Novel Leads for Drug Discovery Using LLMs with Logical Feedback
Shreyas Bhat Brahmavar, Ashwin Srinivasan, Tirtharaj Dash et al.
A Dual-Way Enhanced Framework from Text Matching Point of View for Multimodal Entity Linking
Shezheng Song, Shan Zhao, ChengYu Wang et al.
Clean-Label Graph Backdoor Attack in the Node Classification Task
Hui Xia, Xiangwei Zhao, Rui Zhang et al.
MMGDreamer: Mixed-Modality Graph for Geometry-Controllable 3D Indoor Scene Generation
Zhifei Yang, Keyang Lu, Chao Zhang et al.
Every Node Is Different: Dynamically Fusing Self-Supervised Tasks for Attributed Graph Clustering
Pengfei Zhu, Qian Wang, Yu Wang et al.
ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining
Dezhi Peng, Chongyu Liu, Yuliang Liu et al.
Uncovering LLM-Generated Code: A Zero-Shot Synthetic Code Detector via Code Rewriting
Tong Ye, Yangkai Du, Tengfei Ma et al.
Social Physics Informed Diffusion Model for Crowd Simulation
Hongyi Chen, Jingtao Ding, Yong Li et al.
Spectral-Based Graph Neutral Networks for Complementary Item Recommendation
Haitong Luo, Xuying Meng, Suhang Wang et al.
PreRoutGNN for Timing Prediction with Order Preserving Partition: Global Circuit Pre-training, Local Delay Learning and Attentional Cell Modeling
Ruizhe Zhong, Junjie Ye, Zhentao Tang et al.
ParaGuide: Guided Diffusion Paraphrasers for Plug-and-Play Textual Style Transfer
Zachary Horvitz, Ajay Patel, Chris Callison-Burch et al.
Deep Homography Estimation for Visual Place Recognition
Feng Lu, Shuting Dong, Lijun Zhang et al.
Towards Multi-Intent Spoken Language Understanding via Hierarchical Attention and Optimal Transport
Xuxin Cheng, Zhihong Zhu, Hongxiang Li et al.
Lifting by Image – Leveraging Image Cues for Accurate 3D Human Pose Estimation
Feng Zhou, Jianqin Yin, Peiyang Li
Structure-Adaptive Multi-View Graph Clustering for Remote Sensing Data
Renxiang Guan, Wenxuan Tu, Siwei Wang et al.
FatesGS: Fast and Accurate Sparse-View Surface Reconstruction Using Gaussian Splatting with Depth-Feature Consistency
Han Huang, Yulun Wu, Chao Deng et al.
MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement
Xu He, Zhiyong Wu, Xiaoyu Li et al.
A Novel Energy Based Model Mechanism for Multi-Modal Aspect-Based Sentiment Analysis
Tianshuo Peng, Zuchao Li, Ping Wang et al.
AttackBench: Evaluating Gradient-based Attacks for Adversarial Examples
Antonio Emanuele Cinà, Jérôme Rony, Maura Pintor et al.
Relightable and Animatable Neural Avatars from Videos
Wenbin Lin, Chengwei Zheng, Jun-hai Yong et al.
COMBAT: Alternated Training for Effective Clean-Label Backdoor Attacks
Tran Huynh, Dang Nguyen, Tung Pham et al.
Traffic Flow Optimisation for Lifelong Multi-Agent Path Finding
Zhe Chen, Daniel Harabor, Jiaoyang Li et al.
STDiff: Spatio-Temporal Diffusion for Continuous Stochastic Video Prediction
Xi Ye, Guillaume-Alexandre Bilodeau
CognitionCapturer: Decoding Visual Stimuli from Human EEG Signal with Multimodal Information
Kaifan Zhang, Lihuo He, Xin Jiang et al.
Stitching Sub-trajectories with Conditional Diffusion Model for Goal-Conditioned Offline RL
Sungyoon Kim, Yunseon Choi, Daiki Matsunaga et al.
Accelerating Text-to-Image Editing via Cache-Enabled Sparse Diffusion Inference
Zihao Yu, Haoyang Li, Fangcheng Fu et al.
Pseudo-Label Calibration Semi-supervised Multi-Modal Entity Alignment
Luyao Wang, Pengnian Qi, Xigang Bao et al.
Toward Falsifying Causal Graphs Using a Permutation-Based Test
Elias Eulig, Atalanti A. Mastakouri, Patrick Blöbaum et al.
Layer Collaboration in the Forward-Forward Algorithm
Guy Lorberbom, Itai Gat, Yossi Adi et al.
Diverse Person: Customize Your Own Dataset for Text-Based Person Search
Zifan Song, Guosheng Hu, Cairong Zhao
Chain of Generation: Multi-Modal Gesture Synthesis via Cascaded Conditional Control
Zunnan Xu, Yachao Zhang, Sicheng Yang et al.
MM-Point: Multi-View Information-Enhanced Multi-Modal Self-Supervised 3D Point Cloud Understanding
HaiTao Yu, Mofei Song
Towards Adversarially Robust Dataset Distillation by Curvature Regularization
Eric Xue, Yijiang Li, Haoyang Liu et al.
What Are Step-Level Reward Models Rewarding? Counterintuitive Findings from MCTS-Boosted Mathematical Reasoning
Yiran Ma, Zui Chen, Tianqiao Liu et al.
Image Captioning with Multi-Context Synthetic Data
Feipeng Ma, Y. Zhou, Fengyun Rao et al.
Inducing Point Operator Transformer: A Flexible and Scalable Architecture for Solving PDEs
Seungjun Lee, TaeIL Oh
TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching
Wenxiang Guo, Yu Zhang, Changhao Pan et al.
DVSAI: Diverse View-Shared Anchors Based Incomplete Multi-View Clustering
Shengju Yu, Siwei Wang, Pei Zhang et al.
Decomposing Semantic Shifts for Composed Image Retrieval
Xingyu Yang, Daqing Liu, Heng Zhang et al.
OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and Open-World Unknown Objects Supervision
Junjie Wang, Bin Chen, Bin Kang et al.
Scaling and Masking: A New Paradigm of Data Sampling for Image and Video Quality Assessment
Yongxu Liu, Yinghui Quan, Guoyao Xiao et al.
RGMComm: Return Gap Minimization via Discrete Communications in Multi-Agent Reinforcement Learning
Jingdi Chen, Tian Lan, Carlee Joe-Wong
SPEAL: Skeletal Prior Embedded Attention Learning for Cross-Source Point Cloud Registration
Kezheng Xiong, Maoji Zheng, Qingshan Xu et al.
Music Style Transfer with Time-Varying Inversion of Diffusion Models
Sifei Li, Yuxin Zhang, Fan Tang et al.
Hierarchical Context Pruning: Optimizing Real-World Code Completion with Repository-Level Pretrained Code LLMs
Lei Zhang, Yunshui Li, Jiaming Li et al.
G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4o
Tony Cheng Tong, Sirui He, Zhiwen Shao et al.
Can LVLMs Obtain a Driver’s License? A Benchmark Towards Reliable AGI for Autonomous Driving
Yuhang Lu, Yichen Yao, Jiadong Tu et al.
TimePFN: Effective Multivariate Time Series Forecasting with Synthetic Data
Ege Onur Taga, Muhammed Emrullah Ildiz, Samet Oymak
Stable Unlearnable Example: Enhancing the Robustness of Unlearnable Examples via Stable Error-Minimizing Noise
Yixin Liu, Kaidi Xu, Xun Chen et al.
Multi-Source Collaborative Gradient Discrepancy Minimization for Federated Domain Generalization
Yikang Wei, Yahong Han
PRAGA: Prototype-aware Graph Adaptive Aggregation for Spatial Multi-modal Omics Analysis
Xinlei Huang, Zhiqi Ma, Dian Meng et al.
What Effects the Generalization in Visual Reinforcement Learning: Policy Consistency with Truncated Return Prediction
Shuo Wang, Zhihao Wu, X. Hu et al.
LRANet: Towards Accurate and Efficient Scene Text Detection with Low-Rank Approximation
Yuchen Su, Zhineng Chen, Zhiwen Shao et al.
TCNet: Continuous Sign Language Recognition from Trajectories and Correlated Regions
GraphMoRE: Mitigating Topological Heterogeneity via Mixture of Riemannian Experts
Zihao Guo, Qingyun Sun, Haonan Yuan et al.
Weakly Supervised Semantic Segmentation for Driving Scenes
Dongseob Kim, Seungho Lee, Junsuk Choe et al.
Cross-modulated Attention Transformer for RGBT Tracking
Yun Xiao, Jiacong Zhao, Andong Lu et al.
Controllable 3D Face Generation with Conditional Style Code Diffusion
Region-Aware Exposure Consistency Network for Mixed Exposure Correction
Jin Liu, Huiyuan Fu, Chuanming Wang et al.
A New Benchmark and Model for Challenging Image Manipulation Detection
Zhenfei Zhang, Mingyang Li, Ming-Ching Chang
ConsistentEE: A Consistent and Hardness-Guided Early Exiting Method for Accelerating Language Models Inference
Ziqian Zeng, Yihuai Hong, Hongliang Dai et al.
Frequency-Controlled Diffusion Model for Versatile Text-Guided Image-to-Image Translation
Xiang Gao, Zhengbo Xu, junhan Zhao et al.
Enhancing Hyperspectral Images via Diffusion Model and Group-Autoencoder Super-resolution Network
Zhaoyang Wang, Dongyang Li, Mingyang Zhang et al.
Contrastive Continual Learning with Importance Sampling and Prototype-Instance Relation Distillation
Jiyong Li, Dilshod Azizov, Yang LI et al.
TCPFormer: Learning Temporal Correlation with Implicit Pose Proxy for 3D Human Pose Estimation
Jiajie Liu, Mengyuan Liu, Hong Liu et al.
UniDet3D: Multi-dataset Indoor 3D Object Detection
Maksim Kolodiazhnyi, Anna Vorontsova, Matvey Skripkin et al.
Video Diffusion Models Are Strong Video Inpainter
Minhyeok Lee, Suhwan Cho, Chajin Shin et al.
Personalized Federated Collaborative Filtering: A Variational AutoEncoder Approach
Zhiwei Li, Guodong Long, Tianyi Zhou et al.
Proportional Representation in Metric Spaces and Low-Distortion Committee Selection
Yusuf Kalayci, David Kempe, Vikram Kher
Local-Global Multi-Modal Distillation for Weakly-Supervised Temporal Video Grounding
6627 Peijun Bao, Yong Xia, Wenhan Yang et al.