Most Cited 2024 "semantic error rectification" Papers
12,324 papers found • Page 59 of 62
Conference
Unsqueeze [CLS] Bottleneck to Learn Rich Representations
Qing Su, Shihao Ji
Hierarchical Novelty Detection via Fine-Grained Evidence Allocation
Spandan Pyakurel, Qi Yu
DomainFusion: Generalizing To Unseen Domains with Latent Diffusion Models
Yuyang Huang, Yabo Chen, Yuchen Liu et al.
Efficient Constrained K-center Clustering with Background Knowledge
Longkun Guo, Chaoqi Jia, Kewen Liao et al.
Multi-Resolution Diffusion Models for Time Series Forecasting
Lifeng Shen, Weiyu Chen, James Kwok
Towards Efficient Training and Evaluation of Robust Models against $l_0$ Bounded Adversarial Perturbations
Xuyang Zhong, Yixiao HUANG, Chen Liu
Position: AI-Powered Autonomous Weapons Risk Geopolitical Instability and Threaten AI Research
Riley Simmons-Edler, Ryan Badman, Shayne Longpre et al.
DeepSPF: Spherical SO(3)-Equivariant Patches for Scan-to-CAD Estimation
Driton Salihu, Adam Misik, Yuankai Wu et al.
TrajPrompt: Aligning Color Trajectory with Vision-Language Representations
Li-Wu Tsao, Hao-Tang Tsui, Yu-Rou Tuan et al.
MEPSI: An MDL-Based Ensemble Pruning Approach with Structural Information
Xiao-Dong Bi, Shao-Qun Zhang, Yuan Jiang
Decentralized Scheduling with QoS Constraints: Achieving O(1) QoS Regret of Multi-Player Bandits
Qingsong Liu, Zhixuan Fang
Position: Intent-aligned AI Systems Must Optimize for Agency Preservation
Catalin Mitelut, Benjamin Smith, Peter Vamplew
Semi-supervised Learning of Dynamical Systems with Neural Ordinary Differential Equations: A Teacher-Student Model Approach
Yu Wang, Yuxuan Yin, Karthik Somayaji NS et al.
On the Convergence of Projected Bures-Wasserstein Gradient Descent under Euclidean Strong Convexity
Junyi FAN, Yuxuan Han, Zijian Liu et al.
Prompt Gradient Projection for Continual Learning
Jingyang Qiao, Zhizhong Zhang, Xin Tan et al.
Beyond Seen Primitive Concepts and Attribute-Object Compositional Learning
Nirat Saini, Khoi Pham, Abhinav Shrivastava
A Plug-and-Play Quaternion Message-Passing Module for Molecular Conformation Representation
Angxiao Yue, Dixin Luo, Hongteng Xu
New Classes of the Greedy-Applicable Arm Feature Distributions in the Sparse Linear Bandit Problem
Koji Ichikawa, Shinji Ito, Daisuke Hatano et al.
Bridging the Gap: Studio-like Avatar Creation from a Monocular Phone Capture
ShahRukh Athar, Shunsuke Saito, Stanislav Pidhorskyi et al.
CcDPM: A Continuous Conditional Diffusion Probabilistic Model for Inverse Design
Yanxuan Zhao, Peng Zhang, Guopeng Sun et al.
Universal Weak Coreset
Ragesh Jaiswal, Amit Kumar
GenH2R: Learning Generalizable Human-to-Robot Handover via Scalable Simulation Demonstration and Imitation
Zifan Wang, Junyu Chen, Ziqing Chen et al.
Fixed Non-negative Orthogonal Classifier: Inducing Zero-mean Neural Collapse with Feature Dimension Separation
Hoyong Kim, Kangil Kim
PBWR: Parametric-Building-Wireframe Reconstruction from Aerial LiDAR Point Clouds
Shangfeng Huang, Ruisheng Wang, Bo Guo et al.
A Unified Adaptive Testing System Enabled by Hierarchical Structure Search
Junhao Yu, Yan Zhuang, Zhenya Huang et al.
Sparse Autoencoders Find Highly Interpretable Features in Language Models
Robert Huben, Hoagy Cunningham, Logan Smith et al.
Exploiting Human-AI Dependence for Learning to Defer
Zixi Wei, Yuzhou Cao, Lei Feng
Secure Distributed Sparse Gaussian Process Models Using Multi-Key Homomorphic Encryption
Adil Nawaz, Guopeng Chen, Muhammad Umair Raza et al.
Towards General Algorithm Discovery for Combinatorial Optimization: Learning Symbolic Branching Policy from Bipartite Graph
Yufei Kuang, Jie Wang, Yuyan Zhou et al.
CaMIL: Causal Multiple Instance Learning for Whole Slide Image Classification
Kaitao Chen, Shiliang Sun, Jing Zhao
Bootstrapping SparseFormers from Vision Foundation Models
Ziteng Gao, Zhan Tong, Kevin Qinghong Lin et al.
Flexible Residual Binarization for Image Super-Resolution
Yulun Zhang, Haotong Qin, Zixiang Zhao et al.
DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior
Ha-Yeong Choi, Sang-Hoon Lee, Seong-Whan Lee
Predicting Dose-Response Curves with Deep Neural Networks
Pedro A. Campana, Paul Prasse, Tobias Scheffer
UV-IDM: Identity-Conditioned Latent Diffusion Model for Face UV-Texture Generation
Hong Li, Yutang Feng, Song Xue et al.
Approximation Scheme for Weighted Metric Clustering via Sherali-Adams
Dmitrii Avdiukhin, Vaggos Chatziafratis, Konstantin Makarychev et al.
Contextual Pandora’s Box
Alexia Atsidakou, Constantine Caramanis, Evangelia Gergatsouli et al.
NAPGuard: Towards Detecting Naturalistic Adversarial Patches
Siyang Wu, Jiakai Wang, Jiejie Zhao et al.
Tilt and Average : Geometric Adjustment of the Last Layer for Recalibration
Gyusang Cho, Chan-Hyun Youn
Robust Distributed Gradient Aggregation Using Projections onto Gradient Manifolds
Kwang In Kim
Generative Model Perception Rectification Algorithm for Trade-Off between Diversity and Quality
Guipeng Lan, Shuai Xiao, Jiachen Yang et al.
A Physics-informed Low-rank Deep Neural Network for Blind and Universal Lens Aberration Correction
Jin Gong, Runzhao Yang, Weihang Zhang et al.
CAMEL: CAusal Motion Enhancement Tailored for Lifting Text-driven Video Editing
Guiwei Zhang, Tianyu Zhang, Guanglin Niu et al.
Exploring Pose-Aware Human-Object Interaction via Hybrid Learning
EASTMAN Z Y WU, Yali Li, Yuan Wang et al.
Implicit Representations for Constrained Image Segmentation
Jan Philipp Schneider, Mishal Fatima, Jovita Lukasik et al.
Towards Dynamic Spatial-Temporal Graph Learning: A Decoupled Perspective
Binwu Wang, Pengkun Wang, Yudong Zhang et al.
When is Transfer Learning Possible?
My Phan, Kianté Brantley, Stephanie Milani et al.
Improving Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning
Wei Li, Hehe Fan, Yongkang Wong et al.
A Closer Look at Curriculum Adversarial Training: From an Online Perspective
Lianghe Shi, Weiwei Liu
You Only Query Once: An Efficient Label-Only Membership Inference Attack
Yutong Wu, Han Qiu, Shangwei Guo et al.
Scalable Safe Policy Improvement for Factored Multi-Agent MDPs
Federico Bianchi, Edoardo Zorzi, Alberto Castellini et al.
AttEXplore: Attribution for Explanation with model parameters eXploration
Zhiyu Zhu, Huaming Chen, Jiayu Zhang et al.
Generalization Analysis for Multi-Label Learning
Yi-Fan Zhang, Min-Ling Zhang
Dynamic Knowledge Injection for AIXI Agents
Samuel Yang-Zhao, Kee Siong Ng, Marcus Hutter
On Bias-Variance Alignment in Deep Models
Lin Chen, Michal Lukasik, Wittawat Jitkrittum et al.
iMatching: Imperative Correspondence Learning
Chen Wang, Dasong Gao, Yun-Jou Lin et al.
The Reversal Curse: LLMs trained on “A is B” fail to learn “B is A”
Lukas Berglund, Meg Tong, Maximilian Kaufmann et al.
Lightweight Image Super-Resolution via Flexible Meta Pruning
Yulun Zhang, Kai Zhang, Luc Van Gool et al.
Feature Distribution Matching by Optimal Transport for Effective and Robust Coreset Selection
Weaker MVI Condition: Extragradient Methods with Multi-Step Exploration
Yifeng Fan, Yongqiang Li, Bo Chen
Fundamental Limits of Distributed Covariance Matrix Estimation Under Communication Constraints
Mohammad Reza Rahmani, Mohammad Hossein Yassaee, Mohammad Ali Maddah Ali et al.
Sign Rank Limitations for Inner Product Graph Decoders
Su Hyeong Lee, QINGQI ZHANG, Risi Kondor
AnyHome: Open-Vocabulary Large-Scale Indoor Scene Generation with First-Person View Exploration
Rao Fu, Zehao Wen, Zichen Liu et al.
On the Hardness of Online Nonconvex Optimization with Single Oracle Feedback
Ziwei Guan, Yi Zhou, Yingbin Liang
MMD Graph Kernel: Effective Metric Learning for Graphs via Maximum Mean Discrepancy
Yan Sun, Jicong Fan
AnyScene: Customized Image Synthesis with Composited Foreground
Ruidong Chen, Lanjun Wang, Weizhi Nie et al.
Dialogues Are Not Just Text: Modeling Cognition for Dialogue Coherence Evaluation
A Unified Self-Distillation Framework for Multimodal Sentiment Analysis with Uncertain Missing Modalities
Guiding a Harsh-Environments Robust Detector via RAW Data Characteristic Mining
Mixed-Effects Contextual Bandits
Weiwei Xiao, Yongyong Chen, Qiben Shan et al.
ReLU Network with Width $d+\mathcal{O}(1)$ Can Achieve Optimal Approximation Rate
Chenghao Liu, Minghua Chen
Characterizing ResNet's Universal Approximation Capability
Chenghao Liu, Enming Liang, Minghua Chen
Position: Social Choice Should Guide AI Alignment in Dealing with Diverse Human Feedback
Vincent Conitzer, Rachel Freedman, Jobstq Heitzig et al.
Beyond Attention: Breaking the Limits of Transformer Context Length with Recurrent Memory
Aydar Bulatov, Yuri Kuratov, Yermek Kapushev et al.
Minimum Norm Interpolation Meets The Local Theory of Banach Spaces
Gil Kur, Pedro Abdalla, Pierre Bizeul et al.
Transportable Representations for Domain Generalization
Kasra Jalaldoust, Elias Bareinboim
Exponential Hardness of Optimization from the Locality in Quantum Neural Networks
Hao-Kai Zhang, Chengkai Zhu, Geng Liu et al.
MFOS: Model-Free & One-Shot Object Pose Estimation
JongMin Lee, Yohann Cabon, Romain Brégier et al.
Training Graph Transformers via Curriculum-Enhanced Attention Distillation
Yisong Huang, Jin Li, Xinlong Chen et al.
DRF: Improving Certified Robustness via Distributional Robustness Framework
Zekai Wang, Zhengyu Zhou, Weiwei Liu
Forward $\chi^2$ Divergence Based Variational Importance Sampling
Chengrui Li, Yule Wang, Weihan Li et al.
Rethinking the Benefits of Steerable Features in 3D Equivariant Graph Neural Networks
Shih-Hsin Wang, Yung-Chang Hsu, Justin Baker et al.
Boosting Graph Anomaly Detection with Adaptive Message Passing
Jingyan Chen, Guanghui Zhu, Chunfeng Yuan et al.
HAGO-Net: Hierarchical Geometric Massage Passing for Molecular Representation Learning
Hongbin Pei, Taile Chen, Chen A et al.
Task2Box: Box Embeddings for Modeling Asymmetric Task Relationships
Rangel Daroya, Aaron Sun, Subhransu Maji
A Simple Romance Between Multi-Exit Vision Transformer and Token Reduction
Dongyang Liu, Meina Kan, Shiguang Shan et al.
Latent Trajectory Learning for Limited Timestamps under Distribution Shift over Time
Qiuhao Zeng, Changjian Shui, Long-Kai Huang et al.
Efficient Model Stealing Defense with Noise Transition Matrix
Dong-Dong Wu, Chilin Fu, Weichang Wu et al.
Position: Video as the New Language for Real-World Decision Making
Sherry Yang, Jacob C Walker, Jack Parker-Holder et al.
Learning to Select Views for Efficient Multi-View Understanding
Yunzhong Hou, Stephen Gould, Liang Zheng
HOIAnimator: Generating Text-prompt Human-object Animations using Novel Perceptive Diffusion Models
Wenfeng Song, Xinyu Zhang, Shuai Li et al.
Position: Will we run out of data? Limits of LLM scaling based on human-generated data
Pablo Villalobos, Anson Ho, Jaime Sevilla et al.
Meta Continual Learning Revisited: Implicitly Enhancing Online Hessian Approximation via Variance Reduction
Yichen Wu, Long-Kai Huang, Renzhen Wang et al.
Online Stabilization of Spiking Neural Networks
Yaoyu Zhu, Jianhao Ding, Tiejun Huang et al.
PROGRAM: PROtotype GRAph Model based Pseudo-Label Learning for Test-Time Adaptation
Haopeng Sun, Lumin Xu, Sheng Jin et al.
A Study of First-Order Methods with a Deterministic Relative-Error Gradient Oracle
Nadav Hallak, Kfir Levy
HDQMF: Holographic Feature Decomposition Using Quantum Algorithms
Prathyush Poduval, Zhuowen Zou, Mohsen Imani
Bilateral Adaptation for Human-Object Interaction Detection with Occlusion-Robustness
Guangzhi Wang, Yangyang Guo, Ziwei Xu et al.
Efficient Value Iteration for s-rectangular Robust Markov Decision Processes
Navdeep Kumar, Kaixin Wang, Kfir Levy et al.
GNNCert: Deterministic Certification of Graph Neural Networks against Adversarial Perturbations
Zaishuo Xia, Han Yang, Binghui Wang et al.
REWIND: Real-Time Egocentric Whole-Body Motion Diffusion with Exemplar-Based Identity Conditioning
Jian Wang, Zhe Cao, Diogo Luvizon et al.
Parameter-Dependent Competitive Analysis for Online Capacitated Coverage Maximization through Boostings and Attenuations
Pan Xu
Promoting External and Internal Equities Under Ex-Ante/Ex-Post Metrics in Online Resource Allocation
Karthik Abinav Sankararaman, Aravind Srinivasan, Pan Xu
H-ViT: A Hierarchical Vision Transformer for Deformable Image Registration
Morteza Ghahremani, Mohammad Khateri, Bailiang Jian et al.
Going Beyond Multi-Task Dense Prediction with Synergy Embedding Models
Huimin Huang, Yawen Huang, Lanfen Lin et al.
MR-VNet: Media Restoration using Volterra Networks
Siddharth Roheda, Amit Unde, Loay Rashid
Mudslide: A Universal Nuclear Instance Segmentation Method
Jun Wang
Virtual Immunohistochemistry Staining for Histological Images Assisted by Weakly-supervised Learning
Jiahan Li, Jiuyang Dong, Shenjin Huang et al.
Autoencoding Conditional Neural Processes for Representation Learning
Victor Prokhorov, Ivan Titov, Siddharth N
BLO-SAM: Bi-level Optimization Based Finetuning of the Segment Anything Model for Overfitting-Preventing Semantic Segmentation
Li Zhang, Youwei Liang, Ruiyi Zhang et al.
U-COPE: Taking a Further Step to Universal 9D Category-level Object Pose Estimation
li zhang, Weiqing Meng, Yan Zhong et al.
Model Adaptation for Time Constrained Embodied Control
Jaehyun Song, Minjong Yoo, Honguk Woo
SRTube: Video-Language Pre-Training with Action-Centric Video Tube Features and Semantic Role Labeling
Juhee Lee, Jewon Kang
SPAD: Spatially Aware Multi-View Diffusers
Yash Kant, Aliaksandr Siarohin, Ziyi Wu et al.
Traceable Federated Continual Learning
Qiang Wang, Bingyan Liu, Yawen Li
DiffPerformer: Iterative Learning of Consistent Latent Guidance for Diffusion-based Human Video Generation
Chenyang Wang, Zerong Zheng, Tao Yu et al.
SceneFun3D: Fine-Grained Functionality and Affordance Understanding in 3D Scenes
Alexandros Delitzas, Ayça Takmaz, Federico Tombari et al.
MAPLM: A Real-World Large-Scale Vision-Language Benchmark for Map and Traffic Scene Understanding
Xu Cao, Tong Zhou, Yunsheng Ma et al.
Weakly Supervised Point Cloud Semantic Segmentation via Artificial Oracle
Hyeokjun Kweon, Jihun Kim, Kuk-Jin Yoon
SEAS: ShapE-Aligned Supervision for Person Re-Identification
Haidong Zhu, Pranav Budhwant, Zhaoheng Zheng et al.
Construct to Associate: Cooperative Context Learning for Domain Adaptive Point Cloud Segmentation
Guangrui Li
Not All Prompts Are Secure: A Switchable Backdoor Attack Against Pre-trained Vision Transfomers
Sheng Yang, Jiawang Bai, Kuofeng Gao et al.
LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding Reasoning and Planning
Sijin Chen, Xin Chen, Chi Zhang et al.
Effective Federated Graph Matching
Yang Zhou, Zijie Zhang, Zeru Zhang et al.
Self-cognitive Denoising in the Presence of Multiple Noisy Label Sources
Yi-Xuan Sun, Ya-Lin Zhang, BIN HAN et al.
E3V-K5: An Authentic Benchmark for Redefining Video-Based Energy Expenditure Estimation
Shengxuming Zhang, Lei Jin, Yifan Wang et al.
Fourier Priors-Guided Diffusion for Zero-Shot Joint Low-Light Enhancement and Deblurring
Xiaoqian Lv, Shengping Zhang, Chenyang Wang et al.
Understanding and Improving Source-free Domain Adaptation from a Theoretical Perspective
Yu Mitsuzumi, Akisato Kimura, Hisashi Kashima
Federated Continual Learning via Prompt-based Dual Knowledge Transfer
Hongming Piao, Yichen WU, Dapeng Wu et al.
LowRankOcc: Tensor Decomposition and Low-Rank Recovery for Vision-based 3D Semantic Occupancy Prediction
Linqing Zhao, Xiuwei Xu, Ziwei Wang et al.
Video Frame Interpolation via Direct Synthesis with the Event-based Reference
Yuhan Liu, Yongjian Deng, Hao Chen et al.
MCNet: Rethinking the Core Ingredients for Accurate and Efficient Homography Estimation
Haokai Zhu, Si-Yuan Cao, Jianxin Hu et al.
Uncertainty-Guided Never-Ending Learning to Drive
Lei Lai, Eshed Ohn-Bar, Sanjay Arora et al.
Feedback-Guided Autonomous Driving
Jimuyang Zhang, Zanming Huang, Arijit Ray et al.
Pathformer3D: A 3D Scanpath Transformer for 360° Images
Rong Quan, yantao Lai, Mengyu Qiu et al.
Easing Concept Bleeding in Diffusion via Entity Localization and Anchoring
Jiewei Zhang, Song Guo, Peiran Dong et al.
Visual Prompting via Partial Optimal Transport
MENGYU ZHENG, Zhiwei Hao, Yehui Tang et al.
LiteSAM is Actually what you Need for segment Everything
Jianhai Fu, Yuanjie Yu, Ningchuan Li et al.
Small Steps and Level Sets: Fitting Neural Surface Models with Point Guidance
Chamin Hewa Koneputugodage, Yizhak Ben-Shabat, Dylan Campbell et al.
Efficient Training of Spiking Neural Networks with Multi-Parallel Implicit Stream Architecture
Zhigao Cao, Meng Li, Xiashuang Wang et al.
Adapt or Perish: Adaptive Sparse Transformer with Attentive Feature Refinement for Image Restoration
Shihao Zhou, Duosheng Chen, Jinshan Pan et al.
LTM: Lightweight Textured Mesh Extraction and Refinement of Large Unbounded Scenes for Efficient Storage and Real-time Rendering
Jaehoon Choi, Rajvi Shah, Qinbo Li et al.
Adversarial Text to Continuous Image Generation
Kilichbek Haydarov, Aashiq Muhamed, Xiaoqian Shen et al.
Improving Neural Logic Machines via Failure Reflection
Zhiming Li, Yushi Cao, Yan Zheng et al.
Less is More: on the Over-Globalizing Problem in Graph Transformers
Yujie Xing, Xiao Wang, Yibo Li et al.
Check Locate Rectify: A Training-Free Layout Calibration System for Text-to-Image Generation
Biao Gong, Siteng Huang, Yutong Feng et al.
Leveraging Camera Triplets for Efficient and Accurate Structure-from-Motion
Lalit Manam, Venu Madhav Govindu
DIEM: Decomposition-Integration Enhancing Multimodal Insights
Xinyi Jiang, Guoming Wang, Junhao Guo et al.
HOI-M^3: Capture Multiple Humans and Objects Interaction within Contextual Environment
Juze Zhang, Jingyan Zhang, Zining Song et al.
CORES: Convolutional Response-based Score for Out-of-distribution Detection
Keke Tang, Chao Hou, Weilong Peng et al.
Energy-Efficient Gaussian Processes Using Low-Precision Arithmetic
Nicolas Alder, Ralf Herbrich
Diff3DETR: Agent-based Diffusion Model for Semi-supervised 3D Object Detection
Jiacheng Deng, Jiahao Lu, Tianzhu Zhang
DeMatch: Deep Decomposition of Motion Field for Two-View Correspondence Learning
Shihua Zhang, Zizhuo Li, Yuan Gao et al.
Mol-AE: Auto-Encoder Based Molecular Representation Learning With 3D Cloze Test Objective
Junwei Yang, Kangjie Zheng, Siyu Long et al.
Structured Model Probing: Empowering Efficient Transfer Learning by Structured Regularization
Zhi-Fan Wu, Chaojie Mao, Xue Wang et al.
Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous and Instruction-guided Driving
Brian Yang, Huangyuan Su, Nikolaos Gkanatsios et al.
Domain Gap Embeddings for Generative Dataset Augmentation
Yinong Oliver Wang, Younjoon Chung, Chen Henry Wu et al.
Complexity Matters: Feature Learning in the Presence of Spurious Correlations
GuanWen Qiu, Da Kuang, Surbhi Goel
IQ-VFI: Implicit Quadratic Motion Estimation for Video Frame Interpolation
Mengshun Hu, Kui Jiang, Zhihang Zhong et al.
Towards Resource-friendly, Extensible and Stable Incomplete Multi-view Clustering
Shengju Yu, Dong Zhibin, Siwei Wang et al.
TCP:Textual-based Class-aware Prompt tuning for Visual-Language Model
Hantao Yao, Rui Zhang, Changsheng Xu
TransLoc4D: Transformer-based 4D Radar Place Recognition
Guohao Peng, Heshan Li, Yangyang Zhao et al.
Scalable Multiple Kernel Clustering: Learning Clustering Structure from Expectation
Weixuan Liang, En Zhu, Shengju Yu et al.
Decouple then Classify: A Dynamic Multi-view Labeling Strategy with Shared and Specific Information
Xinhang Wan, Jiyuan Liu, Xinwang Liu et al.
6DoF Head Pose Estimation through Explicit Bidirectional Interaction with Face Geometry
Sungho Chun, Ju Yong Chang
Higher-order Relational Reasoning for Pedestrian Trajectory Prediction
Sungjune Kim, Hyung-gun Chi, Hyerin Lim et al.
Absolute Pose from One or Two Scaled and Oriented Features
Jonathan Ventura, Zuzana Kukelova, Torsten Sattler et al.
Draw Step by Step: Reconstructing CAD Construction Sequences from Point Clouds via Multimodal Diffusion.
Weijian Ma, Shuaiqi Chen, Yunzhong Lou et al.
Open-Vocabulary 3D Semantic Segmentation with Foundation Models
Li Jiang, Shaoshuai Shi, Bernt Schiele
Training Vision Transformers for Semi-Supervised Semantic Segmentation
Xinting Hu, Li Jiang, Bernt Schiele
ESCAPE: Encoding Super-keypoints for Category-Agnostic Pose Estimation
Khoi D Nguyen, Chen Li, Gim Hee Lee
Dual-Consistency Model Inversion for Non-Exemplar Class Incremental Learning
Zihuan Qiu, Yi Xu, Fanman Meng et al.
S-JEPA: A Joint Embedding Predictive Architecture for Skeletal Action Recognition
Mohamed Abdelfattah, Alexandre ALahi
NC-TTT: A Noise Constrastive Approach for Test-Time Training
David OSOWIECHI, Gustavo Vargas Hakim, Mehrdad Noori et al.
FedLMT: Tackling System Heterogeneity of Federated Learning via Low-Rank Model Training with Theoretical Guarantees
Jiahao Liu, Yipeng Zhou, Di Wu et al.
Class Tokens Infusion for Weakly Supervised Semantic Segmentation
Sung-Hoon Yoon, Hoyong Kwon, Hyeonseong Kim et al.
Improving Training Efficiency of Diffusion Models via Multi-Stage Framework and Tailored Multi-Decoder Architecture
Huijie Zhang, Yifu Lu, Ismail Alkhouri et al.
Endow SAM with Keen Eyes: Temporal-spatial Prompt Learning for Video Camouflaged Object Detection
Wenjun Hui, Zhenfeng Zhu, Shuai Zheng et al.
Attack-free Evaluating and Enhancing Adversarial Robustness on Categorical Data
Yujun Zhou, Yufei Han, Haomin Zhuang et al.
Bridging Environments and Language with Rendering Functions and Vision-Language Models
Théo Cachet, Christopher Dance, Olivier Sigaud
NICE: Neurogenesis Inspired Contextual Encoding for Replay-free Class Incremental Learning
Mustafa B Gurbuz, Jean Moorman, Constantine Dovrolis
DeconfuseTrack: Dealing with Confusion for Multi-Object Tracking
Cheng Huang, Shoudong Han, Mengyu He et al.
Improving Unsupervised Domain Adaptation: A Pseudo-Candidate Set Approach
Aveen Dayal, Rishabh Lalla, Linga Reddy Cenkeramaddi et al.
Fine-grained Local Sensitivity Analysis of Standard Dot-Product Self-Attention
Aaron Havens, Alexandre Araujo, Huan Zhang et al.
JoAPR: Cleaning the Lens of Prompt Learning for Vision-Language Models
YUNCHENG GUO, Xiaodong Gu
GPLD3D: Latent Diffusion of 3D Shape Generative Models by Enforcing Geometric and Physical Priors
Yuan Dong, Qi Zuo, Xiaodong Gu et al.
Weakly-Supervised Audio-Visual Video Parsing with Prototype-based Pseudo-Labeling
Kranthi Kumar Rachavarapu, Kalyan Ramakrishnan, A. N. Rajagopalan
Enhancing Optimization Robustness in 1-bit Neural Networks through Stochastic Sign Descent
NianHui Guo, Hong Guo, Christoph Meinel et al.
Pixel-level Semantic Correspondence through Layout-aware Representation Learning and Multi-scale Matching Integration
Yixuan Sun, Zhangyue Yin, Haibo Wang et al.
View From Above: Orthogonal-View aware Cross-view Localization
Shan Wang, Chuong Nguyen, Jiawei Liu et al.
3DToonify: Creating Your High-Fidelity 3D Stylized Avatar Easily from 2D Portrait Images
Yifang Men, Hanxi Liu, Yuan Yao et al.
Synthesize Diagnose and Optimize: Towards Fine-Grained Vision-Language Understanding
Wujian Peng, Sicheng Xie, Zuyao You et al.
DIOD: Self-Distillation Meets Object Discovery
Sandra Kara, Hejer AMMAR, Julien Denize et al.
Teeth-SEG: An Efficient Instance Segmentation Framework for Orthodontic Treatment based on Multi-Scale Aggregation and Anthropic Prior Knowledge
Bo Zou, Shaofeng Wang, Hao Liu et al.
Countering Personalized Text-to-Image Generation with Influence Watermarks
Hanwen Liu, Zhicheng Sun, Yadong Mu
Forecasting of 3D Whole-body Human Poses with Grasping Objects
yan haitao, Qiongjie Cui, Jiexin Xie et al.
Online Learning and Information Exponents: The Importance of Batch size & Time/Complexity Tradeoffs
Luca Arnaboldi, Yatin Dandi, FLORENT KRZAKALA et al.
Learning Degradation-unaware Representation with Prior-based Latent Transformations for Blind Face Restoration
Lianxin Xie, csbingbing zheng, Wen Xue et al.
Edge-Aware 3D Instance Segmentation Network with Intelligent Semantic Prior
Wonseok Roh, Hwanhee Jung, Giljoo Nam et al.
Don’t Drop Your Samples! Coherence-Aware Training Benefits Conditional Diffusion
Nicolas Dufour, Victor Besnier, Vicky Kalogeiton et al.