Most Cited AAAI "multi-frame analysis" Papers
5,317 papers found • Page 7 of 27
Conference
Test-Time Personalization with Meta Prompt for Gaze Estimation
Huan Liu, Julia Qi, Zhenhao Li et al.
Temporal Correlation Vision Transformer for Video Person Re-Identification
Pengfei Wu, Le Wang, Sanping Zhou et al.
MotionMix: Weakly-Supervised Diffusion for Controllable Motion Generation
Nhat Hoang, Kehong Gong, Chuan Guo et al.
Bridging the User-side Knowledge Gap in Knowledge-aware Recommendations with Large Language Models
Zheng Hu, Zhe Li, Ziyun Jiao et al.
Multi-Dimensional Fair Federated Learning
Cong Su, Guoxian Yu, Jun Wang et al.
BearLLM: A Prior Knowledge-Enhanced Bearing Health Management Framework with Unified Vibration Signal Representation
Haotian Peng, Jiawei Liu, Jinsong Du et al.
UNEX-RL: Reinforcing Long-Term Rewards in Multi-Stage Recommender Systems with UNidirectional EXecution
Gengrui Zhang, Xiaoshuang Chen, Yao WANG et al.
Learning Spatially Collaged Fourier Bases for Implicit Neural Representation
Jason Chun Lok Li, Chang Liu, Binxiao Huang et al.
A Positive-Unlabeled Metric Learning Framework for Document-Level Relation Extraction with Incomplete Labeling
Ye Wang, Huazheng Pan, Tao Zhang et al.
MPTSNet: Integrating Multiscale Periodic Local Patterns and Global Dependencies for Multivariate Time Series Classification
Yang Mu, Muhammad Shahzad, Xiao Xiang Zhu
Cycle Self-Refinement for Multi-Source Domain Adaptation
Chaoyang Zhou, Zengmao Wang, Bo Du et al.
Non-parametric Representation Learning with Kernels
Hebaixu Wang, Meiqi Gong, Xiaoguang Mei et al.
FedST: Federated Style Transfer Learning for Non-IID Image Segmentation
Boyuan Ma, Yin Xiang, Jing Tan et al.
ESRL: Efficient Sampling-Based Reinforcement Learning for Sequence Generation
Chenglong Wang, Hang Zhou, Yimin Hu et al.
Knowledge Graph Error Detection with Contrastive Confidence Adaption
Xiangyu Liu, Yang Liu, Wei Hu
Segment beyond View: Handling Partially Missing Modality for Audio-Visual Semantic Segmentation
Renjie Wu, Hu Wang, Feras Dayoub et al.
Understanding Emotional Body Expressions via Large Language Models
Haifeng Lu, Jiuyi Chen, Feng Liang et al.
UFDA: Universal Federated Domain Adaptation with Practical Assumptions
Xinhui Liu, Zhenghao Chen, Luping Zhou et al.
ViSTec: Video Modeling for Sports Technique Recognition and Tactical Analysis
Yuchen He, Zeqing Yuan, Yihong Wu et al.
CoRe: Context-Regularized Text Embedding Learning for Text-to-Image Personalization
Feize Wu, Yun Pang, Junyi Zhang et al.
Chain-of-Instructions: Compositional Instruction Tuning on Large Language Models
Shirley Anugrah Hayati, Taehee Jung, Tristan Bodding-Long et al.
Knowledge-Aware Parameter Coaching for Personalized Federated Learning
Mingjian Zhi, Yuanguo Bi, Wenchao Xu et al.
StyO: Stylize Your Face in Only One-Shot
Bonan Li, Zicheng Zhang, Xuecheng Nie et al.
MM-CamObj: A Comprehensive Multimodal Dataset for Camouflaged Object Scenarios
Jiacheng Ruan, Wenzhen Yuan, Zehao Lin et al.
Delivering Inflated Explanations
Yacine Izza, Alexey Ignatiev, Peter Stuckey et al.
Global Graph Propagation with Hierarchical Information Transfer for Incomplete Contrastive Multi-view Clustering
Guoqing Chao, Kaixin Xu, Xijiong Xie et al.
VQ4DiT: Efficient Post-Training Vector Quantization for Diffusion Transformers
Juncan Deng, Shuaiting Li, Zeyu Wang et al.
The Complexity of Fair Division of Indivisible Items with Externalities
Argyrios Deligkas, Eduard Eiben, Viktoriia Korchemna et al.
Attention-Driven GUI Grounding: Leveraging Pretrained Multimodal Large Language Models Without Fine-Tuning
Hai-Ming Xu, Qi Chen, Lei Wang et al.
FAMNet: Frequency-aware Matching Network for Cross-domain Few-shot Medical Image Segmentation
Yuntian Bo, Yazhou Zhu, Lunbo Li et al.
Evaluating Image Hallucination in Text-to-Image Generation with Question-Answering
Youngsun Lim, Hojun Choi, Hyunjung Shim
X4D-SceneFormer: Enhanced Scene Understanding on 4D Point Cloud Videos through Cross-Modal Knowledge Transfer
Linglin Jing, Ying Xue, Xu Yan et al.
Learning Invariant Inter-pixel Correlations for Superpixel Generation
Sen Xu, Shikui Wei, Tao Ruan et al.
From GARCH to Neural Network for Volatility Forecast
Pengfei Zhao, Haoren ZHU, Wilfred Ng et al.
Temporal Fair Division
Benjamin Cookson, Soroush Ebadian, Nisarg Shah
Automated Design of Affine Maximizer Mechanisms in Dynamic Settings
Michael Curry, Vinzenz Thoma, Darshan Chakrabarti et al.
MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity Talking Head Generation
Seyeon Kim, Siyoon Jin, Jihye Park et al.
Enhancing Non-English Capabilities of English-Centric Large Language Models Through Deep Supervision Fine-Tuning
Wenshuai Huo, Xiaocheng Feng, Yichong Huang et al.
NeSyCoCo: A Neuro-Symbolic Concept Composer for Compositional Generalization
Danial Kamali, Elham J. Barezi, Parisa Kordjamshidi
ConsistNER: Towards Instructive NER Demonstrations for LLMs with the Consistency of Ontology and Context
Chenxiao Wu, Ke Wenjun, Peng Wang et al.
CustomTTT: Motion and Appearance Customized Video Generation via Test-Time Training
Xiuli Bi, Jian Lu, Bo Liu et al.
Continuous Piecewise-Affine Based Motion Model for Image Animation
Hexiang Wang, Fengqi Liu, Qianyu Zhou et al.
Uncertainty Regularized Evidential Regression
Kai Ye, Tiejin Chen, Hua Wei et al.
Adaptive Anytime Multi-Agent Path Finding Using Bandit-Based Large Neighborhood Search
Thomy Phan, Taoan Huang, Bistra Dilkina et al.
MDD-5k: A New Diagnostic Conversation Dataset for Mental Disorders Synthesized via Neuro-Symbolic LLM Agents
Congchi Yin, Feng Li, Shu Zhang et al.
AT4CTR: Auxiliary Match Tasks for Enhancing Click-Through Rate Prediction
Qi Liu, Xuyang Hou, Defu Lian et al.
KnowPO: Knowledge-Aware Preference Optimization for Controllable Knowledge Selection in Retrieval-Augmented Language Models
Ruizhe Zhang, Yongxin Xu, Yuzhen Xiao et al.
Point Transformer with Federated Learning for Predicting Breast Cancer HER2 Status from Hematoxylin and Eosin-Stained Whole Slide Images
Bao Li, Zhenyu Liu, Lizhi Shao et al.
Poincaré Differential Privacy for Hierarchy-Aware Graph Embedding
Yuecen Wei, Haonan Yuan, Xingcheng Fu et al.
Frame Order Matters: A Temporal Sequence-Aware Model for Few-Shot Action Recognition
Bozheng Li, Mushui Liu, Gaoang Wang et al.
Accurate and Regret-Aware Numerical Problem Solver for Tabular Question Answering
Yuxiang Wang, Jianzhong Qi, Junhao Gan
Learning to Stop Cut Generation for Efficient Mixed-Integer Linear Programming
Haotian Ling, Zhihai Wang, Jie Wang
BLADE: Box-Level Supervised Amodal Segmentation through Directed Expansion
Zhaochen Liu, Zhixuan Li, Tingting Jiang
I-rebalance: Personalized Vehicle Repositioning for Supply Demand Balance
Haoyang Chen, Peiyan Sun, Qiyuan Song et al.
FD3D: Exploiting Foreground Depth Map for Feature-Supervised Monocular 3D Object Detection
Zizhang Wu, Yuanzhu Gan, Yunzhe Wu et al.
FedLPS: Heterogeneous Federated Learning for Multiple Tasks with Local Parameter Sharing
Yongzhe Jia, Xuyun Zhang, Amin Beheshti et al.
Union Subgraph Neural Networks
Jiaxing Xu, Aihu Zhang, Qingtian Bian et al.
Open-Set Facial Expression Recognition
Yuhang Zhang, Yue Yao, Xuannan Liu et al.
Let There Be Sound: Reconstructing High Quality Speech from Silent Videos
Ji-Hoon Kim, Jaehun Kim, Joon Son Chung
Compressing Image-to-Image Translation GANs Using Local Density Structures on Their Learned Manifold
Alireza Ganjdanesh, Shangqian Gao, Hirad Alipanah et al.
Tuning-Free Accountable Intervention for LLM Deployment – a Metacognitive Approach
Zhen Tan, Jie Peng, Song Wang et al.
LAMPAT: Low-Rank Adaption for Multilingual Paraphrasing Using Adversarial Training
Khoi M. Le, Trinh Pham, Tho Quan et al.
Relational Programming with Foundational Models
Ziyang Li, Jiani Huang, Jason Liu et al.
Stability in Online Coalition Formation
Authors: Martin Bullinger, René Romen
Adaptive Prompting for Continual Relation Extraction: A Within-Task Variance Perspective
Minh Le, Tien Ngoc Luu, An Nguyen The et al.
Importance Weighting Can Help Large Language Models Self-Improve
Chunyang Jiang, Chi-Min Chan, Wei Xue et al.
Selective Visual Prompting in Vision Mamba
Yifeng Yao, Zichen Liu, Zhenyu Cui et al.
DG-Mamba: Robust and Efficient Dynamic Graph Structure Learning with Selective State Space Models
Haonan Yuan, Qingyun Sun, Zhaonan Wang et al.
ConVis: Contrastive Decoding with Hallucination Visualization for Mitigating Hallucinations in Multimodal Large Language Models
Yeji Park, Deokyeong Lee, Junsuk Choe et al.
Detect Any Keypoints: An Efficient Light-Weight Few-Shot Keypoint Detector
Changsheng Lu, Piotr Koniusz
Bridging the Gap for Test-Time Multimodal Sentiment Analysis
Zirun Guo, Tao Jin, Wenlong Xu et al.
1497 Once and for All: Universal Transferable Adversarial Perturbation against Deep Hashing-Based Facial Image Retrieval
Long Tang, Dengpan Ye, Yunna Lv et al.
Hierarchical Mixture of Experts: Generalizable Learning for High-Level Synthesis
Weikai Li, Ding Wang, Zijian Ding et al.
No Prior Mask: Eliminate Redundant Action for Deep Reinforcement Learning
Dianyu Zhong, Yiqin Yang, Qianchuan Zhao
Improving Generalization for AI-Synthesized Voice Detection
Hainan Ren, Li Lin, Chun-Hao Liu et al.
Synergistic Anchored Contrastive Pre-training for Few-Shot Relation Extraction
Da Luo, Yanglei Gan, Rui Hou et al.
CR-SAM: Curvature Regularized Sharpness-Aware Minimization
Tao Wu, Tie Luo, Donald Wunsch
LINGO-Space: Language-Conditioned Incremental Grounding for Space
Dohyun Kim, Nayoung Oh, Deokmin Hwang et al.
Conformalized Interval Arithmetic with Symmetric Calibration
Rui Luo, Zhixin Zhou
One Node One Model: Featuring the Missing-Half for Graph Clustering
Xuanting Xie, Bingheng Li, Erlin Pan et al.
Tri-Ergon: Fine-Grained Video-to-Audio Generation with Multi-Modal Conditions and LUFS Control
Bingliang Li, Fengyu Yang, Yuxin Mao et al.
Combinatorial Stochastic-Greedy Bandit
Fares Fourati, Christopher John Quinn, Mohamed-Slim Alouini et al.
Early Concept Drift Detection via Prediction Uncertainty
Pengqian Lu, Jie Lu, Anjin Liu et al.
Colour Passing Revisited: Lifted Model Construction with Commutative Factors
Malte Luttermann, Tanya Braun, Ralf Möller et al.
Cross-Modal Match for Language Conditioned 3D Object Grounding
Yachao Zhang, Runze Hu, Ronghui Li et al.
Ouroboros-Diffusion: Exploring Consistent Content Generation in Tuning-free Long Video Diffusion
Jingyuan Chen, Fuchen Long, Jie An et al.
A Sequentially Fair Mechanism for Multiple Sensitive Attributes
Francois HU, Philipp Ratz, Arthur Charpentier
Model-Driven Deep Neural Network for Enhanced AoA Estimation Using 5G gNB
Shengheng Liu, Xingkang Li, Zihuan Mao et al.
Omnipotent Distillation with LLMs for Weakly-Supervised Natural Language Video Localization:
Peijun Bao, Zihao Shao, Wenhan Yang et al.
Multi-Domain Recommendation to Attract Users via Domain Preference Modeling
Hyunjun Ju, SeongKu Kang, Dongha Lee et al.
Decoupling Degradations with Recurrent Network for Video Restoration in Under-Display Camera
Chengxu Liu, Xuan Wang, Yuanting Fan et al.
VVS: Video-to-Video Retrieval with Irrelevant Frame Suppression
Won Jo, Geuntaek Lim, Gwangjin Lee et al.
Multi-View Dynamic Reflection Prior for Video Glass Surface Detection
Fang Liu, Yuhao Liu, Jiaying Lin et al.
Joint Learning Neuronal Skeleton and Brain Circuit Topology with Permutation Invariant Encoders for Neuron Classification
Minghui Liao, Guojia Wan, Bo Du
Low-Light Face Super-resolution via Illumination, Structure, and Texture Associated Representation
Chenyang Wang, Junjun Jiang, Kui Jiang et al.
Knowledge Graph Completion with Relation-Aware Anchor Enhancement
Duanyang Yuan, Sihang Zhou, Xiaoshu Chen et al.
Generalized Bradley-Terry Models for Score Estimation from Paired Comparisons
Julien Fageot, Lê-Nguyên Hoang, Oscar Villemaud et al.
Cumulative Regret Analysis of the Piyavskii–Shubert Algorithm and Its Variants for Global Optimization
Kaan Gokcesu, Hakan Gökcesu
Adversarial Purification with the Manifold Hypothesis
Zhaoyuan Yang, Zhiwei Xu, Jing Zhang et al.
Collaborative Synthesis of Patient Records through Multi-Visit Health State Inference
Hongda Sun, Hongzhan Lin, Rui Yan
PARSAC: Accelerating Robust Multi-Model Fitting with Parallel Sample Consensus
Florian Kluger, Bodo Rosenhahn
Few-Shot Neural Radiance Fields under Unconstrained Illumination
SeokYeong Lee, JunYong Choi, Seungryong Kim et al.
Semantic Lens: Instance-Centric Semantic Alignment for Video Super-resolution
Grab What You Need: Rethinking Complex Table Structure Recognition with Flexible Components Deliberation
Hao Liu, Xin Li, Mingming Gong et al.
ParZC: Parametric Zero-Cost Proxies for Efficient NAS
Peijie Dong, Lujun Li, Zhenheng Tang et al.
Towards Modeling Uncertainties of Self-explaining Neural Networks via Conformal Prediction
Wei Qian, Chenxu Zhao, Yangyi Li et al.
Contributing Dimension Structure of Deep Feature for Coreset Selection
Zhijing Wan, Zhixiang Wang, Yuran Wang et al.
VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering
Chun-Mei Feng, Yang Bai, Tao Luo et al.
Confidence Estimation for Error Detection in Text-to-SQL Systems
Oleg Somov, Elena Tutubalina
DM-Adapter: Domain-Aware Mixture-of-Adapters for Text-Based Person Retrieval
Yating Liu, Zimo Liu, Xiangyuan Lan et al.
PowerMLP: An Efficient Version of KAN
Ruichen Qiu, Yibo Miao, Shiwen Wang et al.
FlexiTex: Enhancing Texture Generation via Visual Guidance
Dadong Jiang, Xianghui Yang, Zibo Zhao et al.
SlerpFace: Face Template Protection via Spherical Linear Interpolation
Zhizhou Zhong, Yuxi Mi, Yuge Huang et al.
UniAP: Towards Universal Animal Perception in Vision via Few-Shot Learning
Meiqi Sun, Zhonghan Zhao, Wenhao Chai et al.
Identification of Necessary Semantic Undertakers in the Causal View for Image-Text Matching
Huatian Zhang, Lei Zhang, Kun Zhang et al.
Improving Open Set Recognition via Visual Prompts Distilled from Common-Sense Knowledge
Seong-Tae Kim, Hyungil Kim, Y. Ro
Parsing All Adverse Scenes: Severity-Aware Semantic Segmentation with Mask-Enhanced Cross-Domain Consistency
Fuhao Li, Ziyang Gong, Yupeng Deng et al.
CriSPO: Multi-Aspect Critique-Suggestion-guided Automatic Prompt Optimization for Text Generation
Han He, Qianchu Liu, Lei Xu et al.
MobileInst: Video Instance Segmentation on the Mobile
Renhong Zhang, Tianheng Cheng, Shusheng Yang et al.
HiHPQ: Hierarchical Hyperbolic Product Quantization for Unsupervised Image Retrieval
Zexuan Qiu, Jiahong Liu, Yankai Chen et al.
Towards a Comprehensive, Efficient and Promptable Anatomic Structure Segmentation Model Using 3D Whole-Body CT Scans
Heng Guo, Jianfeng Zhang, Jiaxing Huang et al.
Learning Deformable Hypothesis Sampling for Accurate PatchMatch Multi-View Stereo
Hongjie Li, Yao Guo, Xianwei Zheng et al.
Disentangled Motion Modeling for Video Frame Interpolation
Jaihyun Lew, Jooyoung Choi, Chaehun Shin et al.
OctOcc: High-Resolution 3D Occupancy Prediction with Octree
Wenzhe Ouyang, Xiaolin Song, Bailan Feng et al.
Exact ASP Counting with Compact Encodings
Mohimenul Kabir, Supratik Chakraborty, Kuldeep S Meel
Beyond Federated Prototype Learning: Learnable Semantic Anchors with Hyperspherical Contrast for Domain-Skewed Data
Lele Fu, Sheng Huang, Yanyi Lai et al.
Coupling Graph Neural Networks with Fractional Order Continuous Dynamics: A Robustness Study
Qiyu Kang, Kai Zhao, Yang Song et al.
PTMQ: Post-training Multi-Bit Quantization of Neural Networks
Ke Xu, Zhongcheng Li, Shanshan Wang et al.
Modeling Inter-Intra Heterogeneity for Graph Federated Learning
Wentao Yu, Shuo Chen, Yongxin Tong et al.
Imitate Before Detect: Aligning Machine Stylistic Preference for Machine-Revised Text Detection
Jiaqi Chen, Xiaoye Zhu, Tianyang Liu et al.
Multiscale Low-Frequency Memory Network for Improved Feature Extraction in Convolutional Neural Networks
Fuzhi Wu, Jiasong Wu, Youyong Kong et al.
Label-Free Backdoor Attacks in Vertical Federated Learning
Wei Shen, Wenke Huang, Guancheng Wan et al.
ProtoOcc: Accurate, Efficient 3D Occupancy Prediction Using Dual Branch Encoder-Prototype Query Decoder
Jungho Kim, Changwon Kang, Dongyoung Lee et al.
De-biased Attention Supervision for Text Classification with Causality
Yiquan Wu, Yifei Liu, Ziyu Zhao et al.
RHanDS: Refining Malformed Hands for Generated Images with Decoupled Structure and Style Guidance
Chengrui Wang, Pengfei Liu, Min Zhou et al.
ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation
Mengyang Wu, Yuzhi Zhao, Jialun Cao et al.
Towards Optimal Subsidy Bounds for Envy-Freeable Allocations
Yasushi Kawase, Kazuhisa Makino, Hanna Sumita et al.
LoRID: Low-Rank Iterative Diffusion for Adversarial Purification
Geigh Zollicoffer, Minh N. Vu, Ben Nebgen et al.
Scalable Surrogate Verification of Image-Based Neural Network Control Systems Using Composition and Unrolling
Feiyang Cai, Chuchu Fan, Stanley Bak
Arbitrary-Scale Point Cloud Upsampling by Voxel-Based Network with Latent Geometric-Consistent Learning
Hang Du, Xuejun Yan, Jingjing Wang et al.
Radiology Report Generation via Multi-objective Preference Optimization
Ting Xiao, Lei Shi, Peng Liu et al.
FedCFA: Alleviating Simpson’s Paradox in Model Aggregation with Counterfactual Federated Learning
Zhonghua Jiang, Jimin Xu, Shengyu Zhang et al.
GIM: A Million-scale Benchmark for Generative Image Manipulation Detection and Localization
Yirui Chen, Xudong Huang, Quan Zhang et al.
How to Use the Metropolis Algorithm for Multi-Objective Optimization?
Weijie Zheng, Mingfeng Li, Renzhong Deng et al.
Seeing Your Speech Style: A Novel Zero-Shot Identity-Disentanglement Face-based Voice Conversion
Yan Rong, Li Liu
Graph-Based Prediction and Planning Policy Network (GP3Net) for Scalable Self-Driving in Dynamic Environments Using Deep Reinforcement Learning
Jayabrata Chowdhury, Venkataramanan Shivaraman, Suresh Sundaram et al.
Domain Generalization with Vital Phase Augmentation
Ingyun Lee, WooJu Lee, Hyun Myung
Federated Graph Condensation with Information Bottleneck Principles
Bo Yan, Sihao He, Cheng Yang et al.
LaneGraph2Seq: Lane Topology Extraction with Language Model via Vertex-Edge Encoding and Connectivity Enhancement
Renyuan Peng, Xinyue Cai, Hang Xu et al.
Simplifying Complex Observation Models in Continuous POMDP Planning with Probabilistic Guarantees and Practice
Idan Lev-Yehudi, Moran Barenboim, Vadim Indelman
V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer
Hangzhou He, Lei Zhu, Xinliang Zhang et al.
Mixture-of-Attack-Experts with Class Regularization for Unified Physical-Digital Face Attack Detection
Shunxin Chen, Ajian Liu, Junze Zheng et al.
Scaffold-BPE: Enhancing Byte Pair Encoding for Large Language Models with Simple and Effective Scaffold Token Removal
Haoran Lian, Yizhe Xiong, Jianwei Niu et al.
Revolutionizing Encrypted Traffic Classification with MH-Net: A Multi-View Heterogeneous Graph Model
Haozhen Zhang, Haodong Yue, Xi Xiao et al.
Patched Line Segment Learning for Vector Road Mapping
Jiakun Xu, Bowen Xu, Gui-Song Xia et al.
Empowering CAM-Based Methods with Capability to Generate Fine-Grained and High-Faithfulness Explanations
Changqing Qiu, Fusheng Jin, Yining Zhang
Semi-supervised Class-Agnostic Motion Prediction with Pseudo Label Regeneration and BEVMix
Kewei Wang, Yizheng Wu, Zhiyu Pan et al.
Reverse Multi-Choice Dialogue Commonsense Inference with Graph-of-Thought
Li Zheng, Hao Fei, Fei Li et al.
Efficient Attention-Sharing Information Distillation Transformer for Lightweight Single Image Super-Resolution
Karam Park, Jae Woong Soh, Nam Ik Cho
Self-Training Based Few-Shot Node Classification by Knowledge Distillation
Zongqian Wu, Yujie Mo, Peng Zhou et al.
Maximizing Nash Social Welfare under Two-Sided Preferences
Pallavi Jain, Rohit Vaish
DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning
Xinghao Wang, Junliang He, Pengyu Wang et al.
Taylor Series-Inspired Local Structure Fitting Network for Few-shot Point Cloud Semantic Segmentation
Changshuo Wang, Shuting He, Xiang Fang et al.
Symmetric Self-Paced Learning for Domain Generalization
Di Zhao, Yun Sing Koh, Gillian Dobbie et al.
Efficient Gaussian Splatting for Monocular Dynamic Scene Rendering via Sparse Time-Variant Attribute Modeling
Hanyang Kong, Xingyi Yang, Xinchao Wang
ViG: Linear-complexity Visual Sequence Learning with Gated Linear Attention
Bencheng Liao, Xinggang Wang, Lianghui Zhu et al.
GraphAvatar: Compact Head Avatars with GNN-Generated 3D Gaussians
Xiaobao Wei, Peng Chen, Ming Lu et al.
VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation
Jialu Li, Aishwarya Padmakumar, Gaurav Sukhatme et al.
ADBA: Approximation Decision Boundary Approach for Black-Box Adversarial Attacks
Feiyang Wang, Xingquan Zuo, Hai Huang et al.
Geometry-Guided Domain Generalization for Monocular 3D Object Detection
Fan Yang, Hui Chen, Yuwei He et al.
Detection and Defense of Unlearnable Examples
Yifan Zhu, lijia Yu, Xiao-Shan Gao
CatFormer: Category-Level 6D Object Pose Estimation with Transformer
Sheng Yu, Dihua Zhai, Yuanqing Xia
Completing Priceable Committees: Utilitarian and Representation Guarantees for Proportional Multiwinner Voting
Markus Brill, Jannik Peters
VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models
Yabo Zhang, Yuxiang Wei, Xianhui Lin et al.
FilterTS: Comprehensive Frequency Filtering for Multivariate Time Series Forecasting
Yulong Wang, Yushuo Liu, Xiaoyi Duan et al.
Incomplete Multi-view Clustering via Diffusion Contrastive Generation
Yuanyang Zhang, Yijie Lin, Weiqing Yan et al.
FACL-Attack: Frequency-Aware Contrastive Learning for Transferable Adversarial Attacks
Hunmin Yang, Jongoh Jeong, Kuk-Jin Yoon
Enhancing Low-Resource Relation Representations through Multi-View Decoupling
Curved Representation Space of Vision Transformers
Juyeop Kim, Junha Park, Songkuk Kim et al.
ZoRI: Towards Discriminative Zero-Shot Remote Sensing Instance Segmentation
Shiqi Huang, Shuting He, Bihan Wen
B-spine: Learning B-spline Curve Representation for Robust and Interpretable Spinal Curvature Estimation
Hao Wang, Qiang Song, Ruofeng Yin et al.
patchDPCC: A Patchwise Deep Compression Framework for Dynamic Point Clouds
Authors: Zirui Pan, Mengbai Xiao, Xu Han et al.
BOK-VQA: Bilingual outside Knowledge-Based Visual Question Answering via Graph Representation Pretraining
Minjun Kim, SeungWoo Song, Youhan Lee et al.
Debiased Novel Category Discovering and Localization
Juexiao Feng, Yuhong Yang, Yanchun Xie et al.
Causal Inference over Visual-Semantic-Aligned Graph for Image Classification
Lei Meng, Xiangxian Li, Xiaoshuo Yan et al.
RG-GAN: Dynamic Regenerative Pruning for Data-Efficient Generative Adversarial Networks
Divya Saxena, Jiannong Cao, Jiahao Xu et al.
Revisiting Multimodal Fusion for 3D Anomaly Detection from an Architectural Perspective
Kaifang Long, Guoyang Xie, Lianbo Ma et al.
GO-DICE: Goal-Conditioned Option-Aware Offline Imitation Learning via Stationary Distribution Correction Estimation
Abhinav Jain, Vaibhav Unhelkar
2043 Improved MLP Point Cloud Processing with High-Dimensional Positional Encoding
Yanmei Zou, Hongshan Yu, Zhengeng Yang et al.
Region-Based Optimization in Continual Learning for Audio Deepfake Detection
Yujie Chen, Jiangyan Yi, Cunhang Fan et al.
Identifying Macro Conditional Independencies and Macro Total Effects in Summary Causal Graphs with Latent Confounding
Simon Ferreira, Charles K. Assaad
Self-Explainable Graph Transformer for Link Sign Prediction
Lu Li, Jiale Liu, Xingyu Ji et al.
UniPCGC: Towards Practical Point Cloud Geometry Compression via an Efficient Unified Approach
Kangli Wang, Wei Gao
Federated Foundation Models on Heterogeneous Time Series
Shengchao Chen, Guodong Long, Jing Jiang et al.
SC-NeuS: Consistent Neural Surface Reconstruction from Sparse and Noisy Views
Shi-Sheng Huang, Zixin Zou, Yichi Zhang et al.
PG-LBO: Enhancing High-Dimensional Bayesian Optimization with Pseudo-Label and Gaussian Process Guidance
Taicai Chen, Yue Duan, Dong Li et al.
M2OST: Many-to-one Regression for Predicting Spatial Transcriptomics from Digital Pathology Images
Hongyi Wang, Xiuju Du, Jing Liu et al.
LLaVA Needs More Knowledge: Retrieval Augmented Natural Language Generation with Knowledge Graph for Explaining Thoracic Pathologies
Ameer Hamza, Abdullah, Yong Hyun Ahn et al.
Human and AI Perceptual Differences in Image Classification Errors
Minghao Liu, Jiaheng Wei, Yang Liu et al.
Surgical Workflow Recognition and Blocking Effectiveness Detection in Laparoscopic Liver Resection with Pringle Maneuver
Diandian Guo, Weixin Si, Zhixi Li et al.