α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Yu Shen
Yu Shen
27
papers
393
total citations
papers (27)
VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
NEURIPS 2025
arXiv
138
citations
V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection
ICLR 2024
arXiv
39
citations
Rethinking Reward Modeling in Preference-based Large Language Model Alignment
ICLR 2025
27
citations
What Makes a Good Diffusion Planner for Decision Making?
ICLR 2025
arXiv
27
citations
SPARTUN3D: Situated Spatial Understanding of 3D World in Large Language Model
ICLR 2025
arXiv
20
citations
How Do Large Language Models Understand Graph Patterns? A Benchmark for Graph Pattern Comprehension
ICLR 2025
arXiv
20
citations
Framer: Interactive Frame Interpolation
ICLR 2025
arXiv
20
citations
VITA-Audio: Fast Interleaved Audio-Text Token Generation for Efficient Large Speech-Language Model
NEURIPS 2025
17
citations
Refine Knowledge of Large Language Models via Adaptive Contrastive Learning
ICLR 2025
arXiv
16
citations
DivBO: Diversity-aware CASH for Ensemble Learning
NEURIPS 2022
arXiv
12
citations
KVFlow: Efficient Prefix Caching for Accelerating LLM-Based Multi-Agent Workflows
NEURIPS 2025
arXiv
9
citations
Modality-Specialized Synergizers for Interleaved Vision-Language Generalists
ICLR 2025
arXiv
8
citations
Orientation Matters: Making 3D Generative Models Orientation-Aligned
NEURIPS 2025
arXiv
7
citations
Shape My Moves: Text-Driven Shape-Aware Synthesis of Human Motions
CVPR 2025
arXiv
5
citations
Reflection-Window Decoding: Text Generation with Selective Refinement
ICML 2025
arXiv
5
citations
VideoVLA: Video Generators Can Be Generalizable Robot Manipulators
NEURIPS 2025
arXiv
5
citations
SysBench: Can LLMs Follow System Message?
ICLR 2025
5
citations
API Pack: A Massive Multi-Programming Language Dataset for API Call Generation
ICLR 2025
arXiv
4
citations
Habitizing Diffusion Planning for Efficient and Effective Decision Making
ICML 2025
arXiv
3
citations
FairViT: Fair Vision Transformer via Adaptive Masking
ECCV 2024
arXiv
2
citations
Zooming from Context to Cue: Hierarchical Preference Optimization for Multi-Image MLLMs
NEURIPS 2025
arXiv
2
citations
Fast, Accurate Manifold Denoising by Tunneling Riemannian Optimization
ICML 2025
arXiv
1
citations
GUI Exploration Lab: Enhancing Screen Navigation in Agents via Multi-Turn Reinforcement Learning
NEURIPS 2025
arXiv
1
citations
UniRestore3D: A Scalable Framework For General Shape Restoration
ICLR 2025
0
citations
CausalVerse: Benchmarking Causal Representation Learning with Configurable High-Fidelity Simulations
NEURIPS 2025
arXiv
0
citations
GAN-based Garment Generation Using Sewing Pattern Images
ECCV 2020
0
citations
Gradient-Free Adversarial Training Against Image Corruption for Learning-based Steering
NEURIPS 2021
0
citations