α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Zihao Wang
Zihao Wang
2
Affiliations
Affiliations
Peking University
Hong Kong University of Science and Technology
26
papers
715
total citations
papers (26)
OnePose: One-Shot Object Pose Estimation Without CAD Models
CVPR 2022
arXiv
213
citations
ProAgent: Building Proactive Cooperative Agents with Large Language Models
AAAI 2024
arXiv
120
citations
Concept Algebra for (Score-Based) Text-Controlled Generative Models
NEURIPS 2023
arXiv
60
citations
Weakly-supervised 3D Shape Completion in the Wild
ECCV 2020
arXiv
60
citations
Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction
CVPR 2023
arXiv
46
citations
GROOT: Learning to Follow Instructions by Watching Gameplay Videos
ICLR 2024
arXiv
38
citations
Selecting Large Language Model to Fine-tune via Rectified Scaling Law
ICML 2024
arXiv
29
citations
Transforming and Combining Rewards for Aligning Large Language Models
ICML 2024
arXiv
26
citations
Posterior Collapse of a Linear Latent Variable Model
NEURIPS 2022
arXiv
24
citations
MCU: An Evaluation Framework for Open-Ended Game Agents
ICML 2025
arXiv
18
citations
Where am I? Cross-View Geo-localization with Natural Language Descriptions
ICCV 2025
arXiv
16
citations
ROCKET-1: Mastering Open-World Interaction with Visual-Temporal Context Prompting
CVPR 2025
arXiv
12
citations
Quasi-Balanced Self-Training on Noise-Aware Synthesis of Object Point Clouds for Closing Domain Gap
ECCV 2022
arXiv
11
citations
ACE: Anti-Editing Concept Erasure in Text-to-Image Models
CVPR 2025
arXiv
9
citations
Learning Hierarchical Polynomials with Three-Layer Neural Networks
ICLR 2024
arXiv
7
citations
NestE: Modeling Nested Relational Structures for Knowledge Graph Reasoning
AAAI 2024
arXiv
7
citations
A User-Friendly Framework for Generating Model-Preferred Prompts in Text-to-Image
AAAI 2024
arXiv
7
citations
Model Reveals What to Cache: Profiling-Based Feature Reuse for Video Diffusion Models
ICCV 2025
arXiv
6
citations
Learning Hierarchical Polynomials of Multiple Nonlinear Features
ICLR 2025
arXiv
4
citations
Transtreaming: Adaptive Delay-aware Transformer for Real-time Streaming Perception
AAAI 2025
arXiv
2
citations
Open-World Skill Discovery from Unsegmented Demonstration Videos
ICCV 2025
0
citations
Theoretical Analysis of the Inductive Biases in Deep Convolutional Networks
NEURIPS 2023
0
citations
Learning Transformation-Predictive Representations for Detection and Description of Local Features
CVPR 2023
0
citations
Describe, Explain, Plan and Select: Interactive Planning with LLMs Enables Open-World Multi-Task Agents
NEURIPS 2023
0
citations
ESEG: Event-Based Segmentation Boosted by Explicit Edge-Semantic Guidance
AAAI 2025
0
citations
MSV-PCT: Multi-Sparse-View Enhanced Transformer Framework for Salient Object Detection in Point Clouds
AAAI 2025
0
citations