α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Jie Tang
Jie Tang
OpenReview
29
papers
5,813
total citations
papers (29)
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
ICLR 2025
arXiv
1,409
citations
CogView: Mastering Text-to-Image Generation via Transformers
NEURIPS 2021
arXiv
934
citations
ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation
NEURIPS 2023
arXiv
803
citations
CogAgent: A Visual Language Model for GUI Agents
CVPR 2024
arXiv
629
citations
Graph Random Neural Networks for Semi-Supervised Learning on Graphs
NEURIPS 2020
arXiv
470
citations
CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers
NEURIPS 2022
arXiv
402
citations
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
NEURIPS 2022
arXiv
372
citations
LVBench: An Extreme Long Video Understanding Benchmark
ICCV 2025
arXiv
229
citations
Robust Object Modeling for Visual Tracking
ICCV 2023
arXiv
152
citations
KoLA: Carefully Benchmarking World Knowledge of Large Language Models
ICLR 2024
arXiv
88
citations
VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
ICLR 2025
arXiv
70
citations
Bilateral Propagation Network for Depth Completion
CVPR 2024
arXiv
58
citations
Scaling Speech-Text Pre-training with Synthetic Interleaved Data
ICLR 2025
arXiv
41
citations
Towards Efficient Exact Optimization of Language Model Alignment
ICML 2024
arXiv
32
citations
MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models
CVPR 2025
arXiv
27
citations
CATANet: Efficient Content-Aware Token Aggregation for Lightweight Image Super-Resolution
CVPR 2025
arXiv
26
citations
Sketch and Refine: Towards Fast and Accurate Lane Detection
AAAI 2024
arXiv
22
citations
TriSampler: A Better Negative Sampling Principle for Dense Retrieval
AAAI 2024
arXiv
14
citations
VPO: Aligning Text-to-Video Generation Models with Prompt Optimization
ICCV 2025
arXiv
13
citations
SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models
ICLR 2025
arXiv
13
citations
AutoLUT: LUT-Based Image Super-Resolution with Automatic Sampling and Adaptive Residual Learning
CVPR 2025
arXiv
5
citations
A Matrix Chernoff Bound for Markov Chains and Its Application to Co-occurrence Matrices
NEURIPS 2020
arXiv
3
citations
Small Language Model Makes an Effective Long Text Extractor
AAAI 2025
arXiv
1
citations
A Hierarchical Reinforcement Learning Based Optimization Framework for Large-scale Dynamic Pickup and Delivery Problems
NEURIPS 2021
0
citations
UFC-BERT: Unifying Multi-Modal Controls for Conditional Image Synthesis
NEURIPS 2021
0
citations
Residual Feature Aggregation Network for Image Super-Resolution
CVPR 2020
0
citations
CogLTX: Applying BERT to Long Texts
NEURIPS 2020
0
citations
BodyGAN: General-Purpose Controllable Neural Human Body Generation
CVPR 2022
0
citations
Adaptive Diffusion in Graph Neural Networks
NEURIPS 2021
0
citations