α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Shengqiong Wu
Shengqiong Wu
10
papers
1,114
total citations
papers (10)
NExT-GPT: Any-to-Any Multimodal LLM
ICML 2024
arXiv
726
citations
Video-of-Thought: Step-by-Step Video Reasoning from Perception to Cognition
ICML 2024
arXiv
146
citations
LasUIE: Unifying Information Extraction with Latent Adaptive Structure-aware Generative Language Model
NEURIPS 2022
arXiv
104
citations
Towards Semantic Equivalence of Tokenization in Multimodal LLM
ICLR 2025
arXiv
58
citations
Dysen-VDM: Empowering Dynamics-aware Text-to-Video Diffusion with LLMs
CVPR 2024
arXiv
46
citations
Combating Multimodal LLM Hallucination via Bottom-Up Holistic Reasoning
AAAI 2025
arXiv
19
citations
JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation
NEURIPS 2025
arXiv
7
citations
Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene
CVPR 2025
arXiv
4
citations
Universal Scene Graph Generation
CVPR 2025
arXiv
4
citations
Imagine That! Abstract-to-Intricate Text-to-Image Synthesis with Scene Graph Hallucination Diffusion
NEURIPS 2023
0
citations