α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Maosong Sun
Maosong Sun
OpenReview
24
papers
3,751
total citations
papers (24)
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs
ICLR 2024
arXiv
1,197
citations
C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models
NEURIPS 2023
arXiv
767
citations
AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors
ICLR 2024
arXiv
503
citations
RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback
CVPR 2024
arXiv
361
citations
ULTRAFEEDBACK: Boosting Language Models with Scaled AI Feedback
ICML 2024
arXiv
214
citations
Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evaluations
NEURIPS 2023
arXiv
134
citations
Fine-Grained Scene Graph Generation with Data Transfer
ECCV 2022
arXiv
112
citations
A Unified Evaluation of Textual Backdoor Learning: Frameworks and Benchmarks
NEURIPS 2022
arXiv
96
citations
Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages
ICLR 2024
arXiv
77
citations
RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness
CVPR 2025
arXiv
60
citations
Towards Interpretable Natural Language Understanding with Explanations as Latent Variables
NEURIPS 2020
arXiv
47
citations
Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance
ICLR 2025
arXiv
41
citations
Visual Distant Supervision for Scene Graph Generation
ICCV 2021
arXiv
40
citations
Predicting Emergent Abilities with Infinite Resolution Evaluation
ICLR 2024
arXiv
27
citations
XLRS-Bench: Could Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?
CVPR 2025
arXiv
26
citations
A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules
ICLR 2025
arXiv
17
citations
WorkflowLLM: Enhancing Workflow Orchestration Capability of Large Language Models
ICLR 2025
arXiv
15
citations
Exploring the Benefit of Activation Sparsity in Pre-training
ICML 2024
arXiv
6
citations
AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization
CVPR 2025
arXiv
6
citations
DCAD-2000: A Multilingual Dataset across 2000+ Languages with Data Cleaning as Anomaly Detection
NEURIPS 2025
arXiv
3
citations
Rational Decision-Making Agent with Learning Internal Utility Judgment
ICLR 2025
2
citations
Sparse Structure Search for Delta Tuning
NEURIPS 2022
0
citations
H3T: Efficient Integration of Memory Optimization and Parallelism for Large-scale Transformer Training
NEURIPS 2023
0
citations
Moderate-fitting as a Natural Backdoor Defender for Pre-trained Language Models
NEURIPS 2022
0
citations