α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Yuandong Tian
Yuandong Tian
29
papers
2,503
total citations
papers (29)
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
ICML 2024
arXiv
371
citations
Training Large Language Models to Reason in a Continuous Latent Space
COLM 2025
arXiv
357
citations
FBNetV2: Differentiable Neural Architecture Search for Spatial and Channel Dimensions
CVPR 2020
arXiv
321
citations
TravelPlanner: A Benchmark for Real-World Planning with Language Agents
ICML 2024
arXiv
319
citations
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases
ICML 2024
arXiv
195
citations
Learning Search Space Partition for Black-box Optimization using Monte Carlo Tree Search
NEURIPS 2020
arXiv
151
citations
AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs
ICML 2025
arXiv
132
citations
Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer
NEURIPS 2023
arXiv
105
citations
FBNetV3: Joint Architecture-Recipe Search Using Predictor Pretraining
CVPR 2021
arXiv
75
citations
On the Importance of Asymmetry for Siamese Representation Learning
CVPR 2022
arXiv
59
citations
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning
ICML 2025
arXiv
52
citations
MADE: Exploration via Maximizing Deviation from Explored Regions
NEURIPS 2021
arXiv
50
citations
NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions
NEURIPS 2025
arXiv
49
citations
JoMA: Demystifying Multilayer Transformers via Joint Dynamics of MLP and Attention
ICLR 2024
arXiv
48
citations
Understanding Deep Contrastive Learning via Coordinate-wise Optimization
NEURIPS 2022
arXiv
42
citations
DreamShard: Generalizable Embedding Table Placement for Recommender Systems
NEURIPS 2022
arXiv
35
citations
FP-NAS: Fast Probabilistic Neural Architecture Search
CVPR 2021
arXiv
26
citations
Towards General-Purpose Model-Free Reinforcement Learning
ICLR 2025
arXiv
24
citations
Joint Policy Search for Multi-agent Collaboration with Imperfect Information
NEURIPS 2020
arXiv
23
citations
From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories, and Applications
ICML 2025
arXiv
20
citations
Landscape Surrogate: Learning Decision Losses for Mathematical Optimization Under Partial Information
NEURIPS 2023
arXiv
20
citations
LoCoCo: Dropping In Convolutions for Long Context Compression
ICML 2024
arXiv
16
citations
Learning Space Partitions for Path Planning
NEURIPS 2021
arXiv
11
citations
GenCO: Generating Diverse Designs with Combinatorial Constraints
ICML 2024
arXiv
2
citations
Latent Execution for Neural Program Synthesis Beyond Domain-Specific Languages
NEURIPS 2021
0
citations
Param$\Delta$ for Direct Mixing: Post-Train Large Language Model At Zero Cost
ICLR 2025
0
citations
H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models
NEURIPS 2023
0
citations
NovelD: A Simple yet Effective Exploration Criterion
NEURIPS 2021
0
citations
Contrastive Predict-and-Search for Mixed Integer Linear Programs
ICML 2024
0
citations