rleak.com - Spot the Future of AI Research

#1

SAM 2: Segment Anything in Images and Videos

Nikhila Ravi, Valentin Gabeur, Yuan-Ting Hu et al.

ICLR 2025

2,393

citations

#2

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Clemencia Siro, Guy Gur-Ari, Gaurav Mishra et al.

ICLR 2025

2,226

citations

#3

CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer

Zhuoyi Yang, Jiayan Teng, Wendi Zheng et al.

ICLR 2025

1,409

citations

#4

LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code

Naman Jain, Han, Alex Gu et al.

ICLR 2025

1,108

citations

#5

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

Javier Rando, Tony Wang, Stewart Slocum et al.

ICLR 2025

750

citations

#6

WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct

Haipeng Luo, Qingfeng Sun, Can Xu et al.

ICLR 2025

655

citations

#7

RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment

Jipeng Zhang, Hanze Dong, Tong Zhang et al.

ICLR 2025

642

citations

#8

Show-o: One Single Transformer to Unify Multimodal Understanding and Generation

Jinheng Xie, Weijia Mao, Zechen Bai et al.

ICLR 2025

483

citations

#9

GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models

Iman Mirzadeh, Keivan Alizadeh-Vahid, Hooman Shahrokhi et al.

ICLR 2025

436

citations

#10

NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models

Chankyu Lee, Rajarshi Roy, Mengyao Xu et al.

ICLR 2025

419

citations

#11

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

Terry Yue Zhuo, Minh Chien Vu, Jenny Chim et al.

ICLR 2025

410

citations

#12

RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation

Songming Liu, Lingxuan Wu, Bangguo Li et al.

ICLR 2025

409

citations

#13

Causal Reasoning and Large Language Models: Opening a New Frontier for Causality

Chenhao Tan, Robert Ness, Amit Sharma et al.

ICLR 2025

403

citations

#14

Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks

Maksym Andriushchenko, francesco croce, Nicolas Flammarion

ICLR 2025

401

citations

#15

OpenHands: An Open Platform for AI Software Developers as Generalist Agents

Xingyao Wang, Boxuan Li, Yufan Song et al.

ICLR 2025

387

citations

#16

Generative Verifiers: Reward Modeling as Next-Token Prediction

Lunjun Zhang, Arian Hosseini, Hritik Bansal et al.

ICLR 2025

375

citations

#17

Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Sihyun Yu, Sangkyung Kwak, Huiwon Jang et al.

ICLR 2025

342

citations

#18

Scaling and evaluating sparse autoencoders

Leo Gao, Tom Dupre la Tour, Henk Tillman et al.

ICLR 2025

326

citations

#19

Training Language Models to Self-Correct via Reinforcement Learning

Aviral Kumar, Vincent Zhuang, Rishabh Agarwal et al.

ICLR 2025

324

citations

#20

Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model

Chunting Zhou, Lili Yu, Arun Babu et al.

ICLR 2025

318

citations

ICLR

Top Papers in ICLR 2025

SAM 2: Segment Anything in Images and Videos

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer

LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct

RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment

Show-o: One Single Transformer to Unify Multimodal Understanding and Generation

GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models

NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation

Causal Reasoning and Large Language Models: Opening a New Frontier for Causality

Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks

OpenHands: An Open Platform for AI Software Developers as Generalist Agents

Generative Verifiers: Reward Modeling as Next-Token Prediction

Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Scaling and evaluating sparse autoencoders

Training Language Models to Self-Correct via Reinforcement Learning

Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model