"monte carlo tree search" Papers
38 papers found
Conference
AFlow: Automating Agentic Workflow Generation
Jiayi Zhang, Jinyu Xiang, Zhaoyang Yu et al.
AutoDiscovery: Open-ended Scientific Discovery via Bayesian Surprise
Dhruv Agarwal, Bodhisattwa Prasad Majumder, Reece Adamson et al.
CARTS: Advancing Neural Theorem Proving with Diversified Tactic Calibration and Bias-Resistant Tree Search
Xiao-Wen Yang, Zhi Zhou, Haiming Wang et al.
Certifying Language Model Robustness with Fuzzed Randomized Smoothing: An Efficient Defense Against Backdoor Attacks
Bowei He, Lihao Yin, Huiling Zhen et al.
CLDyB: Towards Dynamic Benchmarking for Continual Learning with Pre-trained Models
Shengzhuang Chen, Yikai Liao, Xiaoxiao Sun et al.
DyFo: A Training-Free Dynamic Focus Visual Search for Enhancing LMMs in Fine-Grained Visual Understanding
Geng Li, Jinglin Xu, Yunzhen Zhao et al.
ExACT: Teaching AI Agents to Explore with Reflective-MCTS and Exploratory Learning
Xiao Yu, Baolin Peng, Vineeth Vajipey et al.
Fine-Grained Preference Optimization Improves Spatial Reasoning in VLMs
Yifan Shen, Yuanzhe Liu, Jingyuan Zhu et al.
Grammar Reinforcement Learning: path and cycle counting in graphs with a Context-Free Grammar and Transformer approach
Jason Piquenot, Maxime Berar, Romain Raveaux et al.
Graph-based Symbolic Regression with Invariance and Constraint Encoding
Ziyu Xiang, Kenna Ashen, Xiaofeng Qian et al.
HumanoidGen: Data Generation for Bimanual Dexterous Manipulation via LLM Reasoning
Zhi Jing, Siyuan Yang, Jicong Ao et al.
Implicit Search via Discrete Diffusion: A Study on Chess
Jiacheng Ye, Zhenyu Wu, Jiahui Gao et al.
Improving Monte Carlo Tree Search for Symbolic Regression
Zhengyao Huang, Daniel Huang, Tiannan Xiao et al.
MALinZero: Efficient Low-Dimensional Search for Mastering Complex Multi-Agent Planning
Sizhe Tang, Jiayu Chen, Tian Lan
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solver
Zhenting Qi, Mingyuan MA, Jiahang Xu et al.
PlanU: Large Language Model Reasoning through Planning under Uncertainty
Ziwei Deng, Mian Deng, Chenjing Liang et al.
RF-Agent: Automated Reward Function Design via Language Agent Tree Search
Ning Gao, Xiuhui Zhang, Xingyu Jiang et al.
Scalable Decision-Making in Stochastic Environments through Learned Temporal Abstraction
Baiting Luo, Ava Pettet, Aron Laszka et al.
SE-Agent: Self-Evolution Trajectory Optimization in Multi-Step Reasoning with LLM-Based Agents
Yifu Guo, Jiaye Lin, Huacan Wang et al.
SEEA-R1: Tree-Structured Reinforcement Fine-Tuning for Self-Evolving Embodied Agents
Wanxin Tian, Shijie Zhang, Kevin Zhang et al.
SIGMA: Refining Large Language Model Reasoning via Sibling-Guided Monte Carlo Augmentation
Yanwei Ren, Haotian Zhang, Fuxiang Wu et al.
SolverLLM: Leveraging Test-Time Scaling for Optimization Problem via LLM-Guided Search
Dong Li, Xujiang Zhao, Linlin Yu et al.
Strength Estimation and Human-Like Strength Adjustment in Games
Chun Jung Chen, Chung-Chin Shih, Ti-Rong Wu
Threshold UCT: Cost-Constrained Monte Carlo Tree Search with Pareto Curves
Martin Kurečka, Václav Nevyhoštěný, Petr Novotný et al.
Uncertainty-Guided Exploration for Efficient AlphaZero Training
Scott Cheng, Meng-Yu Tsai, Ding-Yong Hong et al.
Understanding Methods for Scalable MCTS
Will Knipe
WebPilot: A Versatile and Autonomous Multi-Agent System for Web Task Execution with Strategic Exploration
Yao Zhang, Zijian Ma, Yunpu Ma et al.
Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search
Yuichi Inoue, Kou Misaki, Yuki Imajuku et al.
A Bayesian Approach to Online Planning
Nir Greshler, David Ben Eli, Carmel Rabinovitz et al.
DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models (Exemplified as A Video Agent)
Zongxin Yang, Guikun Chen, Xiaodi Li et al.
Efficient Adaptation in Mixed-Motive Environments via Hierarchical Opponent Modeling and Planning
Yizhe Huang, Anji Liu, Fanqi Kong et al.
Language Agent Tree Search Unifies Reasoning, Acting, and Planning in Language Models
Andy Zhou, Kai Yan, Michal Shlapentokh-Rothman et al.
Monte Carlo Tree Search in the Presence of Transition Uncertainty
Farnaz Kohankhaki, Kiarash Aghakasiri, Hongming Zhang et al.
Multi-Agent Reinforcement Learning with Hierarchical Coordination for Emergency Responder Stationing
Amutheezan Sivagnanam, Ava Pettet, Hunter Lee et al.
Position: Rethinking Post-Hoc Search-Based Neural Approaches for Solving Large-Scale Traveling Salesman Problems
Yifan Xia, Xianliang Yang, Zichuan Liu et al.
Provably Efficient Long-Horizon Exploration in Monte Carlo Tree Search through State Occupancy Regularization
Liam Schramm, Abdeslam Boularias
Sample-and-Bound for Non-convex Optimization
Yaoguang Zhai, Zhizhen Qin, Sicun Gao
Scalable Safe Policy Improvement for Factored Multi-Agent MDPs
Federico Bianchi, Edoardo Zorzi, Alberto Castellini et al.