OpenClaw-Skill: Collective Skill Tree Search for Agentic Large Language Models

2026-06-15 • Artificial Intelligence

Artificial IntelligenceComputation and Language

AI summaryⓘ

The authors developed a new method called Collective Skill Tree Search (CSTS) to help large language models build and use skills more effectively for complex tasks. CSTS uses teamwork among multiple models to create, evaluate, and select skills in a structured tree form, ensuring these skills are useful and can be applied in many situations. They also introduced a way to combine multiple skills during learning to find better solutions and avoid relying on just one skill. Their trained model, OpenClaw-Skill, showed strong abilities in planning, using tools, and generalizing to difficult problems.

Large Language ModelsSkill ConstructionTree SearchCollective IntelligenceMulti-step ReasoningSkill TransferabilityReinforcement LearningTool UseGeneralizationOpenClaw

Authors

Tianyi Lin, Chuanyu Sun, Jingyi Zhang, Changxu Wei, Huanjin Yao, Shunyu Liu, Xikun Zhang, Liu Liu, Jiaxing Huang

Abstract

Equipping Large Language Model (LLM) agents with effective skills is crucial for solving complex tasks in real-world systems like OpenClaw. In this work, we aim to develop a framework that automatically constructs such reusable skills to enhance LLMs in tool use, multi-step reasoning, and dynamic environment interaction. To this end, we propose Collective Skill Tree Search (CSTS), a novel tree-search-based skill construction framework that constructs structured, diverse and generalizable tree of skills. The core idea of CSTS is to leverage collective intelligence to jointly search, identify and compose effective skills via two iterative phases: Collective Skill Node Generation (CSN-Gen) and Collective Skill Node Assessment (CSN-Assess). CSN-Gen exploits collective knowledge from multiple models to explore diverse candidate skills for each subtask, enabling comprehensive skill exploration. CSN-Assess employs multiple models as judges to evaluate and select skill nodes with two scoring mechanisms: (1) collective quality scoring that aggregates independent evaluations to produce a robust estimate of skill effectiveness, and (2) collective transferability scoring that explicitly verifies whether a skill generalizes well across different models. With CSTS, we construct a set of comprehensive tree of skills along with skill-augmented training data, enabling models to effectively learn and utilize skills. Besides, we introduce Collective Skill Reinforcement Learning, which actively selects multiple relevant skills from the tree to broaden solution-space exploration, avoid being trapped by a single skill and its resulting homogeneous or suboptimal solutions. As a result, our trained model, OpenClaw-Skill, exhibits outstanding agentic capabilities in long-horizon planning, tool use and generalization over challenging benchmarks.

View PDFOpen arXiv