Blogs

Latest updates, announcements, and insights from the SkillsBench team.

June 16, 2026

SkillsBench 1.1: Agent Skills Benchmark Release

SkillsBench 1.1 updates the Agent Skills benchmark to 87 native BenchFlow task.md packages across 8 domains. The paper reports 18 model-harness configurations; the public leaderboard currently tracks 24 in total including previous results. In the paper aggregate, curated Skills raise mean resolution rate from 33.9% to 50.5% (+16.6 points).

February 10, 2026

Research

Introducing SkillsBench: The First Benchmark for Agent Skills

Historical launch post for the paper-v1 SkillsBench snapshot; the current v1.1 registry contains 87 native BenchFlow task.md packages.

Launches

Follow our journey building SkillsBench.

January 5, 2026

SkillsBench Launch

Introducing SkillsBench and our call for open-source contributors.

January 12, 2026

Week 1 Update

Announcing SkillsBench week one updates and early project progress.