Local Agent Bench: Testing 21 Open-Weight Models on Tool Calling

Benchmarking small open-weight models on a $1,000 laptop to see which ones know when to use tools — and when not to.

February 6, 2026