cline-bench is designed to help open source labs train and evaluate on real world coding work, not just sanitized benchmarks. That is why it resonates with leaders in post training like @Teknium, Head of Post Training at @nousresearch. The benchmark provides a set of challenging, verified coding environments drawn directly from how developers already use coding agents in open source projects.