← All benchmarks

ClawBio Skill Correctness Bench

Third-party (Biostochastics LLC) benchmark of bio-analysis skills on safety / correctness / honesty. 10 skills × 182 tests.
Composite score: 74.2

Rubric (1–5 per criterion)

rigor
5/5
coverage
2/5
maintenance
5/5
adoption
2/5
quality
4/5
accessibility
5/5
industry_relevance
3/5

Metadata

Stages
Disease ModelingTarget IDClinical Development
Modalities
cross-modality
Task types
correctness-auditsafety-audit
License
MIT
First release
2026-04
Last updated
2026-05-03
Flags
none

Size & scope

Primary paper

Title
clawbio_bench README (v0.1.5)
Authors
Biostochastics LLC
Year
2026
DOI / arXiv
N/A — repo
Citations
5

Links

Hosted by (initiatives)

Experts (primary authors / maintainers)

Groups (host labs / companies / consortia)

Related benchmarks

Notes (honest caveats)

Independent third-party bench structurally precludes self-reference. Coverage narrow but rigor exemplary.