ClawBio Skill Correctness Bench
Third-party (Biostochastics LLC) benchmark of bio-analysis skills on safety / correctness / honesty. 10 skills × 182 tests.
Composite score: 74.2
Rubric (1–5 per criterion)
rigor
5/5
coverage
2/5
maintenance
5/5
adoption
2/5
quality
4/5
accessibility
5/5
industry_relevance
3/5
Metadata
Stages
Disease ModelingTarget IDClinical Development
Modalities
cross-modality
Task types
correctness-auditsafety-audit
License
MIT
First release
2026-04
Last updated
2026-05-03
Flags
none
Size & scope
- skills: 10
- tests: 182
- pass_rate_pct: 92.3
Primary paper
Title
clawbio_bench README (v0.1.5)
Authors
Biostochastics LLC
Year
2026
DOI / arXiv
N/A — repo
Citations
5
Links
- Official site: https://clawbio.ai/benchmarks.html
- GitHub: https://github.com/biostochastics/clawbio_bench
- Leaderboard: https://clawbio.ai/benchmarks.html
Hosted by (initiatives)
Experts (primary authors / maintainers)
- none linked
Groups (host labs / companies / consortia)
Related benchmarks
- none
Notes (honest caveats)
Independent third-party bench structurally precludes self-reference. Coverage narrow but rigor exemplary.