Private / Industry benchmarks (28)

Access note: the benchmarks below reference datasets that are not publicly accessible. They are catalogued for industry-relevance reference only and to surface the public proxies that academic / open-source work can use. Access is typically gated by collaboration, data-sharing agreements, or remains closed entirely.

Compiled from public publications, SEC filings, conference talks, and press releases. Each entry links to the best public proxy benchmark.

BenchmarkOwnerTypeStageModalityAccessEstimated sizePublic proxy
AstraZeneca CAS-backed DMPK BenchmarksAstraZenecapharmaLead ID / ADMETsmall-moleculeclosed~200k measured DMPK endpointsTDC ADMET Group
Bristol-Myers Squibb Internal SAR BenchmarkBristol-Myers SquibbpharmaHit IDsmall-moleculeclosed~1M assay pointsChEMBL
Chugai Antibody Engineering BenchmarkChugai Pharmaceutical (Roche)pharmaHit IDbiologiccollaborationundisclosedTherapeutic Antibody Design Benchmark 2026
Deep Genomics RNA Therapeutics BenchmarkDeep GenomicsbiotechHit IDrna-therapeuticcollaborationundisclosedmRNA Design Benchmark (CodonBench 2026)
Exscientia Precision Medicine BenchmarkExscientiabiotechHit IDsmall-moleculeclosedundisclosedChEMBL
FDA CDRH Internal AI Validation SetsFDA Center for Devices and Radiological HealthregulatoryPost-market / RWEcross-modalityclosedundisclosedFAERS (raw)
Flatiron Health Real-World Oncology BenchmarkFlatiron Health (Roche)pharmaPost-market / RWEcross-modalitydata-sharing-agreement~4M oncology patientsMIMIC-IV Benchmark Tasks
Genentech gRED Structure-Activity DatasetGenentech gREDpharmaHit IDsmall-moleculeclosedundisclosed; referenced as 'millions of assay points'ChEMBL
Gilead Internal Antiviral BenchmarkGilead SciencespharmaHit IDsmall-moleculeclosed~500k compounds antiviral screenedASAP Discovery Antiviral 2025
Ginkgo Bioworks Biologics Design BenchmarkGinkgo BioworksbiotechHit IDbiologiccollaboration~millions of enzyme variants with activity measurementsProtein Design Benchmark 2026
IBM RXN Internal Retrosynthesis BenchmarkIBM ResearchtechHit IDsmall-moleculecollaboration~5-10M proprietary reactions beyond USPTOUSPTO-50K / USPTO-MIT (Retrosynthesis)
Insilico Longevity Benchmark (Full Dataset)Insilico MedicinepharmaVirtual Cellcross-modalityconditional-access~1M individuals across NHANES + internal cohorts, 500k methylation samplesLongevity Benchmark (Insilico)
Isomorphic Labs Internal Structure/Docking BenchmarkIsomorphic Labs (Alphabet)biotechHit IDcross-modalityclosedundisclosedPLINDER v2 Protein-Ligand Benchmark
Merck Internal ADMET Benchmark (Demystifying ADMET)Merck & Co.pharmaLead ID / ADMETsmall-moleculeclosed~150k compounds across 17 endpointsPolaris ADMET
Meta FAIR Protein Design Internal EvalMeta FAIR / EvolutionaryScalebiotechHit IDbiologicclosedundisclosedProteinGym
Moderna mRNA Design Internal BenchmarkModernapharmaHit IDrna-therapeuticclosed~100k constructs with HEK293 and primary-cell expression readoutmRNA Design Benchmark (CodonBench 2026)
NIBR Therapeutics Data (NTD)Novartis NIBRpharmaLead ID / ADMETsmall-moleculecollaboration~4M compounds, 20M assaysChEMBL
Open Problems Sponsor-Private Challenge DataOpen Problems consortium (sponsors: 10x Genomics, Chan Zuckerberg Biohub)consortiumVirtual Cellcross-modalityconditional-accessvaries per competition (~100k cells each)Open Problems: Perturbation Prediction
Open Targets Pharma Partner ExtensionsOpen Targets consortiumconsortiumTarget IDcross-modalitydata-sharing-agreementundisclosedOpen Targets Platform
Pfizer Phase II Trial Benchmark DatasetPfizerpharmaphase-iicross-modalityclosed~1500 trials, 40,000 patientsHINT / TrialBench
Pfizer mRNA/LNP Internal BenchmarkPfizerpharmaHit IDrna-therapeuticclosedundisclosedmRNA Design Benchmark (CodonBench 2026)
Recursion Full Phenomics DatasetRecursion PharmaceuticalspharmaHit IDsmall-moleculeconditional-access~50 PB images, 18M compound wellsRxRx3 Phenomics Benchmark
Roche pRED ADMET BenchmarkRoche pREDpharmaLead ID / ADMETsmall-moleculeclosed~300k compound-endpoint pairsTDC ADMET Group
Sanofi Internal PK/PD BenchmarkSanofipharmaIND-enablingsmall-moleculeclosedundisclosedObach PK Dataset
Takeda Internal AI Pipeline BenchmarkTakedapharmaLead ID / ADMETsmall-moleculecollaborationundisclosedTDC ADMET Group
Tempus Oncology AI BenchmarkTempus LabsbiotechTarget IDcross-modalitydata-sharing-agreement~200k sequenced patients + outcomesCPTAC Proteogenomic Benchmarks
Valence Labs Internal ADMET ExtensionsValence Labs (Recursion)biotechLead ID / ADMETsmall-moleculeconditional-access~150k compoundsPolaris ADMET
Xaira Therapeutics Foundation BenchmarkXaira TherapeuticsbiotechVirtual Cellcross-modalityclosedundisclosed; built on Illumina-scale partnershipsVirtual Cell Benchmark Suite 2026
Compare:
Open comparison →