HuggingFace — Bio/Chem Datasets

HuggingFace Datasets hub filtered for bio/chem benchmarks (tdc, bigbio, InstaDeep).

Kind
data-platform
Composite
67.8
Benchmarks tracked
310
Direct-linked
341
As of
2026-05-12
Host organisation
HuggingFace + community uploaders
Primary contacts
HF community
Founded
2020
License model
Per-dataset
Official site
→ site
GitHub
→ GitHub
Count methodology
huggingface.co/datasets tag search (biology/chemistry/medical/drug-discovery) + curated orgs tdc/bigbio/InstaDeepAI 2026-05: ~310 entries, with duplication.

Rubric

rigor
2
coverage
5
maintenance
4
adoption
4
quality
2
accessibility
5
industry_relevance
2

Breakdown

molecular
90
protein
70
clinical_text
80
genomic
40
other
30

Notes

High discoverability, low quality floor.

Hosted benchmarks (341)

Direct-linked — each links to the benchmark’s leaderboard / detail page on the host portal.
HF-biology (200)
MMMU/MMMUHF-biologypulmo/ncbi-genbank-completeHF-biologyMMMU/MMMU_ProHF-biologytahoebio/Tahoe-100MHF-biologyhuggingworld/ncbi-refseq-completeHF-biologyDSIMB/PATHOS-PLM-EMBEDDINGSHF-biologyRosettaCommons/ProteinMPNNHF-biologySciCode/SciCode-Domain-CodeHF-biologyAI4Math/MathVistaHF-biologySciCodePile/SciCode-Domain-CodeHF-biologyqyp111/mdCATHHF-biologyderek-thomas/ScienceQAHF-biologyxlangai/BRIGHTHF-biologycompsciencelab/mdCATHHF-biologyddrg/MUSESHF-biologyProteinMPNN/group_mpnnHF-biologyCohereLabs/include-base-44HF-biologyimageomics/TreeOfLife-200MHF-biologyallenai/peS2oHF-biologyfuturehouse/lab-benchHF-biologyhyf015/EgoExoLearnHF-biologyBrentLab/mahendrawada_2025HF-biologyjackkuo/arXiv-metadata-oai-snapshotHF-biologyEMBO/SourceDataHF-biologynicoboou/IDRCell100kHF-biologyEarthSpeciesProject/NatureLM-audio-trainingHF-biologyslaf-project/Tahoe-100MHF-biologysonglab/TraitGymHF-biologymansoorbaloch/chimera-benchHF-biology1aurent/PatchCamelyonHF-biologyarcinstitute/opengenome2HF-biologyFreedomIntelligence/medical-o1-reasoning-SFTHF-biologyperturbai/wholebrain_crispr_atlasHF-biologyShengyouDuan/SwarmEvoHF-biologySciCode1/SciCodeHF-biologyfish-gang/deepscan-datasetHF-biologyhaitengzhao/molecule_property_instructionHF-biologyR-Bench/R-BenchHF-biologyGreatCaptainNemo/BioProBenchHF-biologynphamdinh/mobilemoldHF-biologyjiawennnn/STimage-1K4MHF-biologyyatin-superintelligence/Edge-Agent-Reasoning-WebSearch-260KHF-biologyPatoFlamejanteTV/geral-edu-ptbrHF-biologyXiaoxin888888/STimage-1K4MHF-biologyOpen-Orca/SlimOrca-DedupHF-biologyBrentLab/rossi_2021HF-biologyhicai-zju/SciKnowEvalHF-biologystanford-crfm/image2struct-latex-v1HF-biologyhheiden/PubChem-124M-SMILES-SELFIES-InChI-IUPACHF-biologyusersaico/modular-s2orcHF-biologyFIFCO/De_vuelta_a_casaHF-biologyCohereLabs/include-lite-44HF-biologyDataQuests/Dyna_Repo_Inria_MDPosit_FilesHF-biologyConvergeBio/uniref100HF-biologyRosettaCommons/SAbDab_rawHF-biologysankalpa1998/cat-dog-yolo-datasetHF-biologyintrovoyz041/ZINC20HF-biologyDEMIRUNC/Edge-Agent-Reasoning-WebSearch-260KHF-biologyimageomics/fish-vistaHF-biologyfuturehouse/hle-gold-bio-chemHF-biologylaion/Wikipedia-AbstractHF-biologyScienceOne-AI/S1-MMAlignHF-biologyBlueIsGreen/Edge-Agent-Reasoning-WebSearch-260KHF-biologyclaran/modular-s2orcHF-biologyJosselinSom/Latex-VLMHF-biologysatputekuldip/opengenome2HF-biologyintrovoyz041/vesm_scoresHF-biologynds029/Tahoe-100MHF-biologyTorenn/Edge-Agent-Reasoning-WebSearch-260KHF-biologycgeorgiaw/merfishHF-biologyLumiOpen/opengpt-x_mmluxHF-biologyAllTheBacteria/BacCorpus-intergenic-dna-90HF-biologyimageomics/invasive_plants_hawaiiHF-biologyimageomics/mmla_mpalaHF-biologyAnimas1024/MarineEvalHF-biologybens-bots/marrs-global-coral-reef-soundscapesHF-biologymm1109/scRegNetHF-biologySZU-ADDG/ZINC-CuratedHF-biologyimageomics/TreeOfLife-10MHF-biologyMxode/BiSTHF-biologyCoastalEcology/SeagrassMapperHF-biologyOATML-Markslab/ProteinGym_v1HF-biologykanepi-1977/Agent-Reasoning-WebSearch-260KHF-biologyGrunCrow/BIRDeep_AudioAnnotationsHF-biologyRosettaCommons/SAbDabHF-biologyAllTheBacteria/BacCorpus-prot-90HF-biologyzjunlp/Mol-InstructionsHF-biologymteb/BRIGHTHF-biologyUniqueData/brain-mri-datasetHF-biologykunikohunter/IEDB_pHLA_binding_dataHF-biologylongevity-genie/cell2sentence4longevity-dataHF-biologyHPAI-BSC/CareQAHF-biologyBGLab/BioTroveHF-biologytsynbio/ProteinLMBenchHF-biologysilicobio/peleke_antibody-antigen_sabdabHF-biologymohanty/PlantVillageHF-biologyDORI-SRKW/DORI-ONCHF-biologyConvergeBio/uniref90HF-biologyMxode/I_Wonder_Why-ChineseHF-biologyHorama/wow_scrapedHF-biologySZU-ADDG/MLM-Scaling-datasetsHF-biologyNeuroTec/SWEC_iEEG_DatasetHF-biologyYinkaiW/LSVHF-biologyzgcarvalho/uniref50-testHF-biologyMMMGBench/MMMGHF-biologyhaydn-jones/PubChemHF-biologyjiahuizhang/PepBenchDataHF-biologyVerisimilitudeX/mobius-dataHF-biologyinstruction-pretrain/medicine-instruction-augmented-corporaHF-biologyimageomics/2018-NEON-beetlesHF-biologyNekochu/Luminia-mixtureHF-biologyhardlyworking/Vagina-Vision-Image-Folder-Captions-IncludedHF-biologyFreedomIntelligence/medical-o1-verifiable-problemHF-biology1aurent/NCT-CRC-HEHF-biologydatatab/serbian-llm-benchmarkHF-biologyIVN-RIN/BioBERT_ItalianHF-biologyimageomics/IDLE-OO-Camera-TrapsHF-biologyRosettaCommons/FPbaseHF-biologyimageomics/VLM4BioHF-biologyimageomics/mmla_wildsHF-biologyMBZUAI/medix-rl-dataHF-biologyPetra-AI/ZalmatiAIHF-biologyEPFL-ECEO/coralscapesHF-biologysonglab/clinvar_vs_benignHF-biologyJierunChen/MMMU_with_difficulty_levelHF-biologyginkgo-datapoints/GDPx3HF-biologyInternScience/SFEHF-biologycomputage/computage_benchHF-biologyvelaiola/Tahoe-100MHF-biologyInternScience/SGI-DeepResearchHF-biologyConvergeBio/uniclust30HF-biologyxuan-liu/FGBenchHF-biologySMAResearch/sma-evidence-graphHF-biologyBIOMEDICA/biomedica_webdataset_24MHF-biologyfarmaieu/plantorgansHF-biologyjmhb/microvqaHF-biologyFreedomIntelligence/PubMedVisionHF-biologyFreedomIntelligence/CMBHF-biologyimageomics/rare-speciesHF-biologyzacharielegault/PatchCamelyonHF-biologyrestor/tcdHF-biologyrootsautomation/pubmed-ocrHF-biologyConvergeBio/uniref50HF-biologyEarthSpeciesProject/BEANS-ZeroHF-biologyOpenMed/Medical-Reasoning-SFT-GPT-OSS-120BHF-biologyArlingtonCL2/Dog-Vocal-SeparationHF-biologyfindshuo/ProtapHF-biologykadzaki/FrBMedQAHF-biologysonglab/cosmicHF-biologyUniqueData/brain-anomaly-detectionHF-biologyscikit-fingerprints/MoleculeNet_BACEHF-biologyNiche-Squad/COLOHF-biologywanglab/keggHF-biologyFeibo-112358/SpatialCorpus-110MHF-biologypolymathic-ai/active_matterHF-biologysonglab/ldscHF-biologygasparyanartur/things-eeg2HF-biologybiomap-research/enzyme_catalytic_efficiencyHF-biologyJoemgu/sumstewHF-biologyUniqueData/multiple-sclerosis-datasetHF-biologyRosettaCommons/PISCES-CulledPDBHF-biologykewserseid/biomedical-instruction-datasetHF-biologybiomap-research/metal_ion_bindingHF-biologyGenerTeam/prokaryotic-gener-tasksHF-biologyAgMMU/AgMMU_v1HF-biologyMBARI-org/DeepSea-MOTHF-biologyJimut123/RV-PBSHF-biologylukaskim/ChEMBL-36HF-biologygalaxyMindAiLabs/stem-reasoning-complexHF-biologyU4R/DocGenomeHF-biologysonglab/omimHF-biologyalejoacelas/uniref50-2025-10HF-biologyintrovoyz041/ZINC-CuratedHF-biologybiomap-research/peptide_HLA_MHC_affinityHF-biologyslaab/NCERT-Parallel-Dataset-IndicHF-biologyBrentLab/hughes_2006HF-biologyai-forever/POLLUXHF-biologyintrovoyz041/Gargantua-R1-CompactHF-biologyRainPPR/china-textbookHF-biologysonglab/gnomad_balancedHF-biologyFreedomIntelligence/Medical_Multimodal_Evaluation_DataHF-biologyBUAADreamer/llava-med-zh-instruct-60kHF-biologyscikit-fingerprints/MoleculeNet_PCBAHF-biologysonglab/clinvarHF-biologyallenai/us-patentsHF-biologyxuejun72/HR-VILAGE-3K3MHF-biologyEurolingua/mmluxHF-biologyYangximiao/PlantCAD2_zero_shot_tasksHF-biologybiomap-research/fitness_predictionHF-biologywanglab/variant_effect_codingHF-biologybiomap-research/stability_predictionHF-biologyKokosDev/single-cell-brain-zarrHF-biologychandar-lab/CoPePHF-biologyDavidVivancos/NeuraxonLife2.5-100K-TimeSeriesHF-biologybiomap-research/fold_predictionHF-biologyscikit-fingerprints/MoleculeNet_ESOLHF-biologysonglab/gpn-msa-sapiens-datasetHF-biologyimageomics/upper-waiakea-PAMHF-biologyagentlans/FreedomIntelligence-medicalHF-biologyproteinglm/fluorescence_predictionHF-biology
HF-chemistry (67)
gaianet/chemistryHF-chemistryXythicK/ChemistryHF-chemistryNarimanka/AbuhuesosHF-chemistryAirtel4141/SssdmdsffHF-chemistryUniParser/MolParser-7MHF-chemistryjablonkagroup/chempile-mliftHF-chemistryopenadmet/pxr-challenge-train-testHF-chemistryjablonkagroup/corral-QAs-reportsHF-chemistryBattery-Life/BatteryLife_ProcessedHF-chemistryjablonkagroup/ChemBenchHF-chemistryjablonkagroup/corral-tracesHF-chemistryLeMaterial/LeMat-BulkHF-chemistryjablonkagroup/corral-QAs-topic_reportsHF-chemistrynfsrulesFR/mega-moledit-largeHF-chemistrydatamol-io/safe-gptHF-chemistryosunlp/SMolInstructHF-chemistryjglaser/binding_affinityHF-chemistryjglaser/pdb_protein_ligand_complexesHF-chemistryyqj01/SimNMR-PubChemHF-chemistryroman-bushuiev/MassSpecGymHF-chemistryjablonkagroup/chempile-liftHF-chemistrydaman1209arora/jeebenchHF-chemistryjablonkagroup/MaCBenchHF-chemistryDasKunststoffZentrumSKZ/Errors_Additive_Manufacturing_Nozzle_CamHF-chemistryMetanova/SAVI-2020HF-chemistryInternSVG/SAgogeHF-chemistryrouskinlab/PDBHF-chemistryDeweiFeng/SmellNetHF-chemistryavaliev/ChemistryQAHF-chemistrycamel-ai/loongHF-chemistryjablonkagroup/chempile-reasoningHF-chemistryjablonkagroup/chempile-paperHF-chemistryYuRiVeRTi/V1QHF-chemistryzihaojing/MuMo-PretrainingHF-chemistryWithinUsAI/Chemistry_25kHF-chemistryjablonkagroup/corral-oss-trace-logprobsHF-chemistrylaion/chemrXiv-pdfHF-chemistryAI4Chem/ChemPref-DPO-for-Chemistry-data-enHF-chemistryscikit-fingerprints/MoleculeNet_HIVHF-chemistryRosettaCommons/MegaScaleHF-chemistryliuganghuggingface/moltextnetHF-chemistryEricLu/SCP-116KHF-chemistryZehuaZhao/SUPERChemHF-chemistrykarina-zadorozhny/ATOMICAHF-chemistryscikit-fingerprints/MoleculeNet_Tox21HF-chemistrykatielink/moleculenet-benchmarkHF-chemistryAlexLoong/S1-MMAlignHF-chemistryjablonkagroup/chempile-educationHF-chemistryakselformo/VexHF-chemistrynvidia/ProfBenchHF-chemistrysequelbox/Celestia3-DeepSeek-R1-0528HF-chemistryscikit-fingerprints/MoleculeNet_BBBPHF-chemistryliupf/ChEBI-20-MMHF-chemistryzake7749/OpenScience-Chinese-Reasoning-SFTHF-chemistryQizhiPei/BioMatrix-SFTHF-chemistryMxode/CAMSHF-chemistryAmirMasoud/GMDHF-chemistrymicrosoft/msr-acc-tae25HF-chemistryscikit-fingerprints/MoleculeNet_LipophilicityHF-chemistryAnonymouScientist/MaterialsSaddlesHF-chemistryprithivMLmods/Gargantua-R1-CompactHF-chemistryRosettaCommons/MIPHF-chemistryscikit-fingerprints/MoleculeNet_ToxCastHF-chemistryscikit-fingerprints/MoleculeNet_SIDERHF-chemistryscikit-fingerprints/MoleculeNet_ClinToxHF-chemistryjablonkagroup/questions4manual_annotationHF-chemistryliuhangbiao/SciCode-Domain-CodeHF-chemistry
HF-medical (74)
mrmrx/CADS-datasetHF-medicalibrahimhamamci/CT-RATEHF-medicalopendatalab/Sci-BaseHF-medicalWilliamsanderson/MedQA-Darija-MultiLingualHF-medicalForithmus/MR-RATEHF-medicalsunghong/CADS-datasetHF-medicalkantor3/CADS-datasetHF-medicalGeneral-Medical-AI/GMAI-VL-5.5MHF-medicalbio-nlp-umass/MedThinkVQAHF-medicalTokyoTechMagicYang/RAM-H1200-v1HF-medicalRichardChenZH/MedForge-90KHF-medicalAVS-Net/knee_fast_mriHF-medicaljunma/CVPR-BiomedSegFMHF-medicalCYX1998/Meissa-SFTHF-medicalChandanmanvi/MedDialog-AudioHF-medicalkyegorov/mcd_rppgHF-medicalSimulaMet/Kvasir-VQA-x1HF-medicalflaviagiammarino/vqa-radHF-medicalForithmus/MR-RATE-coregHF-medicalhuggingface/CADS-datasetHF-medicalmercor/APEX-v1-extendedHF-medicalMedRAG/pubmedHF-medicalharvardairobotics/FairSegHF-medicalCTPLab-DBE-UniBas/staining-robustness-evaluationHF-medicalRichardChenZH/Med-Banana-50KHF-medicalflaviagiammarino/path-vqaHF-medicalwanglab/CT_DeepLesion-MedSAM2HF-medicalSnivellusSnape/iiyi_datasetHF-medicalnovcor/CADS-datasetHF-medicalAbdomenAtlas/AbdomenAtlas1.0MiniHF-medicalloupk/CADS-datasetHF-medicalctmedtech/DDR-datasetHF-medicalBoKelvin/SLAKEHF-medicalFLARE-MedFM/FLARE-Task4-CT-FMHF-medicalTLAIM/TAIX-RayHF-medicalmartagm17/testHF-medicalDearcat/CPathPatchFeatureHF-medicalVoxel51/cholect50HF-medicalTsinghuaC3I/MedXpertQAHF-medicalForithmus/MR-RATE-nvseg-ctmrHF-medicalikala/tmmluplusHF-medicalAbdomenAtlas/AbdomenAtlas1.0MiniBetaHF-medicalAngelou0516/totalsegmentator-organsHF-medicalmsancheza/microCube-Duke-Breast-MRIHF-medicalMIL-UT/Japanese-Medical-VQA-12mHF-medicalalexanderdann/CTSpine1KHF-medicalDermaVLM/PMC-Clinical-VQAHF-medicalForithmus/MR-RATE-atlasHF-medicaleltorio/ROCOv2-radiologyHF-medicalFreedomIntelligence/huatuo26M-testdatasetsHF-medicalrajpurkarlab/3DReasonKneeHF-medicalWY-0206/CADS-datasetHF-medicalstaryutong999/GAMMAHF-medicalmedalpaca/medical_meadow_medqaHF-medicalmks-logic/SPYHF-medicalYongchengYAO/MedVisionHF-medicalAbdomenAtlas/AbdomenAtlas3.0MiniHF-medicalAngelou0516/Adrenal-ACC-Ki67-SegHF-medicalHajihajihaJimmy/Totalsegmentor_Pelvis_Bone_Recon_DatasetHF-medicalFlmc/MedRCubeHF-medicalVoxel51/SLAKEHF-medicalOpenMed/synthvision-annotated-qwenHF-medicalTwanAPI/data1HF-medicalVoxel51/MedXpertQAHF-medicalAmod/mental_health_counseling_conversationsHF-medicallavita/medical-qa-datasetsHF-medicalgOLIVES/OLIVES_DatasetHF-medicalVoxel51/BTCV-CT-as-video-MedSAM2-datasetHF-medicalarekborucki/CADS-datasetHF-medicallavita/MedQuADHF-medicalNj-1111/Glaucoma_DatasetHF-medicalluojunyu/SemiEvolHF-medicalanon-meddial-2026/meddialbenchHF-medicalFLARE-MedFM/FLARE-Task1-PancancerRECIST-to-3DHF-medical

← Back to all initiatives

Compare:
Open comparison →