Google Dataset Search Meta-search across millions of datasets on the web with schema.org-powered indexing. 0400 Datasets & Labeling# catalog# dataset search# metadata
SuperAnnotate AI-assisted annotation and services with workflow, QA, and multimodal support. 0390 Datasets & Labeling# annotation# multimodal# QA
Zenodo CERN-hosted open repository for research data with DOIs and long-term preservation. 0390 Datasets & Labeling# CERN# DOI# open repository
Data.gov (US Open Data) The U.S. government’s open data portal aggregating hundreds of thousands of datasets. 0390 Datasets & Labeling# API# catalog# open government
LAION-5B Massive open image–text dataset (multilingual) widely used for generative models. 0380 Datasets & Labeling# CLIP# image-text# LAION
OpenML Open platform to share datasets, tasks, and benchmarks for machine learning. 0380 Datasets & Labeling# benchmarks# datasets# experiments
Hugging Face Datasets Hub Huge catalog of ML-ready datasets with cards, viewers, and the 🤗 Datasets library. 0370 Datasets & Labeling# dataset cards# datasets# hub
CVAT Leading open-source image/video annotation with auto-annotation and team workflows. 0360 Datasets & Labeling# annotation# computer vision# CVAT
data.world Community Social data catalog to share, query, and collaborate on open datasets. 0330 Datasets & Labeling# catalog# community# data.world