Datasets

BigDocs

An open, permissively licensed dataset for training multimodal models on document and code tasks.

SYNTHIA

A pioneering synthetic dataset for semantic segmentation of urban scenes, licensed to Intel, Audi, and Huawei.