Skip to main content

Datasets

The Datasets function of Chariot handles the storage and versioning of the data that is used for model training and evaluation.

At this time, Chariot supports image and text datasets. Image dataset types include annotations for image classification, object detection, and semantic segmentation. Text dataset types include annotations for text classification, text token classification, text summarization, and text translation. Additionally, dataset annotations can be completed in the Annotation feature of Chariot.

Getting Started With Datasets

Uploading Data

Learn about uploading data into Chariot, including:

  • Supported file formats and archive structures
  • Annotation format specifications for different task types
  • Custom metadata configuration
  • Upload options via UI and SDK

Using Datasets

Once your data is uploaded, explore how to work with it:

  • Browse, filter, and perform exploratory data analysis on your dataset contents
  • Create Views to organize customizable subsets of your data
  • Generate Snapshots for model training
  • Track dataset history and changes