Skip to main content

Uploading Datasets

Chariot supports uploading either annotated or unannotated data.

Uploading Data

To upload a dataset into Chariot, click the Datasets tab on the left-hand navigation bar, and then click Create Dataset.

When choosing data to upload, you can either upload images individually (with a maximum of 25) or in a compressed archive file (which is required to upload annotated data) of all of the images. The archive file must be either a compressed .tar or .zip file with the images in it and a .jsonl file. Annotation format information can be found here.

When prompted, provide the following information:

  • Data Type: Choose which kind of data—image or text—you would like to bring into Chariot. Each dataset type has different annotations and expected outcomes.
  • Name: Provide a name for the dataset. This is used to reference the dataset in the project.
  • Description: Provide a short description of the dataset for your own reference. This may be helpful in differentiating between similar datasets in a project.
  • Upload Files: Select the local file(s) you want to upload into the dataset.

Click Submit to finish creating the dataset.

note

During the upload process, relevant EXIF metadata is extracted from each image, with GPSDateTime, GPSLatitude, and GPSLongitude being particularly useful for filtering datums later. If needed, these fields can be overwritten using custom metadata when uploading a dataset. For more details, refer to the Metadata Format section.

Uploading Data to Specific Splits

If you want to upload data into a specific split—to specify immediately if your data is going to be used for training, testing, or validation purposes—click the Upload as train/test/val split checkbox before uploading your data file(s). This will assign that data to a specific default split that will be used when creating a View of that dataset later on. When selecting this option, the data assigned to each split will be locked to that split type unless explicitly ignored when creating a View. Each split will then be processed as an individual upload as shown below. See the Views section for more information.

warning

Large image files (>1GB) uploaded into Chariot may not populate EXIF metadata. Consider chipping your image data prior to uploading it. For additional information, please contact support@striveworks.com or your Striveworks administrator.

Ignore File

Archive upload contents may be ignored by providing a .chariotignore file at the root of archive. This allows upload archives to contain additional information that is not intended for Chariot. The .chariotignore file uses the same syntax as Git's ignore file. See https://git-scm.com/docs/gitignore for more information.

Example Ignore File

# ignore files foo and bar
foo
bar
# ignore any hidden files
.*
# ignore any json files other than annotations and metadata
*.json
!annotations.json
!metadata.json
# ignore contents of the test directory
test/