Upload Data

Uploading data to your datasets is the foundation of training effective vision language models. Datature Vi provides flexible options for uploading both images (assets) and their corresponding annotations, whether you're working with a few dozen images or thousands.

This guide covers everything you need to know about getting your data into Datature Vi quickly and efficiently.

💡
Quick workflow
Create dataset → Upload data (you are here) → Train a model → Deploy

What you'll upload

Your dataset needs two types of data:

1. Assets (images)

Images are the visual data your model will learn from. Datature Vi supports a wide range of image formats including JPEG, PNG, TIFF, WebP, HEIF, and more.

Upload methods:

Web interface — Drag-and-drop or file browser for quick uploads
Vi SDK — Programmatic uploads for large datasets and automation

Learn more about uploading images →

2. Annotations (labels)

Annotations define what your model should learn to recognize—bounding boxes, phrases, or question-answer pairs depending on your task type.

Upload options:

Import existing annotations — From COCO, YOLO, Pascal VOC, CSV, or Vi JSONL formats
Create new annotations — Using the visual annotator or AI-assisted tools

Learn more about uploading annotations →

Upload workflow

Follow this sequence for best results:

1. Upload images first

Start by uploading your images to the dataset. Supports drag-and-drop, bulk uploads, and SDK integration.

2. Add annotations

Import existing annotations or create them manually. Annotations link to images by filename.

📘
Important upload order
Always upload images before annotations. The annotation system matches labels to images by filename, so images must exist in the dataset first.

Choose your upload method

Web interface uploads

Best for: Small to medium datasets (under 1,000 images), visual workflows, one-time uploads

Features:

Intuitive drag-and-drop interface
Real-time progress tracking
Visual file selection and preview
No coding or technical setup required
Background processing while you work

Get started:

SDK (programmatic) uploads

Best for: Large datasets (1,000+ images), automation, CI/CD integration, repeated workflows

Features:

Efficient batch processing
Advanced error handling and retry logic
Integration with existing data pipelines
Automated workflows and scheduling
Precise control over upload behavior

Get started:

Supported formats

Datature Vi supports a wide range of image formats (JPEG, PNG, TIFF, WebP, HEIF, and more) and annotation formats (COCO, YOLO, Pascal VOC, CSV, Vi JSONL).

Learn more:

Best practices

Key tips:

Use JPEG or PNG formats for images (under 10 MB recommended)
Always upload images before annotations
Ensure annotation filenames exactly match image filenames (case-sensitive)
Upload in batches of 10-50 images for reliability
For large datasets (1,000+ images), use the SDK instead of web interface

Detailed guidelines:

Troubleshooting

Having issues with uploads? Check the detailed troubleshooting guides:

Image upload troubleshooting → — Slow uploads, unsupported formats, file size issues
Annotation upload troubleshooting → — Missing annotations, format errors, filename mismatches

Next steps

Once your images and annotations are uploaded:

Annotate data

Add or edit annotations using the visual annotator

View dataset insights

Analyze your dataset statistics and distributions

Train a model

Start fine-tuning a VLM with your dataset

Detailed guides

Upload assets (images) → — Complete guide to uploading images via web and SDK
Upload annotations → — Import annotations from all supported formats

Related resources

Create a dataset — Learn how to create datasets for different task types
Manage datasets — Rename, delete, and organize your datasets
Download data — Export datasets and annotations
Vi SDK getting started — Get started with programmatic uploads
Phrase grounding concepts — Learn about object detection tasks
Visual question answering concepts — Learn about VQA tasks
Upload assets — Step-by-step image upload guide
Upload annotations — Import existing annotations
Annotate data — Create annotations manually
View dataset insights — Check upload progress and statistics
Train a model — Use your data for VLM training
Quickstart — End-to-end workflow tutorial

Need help?

We're here to support your VLMOps journey. Reach out through any of these channels:

Contact Support

Get help from our team via our website or email us at [email protected]

Join Our Community

Connect with other Datature users, share ideas, and get community support on Slack

Explore Resources

Read our Blog
Check out GitHub
Watch Tutorials

Schedule a Demo

Book a personalized demo to see how Datature Vi can accelerate your vision AI projects