Sync From External Buckets
Connect Datature Vi to your existing object storage and sync image and video assets without uploading them to our servers. Read-only access keeps your source data in place.
If your training data already lives in a cloud storage bucket, you do not need to copy it into Datature Vi. Connect the bucket once and Vi reads the asset metadata in place. Your image and video files stay in your storage account, and the platform tracks references to them so you can annotate, train, and evaluate the same dataset you keep in production.
External bucket connections are available on paid account tiers. Check your plan in Billing before you begin setup.
Connect a cloud storage bucket to a Datature Vi dataset and sync assets without copying files off your infrastructure.
How bucket sync works
Vi uses a read-only metadata sync. The platform fetches filenames, dimensions, and EXIF data, then renders thumbnails and previews from your bucket on demand. The actual image and video bytes stay where they already live.
A few practical consequences of this design:
- Bucket changes flow one way. New objects appear in Vi after the next sync. Annotations and labels you create in Vi never write back to your bucket.
- Deletions in Vi remove the reference, not the file. The original object stays in your bucket until you remove it there.
- Synced assets count toward your monthly data row quota the same way uploaded assets do.
- Connecting multiple buckets to the same dataset merges them. Objects with identical filenames overwrite each other in the dataset view.
- Only image and video assets sync from buckets. Annotations are not pulled from your storage; upload them directly to Vi after the sync finishes.
Choose your storage provider
AWS S3
Connect an Amazon S3 bucket using IAM roles and a trust policy.
Azure Blob Storage
Connect a Storage Account container using a service principal and a role assignment.
Google Cloud Storage
Connect a GCS bucket using a service account and an IAM policy binding.
MinIO
Connect a self-hosted MinIO deployment using access keys and a read policy.
S3-Compatible Storage
Connect Wasabi, Backblaze B2, Cloudflare R2, or any other S3-compatible service.
What works the same across providers
Every connection follows the same four-step wizard inside the Dataset tab under Connect to External Buckets:
Bucket Details
Enter the connection name, bucket or container name, and an optional folder prefix to scope the sync to a subset of your data.
Access Credentials or Policy
Apply the IAM policy, role assignment, or access keys that Vi generates for you. Each provider has a different mechanism, but the goal is the same: grant read-only access to the bucket.
Connection Status
Vi tests the connection. A green status means Vi can list and read objects. A red status means a permission, region, or endpoint setting is wrong.
Sync Assets
Run an initial sync now or schedule it for later. Sync takes 5 to 40 minutes depending on object count.
Asset requirements
Vi reads metadata directly from your bucket, so files have to meet the same format requirements as direct uploads.
If a video fails to sync, run ffprobe against the file locally to confirm the major brand and pixel format match the requirements above.
If major_brand or pix_fmt differ, re-encode the file with FFmpeg before syncing:
CORS allowlist
Image and video previews load directly from your bucket in the browser, so the bucket has to allow cross-origin requests from the Vi web app. Add the Vi origin to your CORS configuration:
https://vi.datature.com
Each provider page has the exact CORS payload to apply.
Common questions
Next steps
Updated 2 days ago
