Annotate Videos

Annotate video frame sequences with freeform text to train a vision-language model on temporal and visual reasoning tasks.

Datature Vi supports freeform text annotations for videos. Unlike image annotation, video annotation works with frame sequences: you select a range of frames using the video scrubber, then write text annotations that describe what happens across those frames.

Before You Start

A dataset with uploaded videos. Create a dataset if you don't have one yet.


Annotation types

Freeform Text

Write open-ended text annotations for video frame sequences. Describe actions, events, and scene changes in any format.

Freeform text

Freeform text annotation for videos lets you select a sequence of frames using the timeline scrubber, then write any structured or unstructured text for that sequence. This is ideal for describing actions, events, physics behaviors, and temporal relationships that span multiple frames.

The result: a model trained to understand and reason about video content over time.

Typical use cases:

  • Action recognition and description
  • Temporal event analysis
  • Physics and behavior rule verification
  • Video captioning and scene understanding
  • Activity monitoring and compliance checking

Annotate videos with freeform text


How video annotation differs from image annotation

Video annotation adds a temporal dimension. Instead of annotating a single static image, you work with frame sequences on a timeline.

Image annotation
Video annotation
Input
Single image
Sequence of frames
Selection
Entire image
Frame range via scrubber
Context
Spatial only
Spatial and temporal
Annotations describe
What is in the image
What happens across frames

Annotation workflow

  1. Upload videos to your dataset
  2. Open the annotator from your dataset's Annotate tab
  3. Select a video from the thumbnail strip
  4. Use the timeline scrubber to select frame sequences
  5. Write freeform text annotations for each sequence
  6. Review coverage using the dataset overview
  7. Train your model using the annotated dataset

Next steps

Annotate With Freeform Text

Step-by-step guide to annotating video frame sequences with freeform text.

Dataset Overview

Check annotation coverage and quality across your video dataset.