Get your timestamped transcription in seconds.

Upload any audio or video file and receive a full transcript with precise timestamps and speaker labels. Up to 99% accuracy, processed in under a minute per hour, hosted entirely in the EU.

Transcribe my file See pricing

Audio transcribed in under a minute with over 98% accuracy — New York Times

Timestamped transcription

Drop your audio or video here

or click to browse

Browse files →

MP3WAVMP4M4AMOVOGG

+14 more

Trusted by over 75,000 people worldwide

99% accuracy

1 free transcription per day

With or without a plan

Accuracy on clear audio: 99 %
Per hour of audio: < 1 min
Languages supported: 100+
Professionals trust Vook.ai: 75k+

How it works

From file to timestamped transcript in three steps

No software to install, no forms to fill. Drop your file and we'll handle the rest.

Upload your file

Drag and drop your file or pick it from your computer. Files up to 6 GB are accepted, no installation needed.

Vook.ai transcribes in minutes

Vook.ai detects speakers, adds timestamps, and produces a clean, punctuated transcript. Typically under one minute per audio hour.

Edit, export, ask

Review in our editor, export to PDF, DOCX, MD, SRT or HTML, and ask the chat to summarize, extract quotes, or pull themes.

Why Vook

The transcription AI that doesn't read your data.

European sovereignty isn't a feature, it's the foundation. Your files stay yours: encrypted, EU-hosted, and never used for training.

Hosted in the EU

Your files stay on French infrastructure and never cross the Atlantic. GDPR-native, no Cloud Act exposure.

AES-256 encryption

Encrypted at rest with AES-256. Only you can access your transcripts.

Never used for training

Your audio and transcripts are never used for training, never resold, never analyzed for ads.

GDPR-native

Built from day one for European compliance. DPA on request, full audit trail, your right to deletion respected.

Formats

Works with every major audio and video format

Vook.ai reads every common audio and video format, and exports to whatever your workflow needs.

We built Vook.ai so that professionals handling sensitive recordings could get accurate, timestamped transcripts without ever worrying about where their data goes.

Vook.ai engineering team

Input formats

.mp3Most common

.wavLossless

.mp4Video audio

.m4aApple devices

.movQuickTime

.oggOpen source

.mpgaMPEG audio

.mpegMPEG audio

.opusLow-bitrate

.flacStudio quality

.aacStreaming

.webmWeb recordings

.wmaWindows

.aviVideo

.mtsAVCHD video

.m4vApple video

.mkvMatroska video

.wmvWindows video

.flvFlash video

.3gpMobile video

Export to

.pdfPrint-ready

.docxWord document

.mdMarkdown

.srtSubtitles

.htmlWeb page

For your profession

Made for people who work with words.

Timestamps turn a raw transcript into a navigable document, from journalists verifying quotes to researchers citing sources and editors planning cuts.

Interview transcription for journalists and newsrooms

Interview transcription, without typing a line

“Every speaker identified”
“Quotes ready to extract”
“Accurate transcripts in minutes”

Learn more

Guide

Timestamped Transcription: Everything You Need to Know

What Is a Timestamped Transcription?

A timestamped transcription is a text document that pairs spoken words with the exact time they occur in the original audio or video file. Each segment, sentence, or speaker turn is tagged with a time code, typically in hours:minutes:seconds format, so readers can locate any moment in the recording without listening to the whole file.

Timestamped transcripts are the standard output format for professional transcription services, subtitling workflows, legal depositions, and qualitative research. They differ from plain transcripts in that the time codes are embedded directly in the text, making the document both human-readable and machine-parseable.

Why Timestamps Matter in a Transcript

Timestamps turn a static text document into a navigable reference. Without them, finding a specific quote in a two-hour recording means reading the entire transcript or scrubbing through audio manually. With timestamps, you jump directly to the relevant second.

Fact-checking. journalists can verify every quote against the original recording in seconds.
Legal and compliance use. precise time codes provide an auditable trail for depositions and meeting records.
Video editing. editors use timestamps to plan cuts and sync subtitles without watching full footage.
Academic citation. researchers can cite a specific moment in an interview with a verifiable reference.

How AI Generates Timestamps Automatically

Modern AI transcription models process audio in short overlapping windows, aligning each recognized word to its position in the audio stream. The result is word-level or segment-level timestamps that reflect the actual timing of speech, not an estimate. Vook.ai's engine processes one hour of audio in under one minute, returning a transcript where every segment carries its start time.

Automatic timestamps are accurate on clear recordings but can drift slightly on heavily compressed audio or recordings with significant background noise. Vook.ai's built-in editor lets you review and adjust any segment before exporting, so the final document is always correct.

Speaker Diarization and Timestamps Together

Speaker diarization is the process of identifying who is speaking at each moment in a recording. Combined with timestamps, diarization produces a transcript that shows not just what was said and when, but who said it. This is essential for multi-speaker recordings such as interviews, panel discussions, focus groups, and meetings.

Vook.ai applies diarization automatically alongside timestamping. Each speaker is assigned a label (Speaker 1, Speaker 2, and so on), and you can rename or merge speakers in the editor before export. All speaker labels and timestamps are preserved in every export format, including PDF, DOCX, Markdown, SRT, and HTML.

Choosing the Right Export Format

The best export format depends on how you plan to use the transcript. Vook.ai supports five formats, each suited to a different workflow:

PDF. print-ready version for archiving, legal records, or client delivery.
DOCX. formatted Word document with speaker labels and timestamps, ready for annotation and sharing with colleagues.
Markdown. structured text for developers, note-taking apps, and documentation platforms.
SRT. subtitle format with time codes, ready to drop into your video editor or player.
HTML. web-ready transcript you can publish or embed directly.

Privacy and Data Security for Sensitive Recordings

Many recordings that require transcription contain sensitive information: medical consultations, legal interviews, confidential business meetings, or personal conversations. Choosing a transcription service that handles this data responsibly is not optional.

Vook.ai is hosted entirely in France, within the EU, and uses AES-256 encryption at rest for all files. Vook.ai never uses your audio to train AI models, never sells your data, and never analyzes it for advertising. A Data Processing Agreement is available on request for organizations that require formal GDPR documentation.

FAQ

Frequently Asked Questions

Have a different question and can’t find the answer you’re looking for? Contact us.

What is a timestamped transcription?

A timestamped transcription is a text version of your audio or video file where each line or segment is tagged with the exact time it occurs. This makes it easy to navigate long recordings, find specific moments, and reference quotes precisely.

How accurate is Vook.ai's timestamped transcription?

Vook.ai reaches up to 99% accuracy on clear audio in supported languages. Accuracy may be lower on overlapping voices, low-quality phone recordings, or strong accents. The built-in editor lets you correct any errors quickly before exporting.

How long does it take to transcribe a file?

Vook.ai processes audio at less than one minute per hour of content. A 60-minute recording is typically ready in under a minute. Processing time may vary slightly depending on file size and server load.

Do I need to create an account to get a timestamped transcript?

No account is required to try Vook.ai. You get one free transcription per day with no sign-up and no credit card. Paid plans unlock longer files and more transcriptions.

What file formats does Vook.ai accept for timestamped transcription?

Vook.ai accepts MP3, WAV, M4A, MP4, MOV, OGG and many more audio and video formats, up to 6 GB per file with no duration limit. Both audio and video files are supported.

Are timestamps included in the exported file?

Yes. Timestamps and speaker labels are preserved in all export formats: PDF, DOCX, Markdown, SRT, and HTML. You can choose which details to include before downloading.

Is my audio file kept private?

Your files are encrypted with AES-256 at rest, stored on EU servers in France. Vook.ai never uses your audio to train AI models and never sells your data.

Free plan

Get 1 free transcript per day. Upgrade for unlimited power.

Subscribe now, cancel anytime

Get 4 months free with annual plans

API plan

Integrate Vook.ai into your stack

Custom pricing and features

Explore

Dedicated API access
Custom-built features
Centralized billing

Credits never expire

10h pass - no subscription

Use these hours whenever you want, they never expire

per hour

Buy hours

Start your first timestamped transcript now

Free for occasional use. No credit card. One file per day, every day, forever.

Try now

Related conversion tools

Speaker identification Transcribe long audio Audio to SRT MP3 to text MP4 to text WAV to text