On-premise
Speech-To-Text models

State of the art WER.

diagram

Available languages

GBEnglish
NLDutch
FRFrench
PTPortuguese
ESSpanish
DEGerman
ITItalian
plusReady soon:
ILHebrew

Basic

Essential features for a solid start. Perfect for individuals and small projects.

12 500

hours included
250 $  / month
0.04$ / hour
Contact us

Standard

Balanced features for a growing need. Ideal for businesses seeking a comprehensive solution.

50 000

hours included
750 $  / month
0.02$ / hour
Contact us

Need more?

Premium features for maximum performance, tailored for professionals.

>50 000

 
Contact us

Model Features

verifiedUltimate Privacy. Your hardware. Data never leaves your servers.
verifiedNo GPU needed
verified50x cheaper than Google ASR (starting at 0.01 $/hour)
verifiedState-of-the-art WER (word error rate)
voice_selection

Streaming Models

Great for
Low latency
Live captions
Accessibility
Voice command services
Chatbots
Virtual assistants
View Plans
x10  real-time on a single CPU core
voice_chat

Post-processed Models

Great for
Meeting transcripts - online and offline
Visual voicemail
Voice memo transcripts
Content creation
Productivity and analytics
Movie subtitles
View Plans
x20  real-time on a single CPU core

Optimized for calls (8kHz or 16kHz)

Models optimized for Call Centers with highest accuracy, for processing recorded speech.
verifiedWebsockets API
verifiedPunctuation
verifiedWord timestamps
verifiedCapitalization
verifiedNo hallucinations

Frequently Asked Questions

Try the models. No login or credit card required.
Demo