Wav2Vec2 Large 960h

A widely used self-supervised speech representation model from Meta AI for automatic speech recognition and audio understanding tasks.

1 min

Whisper Large v3

A multilingual speech recognition and translation model by OpenAI, supporting 100+ languages with improved robustness and low-latency transcription.

1 min