Open-Source

Wav2Vec2 Large 960h

A widely used self-supervised speech representation model from Meta AI for automatic speech recognition and audio understanding tasks.

A multilingual speech recognition and translation model by OpenAI, supporting 100+ languages with improved robustness and low-latency transcription.