Wav2Vec2 Large 960h
A widely used self-supervised speech representation model from Meta AI for automatic speech recognition and audio understanding tasks.
A widely used self-supervised speech representation model from Meta AI for automatic speech recognition and audio understanding tasks.
A multilingual speech recognition and translation model by OpenAI, supporting 100+ languages with improved robustness and low-latency transcription.