What is Whisper from OpenAI ? A speech to text, or automatic speech recognition model.
The team at OpenAI developed a speech recognition system to predict audio transcripts from the internet. They used 680,000 hours of audio data for system. The amazing part is the newly developed system can compete with a prior fully supervised model.
What can it do ?
- English Transcription
- Any-to-English Speech Transcription
- Non-English Transcription
- No Speech Prediction
In conclusion, Whisper shows that it can achieve massive results with speech recognition without the of complicated self-supervised and self-trained techniques. By simply training with a large and diverse dataset, they can make the speech recognition system more robust and accurate!
- Open AI
- Whisper — Robust Speech Recognition via Large-Scale Weak Supervision
- Hugging Faces — Whisper
Are you looking to stay up-to-date with my writing and projects? Connect with me on Linkedin and follow the links below to learn more!
Check out my website — cover letter builder — and read my official blog post to unlock the power of automated background removal with AI.