Skip to playerSkip to main content
  • 3 years ago
If you want to transcribe your videos and audio into text for free but with high quality, you have come to the correct video.

In this tutorial video, I will guide you on how to use #OpenAI #Whisper model. I will show you how to install and run Open AI's Whisper from scratch. I will demonstrate to you how to convert audio/speech into text.

Our Discord server ⤵️
https://bit.ly/SECoursesDiscord

If I have been of assistance to you and you would like to show your support for my work, please consider becoming a patron on ⤵️
https://www.patreon.com/SECourses

Whisper is a general-purpose speech recognition model released for free by Open AI. I claim that Whisper is the best available Speech-to-Text model (Natural Language Processing - #NLP) released to public usage including premium paid ones such as Amazon Web Services, Microsoft Azure Cloud Platform, or Google Cloud API. And Whisper is free to use.

I will show you how to install the necessary Python code and the dependent libraries. I will show you how to download a video from YouTube with YT-DLP, how to cut certain parts of the video with LosslessCut, and how to extract the audio of a video with FFMPEG. I will show you how to do a transcription of a video or a sound. I will show you how to generate subtitles for any video. Finally, I will show you how to generate translated transcription and subtitles of any language video.

With the translation feature of the Whisper model, you can watch any language (Whisper supports 99 languages) with English subtitles. Let's say you can find English subtitles for your favorite video in German or Japanese or Arabic. It is not a problem. Just follow my tutorial and generated English translated subtitles.

The links and the commands I have shown in the video below:

Open AI Whisper : https://openai.com/blog/whisper/

The links and the commands I have shown in the video below:

Open AI Whisper : https://openai.com/blog/whisper/

Whisper Code : https://github.com/openai/whisper

Python : https://www.python.org/downloads/release/python-399/

Whisper install : pip install git+https://github.com/openai/whisper.git

How to install CUDA support for using GPU when doing transcription of audio :
First, delete existing Pytorch : pip3 uninstall torch
Then install Pytorch with CUDA support : pip3 install torch torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/cu116

FFMPEG : https://github.com/BtbN/FFmpeg-Builds/releases

LosslessCut : https://github.com/mifi/lossless-cut/releases

How to extract sound of any video with FFMPEG : ffmpeg -i "test_video.webm" -q:a 0 -map a test_video.mp3

How to transcribe an English video : whisper "C:\speech to text\test_video.mp3" --language en --model base.en --device cpu --task transcribe

How to transcribe an English video with CUDA support : whisper "C:\speech to text\test_video.mp3" --language en --model base.en --device cuda --task transcribe

Category

🤖
Tech
Be the first to comment
Add your comment

Recommended