testxlog

How to convert speech to text without an internet connection?

May 16, 2024281

AI Translation

This post is translated from Chinese into English through AI.View Original

AI-generated summary

The project "Xlog" has been completed in 2024. Various methods are listed for achieving speech-to-text conversion without internet connection, including online tools that require internet connection and offline tools like FasterWhisper. Instructions are provided for setting up FasterWhisper on a computer system, including downloading FFmpeg, the GUI, and a suitable model. The process involves selecting a local model, loading the model, choosing processing options, executing the transcription, editing subtitles, and saving the subtitle file in the desired format.

Method 1: Implement through Xunjie Voice Recognition Function online (requires internet connection)#

Note: Limit individual audio files to no more than 20M. Can be refreshed multiple times.

Method 2: Implement through Weizheng Online (requires internet connection)#

Note: 3 chances in total, try to merge the audio into one before uploading.

Method 3: Implement through iDi Cloud Dictation online (requires internet connection)#

Method 4: Log in to Yuelu with your mobile phone number (requires internet connection)#

Method 5: Convert audio to text through converter.app (requires internet connection)#

Method 6: Convert speech to text using Faster-Whisper (no internet connection required)#

First, download and install FFmpeg suitable for your computer system from Github. Installation tutorial can be found at How to download and install FFmpeg on Windows 10?
Then, download faster-whisper-GUI.exe from Github and install it as an administrator by right-clicking.
Next, search and download a model ending with "base" from huggingface, and copy it to a suitable directory folder.
Run FasterWhisperGUI as an administrator.
Select "Use local model" and choose the downloaded model file, then click "Load Model".
If you are using an Nvidia graphics card, select "cuda" in the processing device options.
Click "Execute Transcription".
Click the plus sign to select the video file to be transcribed.
After transcription is complete, click "Jump to whisperX and subtitle editing".
Click "Save Subtitle File".
You can also choose the subtitle format to save.

Ownership of this post data is guaranteed by blockchain and smart contracts to the creator alone.

Blockchain ID
#59476-20
Owner
0x2468683ff691bf0d7c8ac63afbbc0f157985a600
Transaction Hash
Creation 0x3a5a404a...5c1f57d088 Last Update 0x3a5a404a...5c1f57d088
IPFS Address
ipfs://QmeMtfjnbTCVPpszHTJRtJudZVkwv2xV5z9AqV3sAp7Umu