Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Autosub | 525 | 4 months ago | 9 | mit | Python | |||||
A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui | ||||||||||
Whisper Standalone Win | 488 | 4 months ago | 1 | |||||||
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python. | ||||||||||
Whishper | 443 | 4 months ago | 27 | agpl-3.0 | Svelte | |||||
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models! | ||||||||||
Whisper Website | 174 | 7 months ago | 2 | mit | Python | |||||
Simple web application, which can be used to convert audio to subtitles by OpenAI's Whisper model | ||||||||||
Audiototext | 132 | 3 months ago | 1 | Jupyter Notebook | ||||||
Transcribe and translate audio to text using Whisper and DeepL. | ||||||||||
Simple Obs Stt | 91 | a year ago | gpl-3.0 | TypeScript | ||||||
Speech-to-text and keyboard input captions for OBS. | ||||||||||
Content Localization On Aws | 30 | 6 months ago | 17 | apache-2.0 | Vue | |||||
Automatically generate multi-language subtitles using AWS AI/ML services. Machine generated subtitles can be edited to improve accuracy and downstream tracks will automatically be regenerated based on the edits. Built on Media Insights Engine (https://github.com/awslabs/aws-media-insights-engine) | ||||||||||
Karaok Ai | 29 | 5 months ago | Java | |||||||
Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text) | ||||||||||
Autosub | 22 | 4 years ago | n,ull | gpl-3.0 | Python | |||||
GUI utility to transcribe/translate from video/audio/subtitles to subtitles | ||||||||||
Voice Data Extract | 18 | a year ago | 3 | mit | Python | |||||
A command line interface to combine text information from subtitles with voice data in the video. Provides a convenient way to generate training data for speech-recognition purposes. |