speech-to-text

Here are 2,893 public repositories matching this topic...

Amir-Hofo / Speech-commands-Classification

In this notebook, we aim to recognize speech commands using classification. For this purpose, we used the SPEECHCOMMANDS dataset and the deep convolutional model M5. The code is written in Python and designed for the PyTorch platform.

machine-learning ai deep-learning cnn pytorch artificial-intelligence speech-recognition convolutional-neural-networks speech-to-text audio-classification torchaudio speech-classification

Updated Jun 12, 2024

KevKibe / African-Whisper

Star

🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.

speech speech-recognition speech-to-text whisper asr speech-translation speech-transcription

Updated Jun 12, 2024
Python

HenestrosaDev / audiotext

Star

A desktop application that transcribes audio from files, microphone input or YouTube videos with the option to translate the content and create subtitles.

python speech-recognition speech-to-text transcriber video-to-text audio-to-text speech-to-text-api subtitles-generator customtkinter whisperx

Updated Jun 12, 2024
Python

k2-fsa / sherpa-onnx

Star

Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript

android windows macos linux raspberry-pi ios text-to-speech csharp cpp dotnet speech-to-text aarch64 mfc risc-v asr arm32 onnx vits openkylin

Updated Jun 12, 2024
C++

ggerganov / whisper.cpp

Sponsor

Star

Port of OpenAI's Whisper model in C/C++

inference transformer speech-recognition openai speech-to-text whisper

Updated Jun 12, 2024
C

modelscope / FunClip

Star

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

speech-recognition speech-to-text gradio video-clip subtitles-generator video-subtitles llm gradio-python-llm

Updated Jun 12, 2024
Python

akscf / mod_whisper_asr

Star

Freeswitch ASR module to working with wisper_cpp

speech-recognition freeswitch speech-to-text whisper-cpp

Updated Jun 12, 2024
C

OpenVoiceOS / status

Star

Open Voice OS Status Page

status text-to-speech translator monitoring alerting cuda sam nvidia tts uptime stats speech-to-text stt piper ovos upptime openvoiceos fasterwhisper mimic3

Updated Jun 12, 2024
Markdown

MahmoudAshraf97 / whisper-diarization

Star

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

speech speech-recognition speech-to-text whisper asr speaker-diarization

Updated Jun 12, 2024
Jupyter Notebook

jianchang512 / pyvideotrans

Star

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，并添加配音

text-to-speech speech-to-text video-transition

Updated Jun 12, 2024
Python

ErcinDedeoglu / WhisperDock

Star

Dockerized Whisper C++ speech-to-text API for easy deployment and rapid integration. Offering the latest stable and nightly builds for efficient audio transcription.

api docker machine-learning speech-to-text audio-transcription whisper-cpp

Updated Jun 12, 2024
C++

deepgram / deepgram-go-sdk

Star

Go SDK for Deepgram's automated speech recognition APIs.

go speech-recognition speech-to-text hacktoberfest deepgram

Updated Jun 11, 2024
Go

occ-ai / obs-localvocal

Star

OBS plugin for local speech recognition and captioning using AI

plugin translation ai livestream live-streaming speech-recognition speech-to-text obs transcription obs-studio whisper realtime-translator obs-studio-plugin realtime-transcribe openai-whisper whisper-cpp real-time-transcription

Updated Jun 11, 2024
C++

barrylee111 / voicechat-LLM

Star

A chatbot with both prompt and voicechat capabilities. When using voicechat, the user can immerse themselves in the experience by selecting a narrator, like a pirate for instance.

react python text-to-speech websocket speech-to-text whisper fastapi largelanguagemodel

Updated Jun 11, 2024
Python

richardrigutins / my-transcripts

Star

Web app that converts speech to a text transcript and lets you save the generated transcripts to OneDrive using Microsoft Graph

graph dotnet azure dotnet-core speech-to-text cognitive-services microsoft-graph microsoft-graph-sdk blazor blazor-server hacktogether hack-together

Updated Jun 11, 2024
C#

speechbrain / speechbrain

Star

A PyTorch-based Speech Toolkit

Updated Jun 11, 2024
Python

mkiol / dsnote

Star

Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.

text-to-speech translator translation offline machine-translation sailfishos tts speech-synthesis speech-recognition speech-to-text nmt linux-desktop stt asr flatpak-applications

Updated Jun 11, 2024
C++

aws-solutions / content-localization-on-aws

Star

Automatically generate multi-language subtitles using AWS AI/ML services. Machine generated subtitles can be edited to improve accuracy and downstream tracks will automatically be regenerated based on the edits. Built on Media Insights Engine (https://github.com/awslabs/aws-media-insights-engine)

audio nlp video localization vod media localisation captions subtitles speech-to-text amazon-polly nlp-machine-learning content-analysis mie video-on-demand amazon-comprehend amazon-translate amazon-transcribe aws-media-insights-engine

Updated Jun 11, 2024
Vue

baharudin-yusup / salingsapa

Star

A video call apps to enable deaf people to communicate with normal people using sign language recognition and speech-to-text

android ios text-to-speech firebase webrtc clean-architecture speech-to-text flutter bloc agora sign-language-recognition tensorflow-lite codemagic

Updated Jun 11, 2024
Dart

leon-ai / leon

Star

🧠 Leon is your open-source personal assistant.

Updated Jun 11, 2024
Python

Improve this page

Add a description, image, and links to the speech-to-text topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-to-text topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-to-text

Here are 2,893 public repositories matching this topic...

Amir-Hofo / Speech-commands-Classification

KevKibe / African-Whisper

HenestrosaDev / audiotext

k2-fsa / sherpa-onnx

ggerganov / whisper.cpp

modelscope / FunClip

akscf / mod_whisper_asr

OpenVoiceOS / status

MahmoudAshraf97 / whisper-diarization

jianchang512 / pyvideotrans

ErcinDedeoglu / WhisperDock

deepgram / deepgram-go-sdk

occ-ai / obs-localvocal

barrylee111 / voicechat-LLM

richardrigutins / my-transcripts

speechbrain / speechbrain

mkiol / dsnote

aws-solutions / content-localization-on-aws

baharudin-yusup / salingsapa

leon-ai / leon

Improve this page

Add this topic to your repo