A PyTorch-based Speech Toolkit
-
Updated
Jul 16, 2024 - Python
A PyTorch-based Speech Toolkit
本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods
About A simple python based virtual voice assistant that can take and execute commands it can also recognize multiple user at a time
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Captains log and 3d star map for Elite Dangerous
Casual chess app with strong engine
This project combines a chat bot with voice recognition capabilities to create an interactive Q&A system. The chat bot can recognize speech input, find the best match for a given question from a knowledge base, and provide an appropriate response. Additionally, it can learn new responses from the user.
Tracker presents: a casual standalone version of the Super Metroid & A Link to the Past Crossover Randomizer with built-in automatic item tracking and Twitch integration
A rust voice assistant which is was made to be easy to set up, customize and learn from.
🗣 Discord voice-chat speech recognition
Example of an API that uses Google libraries to transform audio into text and text into audio, in an easy and simple way.
On-device streaming speech-to-text engine powered by deep learning
On-device voice assistant platform powered by deep learning
On-device speech-to-text engine powered by deep learning
A Machine Learning Model integrated in a web app that classifies bird species based on it's sound
Chrome/Edge BROWSER EXTENSION that can RECOGNIZE any live audio/video streaming then TRANSLATE it for FREE (using unofficial online Google Translate API) then display it as LIVE CAPTION / LIVE SUBTITLE!
HTML Web template that can RECOGNIZE any live audio/video streaming (using Chrome webkitSpeechRecognition API) then TRANSLATE it for FREE (using unofficial online Google Translate API) then display it as LIVE CAPTION / LIVE SUBTITLE
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Kibernikto is an app to easily run telegram bots connected to AI models.
Add a description, image, and links to the voice-recognition topic page so that developers can more easily learn about it.
To associate your repository with the voice-recognition topic, visit your repo's landing page and select "manage topics."