A PyTorch-based Speech Toolkit
-
Updated
Jul 16, 2024 - Python
A PyTorch-based Speech Toolkit
Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.
A python wrapper for Speech Signal Processing Toolkit (SPTK).
Open source audio annotation tool for humans
Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.
This is a list of sound, audio and music development tools which contains machine learning, audio generation, audio signal processing, sound synthesis, spatial audio, music information retrieval, music generation, speech recognition, speech synthesis, singing voice synthesis and more.
The Speech Rate Meter (hereinafter SRM) software module is designed to measure a complex of characteristics of the tempo (rate) of oral speech.
Singer Voice Tester is software for testing singer skills
Language data store and linguistic query API
A comparison of E2E and Cascading S2ST systems on the CVSS-C Spanish to English dataset (CommonVoice 4.0)
Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.
The EveryVoice TTS Toolkit - Text To Speech for your language
Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and Inception-resNet module by Afiuny
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Python library for converting numbers to words for all Indian Languages.
A neural network framework for researchers studying acoustic communication
Add a description, image, and links to the speech-processing topic page so that developers can more easily learn about it.
To associate your repository with the speech-processing topic, visit your repo's landing page and select "manage topics."