#

speech-processing

Here are 576 public repositories matching this topic...

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Updated Jul 16, 2024
Python

gryannote

clement-pages / gryannote

Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.

audio annotation-processing gradio speech-processing annotation-tool speaker-diarization pyannote gradio-custom-component

Updated Jul 16, 2024
Svelte

r9y9 / pysptk

A python wrapper for Speech Signal Processing Toolkit (SPTK).

python dsp speech speech-synthesis python-wrapper digital-signal-processing speech-processing sptk

Updated Jul 16, 2024
Python

midas-research / audino

Open source audio annotation tool for humans

python machine-learning datasets speech-processing audio-processing annotation-tool audio-annotation

Updated Jul 16, 2024
JavaScript

IMS-Toucan

DigitalPhonetics / IMS-Toucan

Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.

text-to-speech deep-learning toolkit speech pytorch tts speech-synthesis speech-processing

Updated Jul 16, 2024
Python

snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

voice-commands speech pytorch voice-recognition vad voice-control speech-processing voice-detection voice-activity-detection onnx onnxruntime onnx-runtime

Updated Jul 15, 2024
Python

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

pytorch pretrained-models speaker-recognition speaker-verification speech-processing speaker-diarization voice-activity-detection speech-activity-detection speaker-change-detection speaker-embedding overlapped-speech-detection

Updated Jul 14, 2024
Jupyter Notebook

RuntimeSpeechRecognizer

gtreshchev / RuntimeSpeechRecognizer

Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.

voice-recognition speech-recognition openai unreal-engine ue4 speech-to-text whisper speech-processing audio-processing unreal-engine-4 ue4-plugin speech-detection whis ue5 unreal-engine-5 ue5-plugin whisper-cpp whisper-ai

Updated Jul 13, 2024
C++

Yuan-ManX / audio-development-tools

This is a list of sound, audio and music development tools which contains machine learning, audio generation, audio signal processing, sound synthesis, spatial audio, music information retrieval, music generation, speech recognition, speech synthesis, singing voice synthesis and more.

audio music machine-learning deep-learning signal-processing dsp speech artificial-intelligence speech-synthesis music-generation speech-processing audio-processing audio-generation

Updated Jul 13, 2024

zhitko / speech-rate-meter

The Speech Rate Meter (hereinafter SRM) software module is designed to measure a complex of characteristics of the tempo (rate) of oral speech.

qml speech qt5 speech-processing intonation

Updated Jul 11, 2024
QML

zhitko / singer-voice-tester

Singer Voice Tester is software for testing singer skills

qml speech qt5 speech-processing singi

Updated Jul 11, 2024
QML

MontrealCorpusTools / PolyglotDB

Language data store and linguistic query API

database influxdb neo4j rest-api speech-processing acoustics speech-analysis

Updated Jul 11, 2024
Python

Aadit3003 / s2st-cascading-e2e

A comparison of E2E and Cascading S2ST systems on the CVSS-C Spanish to English dataset (CommonVoice 4.0)

nlp text-to-speech translation meteor comet speech-to-text speech-processing asr bleu-score speech-to-speech cascaded-speech-translation end-to-end-speech-translation speech-to-speech-translation

Updated Jul 11, 2024
Python

chimechallenge / chime-utils

Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.

speech-recognition automatic-speech-recognition speech-processing speech-separation speech-enhancement far-field-speech-recognition diarization multi-speaker-asr meeting-transcription

Updated Jul 10, 2024
Python

EveryVoiceTTS / EveryVoice

The EveryVoice TTS Toolkit - Text To Speech for your language

python text-to-speech speech pytorch tts speech-synthesis speech-processing language-revitalization low-resource-languages pytorch-lightning

Updated Jul 16, 2024
Python

nafiuny / ICRCycleGAN-VC

Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and Inception-resNet module by Afiuny

audio python deep-learning signal-processing voice speech speech-synthesis gan speech-recognition persian song inception voice-conversion speech-processing inception-resnet audio-processing cyclegan non-parallel pytorch-implementation

Updated Jul 10, 2024
Python

awesome-diarization

wq2012 / awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

machine-learning awesome deep-learning speech-recognition awesome-list speech-processing speaker-diarization

Updated Jul 8, 2024

raj-sutariya / indic-num2words

Python library for converting numbers to words for all Indian Languages.

python nlp preprocessing speech-processing indic indian-languages

Updated Jul 13, 2024
Python

vocalpy / vak

A neural network framework for researchers studying acoustic communication

python torch python3 pytorch birdsong speech-processing torchvision bioacoustics animal-communication bioacoustic-analysis vocalizations spectrograms animal-vocalizations

Updated Jul 6, 2024
Python

veeravignesh1 / veeravignesh1.github.io

Professional site made with Quarto

nlp speech-processing audio-ml

Updated Jul 6, 2024
SCSS

Improve this page

Add a description, image, and links to the speech-processing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-processing topic, visit your repo's landing page and select "manage topics."