acoustic-model

Here are 42 public repositories matching this topic...

zzw922cn / awesome-speech-recognition-speech-synthesis-papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

Updated Oct 19, 2023

openvpi / DiffSinger

Star

An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism

midi diffusion svs acoustic-model singing-voice pitch-prediction singing-voice-synthesis rectified-flow melody-frontend diffussion-model

Updated Jul 15, 2024
Python

MontrealCorpusTools / Montreal-Forced-Aligner

Star

Command line utility for forced alignment using Kaldi

python kaldi pronunciation-dictionary forced-alignment grapheme-to-phone acoustic-model

Updated Jul 16, 2024
Python

My-Voice Analysis is a Python library for the analysis of voice (simultaneous speech, high entropy) without the need of a transcription. It breaks utterances and detects syllable boundaries, fundamental frequency contours, and formants.

python-library speech-analysis praatscript acoustic-model voice-analysis

Updated Aug 31, 2021
Python

Shahabks / myprosody

Star

A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.

python-library voice-recognition prosody phonemes speech-analysis acoustic-model acoustic-features speech-patterns

Updated Nov 28, 2022
Python

guanlongzhao / fac-via-ppg

Star

Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)

speech-synthesis acoustic-model accent-conversion

Updated Jul 6, 2023
Python

cvqluu / Factorized-TDNN

Star

PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi

neural-network pytorch speech-recognition neural-networks kaldi speaker-recognition speaker-verification embedding speaker-diarization tdnn acoustic-model acoustic-models x-vector tdnn-f factorized-tdnn

Updated Jan 6, 2020
Python

aluo-x / Learning_Neural_Acoustic_Fields

Star

Official code for "Learning Neural Acoustic Fields" (NeurIPS 2022)

pytorch impulse-response spatial-audio acoustics 3d-audio reverberation acoustic-model acoustic-models neural-fields implicit-functions neural-field spatial-audio-reproduction

Updated Jan 20, 2024
Python

HumBug-Mosquito / HumBugDB

Star

Acoustic mosquito detection code with Bayesian Neural Networks

audio pytorch feature-extraction keras-tensorflow bayesian-neural-networks acoustic-model acoustic-features

Updated Oct 4, 2021
Jupyter Notebook

zhaoyu611 / Automatic_Speech_Recognition_with_Multi_Models

Star

A Simple Automatic Speech Recognition (ASR) Model in Tensorflow, which only needs to focus on Deep Neural Network. It's easy to test popular cells (most are LSTM and its variants) and models (unidirectioanl RNN, bidirectional RNN, ResNet and so on). Moreover, you are welcome to play with self-defined cells or models.

deep-learning tensorflow lstm rnn automatic-speech-recognition ctc timit acoustic-model

Updated Jan 18, 2018
Python

nemoramo / acoustic_model

Star

This is a sub-repository in building to create acoustic model in Mandarin speech recognition.

deep-neural-networks pytorch asr ctc-loss acoustic-model horovod

Updated Oct 24, 2019
Python

X-LANCE / UniCATS-CTX-txt2vec

Star

[AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS

text-to-speech tts speech-synthesis acoustic-model unicats vq-diffusion ctx-txt2vec

Updated Feb 23, 2024
Python

mozilla / deepspeech-playbook

Star

A crash course for training speech recognition models using DeepSpeech.

speech-recognition language-model acoustic-model deepspeech common-voice

Updated May 16, 2021

hcy71o / SC-CNN

Star

SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems

text-to-speech tts speech-synthesis zero-shot feature-extractor acoustic-model multi-speaker-tts

Updated Nov 1, 2023
Python

ronggong / jingjuSingingPhraseMatching

Star

Code for the paper: Audio to Score Matching by Combining Phonetic and Duration Information

score cnn-model phoneme singing-phrase acoustic-model hsmm

Updated Jul 9, 2017
Python

sooftware / End-to-End-Speech-Recognition-Models

Sponsor

Star

PyTorch implementation of automatic speech recognition models.

end-to-end pytorch transformer las vad e2e asr acoustic-model voice-activity-detection deepspeech2 listen-attend-and-spell

Updated Jan 10, 2021
Python

mntabassm / SAEN-LARS

Star

Sequential adaptive elastic net (SAEN) approach, complex-valued LARS solver for weighted Lasso/elastic-net problems, and sparsity (or model) order detection with an application to single-snapshot source localization.

adaptive-learning sparse-regression matlab-toolbox regularized-linear-regression elastic-net sparse-reconstruction lasso-regression source-localization acoustic-model regularization-paths direction-of-arrival sparse-regularization compressed-beamforming complex-valued-data solution-path

Updated Mar 5, 2020
MATLAB

jraleman / 42_Kift

Star

A voice user interface that recognizes the user's voice via the Sphinx library to execute some commands. The system responds with a computer generated voice and sound clips. Finally, there's a server for storing and reacting to the data, and a client for connecting to the system.

voice-recognition kevin acoustic-model ecole42 kift

Updated Aug 29, 2017
C

nvmoyar / aind2-speech-recognition

Star

Some approaches based on deep learning to build the acoustic model for an end-to-end automatic speech recognition (ASR) pipeline.

speech-recognition automatic-speech-recognition asr speech-recognizer acoustic-model librispeech asr-pipeline

Updated Apr 25, 2018
Jupyter Notebook

jim-schwoebel / sound_event_detection

Star

🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.

machine-learning acoustic-fingerprinting object-detection event-detection acoustics object-detection-pipelines audioset acoustic-model sound-event-detection acoustic-features object-detection-label common-voice common-voice-tool voice-computing object-detection-accuracy voicebook surveylex neurolex

Updated Feb 20, 2022
Python

Improve this page

Add a description, image, and links to the acoustic-model topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the acoustic-model topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

acoustic-model

Here are 42 public repositories matching this topic...

zzw922cn / awesome-speech-recognition-speech-synthesis-papers

openvpi / DiffSinger

MontrealCorpusTools / Montreal-Forced-Aligner

Shahabks / my-voice-analysis

Shahabks / myprosody

guanlongzhao / fac-via-ppg

cvqluu / Factorized-TDNN

aluo-x / Learning_Neural_Acoustic_Fields

HumBug-Mosquito / HumBugDB

zhaoyu611 / Automatic_Speech_Recognition_with_Multi_Models

nemoramo / acoustic_model

X-LANCE / UniCATS-CTX-txt2vec

mozilla / deepspeech-playbook

hcy71o / SC-CNN

ronggong / jingjuSingingPhraseMatching

sooftware / End-to-End-Speech-Recognition-Models

mntabassm / SAEN-LARS

jraleman / 42_Kift

nvmoyar / aind2-speech-recognition

jim-schwoebel / sound_event_detection

Improve this page

Add this topic to your repo