A multi-purpose, cat-themed web app created for college students, by college students.
-
Updated
Jul 16, 2024 - JavaScript
A multi-purpose, cat-themed web app created for college students, by college students.
A voice-operated emailing mobile application that allows you to compose and send email messages through voice commands.
End-to-End Speech Processing Toolkit
MARS5 speech model (TTS) from CAMB.AI
Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.
Lidya is an AI assistant powerd by plugins
🍦 ChatTTS-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.
Synthetic data augmentation technique via LLM for Automatic Speech Recognition fine tuning.
A web UI Project In order to learn the large language model. This project includes features such as chat, quantization, fine-tuning, prompt engineering templates, and multimodality.
Frontend of TradEmploi
Eden AI: simplify the use and deployment of AI technologies by providing a unique API that connects to the best possible AI engines
Free and simple to use app for augmentative and alternative communication (AAC) with offline support, flexible input methods and media access
Fully automated video maker using motion graphics and text-to-speech synthesis to turn newsletters into daily YouTube videos.
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
A generative speech model for daily dialogue.
Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter
Converts text to speech in realtime
An advanced tool designed for creating automated news bulletins. It generates dynamic news scripts using OpenAI, and provides voice narration using either OpenAI or ElevenLabs text-to-speech services. The system supports seamless integration with AzuraCast and S3 for content storage and distribution, making it a versatile solution for broadcasters.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Add a description, image, and links to the text-to-speech topic page so that developers can more easily learn about it.
To associate your repository with the text-to-speech topic, visit your repo's landing page and select "manage topics."