Webui for using XTTS and for finetuning it
-
Updated
Jun 9, 2024 - Python
Webui for using XTTS and for finetuning it
Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages
End-to-end platform for building voice first multimodal agents
A simple FastAPI Server to run XTTSv2
Pandrator aspires to be a user-friendly app with a graphical interface and a one-click installer that creates high-quality speech from text in multiple languages (audiobooks, speech synchronised with subtitles and more) using local models (XTTS, Silero or VoiceCraft), plus voice cloning, LLM pre-processing, RVC enhancement, and automatic evaluation
OhanashiGPT is an application that generates personalized children's stories based on parameters like age and preferences. It narrates these stories using an AI-generated voice that mimics a parent, trained on their audio samples. The app also creates illustrations to accompany each story, providing a unique and engaging experience for children.
A User Interface for XTTS-2 Text-Based Voice Cloning with 10 seconds
Converts epub e-book files to mp3 audiobook files.
A command line utility to easily finetune XTTS models in a fully automated way. Developed for Pandrator.
This is an interface that will offline convert anything pdf document you give it into an interview between two people discussing it.
OhanashiGPT is an application that generates personalized children's stories based on parameters like age and preferences. It narrates these stories using an AI-generated voice that mimics a parent, trained on their audio samples. The app also creates illustrations to accompany each story, providing a unique and engaging experience for children.
OhanashiGPT is an application that generates personalized children's stories based on parameters like age and preferences. It narrates these stories using an AI-generated voice that mimics a parent, trained on their audio samples. The app also creates illustrations to accompany each story, providing a unique and engaging experience for children.
Python voice assistant (based on SpeechRecognition, Whisper and XTTS models) designed to transcribe speech to text, translate across languages, engage in chat mode, and ultimately respond vocally.
Add a description, image, and links to the xtts topic page so that developers can more easily learn about it.
To associate your repository with the xtts topic, visit your repo's landing page and select "manage topics."