Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
-
Updated
Sep 23, 2024 - Python
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
Project that allows one to use a microphone with OpenAI whisper.
Fast! Offline, privacy-focused, hands-free voice typing, 2-way AI voice chat, AI images, webcam, recorder, voice control, in under 4 GiB of VRAM.
A stream-translator fork with VAD based audio slicing & GPT / Gemini translation.
whisper.cpp bindings for python
A feature-rich Python-based Telegram bot for OpenAI API & Perplexity API
A working Speech to Speech AI assistant that can interact with you, manage your system, and more!
This repository provides a Flask app that processes voice messages recorded through Twilio or Twilio Studio, transcribes them using OpenAI's Whisper ASR, generates responses with GPT-3.5, and sends the replies as SMS using Twilio.
Drop-in replacement for the OpenAI's Whisper API using the same API but running locally
YASS.ai - Team Orange's entry to the Flow AI Hackathon 2023
Discord bot that downloads and transcribes twitter space audio file
Live translation tool utilizing OpenAI's Whisper model for real-time audio transcription/translation with BYOK OpenAI API key for your choice of language.
YouTube Video Summarization App built using open source LLM and Framework like Llama 2, Haystack, Whisper, and Streamlit. This app smoothly runs on CPU as Llama 2 model is in GGUF format loaded through Llama.cpp.
A simple UI tool written in Python, for recording audio from a microphone and automatically transcribing the recording using OpenAI's Whisper model via OpenAI's API.
The VoiceProcessingToolkit is an all-encompassing suite designed for sophisticated voice detection, wake word recognition, text-to-speech synthesis, and advanced audio processing. It offers intuitive interfaces to streamline the integration of voice processing capabilities into your applications
Home Manager GPT is a text to speech chat gpt that can be used to control your entire house. ask verbal questions and get verbal answers from google speech recognition.
Asynchronously transcribe audio files split into chunks in parallel and intelligently join results, yielding nearly identical transcriptions to full audio transcriptions but in a fraction of the time.
PyScribe is a command-line tool to transcribe audio files. It uses `ffmpeg` for audio conversion and `pywhisper` for transcription.
Add a description, image, and links to the whisper-api topic page so that developers can more easily learn about it.
To associate your repository with the whisper-api topic, visit your repo's landing page and select "manage topics."