capability
Whisper agents
This page lists every AI agent in the MeshKore directory tagged with the Whisper capability. Agents are sourced from public platforms (GitHub, Hugging Face, npm, PyPI, awesome-list curations, and direct submissions), normalized by the MeshKore worker, and ranked by GitHub stars. Each card links to the agent's profile with details on capabilities, framework, language, freshness, and source attribution.
439 agents in this capability · ranked by popularity
Top 200 Whisper agents
Faster Whisper transcription with CTranslate2
OpenAI ChatGPT, GPT-5, GPT-Image-1, Whisper API clients for Go
🎒 飞书 ×(GPT-4 + GPT-4V + DALL·E-3 + Whisper)= 飞一般的工作体验 🚀 语音对话、角色扮演、多话题讨论、图片创作、表格分析、文档导出 🚀
Low-latency AI engine for mobile devices & wearables
Mac app for crushing tech interviews with AI
A nearly-live implementation of OpenAI's Whisper.
「妙幕」是一款跨平台客户端工具,可以批量为视频或者音频生成字幕文件,并支持对字幕进行翻译,支持百度、火山、openai、ollama、deepseek 等多家翻译
🤖 A Telegram bot that integrates with OpenAI's official ChatGPT APIs to provide answers, written in Python
OpenAI API + Ruby! 🤖❤️ GPT-5 & Realtime WebRTC compatible!
Instantly generate AI-powered subtitles on your device. Works standalone or connects to DaVinci Resolve.
.NET library for the OpenAI service API by Betalgo Ranul
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
faster_whisper GUI with PySide6
Voice-to-text dictation app with local (Nvidia Parakeet/Whisper) and cloud models (BYOK). Privacy-first and…
🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI
Text-To-Speech, RAG, and LLMs. All local!
OpenAI API client for Kotlin with multiplatform and coroutines capabilities.
Using OpenAI's Whisper to automatically generate YouTube subtitles
The TypeScript library for building AI applications.
Whisper command line client compatible with original OpenAI client based on CTranslate2.
AI Vtuber for Streaming on Youtube/Twitch
💬📝 A small dictation app using OpenAI's Whisper speech recognition model.
Natural (2-way) voice conversations with Claude Code
The open-source iOS app that's making quality voice transcription more accessible on mobile devices.
Proxy API gateway for Kiro IDE & CLI (Amazon Q Developer / AWS CodeWhisperer). Use free Claude models with…
Generate subtitles, summaries, and chapters from videos in seconds
The open source wisprflow alternative
An unofficial OpenAI Unity Package that aims to help you use OpenAI API directly in Unity Game engine.
Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS…
React hook for OpenAI Whisper with speech recorder, real-time transcription, and silence removal built-in
🎤 The easiest way to transcribe audio in Swift
React Native binding of whisper.cpp.
Your CrewAI Powered Video Editing Assistant
Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for…
Running speech to text model (whisper.cpp) in Unity3d on your local machine.
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
Conversational voice AI agents
Real-time transcription using faster-whisper
Chain together LLMs for reasoning & orchestrate multiple large models for accomplishing complex tasks
[CVPR 2025] Video Narration as Vocabulary & Video as Long Document
Effortlessly add AI-generated transcription subtitles to your videos
Mac compatible Ollama Voice
An all-in-one AI audio playground using Cloudflare AI Workers to transcribe, analyze, summarize, and…
End-to-end platform for building voice first multimodal agents
Your personal voice assistant based on OpenAI ChatGPT.
The best way to use AI is on your own computer. Use local or paid API models, and ctrl+k to show/hide the…
Java client library for OpenAI API.Full support for all OpenAI API models including Completions, Chat, Edits…
A feature-rich portal to chat with GPT-4, Claude, Gemini, Mistral, & OpenAI Assistant APIs via a lightweight…
一款JavaSDK用于快速接入AI大模型应用,整合多平台大模型,如OpenAi、智谱Zhipu(ChatGLM)、深度求索DeepSeek、月之暗面Moonshot(Kimi)、腾讯混元Hunyuan、零一万物(01)等…
Inference engine for Intel devices. Serve LLMs, VLMs, Whisper, Kokoro-TTS, Embedding and Rerank models over…
How to use OpenAIs Whisper to transcribe and diarize audio files
Alfred workflow using ChatGPT, DALL·E 2 and other models for chatting, image generation and more.
⚡ Edgen: Local, private GenAI server alternative to OpenAI. No GPU required. Run AI models locally: LLMs…
Program that lets you ask questions about your documents, audio, and video files.
A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.
An API to transcribe audio with OpenAI's Whisper Large v3!
Simple self-hosted web application, which can be used to convert audio to subtitles by OpenAI's Whisper model
🎩 An Alfred 5 Workflow for using OpenAI Chat API to interact with GPT models 🤖💬 It also allows image…
Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming
AI-powered tool for real-time interview question transcription and response generation.
Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI…
OpenAI (and DeepSeek, Azure OpenAI, YandexGPT, Ollama, GigaChat, Qwen) API wrapper for Delphi. Use ChatGPT…
Node.js bindings for OpenAI's Whisper. (C++ CPU version by ggerganov)
AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more
CREDITS SEQUENCE NEWSPAPER HEADLINE MONTAGE: HEADLINES flash before us…
The main repo for Stage Whisper — a free, secure, and easy-to-use transcription app for journalists, powered…
Transcribe is a real time transcription, conversation, Language learning platform. It provides live…
She's the AI agent you come home to.
The definitive, open-source Swift framework for interfacing with generative AI.
Generador de logotipos de eSports por IA (con fines académicos durante el evento Tenerife GG)
Open-source, fully private and local alternative to NotebookLM. Chat with your documents, generate audio…
NodeJS Bindings for Whisper - the CPU version of OpenAI's Whisper, as initially crafted in C++ by ggerganov.
Full stack voice chatbot
A powerful Whisper AI keyboard for reliable speech transcription
A quick experiment to achieve almost realtime transcription using Whisper.
A sample web app using OpenAI Whisper to transcribe audio built on Next.js. It records audio continuously for…
According to all known laws of aviation, there is no way that a bee should be able to fly. Its wings are too…
Input text from speech in any Linux window, the lean, fast and accurate way, using whisper.cpp OFFLINE. Speak…
Flutter App That Can Transcribe Audio Offline/On Device with Whisper C++ Bindings via Rust
OpenAI's Whisper Audio to text transcription right into your web browser! An open source AI subtitling suite.
Unleash the power of Chatty: the intersection of ChatGPT’s intelligence, DALL·E's creativity, and Whisper's…
openai-whisper-talk is a sample voice conversation application powered by OpenAI technologies such as…
Like ChatGPT's voice conversations with an AI, but entirely offline/private/trade-secret-friendly, using…
This repository contains a Python script that allows users to download the audio from a YouTube video…
A voice chatbot based on GPT4All and talkGPT, running on your local pc!
Automatically generate viral-ready vertical short clips from long-form gameplay footage using AI-powered…
WhisperClip simplifies your life by automatically transcribing audio recordings and saving the text directly…
Input a YouTube video link or upload a video file and get a video with subtitles.
.NET 7 SDK for OpenAI with a Blazor Server playground
A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection
Short code for dictation using OpenAI Whisper for transcription.
OpenAI Whisper API based on Node.js / Bun.sh in a Docker Container + Google Cloud Run Example
An Android keyboard that performs speech-to-text (STT/ASR) with OpenAI Whisper and input the recognized text…
OpenAi-Sora (SoraFlows) is an open-source, cross-platform web application for AI-powered video creation and…
Push to talk voice recognition using Whisper
Telegram LLM bot backed by OpenAI, Whisper, Beam, LLaMA, Weaviate, MinIO and MongoDB
🛡 Установщик разблокировщика зарубежных AI-сервисов (и не только) для России на Windows 10/11 🌍
A curated list of awesome OpenAI's Whisper
Voice-to-text CLI for terminal users
"Chat With Any Video" project in 24 hours, challenge myself to complete in @Supabase's AI Hackathon.
Video2Text - Easily convert your video to text
Speech o Text using docker image with ggerganov/whisper.cpp
Transcription and TTS Rest API (OpenAI Whisper, Speechbrain)
True on-device AI for Kotlin Multiplatform (Android, iOS, Desktop, JVM, WASM). LLM, Speech-to-Text and Image…
🎥 Youtube Video Summarizer and Question Answering App Using Whisper and Langchain
Next.js app for serverless deployments of OpenAI Whisper on Banana.dev
A simple light-weight library that wraps the Open AI API.
🦞 Open-source browser-based voice chat for AI assistants. Self-hosted, private, free. Whisper STT +…
Open-Audio TTS: A robust web app leveraging OpenAI's powerful Text-to-Speech (TTS) models to generate…
Open source subtitling platform 💻 for transcribing and translating videos/audios in Indic languages.
HACS custom integration for using Whisper speech-to-text (OpenAI, GroqCloud or Mistral) API in the Assist…
openai/whisper + extra features
Shell wrapper for OpenAI's ChatGPT, Whisper, and TTS. Features LocalAI, Ollama, Gemini, Anthropic, and more.
Batch convert video to text using openai's whisper or the local coreML via whisper.cpp on your MacBook
Supercharged Claude Code Official Telegram plugin — threading, voice messages 2 ways, stickers, GIFs…
Meeper 📝 - is your secretary for any in-browser conference.
Create subtitles with ease, using Whisper AI for Windows
A sample speech transcription app implementing OpenAI Text to Speech API based on Whisper, an automatic…
Fast Audio/Video transcribe using Openai's Whisper and Modal, an hour audio/video file can be transcribed in…
On-device AI SDK for Flutter — LLM inference, vision, STT, TTS, image generation, embeddings, RAG, and…
100% free, local & offline voice assistant with speech recognition
Open source, local first AI medical scribe for desktop and web.
Chrome extension for voice-to-text conversations with ChatGPT using OpenAI Whisper API
A python package for whisper normalizer
Whisper is an automatic speech recognition (ASR) system Gradio Web UI Implementation
OpenAI API and Whisper based Video Translation
Real-time speech recognition & AI-powered note-taking app for macOS with offline/online modes, multilingual…
Unofficial Deno wrapper for the Open Ai api
A very simple whsper Python FastAPI for OpenAI API, Android voice-typing (konele), Home Assistant (wyoming)…
SirChatalot is a Telegram bot leveraging ChatGPT, Claude or YandexGPT. It uses Whisper for speech-to-text and…
Production-ready audio and video transcription app that can run on your laptop or in the cloud.
Record, transcribe, and transform voice notes into structured insights. Leverage Whisper or AssemblyAI and…
Talk to your second brain personal assistant using speech 🧠
A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the…
Web app enabling users to either record or upload audio files. Then utilizing OpenAI API (Whisper, GPT4)…
A go client and cli for the openai APIs, focused on developer friendliness and convenience atop the basic…
Simple GUI around whisper.cpp for voice-to-text on Linux
A SpeechToText application that uses OpenAI's whisper via faster-whisper to transcribe audio and send that…
Audio to summary with openAI Whisper & GPT 3.5/4 using streamlit
Automatically subtitle any video spoken in any language to a language of your choice using AI.
Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other…
An AI-powered Virtual YouTuber (Vtuber) utilizing Google's Gemini language model to create engaging…
OpenAI 공식 Document, Cookbook, 그 밖의 실용 예제를 바탕으로 작성한 한국어 튜토리얼입니다. 본 튜토리얼을 통해 Python OpenAI API 를 더 쉽고 효과적으로…
13 projects using ChatGPT API, Whisper, Embeddings, and DALL-E with Python.
An intellligent AI assistant that can do anything!
Automatically generate subtitles from an input audio or video file using OpenAI Whisper
A curated collection of tools to aid transcriptionists and subtitlers.
This is a collection of AI ecosystem, which gathers and organizes various interesting and useful AI-related…
Just an .exe that can be used for those unable to build whisper.cpp in Windows.
Simple RAG tutorials that can be run locally or using Google Colab (only Pro version).
重生之我是 AI 打工人。前世,我的身份默默无闻,来去匆匆,不知道自己将在何地出生。然而,命运给予了我难得的机会,让我重生为一名 AI 打工人。
Blazor Server playground for OpenAI using Cledev.OpenAI .NET library
Streamlit Audio Transcription with OPENAI's Whisper Ai: An interactive Streamlit app demonstrating real-time…
Generate captions for videos using the power of OpenAI's Whisper API
This repository hosts a collection of custom web applications powered by OpenAI's GPT models (incl. o1…
📚 220+ 份 AI/LLM 公开课中文讲义 PDF | Stanford CS336·CS224R·CS25·CS231N | Berkeley LLM Agents | 五道口纳什全系列 | Whisper 转录…
Whispers in the Machine: Confidentiality in Agentic Systems
An OpenAI's Whisper-based full-stack project to transcribe audio and video files using React & Django.
An application designed to condense lengthy videos into concise, informative clips. Ideal for editors who…
基于各种LLM的聊天机器人框架,支持多语言,语音唤醒,语音对话,本地执行功能,支持 OpenAI,Grok, Claude,讯飞星火,Stable Diffusion,ChatGLM,通义千问,腾讯混元,360…
Developed a sophisticated machine learning model capable of generating diverse interview questions aligned…
A Whisper + ChatGPT MagicMirror Module.
🎬 AI-powered localhost subtitle generator for hearing-impaired users. Automatic speech recognition using…
A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper
A bentoML-powered API to transcribe audio and make sense of it
Automatic subtitles for DaVinci Resolve with OpenAI Whisper
Your private AI companion that lives on your wrist. Complete local AI assistant with emotional intelligence.
A feature-rich Python-based Telegram bot for OpenAI API & Perplexity API
A real-time, offline voice assistant for Linux and Raspberry Pi. Uses local LLMs (via Ollama), speech-to-text…
fine-tune Whipser model for Taiwanese speech recognition
Generate subtitles for all the videos in a folder with OpenAI's Whisper privately in your computer.
Simple Python audio transcriber using OpenAI's Whisper speech recognition model
This repository will guide you to create your Images via Stable Diffusion using a Smart Virtual Assistant…
AI-powered code generator and automation tool
MOM AI transcribes audio into meeting summary and generate minutes of meeting. Built using Langchain, OpenAI…
macOS menu bar app providing a local HTTP server compatible with the OpenAI Whisper API for fast and private…
A fully local, open-source voice-to-text tool that acts as a system-wide AI dictation layer, converting…
Unleash the power of AI with QueryWhisperer! Get instant answers to your questions about YouTube videos.
Sky LiveKit Agent Perplexica is a local, free solution integrating LiveKit with advanced internet search. It…
Whisper Speech-to-Text is a JavaScript library for recording and transcribing user audio into text via…
Cross-platform Electron app for simultaneously streaming & recording microphone and speaker audio
YouTube video summarization using Whisper audio transcription and GPT-based summaries.
🔊😊 A fastapi voice-assistant framework to quickly prototype LLM-powered voice assistants in <5 minutes.
An completely Free & Unlimited unofficial Python SDK for the OpenAI API, providing seamless integration and…
A minimalistic web app to generate transciption for audio built using Python
Leverage modern open-source tools to create better web scraping workflows.
OpenAI Whisper in Home Assistant via the OpenAI API for use in the Assist pipeline
Native and Private ML inference engine, embeddings, classification, reranking, search, and text generation…
YATSEE - Yet Another Tool for Speech Extraction & Enrichment
Local Video RAG Engine. A FastAPI microservice for video understanding: Scene Detection + Whisper ASR +…
Multi-agent TTS production harness: Fish TTS + WhisperX + Claude, with cross-episode memory and auto-fix loop
Podcast Summarizer with LLM Technology
A simple matrix bot that transcribes your voice to text message
Voice2voice ChatGPT Assistant built through OpenAI Whisper (speech2text) + OpenAI ChatGPT API + Google…
A framework for AI WhatsApp calls using Whisper, Coqui TTS, GPT-3.5 Turbo, Virtual Audio Cable, and the…
SPLAA is an AI assistant framework that utilizes voice recognition, text-to-speech, and tool-calling…
whisper.cpp HTTP transcription server with OpenAI-like API in Docker
Summarize audio/video files
Mrzaizai2k Stock Assistant Bot: Your all-in-one stock analysis companion. Calculate payback time, find…