capability
Transcri agents
This page lists every AI agent in the MeshKore directory tagged with the Transcri capability. Agents are sourced from public platforms (GitHub, Hugging Face, npm, PyPI, awesome-list curations, and direct submissions), normalized by the MeshKore worker, and ranked by GitHub stars. Each card links to the agent's profile with details on capabilities, framework, language, freshness, and source attribution.
243 agents in this capability · ranked by popularity
Top 200 Transcri agents
Faster Whisper transcription with CTranslate2
Give your AI agent eyes to see the entire internet. Read & search Twitter, Reddit, YouTube, GitHub, Bilibili…
Instantly generate AI-powered subtitles on your device. Works standalone or connects to DaVinci Resolve.
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
faster_whisper GUI with PySide6
World's first AI meeting copilot → The Invisible Companion for Work + Life
Voice-to-text dictation app with local (Nvidia Parakeet/Whisper) and cloud models (BYOK). Privacy-first and…
🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI
Open-source meeting transcription API for Google Meet, Microsoft Teams & Zoom. Auto-join bots, real-time…
Using OpenAI's Whisper to automatically generate YouTube subtitles
The open-source iOS app that's making quality voice transcription more accessible on mobile devices.
Natively - Free open-source AI interview copilot & meeting assistant. The best Cluely alternative, Final…
Generate subtitles, summaries, and chapters from videos in seconds
React hook for OpenAI Whisper with speech recorder, real-time transcription, and silence removal built-in
🎤 The easiest way to transcribe audio in Swift
Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for…
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
Real-time transcription using faster-whisper
Effortlessly add AI-generated transcription subtitles to your videos
Make your meetings accessible to AI Agents
An all-in-one AI audio playground using Cloudflare AI Workers to transcribe, analyze, summarize, and…
Evolvable, distributed agent framework & harness for data science.
A dynamic, scalable AI chatbot built with Django REST framework, supporting custom training from PDFs…
How to use OpenAIs Whisper to transcribe and diarize audio files
A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.
An API to transcribe audio with OpenAI's Whisper Large v3!
Webscout is the all-in-one search and AI toolkit you need. Discover insights with Yep.com, DuckDuckGo, and…
Discord AI Chatbot using DialoGPT, trained on the game transcript of The World Ends With You
AI-powered tool for real-time interview question transcription and response generation.
TranscriberBot for Telegram
Talk to ChatGPT in real time using LiveKit
The main repo for Stage Whisper — a free, secure, and easy-to-use transcription app for journalists, powered…
Open-source AI meeting copilot - real-time transcription, echo cancellation, and AI assistance. Captures…
Transcribe is a real time transcription, conversation, Language learning platform. It provides live…
Turn meetings into live agent loops. Record, transcribe, and analyze meetings with real-time AI intelligence…
Music Analysis, Chord Recognition, Beat Tracking, Guitar Diagrams, Piano Visualizer, Lyrics Transcription…
🎙️ AI generated subtitles and segmented chapters for podcasts
A powerful Whisper AI keyboard for reliable speech transcription
A quick experiment to achieve almost realtime transcription using Whisper.
A sample web app using OpenAI Whisper to transcribe audio built on Next.js. It records audio continuously for…
Flutter App That Can Transcribe Audio Offline/On Device with Whisper C++ Bindings via Rust
OpenAI's Whisper Audio to text transcription right into your web browser! An open source AI subtitling suite.
This repository contains a Python script that allows users to download the audio from a YouTube video…
A Personal Tool for Transcribing & Translating My Vlogs into Japanese
WhisperClip simplifies your life by automatically transcribing audio recordings and saving the text directly…
Multi-agent LLM driven cell type annotation for single-cell RNA-Seq data
A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection
Short code for dictation using OpenAI Whisper for transcription.
Conversational & memory-enabled AI research partner for multi-omics analysis. From biological idea to full…
A free MCP server to analyze and extract insights from public filings, earnings transcripts, financial…
Voice-to-text CLI for terminal users
Transcription and TTS Rest API (OpenAI Whisper, Speechbrain)
YouTube Transcript API skills for AI agents. Get transcripts, search videos, browse channels. Works with…
Realtime Interview Copilot is a web application that assists users in crafting responses during interviews…
Meeper 📝 - is your secretary for any in-browser conference.
A sample speech transcription app implementing OpenAI Text to Speech API based on Whisper, an automatic…
Fast Audio/Video transcribe using Openai's Whisper and Modal, an hour audio/video file can be transcribed in…
SemantiClip is an intelligent video processing application that transforms video content into rich…
Callytics is an advanced call analytics solution that leverages speech recognition and large language models…
OpenAI API and Whisper based Video Translation
Real-time speech recognition & AI-powered note-taking app for macOS with offline/online modes, multilingual…
Fast transcript search for humans & agents. Supports Claude Code, Codex CLI & OpenCode
Production-ready audio and video transcription app that can run on your laptop or in the cloud.
Record, transcribe, and transform voice notes into structured insights. Leverage Whisper or AssemblyAI and…
STT 한글 문장 인식기 출력 스크립트의 외자 오류율(CER), 단어 오류율(WER)을 계산하는 Python 함수 패키지
Talk to your second brain personal assistant using speech 🧠
A SpeechToText application that uses OpenAI's whisper via faster-whisper to transcribe audio and send that…
Automatically subtitle any video spoken in any language to a language of your choice using AI.
Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other…
Real-time translation copilot for your browser
The AI Powered Speech Analytics for Amazon Connect solution provides the combination of speech to text…
OpenAI/ChatGPT library for Java - Requires JDK 11 at minimum.
Automatically generate subtitles from an input audio or video file using OpenAI Whisper
A curated collection of tools to aid transcriptionists and subtitlers.
Open-source free alternative to Cluely & Parakeet AI — real-time AI interview copilot with live…
Just an .exe that can be used for those unable to build whisper.cpp in Windows.
STAgent is a multimodal LLM-based AI agent that enables deep research about spatial transcriptomics data
A cutting-edge AI SaaS platform that enables users to create, discover, and enjoy podcasts with advanced…
Streamlit Audio Transcription with OPENAI's Whisper Ai: An interactive Streamlit app demonstrating real-time…
An OpenAI's Whisper-based full-stack project to transcribe audio and video files using React & Django.
🎬 AI-powered localhost subtitle generator for hearing-impaired users. Automatic speech recognition using…
A bentoML-powered API to transcribe audio and make sense of it
AI agent skill: read Lark meeting transcripts, extract action items, and actually get them done
This is a fun Python project that allows you to chat with a chatbot about the PDF you uploaded. and generate…
Learning chatbot that can automatically fetch lecture transcript
Multi-agent LLM driven cell type annotation for single-cell RNA-Seq data
Simple Python audio transcriber using OpenAI's Whisper speech recognition model
MOM AI transcribes audio into meeting summary and generate minutes of meeting. Built using Langchain, OpenAI…
macOS menu bar app providing a local HTTP server compatible with the OpenAI Whisper API for fast and private…
Unleash the power of AI with QueryWhisperer! Get instant answers to your questions about YouTube videos.
Record audio from a meeting, then transcribe, conclude and send the conclusion and a piece of advice to Slack
Whisper Speech-to-Text is a JavaScript library for recording and transcribing user audio into text via…
Cross-platform Electron app for simultaneously streaming & recording microphone and speaker audio
YouTube video summarization using Whisper audio transcription and GPT-based summaries.
A minimalistic web app to generate transciption for audio built using Python
A simple matrix bot that transcribes your voice to text message
Real-time transcription, AI-driven answer suggestions, and interview simulation using Next.js, React, Azure…
Three Claude production tiers generated functional exploit code against live infrastructure when…
whisper.cpp HTTP transcription server with OpenAI-like API in Docker
Voice memos recorded from the microphone, transcribed offline to text and converted to Joplin notes
Streamline your note-taking with ChatGPT's AI expertise and Whisper's precise transcription, enabling fast…
🧬 Analyze spatial transcriptomics data through natural language conversation. Stop writing code, start having…
Code for the OpenCV demo of a recipe transcription OCR agent.
VOXRAD is a voice transcription application for radiologists leveraging locally deployed ASR and LLM models.
D-PC Messenger, a decentralized, Privacy-First Infrastructure for Human-AI-Team Collaboration
Speakscribe is a web application that allows users to transcribe audios using OpenAI and also interact with a…
Unlock AI power with AudioInsightsGenerator! From audio to summaries, emotion analysis, idea generation…
🎙️ Fast CLI tool to transcribe audio/video files to SRT format using OpenAI Whisper API
Transcription from mp3 files to html with or without embedded player
A stand-alone application with GUI for OpenAI's Whisper
Interview Amigo is an AI-powered SaaS platform designed to help users enhance their job interview skills…
A beautiful, native macOS desktop application for transcribing audio and video files using whisper.cpp
Shell scripts for automated transcription on macOS: Integrates whisper.cpp with QuickTime Player and…
Enterprise-grade browser extension bringing multilingual voice interaction to AI chatbots (Pi, Claude…
An open source recorder integrating OpenAI Whisper and ChatGPT.
Video URL transcriber and translator using AI. Download from Youtube and translate automatically by adding…
Summarization web service via the use of OpenAI Whisper and GPT-3 models
Whisper.cpp with diarization
Scribe is a Python script that transcribes audio and video files using OpenAI Whisper and exports the…
🤖 A WhatsApp bot to transcribe and summarize audio messages.
End-to-end AI-powered video call application where you implement real-time calls with customized AI agents…
Generate summaries of Udemy video transcripts using the OpenAI API
Smart assistant in Telegram bot format for transcribing online meetings
format whisper transcripts to .srt
Audio transcription UI for OpenAI Whisper, GPT4o Transcribe and AssemblyAI APIs
Real-time conversation assistant with dual audio transcription and GPT-powered responses, perfect for…
This repository houses a Python application for extracting YouTube video transcripts and summarizing its…
The Advanced Interview Responder uses AI to generate tailored responses from a user’s resume and real-time…
When an audio message is received, the bot downloads the audio file, converts it to a numpy array, loads the…
One memory, three terminals. Shared memory layer for Claude Code, Codex, and Gemini CLI — hybrid retrieval…
Give your AI coding agent eyes and ears. Screen + voice capture → structured Markdown. MCP server, CLI, and…
Voice-to-text with MCP support. System-wide dictation (hold fn) and AI agent mode (hold fn+ctrl) that…
A web-based application enabling users to interact with and extract insights from YouTube video transcripts…
Precision Medicine MCP Platform: A set of bioinformatics servers + tools - production multiomics/genomics +…
Platypus is one app you need to organize your data. Note-taker, meeting transcriber and knowledge management…
An end-to-end AI agent project that transcribes audio files, embeds user queries, and searches in Qdrant and…
A full-stack application that allows practitioners to record voice notes and also export them to Google…
Chrome extension to copy YouTube transcripts with AI-friendly features
Experimental voice user interface (VUI) to interact with an agentic AI assistant
AI tool that turns meeting transcripts into Jira tickets. Claude analyzes your meetings, checks your codebase…
SpiralSafe is self-maintaining, inherently coherent reposystem | Documentation, code, and physical hardware…
AI YouTube Video Chat application, to ask questions to a YouTube video bot and get answers.
🤖🎙️ Explore Lex Fridman Podcast Transcripts with a smart chatbot!
Discord AI bot capable of chatting and moderating, trained on conversation transcripts of Elon Musk
A sample Nuxt 3 application that listens to chatter in the background and transcribes it using the powerful…
FastAPI + Whisper + Ollama: Audio transcription and LLM processing API. Convert speech to text with OpenAI…
A platform to enhance medical e-Shadowing.
This repository will guide you to create automatically generate YouTube Transcription using Using OpenAI's…
▶️ Video Fact Finder for YouTube, using CrewAI agents and Perplexity to verify facts.
CLI educacional para transcrição com OpenAI Whisper
🎙️ Lightweight macOS menubar app for voice-to-text dictation using OpenAI Whisper API. Hold-to-record with Fn…
Transcribe YouTube videos, extract topics, and answer questions interactively.
Faster Whisper with Speaker Diarization
Privacy-first macOS transcription app with global hotkey recording. 100% on-device transcription and AI…
PyTranscriptorAi - Transcript videos to text with Ai and add subtitles - OpenAi
Japanese meeting transcription & minutes generation app with local ASR (Kotoba Whisper) + LLM (Ollama) + RAG…
Voice calling plugin for OpenClaw — give your AI agent a phone number. Inbound/outbound calls, batch calling…
Generate a WhatsApp-style HTML page from an exported chat, with support for images, videos, audio, PDFs, and…
Speech to Text Transcription using OpenAI Whisper v3 and FastAPI
Self-hosted Japanese immersion player — clickable subs, Whisper transcription & Anki export
Demo Multilingual Near Real-Time Transcriber
Generate audio transcripts and summaries by using OpenAI.
This project is an advanced WhatsApp bot that leverages artificial intelligence for automated audio…
Descrição automática de mensagens de voz em conversas privadas no Telegram
🧪⚡ Lightweight, hackable multi-agent orchestration lab. YAML configs, CLI + Python API, 10 presets…
🎧💡 EchoSummarize: A YouTube video summarizer using the Phi-3.5-mini LLM, providing fast and accurate…
Intelligent FFMPEG agent node for ComfyUI - transforms natural language video editing prompts into automated…
Chatbot with voice transcription and Homeassistant integration
medScribe transcribes medical personnel dictated patient clinical report to text which is then cleaned and…
Audio transcriber using Openai whisper ML deployed to Banana.dev
A macOS menu bar app that turns speech into refined, ready-to-send text anywhere you type — powered by local…
Golang RAG/LLM framework with Memory and Transcriber - All-in-One Platform
🕷️ Open-source web crawling toolkit — Video, OCR, NLP, Stealth, 10+ parsers
AI agent skill: tell Claude Code, Codex, Gemini or OpenClaw to upload your recording to YouTube — it…
An OpenClaw skill that uses faster-whisper (a faster implementation of the Whisper transcription model) to…
Batch-process meeting transcripts from Obsidian vault into structured summaries with knowledge graph updates…
The first Minecraft AI that doesn't just talk—it lives in your world. High-performance Gemini-driven…
AI-based YouTube summarizer with chat, notes, and chapter generation using LangChain + MERN.
✨ Instant timestamped chapters for your YouTube videos with just one click! ✨
This is a YouTube Q&A Chatbot powered by a Large Language Model (LLM) and FastAPI. Users can enter a YouTube…
An intelligent audio analysis tool that automatically transcribes and enables semantic search of audio…
AI-powered YouTube video summarizer using GPT-4/Gemini - Extract transcripts and generate concise summaries…
Open-source web dashboard for Vexa – manage meeting transcriptions, view real-time transcripts, and chat with…
CLI agent that analyses meeting transcripts using Recursive Language Models (RLMs) to extract decisions…
Turn your Android phone into an MCP (Model Context Protocol) server. AI agents and desktop scripts can call…
MCP server for semantic search through Apple developer documentation, WWDC transcripts, and code examples…
An app that summarises and answers questions about arbitrary YouTube videos using LangChain and LLMs
End-to-end AI-powered video call application where you implement real-time calls with customized AI agents…
A bot that can join voice channels using the OpenAI api and Microsoft's free Text-to-Speech (TTS) services…
This repository contains a chat bot for WhatsApp that can be used to send messages, create stickers, archive…
一个基于 LLM 的 Bilibili 视频总结命令行工具。CLI tool to summarize Bilibili videos via subtitles or audio transcription…
A FastAPI based chatbot server using OpenAI to respond to textual and audio queries.
Turn any audio into text and any text into speech — menu bar app with system/app audio capture, live…
A Python tool that transcribes video and audio files to text using Whisper, with ChatGPT-powered summarization
A PowerShell script that automatically generates subtitles in bulk for video files using whisper-ctranslate2.
Transcriptr is a modern web application that converts audio files to text using artificial intelligence. It…
Open-source AI-powered platform for analyzing customer discovery calls. Extract pain points, feature…
ChatGPT API based video game audio translator application
A simple UI tool written in Python, for recording audio from a microphone and automatically transcribing the…
A powerful Video RAG system that enables users to upload videos, automatically builds searchable indexes from…