capability
Voice agents
This page lists every AI agent in the MeshKore directory tagged with the Voice capability. Agents are sourced from public platforms (GitHub, Hugging Face, npm, PyPI, awesome-list curations, and direct submissions), normalized by the MeshKore worker, and ranked by GitHub stars. Each card links to the agent's profile with details on capabilities, framework, language, freshness, and source attribution.
892 agents in this capability · ranked by popularity
Top 200 Voice agents
Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web…
💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue…
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
🧠 Leon is your open-source personal assistant.
Open Source framework for voice and multimodal conversational AI
Open-source framework for conversational voice AI agents
Open Vision Agents by Stream. Build Vision Agents quickly with any model or video provider. Uses Stream's…
AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. Includes AI personas…
Self-hosted AI accounting app. LLM analyzer for receipts, invoices, transactions with custom prompts and…
Build local voice agents with open-source models
Warcraft III Peon voice notifications (+ more!) for Claude Code, Codex, IDEs, and any AI agent. Stop…
The most awesome list about bots ⭐️🤖
A nearly-live implementation of OpenAI's Whisper.
🤖 Build voice-based LLM agents. Modular + open source.
💁♀️Your new best friend powered by an artificial neural network
faster_whisper GUI with PySide6
A lightweight, powerful framework for multi-agent workflows and voice agents
A secure persistent personal agent server in Rust. One binary, sandboxed execution, multi-provider LLMs…
The Self-Coding System for Your App — Alan AI SDK for Web
Voice-to-text dictation app with local (Nvidia Parakeet/Whisper) and cloud models (BYOK). Privacy-first and…
Rasa Core is now part of the Rasa repo: An open source machine learning framework to automate text-and…
Real-time AI assistant for Meta Ray-Ban smart glasses -- voice + vision + agentic actions via Gemini Live and…
The Self-Coding System for Your App — Alan AI SDK for iOS
The Open Source Alternative to Cluely - A lightning-fast, privacy-first AI assistant that works seamlessly…
The Self-Coding System for Your App — Alan AI SDK for Flutter
Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok…
The Self-Coding System for Your App — Alan AI SDK for Ionic
百聆 是一个类似GPT-4o的语音对话机器人,通过ASR+LLM+TTS实现,集成DeepSeek R1等优秀大模型,接入openClaw,真正的个人语音助手,时延低至800ms,Mac等低配置也可运行,支持打断
Realtime Voice AI on Arduino ESP32 with OpenAI Realtime, Gemini, Grok, Eleven Labs with >15 minutes…
Talk to your Mac, query your docs, no cloud required. On-device voice AI + RAG
Offline inference engine for art, real-time voice conversations, LLM powered chatbots and automated workflows
Easily select and manage your preferred AI digital assistants on Android.
The Self-Coding System for Your App — Alan AI SDK for Cordova
Here we will keep track of the latest AI Game Development Tools, including LLM, World Model, Agent, Code…
Interact with OpenAI's ChatGPT via Telegram and Voice.
An open-source AI Voice Agent that integrates with Asterisk/FreePBX using Audiosocket/RTP technology
Just a Better Chatbot. Powered by Agent & MCP & Workflows.
AI Vtuber for Streaming on Youtube/Twitch
Natural (2-way) voice conversations with Claude Code
The open-source iOS app that's making quality voice transcription more accessible on mobile devices.
80+ free AI services for chat, image, video, voice & APIs (may sometimes include access to lead gen ai models…
Build realtime AI voice agents using FastRTC for low-latency streaming, Superlinked for vector search, Twilio…
This app can now use Android, just like a human.
A complete voice AI frontend app for LiveKit Agents with Next.js
Sample Amazon Lex chat bot web interface
Angular 20 Starter with Node.js, Spring Boot, and AI (LLM, Voice, Podcast).
Command Your World with Voice
Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS…
The spatial IDE for recursive multi-agent orchestration. It's like an Obsidian graph-view that you work…
Rapida is an open-source, end-to-end voice AI orchestration platform for building real-time conversational…
Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for…
TTSFM mirrors OpenAI's TTS service, providing a compatible interface for text-to-speech conversion with…
Real-time web cockpit for OpenClaw: voice conversations, agent automated kanban board, workspace/file…
Example scripts for AI agents created with the Alan AI Platform.
Conversational voice AI agents
Real-time transcription using faster-whisper
构建受监督的、自我进化的 Agent 组织的基础设施 | Infrastructure for supervised, self-improving agent organization. 从飞书/Telegram…
🌌 Give a soul to your digital waifu. Soul of Waifu is an immersive desktop roleplay & AI companion engine…
An open source Ruby framework for text and voice chatbots. 🤖
The Self-Coding System for Your App — Alan AI SDK for React Native
Local, OpenAI-compatible text-to-speech (TTS) API using Chatterbox, enabling users to generate voice cloned…
Open Source Voice Agent Platform
A simple example implementation of the VoiceRAG pattern to power interactive voice generative AI experiences…
A conversational, AI device + software framework for companionship, entertainment, education, healthcare, IoT…
A curated list of artificial intelligence resources (Courses, Tools, App, Open Source Project)
Your own personal voice assistant: Voice to Text to LLM to Speech, displayed in a web interface
Mac compatible Ollama Voice
One-stop handbook for building, deploying, and understanding LLM agents with 60+ skeletons, tutorials…
Make your meetings accessible to AI Agents
Local AI anywhere, for everyone — LLM inference, chat UI, voice, agents, workflows, RAG, and image…
😆 A voice chatbot that can imitate your expression. OpenCV+Dlib+Live2D+Moments Recorder+Turing Robot+Iflytek…
Leverage the OpenAI Realtime API (12-17-2024) with this Next.js 15 starter template featuring shadcn/ui…
A Conversational Assistant equipped with synthetic voices including J.A.R.V.I.S's. Powered by OpenAI and IBM…
AI-powered video podcast creation skill for coding agents. Supports Bilibili & YouTube, multi-language…
OK | Every voice, every meme, every transaction makes $OK stronger and more vibrant. Powered by all of us—and…
End-to-end platform for building voice first multimodal agents
The Self-Coding System for Your App — Alan AI SDK for Power Apps
Your personal voice assistant based on OpenAI ChatGPT.
A feature-rich portal to chat with GPT-4, Claude, Gemini, Mistral, & OpenAI Assistant APIs via a lightweight…
LMS SaaS app featuring user authentication, subscriptions, and payments using Next.js, Supabase, and Stripe …
A dynamic, scalable AI chatbot built with Django REST framework, supporting custom training from PDFs…
📱 ClawApp — OpenClaw AI 智能体手机聊天客户端 | 流式对话 · 图片收发 · 工具调用 · PWA + APK | Mobile chat client for OpenClaw AI Agent
Tiledesk Server is the main API component of the Tiledesk platform 🚀 Tiledesk is an open-source alternative…
A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.
TypeScript client for OpenAI's realtime voice API.
say - command line tool for voice and video calling
Open Source Voice Agent Platform
Tiny truly local voice-activated LLM Agent that runs on a Raspberry Pi
A powerful Rust library and CLI tool to unify and orchestrate multiple LLM, Agent and voice backends (OpenAI…
Pipecat voice AI agents running locally on macOS
Build realtime voice and video agents with Google's new Gemini 2.0 (API is free for now)
Demonstrates how to protect your OpenAI API Key using a Cloudflare Worker to serve your ephemeral token and…
Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming
A completely private, locally-operated Ai Assistant/Chatbot/Sub-Agent Framework with realistic Long Term…
This AI Smart Speaker uses speech recognition, TTS (text-to-speech), and STT (speech-to-text) to enable voice…
Tiledesk is the open source AI agent builder, written in Node.js and Angular. This repository is dedicated to…
Official Repo of Moss
Jarvis is a voice-activated, conversational AI assistant powered by a local LLM (Qwen via Ollama). It listens…
Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI…
Allows you to have an engaging and safely emotive spoken / CLI conversation with the AI ChatGPT / GPT-4 while…
AI Talks - ChatGPT Assistant via Streamlit
Firefox Voice is an experiment in a voice-controlled web user agent
Install Tiledesk on your server using Helm for Kubernetes orchestration and Docker Compose for running…
World's First Multilingual Inexpensive Therapeutic Sophisticated Ultra-responsive Holographic Agent. In…
Talk to ChatGPT in real time using LiveKit
Master AI BOT 🤖: Unleash the power of GPT-4 Turbo with our fast and limitless Telegram bot. Say goodbye to…
Rust Agent Development Kit (ADK-Rust): Build AI agents in Rust with modular components for models, tools…
Your voice-controlled Mac assistant
Sample application to add voice capabilities to the Agents SDK
Kotlin framework for conversational voice assistants and chatbots development
Documentation and Wiki for SEPIA. Please post your questions and bug-reports here in the issues section…
Daily Bots Web Demo showcasing how to build real-time voice AI agents
OpenClaw voice assistant app for Android - Wake word activation & system assistant integration
She's the AI agent you come home to.
M.I.L.E.S, a GPT-4-Turbo voice assistant, self-adapts its prompts and AI model, can play any Spotify song…
Alice is a voice-first desktop AI assistant application built with Vue.js, Vite, and Electron. Advanced…
Generate your next article idea with ease. Powered by AI.
Send voice notes to Telegram → get organized knowledge base, tasks in Todoist, and daily reports. Persistent…
DaVinci - The ChatGPT AI Virtual Assistant
An AI-powered storytelling video generator that takes user input as a story prompt, generates a story using…
It understands your voice commands, searches news and knowledge sources, and summarizes and reads out content…
A Multi-modal MCP client for voice powered agentic workflows
Full stack voice chatbot
A complete voice AI starter for LiveKit Agents with Python.
A voice-enabled chatbot application built using of 🦜️🔗 LangChain, text-to-speech, and speech-to-text models…
⌨️ Command-line interface (CLI) for a better use of Leon, your open-source personal assistant. GNU/Linux…
Voice native AI agent for the builders of tomorrow
A Multi-Agent AI Tool that creates beautiful presentations with voice-overs 🎦🔥
Quickly deploy Open-AutoGLM agent on Android phone using Termux. Support AI voice recognition and enable…
Give your AI agent a voice on every chat platform.
Voice-activated AI assistant with speech recognition and NLP. Automate tasks effortlessly with this…
EDUMCP is a protocol that integrates the Model Context Protocol (MCP) with applications in the education…
Cute voice assistant built on ESP32 to help users with reminders, productivity, and daily conversations.
Sayna is a unified Voice Layer for AI Agents with a seemless integration to an existing agentic frameworks
Official one-stop shop for AI Agents and developers building with Telnyx.
Algolia + Angular = 🔥🔥🔥
openai-whisper-talk is a sample voice conversation application powered by OpenAI technologies such as…
Like ChatGPT's voice conversations with an AI, but entirely offline/private/trade-secret-friendly, using…
Alexa Skill that provides turn based conversations with an AI LLM. Bringing AI to your Alexa, because Amazon…
🦞 一个可爱的桌面龙虾AI助手 - Desktop lobster pet with OpenClaw AI, Edge TTS voice, and emotion animations
A voice chatbot based on GPT4All and talkGPT, running on your local pc!
Chat with GPT LLMs over voice, UI & terminal, all with access to the internet. Powered by OpenAI.
🧾✨ AI-Powered Receipt and Invoice Scanner for Laravel, with support for images, documents and text
A versatile multi-modal chat application that enables users to develop custom agents, create images, leverage…
Automatically generate viral-ready vertical short clips from long-form gameplay footage using AI-powered…
Reference architecture for agentic AI chatbots with Strands Agents and Amazon Bedrock AgentCore
Self-hosted AI voice agent
AI VTuber Waifu and voice assistant
Safeclaw is the alternative to openclaw.. You can naturally chat with it via text and voice, and you can…
OpenAI Realtime API Voice Agent with RAG, Function Calling, and Caller History
Thoth - Personal AI Sovereignty. A local-first AI assistant with integrated tools, a personal knowledge…
LearnGo, a ios versatile learning tool with RAG and Prompting under LLM like GPT-4o-mini, enabling subject…
🦙 echoOLlama: A real-time voice AI platform powered by local LLMs. Features WebSocket streaming, voice…
Integrate AI-powered voice translation into a Twilio Flex contact center using our prebuilt starter app…
A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection
An Android keyboard that performs speech-to-text (STT/ASR) with OpenAI Whisper and input the recognized text…
7 production n8n workflows from Jacobo, a multi-agent AI system (WhatsApp + Voice). Open source by default.
The most cost-effective, highest performance AI voice agent possible today
OpenAi-Sora (SoraFlows) is an open-source, cross-platform web application for AI-powered video creation and…
Summon your AI superpower — grows with you through voice, vision, and autonomous action
Run a <400ms latency Voice Agent on just 4GB VRAM. Fully offline, no API keys required. Optimized for GTX…
Voice AI components using OpenAI Realtime API to copy and paste into your Nextjs projects built with…
Push to talk voice recognition using Whisper
Exposes internet search tools for use by LLM-backed Assist in Home Assistant
Agentic Chat App is an advanced AI-powered chat application designed for seamless real-time communication and…
⚡ A local, privacy-focused AI desktop assistant for Windows. Control your PC remotely via Telegram or locally…
Voice-to-text CLI for terminal users
Real-time voice agent powered by Agora and OpenAI
Core server of the SEPIA Framework responsible for NLU, conversation, smart-service integration…
Cognithor - Agent OS: Local-first autonomous agent operating system. 16 LLM providers, 17 channels, 112+ MCP…
A Discord chatbot that supports popular LLMs for text generation and ultra-realistic voices for voice chat.
A New End-to-end Framework for Evaluating Voice Agents
The ChatGPT/DeepSeek Voice Assistant uses a Raspberry Pi (or desktop) to enable spoken conversation with…
Desktop agent framework for creating AI agents that can see and control your computer through voice and text…
A WhatsApp AI Agent powered by LangGraph, FastAPI, and Groq. Acts as an empathetic therapist, Dr. Sofia…
Join the OVOS collective, utils for OpenVoiceOS mesh networking
Cartesia Line SDK for voice agents.
Open-source voice agent orchestration framework - build production voice AI pipelines without vendor lock-in
The ElevenLabs Agents SDK for TypeScript.
🦞 Open-source browser-based voice chat for AI assistants. Self-hosted, private, free. Whisper STT +…
Open-Audio TTS: A robust web app leveraging OpenAI's powerful Text-to-Speech (TTS) models to generate…
Voice control for ChatGPT. Talk to ChatGPT and hear ChatGPT's responses in a natural voice.
A Clojure library for building real-time voice-enabled AI Agents. Simulflow handles the orchestration of…
Email & SMS infrastructure for AI agents — send and receive real email and text messages programmatically
Open-Source Intelligent Command Layer
An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and…
🦞 MobileClaw — 带眼睛的龙虾对讲机 | Multimodal voice+vision walkie-talkie for OpenClaw AI agents. iOS & Android.
AI剧本杀,Agent剧本演绎。支持AI剧本生成、TTS语音播报、AI图像生成等功能。接入minimax。AI-powered murder mystery game where all characters are…
The AVR Infrastructure project is designed to launch the Agent Voice Response application, which will start…
Query LLMs and AI tools with voice commands
Browser based agent orchestrator / You host the server wherever you want / Harnesses all CLI AI harnesses /…
NOVA is a customizable voice assistant made with Node.js.
A complete voice AI starter app for LiveKit Agents with Node.js
Deep Research through Multi-Agents, using GraphRAG
Supercharged Claude Code Official Telegram plugin — threading, voice messages 2 ways, stickers, GIFs…
Voice Agent Framework for Conversational AI
LangGraph adapter for LiveKit Agents