capability
Tts agents
This page lists every AI agent in the MeshKore directory tagged with the Tts capability. Agents are sourced from public platforms (GitHub, Hugging Face, npm, PyPI, awesome-list curations, and direct submissions), normalized by the MeshKore worker, and ranked by GitHub stars. Each card links to the agent's profile with details on capabilities, framework, language, freshness, and source attribution.
275 agents in this capability · ranked by popularity
Top 200 Tts agents
A generative speech model for daily dialogue.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Open Vision Agents by Stream. Build Vision Agents quickly with any model or video provider. Uses Stream's…
🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,支持ChatGPT多轮对话能力,还可能是首个支持脑机交互的开源智能音箱项目。
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy…
🤖️ Cross-platform AI language practice app (跨平台AI语言练习应用)
EPUB to audiobook converter, optimized for Audiobookshelf, WebUI included
⚡ Energy consumption metrology agent. Let "scaph" dive and bring back the metrics that will help you make…
Text-To-Speech, RAG, and LLMs. All local!
Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabs
百聆 是一个类似GPT-4o的语音对话机器人,通过ASR+LLM+TTS实现,集成DeepSeek R1等优秀大模型,接入openClaw,真正的个人语音助手,时延低至800ms,Mac等低配置也可运行,支持打断
Meet Ava, the WhatsApp Agent
Talk to your Mac, query your docs, no cloud required. On-device voice AI + RAG
基于AI的工作效率提升工具(聊天、绘画、知识库、工作流、 MCP服务市场、语音输入输出、长期记忆) | Ai-based productivity tools (Chat,Draw,RAG,Workflow,MCP…
AI Vtuber for Streaming on Youtube/Twitch
An AI-powered interactive avatar engine using Live2D, LLM, ASR, TTS, and RVC. Ideal for VTubing, streaming…
Natural (2-way) voice conversations with Claude Code
🧸 Lobe Vidol - Making Virtual Idols Accessible for EveryOne
OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama…
The simplest and lowest-cost AI integration solution. If you like this project, please give it a Star~ |…
Convert any git repository into an engaging podcast
Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS…
Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for…
TTSFM mirrors OpenAI's TTS service, providing a compatible interface for text-to-speech conversion with…
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
The Self-Coding System for Your App — Alan AI SDK for React Native
Local, OpenAI-compatible text-to-speech (TTS) API using Chatterbox, enabling users to generate voice cloned…
Open Source Voice Agent Platform
Unreal Engine plugin for LLM/GenAI models & MCP UE5 server. Includes OpenAI's GPT, Deepseek, Claude…
Your own personal voice assistant: Voice to Text to LLM to Speech, displayed in a web interface
😆 A voice chatbot that can imitate your expression. OpenCV+Dlib+Live2D+Moments Recorder+Turing Robot+Iflytek…
A Conversational Assistant equipped with synthetic voices including J.A.R.V.I.S's. Powered by OpenAI and IBM…
a open-source Artificial Intelligence Virtual Youtuber (AI VTuber), (this project is deprecated)
AI-powered video podcast creation skill for coding agents. Supports Bilibili & YouTube, multi-language…
The Self-Coding System for Your App — Alan AI SDK for Power Apps
Stream-Omni is a GPT-4o-like language-vision-speech chatbot that simultaneously supports interaction across…
This AI Smart Speaker uses speech recognition, TTS (text-to-speech), and STT (speech-to-text) to enable voice…
Jarvis is a voice-activated, conversational AI assistant powered by a local LLM (Qwen via Ollama). It listens…
Allows you to have an engaging and safely emotive spoken / CLI conversation with the AI ChatGPT / GPT-4 while…
AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more
A unified interface for multiple Text-to-Speech (TTS) providers.
World's First Multilingual Inexpensive Therapeutic Sophisticated Ultra-responsive Holographic Agent. In…
Talk to ChatGPT in real time using LiveKit
(Spring Boot 3. X Microservices framework) 基于Spring Boot 3.X 的 Spring Cloud Alibaba / Spring Cloud Tencent +…
gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。
A fully autonomous AI Agent/Python pipeline that utilizes Large Language Models (LLMs) like Gemini to…
She's the AI agent you come home to.
M.I.L.E.S, a GPT-4-Turbo voice assistant, self-adapts its prompts and AI model, can play any Spotify song…
Bella…
Custom TTS component for Home Assistant. Utilizes the OpenAI speech engine or any compatible endpoint to…
pdf reader app with note taking, annotations, collaboration, ai features (chat, flashcards generation w…
🗣️ ZAI/GLM TTS to OpenAI Speech API, 免费的语音合成API,支持克隆音色,基于智谱TTS
OpenAI-Compatible Proxy Middleware for the Wyoming Protocol
Input text from speech in any Linux window, the lean, fast and accurate way, using whisper.cpp OFFLINE. Speak…
Collections of Skills to assist building with ElevenLabs
Official one-stop shop for AI Agents and developers building with Telnyx.
A full stack app for interruptible, low-latency and near-human quality AI phone calls built from stitching…
🎬 全自动 AI 视频代理 · 一句话生成带字幕成片 · Fully Automated AI Video Agent · Local Deployment
🦞 一个可爱的桌面龙虾AI助手 - Desktop lobster pet with OpenClaw AI, Edge TTS voice, and emotion animations
Automatically generate engaging AI podcasts from nothing but an episode title.
Automatically generate viral-ready vertical short clips from long-form gameplay footage using AI-powered…
🎭 TTS for Claude Code
It is a personal assistant chatbot, capable to perform many tasks same as Google Assistant plus more extra…
Safeclaw is the alternative to openclaw.. You can naturally chat with it via text and voice, and you can…
ChatGPT web application. ChatGPT 网页应用,支持多对话、海量提示词、PWA、ASR、TTS
一个基于Indextts和Qwen3TTS的 AI 有声书制作工具。利用 LLM 自动拆解剧本与识别情绪,集成多角色 TTS…
AAHL's Agent Skills. 汇集了多种实用的智能体技能,涵盖Home Assistant智能家居控制、微软Edge…
Full-stack AI chat platform built on Cloudflare using Workers, Durable Objects, KV, and AI Gateway. Features…
AI-Native Video Editor — CLI-first, MCP-ready. Generate, edit, and ship videos from your terminal.
The ChatGPT/DeepSeek Voice Assistant uses a Raspberry Pi (or desktop) to enable spoken conversation with…
Transcription and TTS Rest API (OpenAI Whisper, Speechbrain)
A true Artificial Intelligent Assistant with ALICE as backend and offline speech recognition with vosk engine…
不会聊天的字幕提取器不是一个好 B 站下载器~
🗣️🔊 Your Text-to-Speech Services, All-in-One.
GPT-3 client for Windows and Unix with memories management that supports both text and speech in any…
Shell wrapper for OpenAI's ChatGPT, Whisper, and TTS. Features LocalAI, Ollama, Gemini, Anthropic, and more.
AI剧本杀,Agent剧本演绎。支持AI剧本生成、TTS语音播报、AI图像生成等功能。接入minimax。AI-powered murder mystery game where all characters are…
The AVR Infrastructure project is designed to launch the Agent Voice Response application, which will start…
Implementation of OpenAI's Text-To-Speech in Unity. Synthesize any text and play it via any AudioSource.
Hybrid Conversational Bot based on both neural retrieval and neural generative mechanism with TTS.
An AI-powered chatbot integrated with Telegram, using OpenAI GPT-3.5 Turbo, language embeddings, and FAISS…
Two Heads are Better Than One: Test-time Scaling of Multi-agent Collaborative Reasoning (NeurIPS2025-SEA)
100% free, local & offline voice assistant with speech recognition
时下热词追踪Agent 💡,集成多 Tools、TTS、ASR、HeyGem API
Leopard Chat UI - A Teneo Chat Client based on Vue and Vuetify
Acid Reflux for your Ears!
Talk to your second brain personal assistant using speech 🧠
Omnigram is a Flutter-based file reader and audiobook . It accommodates EPUB and PDF and offers audiobook…
Open-source AI pipeline that turns any topic into a publish-ready YouTube/Instagram/TikTok Short — research…
Build, test, and ship omnichannel voice agents on Azure—ACS telephony, custom STT→LLM→TTS pipeline, Voice…
Use OpenAI TTS(Text to Speech) API with Gradio
AgentOS2-Live by OrionStar — an end-to-end real-time voice interaction solution based on the Realtime API. No…
Your AI executive team on Discord. 7 specialized agents — Engineering, Finance, Marketing, DevOps, Legal…
HanaVerse is a interactive web UI for chatting with ollama with a lively 2D anime character Hana. Star it on…
Twitch Streamer GPT is a NodeJS-based Twitch enhancement tool, offering interactive stream experiences with…
重生之我是 AI 打工人。前世,我的身份默默无闻,来去匆匆,不知道自己将在何地出生。然而,命运给予了我难得的机会,让我重生为一名 AI 打工人。
一个具有长时记忆和 Live2d 形象的"数字生命" / A digital life with long-term memories and live2d body
久远:一个开发中的大模型语音助手,当前关注易用性,简单上手,支持对话选择性记忆和Model Context Protocol (MCP)服务。 KUON:A large language model-based…
A multi engine TTS & LLM edge computing playground with audio book features and more!
Chatbot with a 3D avatar that can answer interview questions in your behalf. It can speak and understand…
A self-hosted AI companion web app with anime-style Live2D and VRM characters. Talk with your companion via…
An AI assistant building SDK in python
A Python based Voice Assistant like Siri
A Whisper + ChatGPT MagicMirror Module.
The Open Source Voice Agent Platform. Orchestrate ultra-low latency AI pipelines for real-time conversations…
Voice-powered AI assistant platform — connect any LLM, any TTS, with a live web canvas, music generation, and…
A modified version of SalesGPT with the addition of TTS, STT, and Twilio to make calls. A Context-aware AI…
Python platform for working with LLMs
A Chat Client for LLMs, written in Compose Multiplatform.
Lightweight Java library to interact with the OpenAI API (GPT, DALL-E, TTS, etc.)
Uses OpenAI API to clean pdf then converts it to professional grade audiobook with text to speech.
Live2D + ASR + LLM + TTS → Real-time communication + Offline Deployment/Cloud Inference 实时沟通 本地部署/云端推理
A powerful, unofficial OpenAI-compatible API service offering free access to GPT-4o, GPT-4-turbo, and audio…
基于Maibot核心修改而成的笨蛋Nachoneko特制bot
Harness OpenAI's power to effortlessly create YouTube Shorts with this project. Includes tools for generating…
Text To Speech Demo in ReactJS Application using Azure Avatar AI Service.
Sky LiveKit Agent Perplexica is a local, free solution integrating LiveKit with advanced internet search. It…
OpenAI GPT-4o Mini TTS – Home Assistant Integration
Langchain Voice Agent with Inworld TTS
Implementation of OpenAI's Realtime API in Unity. Easily integrate low-latency, multi-modal conversations via…
Multi-agent TTS production harness: Fish TTS + WhisperX + Claude, with cross-episode memory and auto-fix loop
A framework for AI WhatsApp calls using Whisper, Coqui TTS, GPT-3.5 Turbo, Virtual Audio Cable, and the…
Threlte Live – A SvelteKit + Three.js platform for live-streaming 3D VRM avatars. Features real-time chat…
Private AI Hub (P8Hub) - Host and use your own AI Services. Keep everything simple and private.
AI Agent Skills toolkit for automated product introduction video generation with Remotion, Playwright, and…
🎙️ Voice-native document intelligence using Gemini, ElevenLabs STT/TTS, and Datadog observability — turning…
A repo listing known open source voice tools, ordered by where they sit in the voice stack
Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our…
🧠 Personal AI Gateway — Single-file Python AI agent with multi-LLM, tools, vision, TTS, encrypted vault. Your…
Monika is an AI assistant that combines speech-to-text, natural language processing, and text-to-speech…
Your AI assistant, stt -> llm agent -> tts, full api, can run on raspberry pi
Zippy Talking Avatar uses Azure Cognitive Services and OpenAI API to generate text and speech. It is built…
This is the guide to show the method to build your own AI-Powered voice agent with LiveKit and Twillio
📣 Auto-plays ChatGPT responses
A one command Voice AI deployment script for MacOS. Supports Sesame, Kokoro, Spark, Zonos and Whisper…
This repository contains an attempt to incorporate Rasa Chatbot with state-of-the-art ASR (Automatic Speech…
Enterprise-grade browser extension bringing multilingual voice interaction to AI chatbots (Pi, Claude…
Model uses Whisper, CHATGPT, GTTS
Eolian is a Discord music bot which provide a very powerful API for queuing songs from a variety of sources…
Build realtime AI interviewer voice agent that joins meetings. It demonstrates integrating Deepgram (STT)…
DRIA (Deep Research and Intelligence Agent) is a fully local voice assistant that can hold real-time…
Text to Speech Studio to convert text into natural-sounding speech using advanced AI models from leading…
AIPE (AI Pipeline Engine) is a flexible and powerful tool for creating and executing complex AI workflows
Discord bot that uses OpenAI chatGPT under the hood. Prompts and answers using voice with(gTTS)
This project is the backend engine for a fully autonomous AI-powered call center. It integrates a large…
A Voice-to-Voice AI Agent that lets you naturally talk to documents in real time. Powered by LiveKit's…
中文语音助手 | 唤醒词 + ASR + OpenClaw Agent + TTS | 离线唤醒、流式语音交互、工具调用、Skills 扩展
A desktop client with MCP support for Mistral LLMs
Ultra-fast local TTS for AI agents. ~90ms to first sound.
Hermes Agent made portable desktop for Windows — 100 tools, GUI, local models via LM Studio, TTS, Music…
GLaDOS Terminal-based AI Assistant
面向全平台愿景的原生 AI 桌面助手,支持 Live2D 角色交互与 OpenAI 兼容对话/TTS API。 | A cross-platform and native vision AI desktop…
Try out the OpenAI Text to Speech API in your browser.
OpenAI-Assistant API integration with Speech Recognition and Eleven Labs TTS. User can choose name…
🤖 AI Conversation Agent for Home Assistant. Compatible with any OpenAI format LLM providers, supports STT/TTS
Outbound PSTN calling agent using LiveKit SIP trunks with a voice pipeline (Silero VAD, Deepgram STT, OpenAI…
A project combining roguelike with LLMs, RAG, Text2Speech, and Speech2Text
Persistent agents for Claude Code — personality, bilingual memory (SQLite+FTS5 / QMD), nightly dreaming…
A spoken English education chatbot based on ChatGPT/whsiper and gTTS.社恐人士的英语角
Stitch together text-to-speech over 4096 characters via the OpenAI API
An AI chatbot that talks to people in VR Chat.
voice ai agent that's able to do tool calls with composio integrations
Self-evolving AI assistant platform with 42+ skills, 5-level model fallback, lossless context, deep…
Zippy Talking Avatar uses Azure Cognitive Services and OpenAI API to generate text and speech. It is built…
Voice-to-Voice Chatbot using Whisper, LLaMA, and Groq API
A simple Python based implementation of a Raspberry Pi based, OpenAI ChatGPT enabled voice assistant
Text-to-speech plugin for Claude Code — multi-provider support (ElevenLabs, OpenAI, Google, Amazon Polly…
Generate videos on any topic automatically, harnessing OpenAI for script generation, ElevenLabs for TTS, and…
A web application that utilizes AI to help you improve your English speaking and conversation skills.
This is a fully local AI Assistant that uses Silero VAD, Faster-Whisper, LM Studio, Coqui TTS, MiniLM-L6-v2…
Google Alexa like Laptop assistant written in Python which uses google's speech-to-text library to process…
IA na Prática: LLM, RAG, MCP, Agents, Function Calling, Multimodal, TTS/STT e mais
Medora AI is an innovative, voice-first medical appointment booking system designed to revolutionize…
Darvin is a Python-based voice-activated chatbot that interacts with users via microphone and speaker. It can…
Simple chatbot using IBM Watson (STT, TTS, Assistant)
An AI-powered, fully automated n8n workflow that converts a single text prompt into scroll-stopping YouTube…
AI-powered technical interview system with dynamic resume analysis, voice interaction, and automated…
An unofficial workspace for ElevenLabs
Interact with GPT-3 through speech
An awesome dicord bot
Cloudflare-native AI agent — 13 tools, codemode, 5-layer memory, self-learning, multimodal I/O. Telegram…
Node.js app where you can ask questions to ChatGPT using voice prompts, see the ChatGPT-like word-by-word…
Discord bot for Google and Polly Text-to-Speech
Enhanced version of sora-extend with Docker support and CLI arguments. Original by @mattshumer_
Docker container that gives your IP phone (e.g., FRITZ!Box) a voice + AI. It answers calls, speaks with fast…
AI-powered video creation platform with credit-based system. Built with Remora, Google Cloud TTS, Neon DB…
A.L.I.C.E (Artificial Labile Intelligence Cybernated Existence). A REST API of A.I companion for creating…
This repository contains an attempt to utilize the NeMo toolkit created by NVIDIA
A Text Based Chatbot - Our project for HackNotts 2017
Fluid dialogue manager plugin for Godot 4.x built on RiveScript
Emilia - Desktop Character.AI Client
Script that is using OpenAI API for text to speech. Mainly made for speech impeded users so they could…
Freeswitch Speech-To-Text module
Stream GPT response to TTS directly using Flask
EasyTTS是一个便捷的工具,旨在方便地使用第三方API服务来调用OpenAI的文本转语音(TTS)功能。 EasyTTS允许用户输入文本,并选择不同的模型、音色、格式来生成音频文件。
VidGen is a fun tool written with python that uses OpenAI to generate video scripts. It then fetches images…
AI Vtuber for Streaming on Youtube/Twitch
Crix- your personal AI voice assistant that actually does your work for you
An intent-based chatbot in python with tflearn and TensorFlow. It can be trained for a specific purpose and…
chatkore为开发者提供优质稳定的OpenAI相关的API调用接口,方便国内用户使用各类开源ChatGPT项目或者AI领域的库的使用。