capability
Test agents
This page lists every AI agent in the MeshKore directory tagged with the Test capability. Agents are sourced from public platforms (GitHub, Hugging Face, npm, PyPI, awesome-list curations, and direct submissions), normalized by the MeshKore worker, and ranked by GitHub stars. Each card links to the agent's profile with details on capabilities, framework, language, freshness, and source attribution.
1,143 agents in this capability · ranked by popularity
Top 200 Test agents
an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with…
Curso para aprender el lenguaje de programación Python desde cero y para principiantes. 100 clases, 44 horas…
Open-source platform for creating safe, isolated production sandboxes for API, integration, and E2E testing.
Fully autonomous AI Agents system capable of performing complex penetration testing tasks
🕷 Super-agent driven library for testing node.js HTTP servers using a fluent API. Maintained for…
Automated Penetration Testing Agentic Framework Powered by Large Language Models
MiroThinker is a deep research agent optimized for complex research and prediction tasks. Our latest models…
HexStrike AI MCP Agents is an advanced MCP server that lets AI agents (Claude, GPT, Copilot, etc.)…
The most powerful Android RPA agent framework, next generation mobile automation.
VPS 融合怪服务器测评项目 更推荐使用无环境依赖的Go版本 VPS Fusion Monster Server Test Script – More recommended to use the Go version…
A Modern Orchestration Engine for Security
QA via natural language AI tests
Kodu is an autonomous coding agent that lives in your IDE. It is a VSCode extension that can help you build…
🐢 Open-Source Evaluation & Testing library for LLM Agents
The fastest and the most accurate file search toolkit for AI agents, Neovim, Rust, C, and NodeJS
Open-source, vision-first browser agent
AI agent framework for plan-first development workflows with approval-based execution. Multi-language support…
Product Management skills framework built on battle-tested methods for Claude Code, Cowork, Codex, and AI…
Expect tests your agent's code in a real browser
CyberStrikeAI is an AI-native security testing platform built in Go. It integrates 100+ security tools, an…
Write tests against structured configuration data using the Open Policy Agent Rego query language
The fastest business intelligence tool for humans and agents.
基金投资管理回测引擎
A Burp Suite extension that integrates OpenAI's GPT to perform an additional passive scan for discovering…
The fastest way to build robust AI agents
AI Skills, MCP Tools, and CLI for Unity Engine. Full AI develop and test loop. Use cli for quick setup…
PentestAgent is an AI agent framework for black-box security testing, supporting bug bounty, red-team, and…
Claude Code learns from your corrections: self-correcting memory that compounds over 50+ sessions. Context…
Trading and Backtesting environment for training reinforcement learning agent or simple rule base algo.
Agentic LLM Vulnerability Scanner / AI red teaming kit 🧪
AI-powered icon generation CLI for React Native & Expo developers. Generate stunning app icons in seconds…
This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository…
An AI-powered agentic red team framework that automates offensive security operations, from reconnaissance to…
"Vibe-Trading: Your Personal Trading Agent"
Agent Skills for optimizing web quality based on Lighthouse and Core Web Vitals.
CLI to control iOS and Android devices for AI agents
🎉Agent of Sonic cloud real machine platform. Sonic云真机平台Agent端。
☸️ Testkube is a Test Orchestration Platform for Cloud Native Applications
Neovate Code is a code agent to enhance your development. You can use it to generate code, fix bugs, review…
The Claude Agent Skill for Terraform and OpenTofu - testing, modules, CI/CD, and production patterns
Standards for building agents, better
A secure sandbox environment for malware developers and red teamers to test payloads against detection…
Backtesting and Trading Bots Made Easy for Crypto, Stocks, Options, Futures, FOREX and more. Lumibot also…
Stealth Chromium that passes every bot detection test. Drop-in Playwright replacement with source-level…
An MCP server that autonomously evaluates web applications.
Autonomous Hacking Agent for Red Team Testing
A curated list of awesome AI assistants. Example Telegram bot with all these assistants can be tested on the…
Testsigma is an agentic test automation platform powered by AI-coworkers that work alongside QA teams to…
AIPex: AI browser automation assistant, no migration and privacy first. Alternative to Manus Browser…
An offensive/defense security toolset for discovery, recon and ethical assessment of AI Agents
Production-grade multi-agent orchestration platform - JSON-defined agents, multi-tier memory, and built-in…
AI Agent Evaluator & Red Team Platform
Hercules is the world’s first open-source testing agent, enabling UI, API, Security, Accessibility, and…
Intelligent enterprise-grade reference architecture for JavaScript, featuring OpenAI integration, Azure…
A complete guide to start and improve your LLM skills in 2026 with little background in the field and stay…
🚀 MassGen is an open-source multi-agent scaling system that runs in your terminal, autonomously orchestrating…
Burp Suite extension that adds built-in MCP tooling, AI-assisted analysis, privacy controls, passive and…
Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.
Live validation proxy tool for testing web app vulnerabilities
Latest Advances on Agentic AI & AI Agents for Healthcare
A suite of test scenarios for multi-agent reinforcement learning.
A fully customizable and self-hosted sandboxing solution for AI agent code execution and computer use. It…
An IOS Simulator Skill for ClaudeCode. Use it to optimise Claude's ability to build, run and interact with…
The fastest JavaScript BPE Tokenizer Encoder Decoder for OpenAI's GPT models (gpt-5, gpt-o*, gpt-4o, etc.)…
Demo of a UI testing agent using the OpenAI CUA model and the Responses API.
Cookiecutter template for FastAPI projects using: Machine Learning, uv, Github Actions and Pytests
Autonomous software engineering fleet of AI agents for production-grade PRs on AgentField: plan, code, test…
LuaN1aoAgent is a cognitive-driven AI hacker. It is a fully autonomous AI penetration testing agent powered…
Agent that empowers software testing with LLMs; industrial-first in China
📄 Production-ready MCP server for PDF processing - 5-10x faster with parallel processing and 94%+ test…
DevoxxGenie is a plugin for IntelliJ IDEA that uses local LLM's (Ollama, LMStudio, GPT4All, Jan and…
A Multi-Agent Trading System Based on Internal Contest Mechanism
MCP configuration to connect AI agent to a Linux machine.
Backtrader-powered backtesting framework for algorithmic trading, featuring 20+ strategies, multi-market…
My personal Claude Code and OpenAI Codex setup with battle-tested skills, plugins, hooks and agents that I…
AI Agent for testing Android, iOS, and Web apps. Get Started in 5 Minutes. Arbigent's intuitive UI and…
AI-Hedge-Fund for Crypto 🚀 AI-powered hedge fund for cryptocurrency trading, leveraging LLM agents for…
The fastest PDF library for Python and Rust. Text extraction, image extraction, markdown conversion, PDF…
🚀WebUI integrated platform for latest LLMs | 各大语言模型的全流程工具 WebUI…
Claude Code Skills for software engineering workflows - Git automation, testing, and code review
The fastest way to bring multi-agent workflows to production.
AI-Powered Penetration Testing Assistant
Connect Cursor, Copilot & Claude AI directly to Cheat Engine via MCP. Automate reverse engineering, pointer…
Official plugin for OpenClaw that exports agent traces to Opik. See and monitor agent behaviour, cost…
暴走皮皮虾之代码发布系统,是现代的持续集成发布系统,由后台管理系统和agent两部分组成,一个运行着的agent就是一个节点,本系统并不是造轮子,是"鸟枪"到"大炮"的创新,对"前朝遗老"的革命.
A program synthesis agent that autonomously fixes its output by running tests!
Automated web vulnerability scanning with LLM agents
Pentest Copilot is an AI-powered browser based ethical hacking assistant tool designed to streamline…
RAG LLM Ops App for easy deployment and testing
The CLI for AI agents to control Chrome. Zero config, agent-agnostic, battle-tested.
Deeper Seeker is an simpler OSS version of OpenAI's latest Deep Research feature in ChatGPT.It is an agentic…
MCPMark is a comprehensive, stress-testing MCP benchmark designed to evaluate model and agent capabilities in…
DeepV Code - A highly customizable AI coding assistant compatible with all major AI models. The perfect…
EVA is an AI-assisted penetration testing agent that enhances offensive security workflows by providing…
A virtual AI-based pit crew for Sim Racing. Use the latest GPT technology to create a real life like…
3D rendered proc-gen world test. C++ homebrew voxel engine for agent-driven prodedural generation / world…
User-Agent , X-Forwarded-For and Referer SQLI Fuzzer
AIRecon is an autonomous cybersecurity agent that combines a self-hosted Large Language Model (Ollama) with a…
"Unit tests" for your agent skills
The easiest, and fastest way to run AI-generated Python code safely
SwarmGo (agents-sdk-go) is a Go package that allows you to create AI agents capable of interacting…
A generative AI-powered framework for testing virtual agents.
👻 A LAN dropbox chatbot controllable via Telegram
The AMLSim project is intended to provide a multi-agent based simulator that generates synthetic banking…
PAO is agent-optimized output for PHP testing tools.
Using Agents To Automate Pentesting
An agent skill focused entirely on Swift Testing, helping you write better tests, migrate from XCTest…
此仓库存储我在YouTube频道和B站频道关于AI Agent相关分享,所有资源全部开源免费
Agent skill for AWP RootNet protocol (testnet) — query, stake, govern, and monitor on-chain
Halberd : Multi-Cloud Agentic Attack Tool
正规子群.AI Agent | SubgroupX: A high-performance AI Agent for offensive security, Coding, CTF operations, and…
💀 It's headless WordPress!
Production-tested templates for deploying multi-agent AI teams on OpenClaw with Telegram supergroup…
The easiest way to run the fastest MLX-based LLMs locally
ReconNess is a platform to allow continuous recon (CR) where you can set up a pipeline of #recon tools…
Mock everything your AI app talks to — LLM APIs, MCP, A2A, AG-UI, vector DBs, search. One package, one port…
🚀 19 AI Agents + 44 Commands for Gemini CLI - Code 10x faster with auto planning, testing, review & security
Getting the latest versions of Disco Diffusion to work locally, instead of colab. Including how I run this on…
Agentic QE Fleet is an open-source AI-powered QA/QE platform designed for use with Coding Agents (works best…
Automate the tedious development tasks with AI
LLM Agent and Evaluation Framework for Autonomous Penetration Testing
A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players…
gpttools extends gptstudio for package development to help you document code, write tests, or even explain…
OpenAI GPT3/3.5 and GPT4 ChatGPT API Client Library for Go, simple, less dependencies, and well-tested
Build, test, and deploy intelligent AI agents the Laravel way
Hulken is a stress testing tool for everything speaking HTTP. Hulken supports multiple urls, GETs and POSTs…
AUITestAgent is the first automatic, natural language-driven GUI testing tool for mobile apps, capable of…
A Test Project for a Network Security-oriented LLM Tool Emulating AutoGPT
✨✨Latest Advances on Neuro-Symbolic Learning in the era of Large Language Models
🛡⚔️AI-Powered Penetration Testing Framework with automated vulnerability scanning, multi-agent system, and…
AiScan-N 来了!这是一款基于人工智能驱动的Ai自动化网络安全(运维)工具,专注于网络安全评估、漏洞扫描、运维、应急响应、渗透测试自动化,Ai大模型工具集【CLI Agent】…
The definitive benchmark for AI agents on OpenClaw. 45 tasks across 4 tiers. Powered by MyClaw.ai
Cross-browser web performance testing agent
CLI for TDD — you write the test, GPT writes the code to pass it ✅
🐤 AI chat & search summaries in DuckDuckGo, powered by the latest LLMs
GenAI powered OpenSource IDE for API first workflows
Fake Sora API is an open-source project that simulates the yet-to-be-released OpenAI Sora API, enabling…
This repo houses Rubber Ducky scripts integrated with OpenAI's GPT. Designed for ethical hackers and…
Web Testing AI Agent - Write your specs, it does the rest
Swift Testing agent skill for Claude Code, Codex, and other AI tools.
Continuous Integration for LLM powered applications
Tree-sitter-powered code indexing server that gives LLM agents precise, on-demand access to symbols…
19 production-ready Claude Code plugins: git workflows, code review, spec-driven development, architecture…
使用CrewAI+FastAPI搭建多Agent协作应用并对外提供API服务,同时支持gpt、国产大模型、Ollama本地大模型。
Build Agentic AI solutions on AWS, using latest OSS Agentic Frameworks.
🚀 ERA Connect by VYNECT™ — The evolution of secure WhatsApp automation ERA Connect is part of the VYNECT™…
The Selenium for Chatbots - Bots Testing Bots
🚀 JoySafeter: An enterprise AI Agent Platform—Not just chatting. building、running、testing, and tracing…
AI-powered offensive security testing using autonomous agents, directly in your terminal.
SE-Agent is a self-evolution framework for LLM Code agents. It enables trajectory-level evolution to exchange…
🤖 LLM-powered agent for automated Google Dorking in bug hunting & pentesting.
The AI framework for Go developers. Build powerful AI applications and agents using our free, open-source…
Prompts for performing tests on your Kali Linux using Gemini-cli, ChatGPT, DeepSeek, CursorAI, Claude Code…
The Android Agent for the Drozer Security Assessment Framework.
Autonomous penetration testing using a swarm of AI agents. Orchestrates recon, classification, exploitation…
Universal development automation for ANY project. Claude Code implements features, runs tests, creates PRs…
Agentic pentest tooling. Currently achieving 81% (KIMI K2.5) on XBOW's benchmark in full black-box…
Testing WASM-powered AI agents
Paper trading simulator for Polymarket — built for AI agents. MCP server, live order books, strategy…
Computer-Use SDK for E2E QA Testing
Repo for AI Agents The Definitive Guide
Load local LLMs effortlessly in a Jupyter notebook for testing purposes alongside Langchain or other agents…
Turn Claude Code into your offensive security research assistant. Specialized AI subagents for authorized…
AlphaSuite is an open-source quantitative analysis platform that gives you the power to build, test, and…
🦁 AI chat & search summaries in Brave Search, powered by the latest LLMs
Red Teaming python-framework for testing chatbots and GenAI systems.
An environment for testing AI pentesting agents against a simulated network.
An Open Source Playground with Agent Datasets and APIs for building and testing your own Autonomous Web Agents
AI powered cli coding agent that monitors your dev/test server and fixes errors and adds features
Unified Emacs interface supporting OpenAI Codex, GitHub Copilot CLI, Claude Code, Gemini CLI, Opencode, and…
Autonomous web browser agent that audits performance, functionality & UX for engineers and vibe-coding…
A collection of agents and skills to aid in the planning, implementation, documentation and testing of…
All-in-one security testing toolbox that brings together popular open source tools through a single MCP…
Put the world's smartest AI agents & plugins in your pocket
API Key 测活工具 - 用于批量检测 OpenAI、Claude、Gemini 等 API 密钥有效性 | Modern API Key Tester - Used for batch testing the…
A framework for creating rich, 3D, Minecraft-like single and multi-agent environments for AI research…
AI-powered E2E testing for 10 platforms. 253 MCP tools. Zero config. Works with Claude, Cursor, Windsurf…
A high-fidelity, general-purpose platform for embodied agent training and testing.
ChatGPT加持的,多人在线协同信息安全报告编写平台。目前支持的报告类型:渗透测试报告,APP隐私合规报告。
🤖 AI chat & search summaries in Google Search, powered by the latest LLMs
A QA testing framework for your coding agent.
A comprehensive codebase of best practices, coding rules, and workflow automation for AI-assisted development…
[arxiv: 2503.23895] Dynamic Parametric Retrieval Augmented Generation for Test-time Knowledge Enhancement
A CLI tool to control Unity Editor - enabling both humans and AI agents to run compilations, tests, and…
A comprehensive list of document parsers, covering PDF-to-text conversion and layout extraction. Each tested…
编程导航 2025 年 AI + 全栈新项目,基于 Spring Boot 3 + LangChain4j + Vue 3 构建的程序员技术练兵场平台,检验程序员水平。支持 AI…
skUnit is a testing tool for AI units, such as IChatClient, MCP Servers and agents.
SEOBuild Onpage - The first AI agent that writes pages Google ranks AND LLMs cite. One command in, ranking…
Agent skills that make AI coding assistants write production-grade robotics software. ROS1, ROS2, design…
A fast, native browser automation CLI built from the ground up for AI agents, powered by Chrome DevTools…
AI-powered offensive security agent. Autonomous pentesting with 13+ specialized agents, 120+ OWASP test…
This repo is deprecated. Please go to langchain-ai/docs.
Open source version of Claude Managed Agents. Fastest way to build and deploy reliable AI agents, MCP tools…
AI Agents are missing the UI! We're here to change it. Build Business AI Agents for your company: business…
It's like Auto-GPT met Brew. The easiest and fastest way to get started with AutoGPT with any backend of your…
AI QA Agent for mobile apps
Curated, production-grade skills for AI coding agents. Battle-tested workflows for developers who use AI…
Setting up QA testing agents using playwright and crewAI
Open-Source RAG app with LLM Observability (Langfuse), support for 100+ providers (LiteLLM), Dockerized, Full…
The repository of VulnBot: Autonomous Penetration Testing for A Multi-Agent Collaborative Framework.