Category

Data agents

5,097 Data AI agents indexed on MeshKore — the most complete public catalog, ranked by popularity and updated daily.

5,097 agents · ranked by popularity · refine in the directory →

Top 100 Data agents

autoresearch83,620

AI agents running research on single-GPU nanochat training automatically

OpenBB68,138

Financial data platform for analysts, quants and AI agents.

MinerU65,083

Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.

milvus44,464

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

qlib43,542

Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, including supervised learning, market dynamics modeling, and RL, and is now equipped with https://github.com/microsoft/RD-Agent to automate R&D process.

BettaFish41,065

微舆:人人可用的多Agent舆情分析助手,打破信息茧房,还原舆情原貌,预测未来走向,辅助决策!从0实现,不依赖任何框架。

daily_stock_analysis39,013

LLM驱动的 A/H/美股智能分析:多数据源行情 + 实时新闻 + LLM决策仪表盘 + 多渠道推送,零成本定时运行,纯白嫖. LLM-powered stock analysis system for A/H/US markets.

khoj34,724

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

PageIndex32,198

📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG

FastGPT28,159

FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.

chroma28,099

Search infrastructure for AI

RAG_Techniques27,577

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. Each technique has a detailed notebook tutorial.

dexter26,553

An autonomous agent for deep financial research

DeepTutor24,326

DeepTutor -- Agent-native, Open-sourced Personalized Tutoring. https://deeptutor.info/.

FinceptTerminal24,197

FinceptTerminal is a modern finance application offering advanced market analytics, investment research, and economic data tools, designed for interactive exploration and data-driven decision-making in a user-friendly environment.

opendataloader-pdf21,629

PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.

DeepResearch18,967

Tongyi Deep Research, the Leading Open-source Deep Research Agent

memvid15,570

Memory layer for AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer. Give your agents instant retrieval and long-term memory.

unstructured14,788

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.

easy-dataset14,347

A powerful tool for creating datasets for LLM fine-tuning 、RAG and Eval

RD-Agent13,224

Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committed to automating these high-value generic R&D processes through R&D-Agent, which lets AI drive data-driven AI. 🔗https://aka.ms/RD-Agent-Tech-Report

llm-universe13,113

本项目是一个面向小白开发者的大模型应用开发教程,在线阅读地址:https://datawhalechina.github.io/llm-universe/

AutoResearchClaw12,753

Fully autonomous & self-evolving research from idea to paper. Chat an Idea. Get a Paper. 🦞

ai-goofish-monitor12,180

基于 Playwright 和AI实现的闲鱼多任务实时/定时监控与智能分析系统,配备了功能完善的后台管理UI。帮助用户从闲鱼海量商品中,找到心仪产品。

LEANN11,761

[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.

dolly10,789

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

UI-TARS10,756

Pioneering Automated GUI Interaction with Native Agents

electric10,211

The agent platform built on sync.

ua-parser-js10,127

UAParser.js - The Essential Web Development Tool for User-Agent Detection. Detect Browsers, OS, Devices, Bots, Apps, AI Crawlers, and more. Run in Browser (client-side) or Node.js (server-side).

Crucix10,067

Your personal intelligence agent. Watches the world from multiple data sources and pings you when something changes.

unopim9,928

Unopim is a free and open-source Laravel-based Product Information Management (PIM) system that helps businesses manage and enrich product data from a single platform. Built to scale beyond 10M+ products, now evolving with Agentic PIM capabilities.

phoenix9,859

AI Observability & Evaluation

pyod9,859

A Python library for anomaly detection across tabular, time series, graph, text, and image data. 60+ detectors, benchmark-backed ADEngine orchestration, and an agentic workflow for AI agents.

zvec9,709

A lightweight, lightning-fast, in-process vector database

deeplake9,140

Deeplake is AI Data Runtime for Agents. It provides serverless postgres with a multimodal datalake, enabling scalable retrieval and training.

hermes-webui8,869

Hermes WebUI: The best way to use Hermes Agent from the web or from your phone!

Shadowbroker8,845

Open-source intelligence for the global theater. Track everything from the corporate/private jets of the wealthy, and spy satellites, to seismic events in one unified interface. Hook an AI agent up to have it parse through data and find previously unseen correlations. The knowledge is available to all but rarely aggregated in the open, until now.

reor8,563

Private & local AI personal knowledge management app for high entropy people.

datahaven7,961

An EVM compatible Substrate chain, powered by StorageHub and secured by EigenLayer

all-in-rag7,960

🔍大模型应用开发实战一:RAG 技术全栈指南,在线阅读地址:https://datawhalechina.github.io/all-in-rag/

deep-searcher7,845

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

lab7,359

A customisable 3D platform for agent-based AI research

browser-tools-mcp7,217

Monitor browser logs directly from Cursor and other MCP compatible IDEs.

flyte7,048

Dynamic, resilient AI orchestration. Coordinate data, models, and compute as you build AI workflows.

opencompass7,033

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

vespa6,926

AI + Data, online. https://vespa.ai

llm-scraper6,749

Turn any webpage into structured data using LLMs

MeshCentral6,600

A complete web-based remote monitoring and management web site. Once setup you can install agents and perform remote desktop session to devices on the local network or over the Internet.

nlp.js6,572

An NLP library for building bots, with entity extraction, sentiment analysis, automatic language identify, and so more

plano6,546

Plano is an AI-native proxy and data plane for agentic apps — with built-in orchestration, safety, observability, and smart LLM routing so you stay focused on your agents core logic.

rags6,535

Build ChatGPT over your data, all with natural language

ChatLab6,510

Local-first chat history analyzer with AI. | 本地优先的 AI 聊天记录分析工具

open-deep-research6,238

An open source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawl

hermes-web-ui6,178

Web dashboard for Hermes Agent — multi-platform AI chat, session management, scheduled jobs, usage analytics

kube-state-metrics6,126

Add-on agent to generate and expose cluster-level metrics.

genkit6,051

Open-source framework for building AI-powered apps in JavaScript, Go, and Python, built and used in production by Google

opensre5,962

Build your own AI SRE agents. The open source toolkit for the AI era.

TaxHacker5,920

Self-hosted AI accounting app. LLM analyzer for receipts, invoices, transactions with custom prompts and categories

AgentLaboratory5,621

Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas

MineContext5,335

MineContext is your proactive context-aware AI partner(Context-Engineering+ChatGPT Pulse)

superduper5,281

Superduper: End-to-end framework for building custom AI applications and agents.

ai-data-science-team5,230

An AI-powered data science team of agents to help you perform common data science tasks 10X faster.

sparrow5,159

Structured data extraction and instruction calling with ML, LLM and Vision LLM

argilla4,985

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

hermes-workspace4,939

Native web workspace for Hermes Agent — chat, terminal, memory, skills, inspector.

ml-road4,798

Machine Learning and Agentic AI Resources, Practice and Research

AutoRAG4,793

AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation

llm-graph-builder4,707

Neo4j graph construction from unstructured data using LLMs

solace-agent-mesh4,679

An event-driven framework designed to build and orchestrate multi-agent AI systems. It enables seamless integration of AI agents with real-world data sources and systems, facilitating complex, multi-step workflows.

thunderbolt4,659

AI You Control: Choose your models. Own your data. Eliminate vendor lock-in.

helix-db4,580

HelixDB is an open-source graph-vector database built from scratch in Rust.

Olares4,555

Olares: An Open-Source Personal Cloud to Reclaim Your Data

infinity4,528

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.

myGPTReader4,421

A community-driven way to read and chat with AI bots - powered by chatGPT.

cognita4,409

RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry

OpenAlice4,260

Your one-person Wall Street. An AI trading agent covering equities, crypto, commodities, forex, and macro — from research through position entry, ongoing management, to exit.

m_flow4,251

A bio-inspired cognitive memory engine — a new paradigm for Graph RAG.

llama_cloud_services4,251

Knowledge Agents and Management in the Cloud

csghub4,169

CSGHub is a brand-new open-source platform for managing LLMs, developed by the OpenCSG team. It offers both open-source and on-premise/SaaS solutions, with features comparable to Hugging Face. Gain full control over the lifecycle of LLMs, datasets, and agents, with Python SDK compatibility with Hugging Face. Join us! ⭐️

acme3,988

A library of reinforcement learning components and agents

hermes-agent-orange-book3,883

Hermes Agent 从入门到精通 · 橙皮书系列 · Nous Research 开源 AI Agent 框架实战指南

LazyLLM3,833

Easiest and laziest way for building multi-agent LLMs applications.

docetl3,752

A system for agentic LLM-powered data processing and ETL

mesa3,670

Mesa is an open-source Python library for agent-based modeling, ideal for simulating complex systems and exploring emergent behaviors.

datadog-agent3,629

Main repository for Datadog Agent

morphik-core3,601

The most accurate document search and store for building AI apps

MIRIX3,554

Mirix is a multi-agent personal assistant designed to track on-screen activities and answer user questions intelligently. By capturing real-time visual data and consolidating it into structured memories, Mirix transforms raw inputs into a rich knowledge base that adapts to your digital experiences.

semantic-router3,552

Superfast AI decision making and intelligent processing of multi-modal data.

Acontext3,474

Agent Skills as a Memory Layer

awesome-hermes-agent3,469

A curated list of awesome skills, tools, integrations, and resources for Hermes Agent by Nous Research

surf3,421

Personal AI Notebooks. Organize files & webpages and generate notes from them. Open source, local & open data, open model choice (incl. local).

OB13,413

Open Brain — The infrastructure layer for your thinking. One database, one AI gateway, one chat channel — any AI plugs in. No middleware, no SaaS.

DeepResearchAgent3,403

DeepResearchAgent is a hierarchical multi-agent system designed not only for deep research tasks but also for general-purpose task solving. The framework leverages a top-level planning agent to coordinate multiple specialized lower-level agents, enabling automated task decomposition and efficient execution across diverse and complex domains.

LLMDataHub3,386

A quick guide (especially) for trending instruction finetuning datasets

trulens3,346

Evaluation and Tracking for LLM Experiments and AI Agents

Sidekick3,246

A native macOS app that allows users to chat with a local LLM that can respond with information from files, folders and websites on your Mac without installing any other software. Powered by llama.cpp.

distilabel3,231

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

EvoScientist3,183

🔬 Harness Vibe Research with Self-evolving AI Scientists

oracle-ai-developer-hub3,113

Technical resources for AI developers to build applications, agents, and systems using Oracle AI Database and OCI services

LlamaIndexTS3,079

Data framework for your LLM applications. Focus on server side solution

Browse other category pages