Top AI GitHub Repos Q2 2026

01

Agent Frameworks

Autonomous and multi-agent orchestration. The fastest-moving category in Q2 2026.

Build stateful, multi-step agent workflows with explicit graph control.

Why notableThe graph model beats linear chains for agents that need to loop, branch, and retry. Active development, strong production adoption.

~32,200 stars Python MIT

AutoGen

Multi-agent conversation framework. Agents talk to each other to complete tasks.

Why notablev0.4 shipped a complete rewrite with async-first architecture. Still the most cited framework in multi-agent research.

~58,100 stars Python CC BY 4.0

CrewAI

Role-based agent orchestration — assign agents jobs, watch them collaborate.

Why notableFastest-growing framework for non-research use cases. Simple API, large community, integrates with most LLM providers out of the box.

~51,600 stars Python MIT

smolagents

Minimal agent library built around code execution as the default action.

Why notableUnder 1,000 lines of core code. Code-first agents outperform tool-call agents on most benchmarks.

~27,400 stars Python Apache 2.0

openai-agents-python

OpenAI's own lightweight multi-agent SDK. Replaces Swarm.

Why notableOfficial library, active maintenance, clean handoff primitives. If you're building on the OpenAI API, this is now the canonical starting point.

~26,400 stars Python MIT

Agno

Build, run, and manage agent platforms. Focused on production deployment.

Why notableTargets the gap between "runs in a notebook" and "runs in production at scale." Good observability hooks built in.

~40,200 stars Python MIT

pydantic-ai

Agent framework built on top of Pydantic. Type-safe LLM outputs by default.

Why notablePydantic already owns structured output parsing. This extends that to full agent workflows. Production-grade from day one.

Python MIT

12-factor-agents

Design principles for production-grade LLM-powered software. Not a framework, a spec.

Why notableThe clearest articulation of what separates toy agents from production ones. Essential reading before picking a framework.

~19,900 stars Markdown MIT

MetaGPT

Multi-agent framework that simulates a software company — PM, engineer, QA all as agents.

Why notableResearch-forward, but increasingly practical. Best for complex code generation tasks that require multiple review passes.

~68,100 stars Python MIT

CopilotKit

Frontend stack for embedding agents and generative UI in React apps.

Why notableMakers of the AG-UI Protocol. The only serious framework for building agent interactions directly into web UIs.

~31,500 stars TypeScript MIT

02

LLM Serving and Inference

Run models faster and cheaper. The infrastructure layer for everyone building on open weights.

Ollama

Get open-weight models running locally in one command.

Why notableSupports Kimi-K2.5, GLM-5, MiniMax, DeepSeek, Qwen, Gemma, and more. The easiest on-ramp to local inference by a wide margin.

~171,600 stars Go MIT

llama.cpp

LLM inference in pure C/C++. Runs on CPUs, Apple Silicon, and consumer GPUs.

Why notableThe engine behind most local inference tools. Quantization quality and speed continue to improve quarter over quarter.

~110,600 stars C/C++ MIT

vLLM

High-throughput, memory-efficient LLM serving with PagedAttention.

Why notableThe default choice for serving open-weight models at scale. Continuous batching, OpenAI-compatible API, multi-GPU support.

~80,300 stars Python Apache 2.0

SGLang

High-performance serving for LLMs and multimodal models.

Why notableBenchmarks consistently faster than vLLM on certain workloads. Worth testing if you are latency-sensitive.

~27,900 stars Python Apache 2.0

BitNet

Official inference framework for 1-bit LLMs.

Why notable1-bit quantization is the most aggressive size reduction available. BitNet makes it practical on commodity hardware.

~39,000 stars C++ MIT

LiteLLM

Unified API proxy for 100+ LLM providers in OpenAI-compatible format.

Why notableSingle integration point for every major provider. Cost tracking, load balancing, and guardrails included. Production-proven.

~47,300 stars Python MIT

LocalAI

Run LLMs, vision, voice, image, and video models on any hardware without a GPU.

Why notableThe only single-binary solution for running all modalities locally. Privacy-first alternative to cloud APIs.

~46,300 stars Go MIT

03

Fine-Tuning and Training

Adapt open-weight models to specific domains and tasks.

LLaMA-Factory

Unified fine-tuning interface for 100+ LLMs and vision-language models.

Why notableLoRA, QLoRA, full fine-tune, and RLHF all in one tool. ACL 2024 paper. The most complete fine-tuning harness available.

~71,300 stars Python Apache 2.0

Unsloth

Fine-tune and run open models with a web UI. Supports Gemma 4, Qwen 3, DeepSeek.

Why notableDramatically reduces memory footprint vs. standard training. Studio UI makes it accessible without writing training scripts.

~64,500 stars Python Apache 2.0

PEFT

Parameter-efficient fine-tuning. LoRA, prefix tuning, prompt tuning, and more.

Why notableThe canonical library for LoRA. Integrates with Transformers, Diffusers, and Accelerate. Used by nearly every other fine-tuning tool.

~21,100 stars Python Apache 2.0

Axolotl

Config-driven fine-tuning for LLMs. Write YAML, not training loops.

Why notableBest option for teams who want reproducible, version-controlled training runs without custom code.

~11,900 stars Python Apache 2.0

easy-dataset

Build fine-tuning datasets from unstructured content with LLM assistance.

Why notableThe dataset problem is often the real bottleneck. This automates the extraction and formatting step.

~14,300 stars TypeScript MIT

H2O LLM Studio

No-code GUI for fine-tuning LLMs.

Why notableLowers the barrier for non-ML teams. Export to GGUF for local deployment. Solid option for domain-specific model builds.

~5,000 stars Python Apache 2.0

NeMo

End-to-end platform for LLM training, fine-tuning, and alignment.

Why notableEnterprise-grade. Used for training production models, not just fine-tuning. Requires NVIDIA hardware but has no ceiling on scale.

~14,000 stars Python Apache 2.0

TorchTune

PyTorch-native fine-tuning library. No abstractions above the framework.

Why notableMaximum control for teams that need to customize the training loop. Official PyTorch project with strong long-term support.

~5,000 stars Python BSD

04

RAG and Vector Infrastructure

Retrieval-augmented generation and the databases that make it work.

LlamaIndex

The leading document agent and OCR platform for RAG pipelines.

Why notableMoved well beyond naive chunking. Structured retrieval, agent-driven queries, and a large integrations ecosystem.

~49,500 stars Python MIT

mem0

Universal memory layer for AI agents. Persistent, queryable agent memory.

Why notableSolves the statelessness problem without requiring a custom vector store setup. Integrates with most agent frameworks.

~55,900 stars Python Apache 2.0

Milvus

High-performance, cloud-native vector database built for ANN search.

Why notableBattle-tested at scale. Handles billions of vectors. The production-grade option when Chroma becomes a bottleneck.

~44,300 stars Go/Python Apache 2.0

Qdrant

High-performance vector database and search engine, written in Rust.

Why notableRust performance, strong filtering capabilities, and a clean API. The preferred choice for teams prioritizing speed and reliability.

~31,400 stars Rust Apache 2.0

Chroma

Search infrastructure for AI. Embeddable, open-source vector store.

Why notableFastest path from zero to working RAG. Runs in-memory or persisted. The default starting point for most RAG prototypes.

~28,000 stars Python Apache 2.0

Haystack

Modular pipelines for context-engineered, production-ready LLM applications.

Why notableExplicit control over retrieval routing, memory, and generation. Built for teams that outgrow LangChain's abstractions.

~25,300 stars Python Apache 2.0

GraphRAG

Graph-based RAG. Builds knowledge graphs from documents before retrieval.

Why notableOutperforms standard RAG on complex, multi-hop questions. Slower to index but meaningfully better answers.

~33,000 stars Python MIT

RAG Techniques

Notebook-based showcase of advanced RAG patterns with runnable code.

Why notableNot a library — a learning resource. The fastest way to understand what's beyond naive chunking.

~27,400 stars Jupyter MIT

AnythingLLM

All-in-one AI productivity tool. On-device, privacy-first RAG with no setup.

Why notableThe non-technical user's path to private RAG. Desktop app, no cloud requirement, works with local and remote models.

~60,200 stars JavaScript MIT

05

Coding Agents and IDE Assistants

AI systems that write, review, and execute code.

Cline

Autonomous coding agent as SDK, IDE extension, or CLI.

Why notableDeep VS Code integration, full file system access, terminal execution. The most capable open-source coding agent in active use.

~61,900 stars TypeScript Apache 2.0

Aider

AI pair programming in your terminal. Works with any Git repo.

Why notablePolyglot, diff-aware, commit-native. Benchmarks consistently high on SWE-bench. The terminal-native alternative to IDE extensions.

~44,900 stars Python Apache 2.0

OpenHands

Open-source software development agent. Reads, writes, and runs code autonomously.

Why notableFull sandbox environment, browser control, and shell access. The open-source closest approximation to Devin.

~51,000 stars Python MIT

Continue

Source-controlled AI code checks, enforceable in CI.

Why notableShifted from pure IDE autocomplete to CI-enforced AI review. Adds AI gatekeeping to the PR pipeline.

~33,200 stars TypeScript Apache 2.0

Codex CLI

OpenAI's CLI coding agent. Point it at a task, it writes and runs code.

Why notableOfficial OpenAI release. Sandboxed execution, multi-file edits, works with any codebase.

~30,000 stars TypeScript MIT

Gemini CLI

Open-source AI agent bringing Gemini directly into your terminal.

Why notable104,000 stars in weeks after launch. The fastest-rising coding CLI of Q2 2026. Gemini 2.5 Pro backend with 1M token context.

~104,200 stars TypeScript Apache 2.0

06

Image Generation

Open-source image synthesis tools for professionals and builders.

Stable Diffusion Web UI

The original Stable Diffusion web interface. Still the most widely used.

Why notablePlugin ecosystem is enormous. Every new model, ControlNet, and sampler ships support here first.

~163,100 stars Python AGPL-3.0

ComfyUI

Node-based Stable Diffusion UI and inference backend.

Why notablePipelines are graphs — infinitely composable, exportable, and reproducible. Industry standard for production image workflows.

~75,000 stars Python GPL-3.0

Fooocus

Simplified Stable Diffusion UI focused on prompting and generating.

Why notableZero config, just prompt and generate. Best quality-to-effort ratio for users who don't need node control.

~48,500 stars Python GPL-3.0

InvokeAI

Professional creative engine for Stable Diffusion. Industry-grade web UI.

Why notableThe production tool used by professional artists and studios. Canvas, workflow editor, and team features.

~27,200 stars Python Apache 2.0

Forge

Optimized fork of AUTOMATIC1111 with lower VRAM usage.

Why notableRuns newer models (Flux, SD3.5) faster on consumer hardware. Drop-in replacement for A1111 with meaningfully better performance.

~12,600 stars Python AGPL-3.0

Diffusers

State-of-the-art diffusion model library for image, video, and audio.

Why notableThe code layer that most serious tools build on. Start here to understand how generation works or build custom pipelines.

~33,600 stars Python Apache 2.0

stable-diffusion.cpp

Diffusion model inference in pure C/C++. Supports Flux, Wan, SD, Qwen Image.

Why notableRuns on CPU and low-end hardware. The llama.cpp equivalent for image generation.

~6,000 stars C++ MIT

07

Video Generation

Open-source video synthesis and motion models.

Wan2.1

Open and advanced large-scale video generative models from Alibaba.

Why notableThe highest-quality open-weight video model available as of Q2 2026. Outperforms most commercial APIs on standard benchmarks.

~16,100 stars Python Apache 2.0

LTX-Video

Real-time-class video generation. Official Lightricks repository.

Why notableFirst open model to hit near-real-time generation speeds. The last_frame_uri parameter enables true seamless loops.

~10,300 stars Python Apache 2.0

AnimateDiff

Animate any personalized text-to-image model without specific tuning.

Why notableThe original plug-and-play motion adapter. Still the reference implementation for controlled animation from image inputs.

~12,100 stars Python Apache 2.0

CogVideo

Open-source video generation models from Tsinghua University.

Why notableCogVideoX series delivers strong text-to-video quality from a research lab without a commercial API paywall.

~12,000 stars Python Apache 2.0

Open-Sora

Open-source attempt to replicate and improve on OpenAI's Sora.

Why notableTraining code included. The only open project that lets you train video generation models at scale.

~25,000 stars Python Apache 2.0

Mochi

High-fidelity video generation models from Genmo.

Why notablePrioritizes motion quality and physical plausibility over raw resolution. Good for realistic motion content.

~3,700 stars Python Apache 2.0

08

Voice and Audio

Open-source TTS, STT, voice conversion, and audio generation.

Whisper

Robust speech recognition via large-scale weak supervision.

Why notableStill the most accurate open ASR model. Multi-language, multi-task, runs locally. The baseline everything else is compared to.

~99,600 stars Python MIT

faster-whisper

Whisper re-implementation with CTranslate2. Significantly faster inference.

Why notable4x faster than the original Whisper with comparable accuracy. The production choice for transcription workloads.

~23,000 stars Python MIT

WhisperX

Whisper with word-level timestamps and speaker diarization.

Why notableAdds the two features standard Whisper lacks. Indispensable for subtitle generation and meeting transcription.

~21,900 stars Python BSD

F5-TTS

Flow-matching TTS — ultra-realistic speech synthesis.

Why notableAmong the best open-source TTS models on naturalness benchmarks. Fast inference, clean API.

~14,500 stars Python CC BY-NC

CosyVoice

Multi-lingual large voice generation model from Alibaba.

Why notableSupports zero-shot voice cloning across languages. Full training and deployment stack included.

~21,100 stars Python Apache 2.0

DIA

TTS model capable of generating ultra-realistic dialogue in one pass.

Why notableDialogue-native — handles conversation, interruptions, and speaker transitions that break standard TTS systems.

~19,300 stars Python Apache 2.0

Kokoro

82M parameter TTS model. Fast, local, surprisingly high quality.

Why notableRuns on CPU in real time. The lightweight TTS option when ElevenLabs latency or cost is a problem.

~7,100 stars Python Apache 2.0

pipecat

Framework for voice and multimodal conversational AI.

Why notableHandles the real-time audio pipeline — VAD, STT, LLM, TTS, all stitched together. Building voice agents without pipecat is building your own media server.

~12,300 stars Python BSD

LiveKit Agents

Framework for realtime voice AI agents with video and audio.

Why notableLiveKit infrastructure plus agent SDK. The production path for voice agents that need WebRTC-grade reliability.

~10,500 stars Python Apache 2.0

fish-speech

State-of-the-art open-source TTS with voice cloning.

Why notableOne of the best-sounding open TTS models. Fast inference, multilingual, active development.

~30,400 stars Python CC BY-NC-SA

09

MLOps and LLM Observability

Track what your models do in production. Catch regressions. Control costs.

Langfuse

Open-source LLM engineering platform. Observability, evals, prompt management.

Why notableThe most complete open-source LLMOps tool. OpenTelemetry-based tracing, session-level analysis, prompt versioning. YC W23.

~27,300 stars TypeScript MIT

MLflow

Open-source AI engineering platform for agents, LLMs, and ML models.

Why notableDebugging, evaluation, monitoring, and cost control in one platform. LLM tracing is now production-grade.

~26,000 stars Python Apache 2.0

Helicone

One-line LLM observability. Monitor, evaluate, and experiment.

Why notableLiterally one line of code to instrument. Best time-to-first-insight of any tool in the category. YC W23.

~5,700 stars TypeScript Apache 2.0

AgentOps

Observability and testing for AI agents. Session replay, cost tracking, evals.

Why notableAgent-specific. Tracks multi-step sessions, not just individual LLM calls. The Datadog equivalent for agent workflows.

~3,500 stars Python MIT

OpenLLMetry

Open-source observability for GenAI based on OpenTelemetry.

Why notableStandard-based. If your stack already uses OpenTelemetry, this is the cleanest integration path.

~7,100 stars Python Apache 2.0

Coze Loop

Full-lifecycle AI agent optimization — development, debugging, eval, monitoring.

Why notableOne of the few tools that covers the full loop from prompt to production. Strong debugging UI for complex agent pipelines.

~5,500 stars Go Apache 2.0

10

Browser and UI Automation

AI agents that operate the web.

browser-use

Make websites accessible for AI agents. Automate tasks online.

Why notableThe dominant Python library for LLM-driven browser control. 94,000 stars is not inflated — this is the tool the community converged on.

~94,300 stars Python MIT

Stagehand

SDK for browser agents. Built for TypeScript.

Why notableThe TypeScript-native alternative to browser-use. Robust session management, cloud-friendly, strong abstraction layer.

~22,700 stars TypeScript MIT

Skyvern

Automate browser-based workflows with AI. Vision-first approach.

Why notableUses vision to understand pages instead of DOM parsing. Works on sites that break selector-based automation.

~21,600 stars Python AGPL-3.0

OmniParser

Parse and understand any UI screenshot for agent interaction.

Why notableConverts arbitrary UI screenshots into structured data an agent can act on. The universal UI understanding layer.

~35,000 stars Python CC BY-NC-SA

UI-TARS Desktop

Open-source multimodal AI agent stack from Bytedance.

Why notableHandles the full stack from visual understanding to action execution. Production-grade, open-sourced by Bytedance.

~34,400 stars TypeScript Apache 2.0

chrome-devtools-mcp

Chrome DevTools for coding agents via MCP.

Why notableOfficial Chrome team release. Gives agents direct access to DevTools protocol — debugging, network inspection, DOM access.

~39,800 stars TypeScript Apache 2.0

playwright-mcp

Playwright as an MCP server. Browser automation for Claude and other agents.

Why notableOfficial Microsoft release. The standard way to wire Playwright into Claude Code and other MCP-compatible agents.

~15,000 stars TypeScript MIT

Open Interpreter

Natural language interface for computers. Code execution, file system, browser.

Why notable63,000 stars, years of production use. The original "LLM with computer access" project. Still the most general-purpose option.

~63,600 stars Python AGPL-3.0

11

MCP Servers

The Model Context Protocol ecosystem — servers that give agents real-world capabilities.

MCP Servers (official)

The official MCP server reference collection from Anthropic.

Why notableFilesystem, GitHub, Slack, memory, fetch, and more — all in one repo. The canonical starting point for understanding what MCP can do.

~85,800 stars TypeScript MIT

github-mcp-server

GitHub's official MCP server. Full GitHub API access for agents.

Why notableOfficial GitHub release. Agents can read issues, create PRs, search code, and manage repos without custom integration work.

~29,900 stars Go MIT

Context7

Up-to-date code documentation for LLMs and AI code editors.

Why notableSolves the hallucination-from-stale-docs problem. Pulls live, version-specific documentation into the agent's context at query time.

~55,500 stars TypeScript MIT

Serena

MCP toolkit for coding — semantic retrieval and editing capabilities.

Why notableSemantic code search and structured editing over an MCP interface. The IDE capability layer for coding agents.

~24,300 stars Python MIT

GPT Researcher

Autonomous agent for deep research on any topic using any LLM.

Why notableMCP-compatible research agent. Gathers, synthesizes, and reports from the web autonomously.

~27,100 stars Python MIT

Activepieces

AI agents, MCPs, and workflow automation with ~400 MCP server integrations.

Why notablen8n-style workflow automation with deep MCP integration. The broadest set of ready-made agent action connectors.

~22,200 stars TypeScript MIT

n8n-mcp

MCP server for building n8n workflows via Claude Code or Cursor.

Why notableLets agents construct n8n automation flows through natural language. Practical bridge between agent instructions and workflow automation.

~21,000 stars TypeScript MIT

Top AI GitHub Repos to Save This Quarter