2026-04 · 172 projects · Ranked by HubLens score
FlashMLA is a library of high-performance attention kernels specifically designed to power DeepSeek-V3 and DeepSeek-V3.2 models. It provides optimized implementations for both sparse and dense attention mechanisms during prefill and decoding stages. The library supports advanced features like FP8 KV cache and is compatible with various GPU architectures including SM90 and SM100.
PaddlePaddle is a comprehensive industrial deep learning platform that provides core frameworks, model libraries, and end-to-end development tools. It supports advanced features like unified dynamic and static graphs, automatic parallelism, and high-order differentiation for scientific computing. The platform is designed to facilitate large-scale model training and inference across diverse industrial sectors.
PaddleOCR is a comprehensive toolkit designed to convert images and PDF documents into structured, LLM-ready data formats like Markdown and JSON. It features state-of-the-art vision-language models and high-performance text recognition engines that support over 100 languages. The platform is widely integrated into major AI agent and RAG frameworks, offering efficient deployment options across various hardware backends.
ncnn is a high-performance neural network forward computation framework deeply optimized for mobile platforms. The framework has no third-party dependencies and features cross-platform capabilities, outperforming all known open-source frameworks on mobile CPUs. Developers can easily port deep learning models to mobile devices using ncnn to build various intelligent applications.
ncnn is a high-performance neural network forward computation framework deeply optimized for mobile platforms. The framework has no third-party dependencies and features cross-platform capabilities, outperforming all known open-source frameworks on mobile CPUs. Developers can easily port deep learning models to mobile devices using ncnn to build various intelligent applications.
PaddlePaddle is a comprehensive industrial deep learning platform that provides core frameworks, model libraries, and end-to-end development tools. It supports advanced features like unified dynamic and static graphs, automatic parallelism, and high-order differentiation for scientific computing. The platform is designed to facilitate large-scale model training and inference across diverse industrial sectors.
Thinking with Visual Primitives introduces a novel approach to Multimodal Large Language Models by interleaving spatial markers directly into the reasoning process. This method addresses the reference gap in complex structural tasks by anchoring abstract language to concrete physical coordinates. The framework achieves frontier-competitive performance while maintaining high visual token efficiency through a compressed architecture.
Page Agent is a client-side library that enables natural language control of web interfaces directly within the browser. It utilizes text-based DOM manipulation to interact with elements without requiring screenshots or complex headless browser setups. Developers can easily integrate this tool to build AI copilots, automate form filling, or enhance web accessibility.
OpenSandbox is a versatile sandbox platform designed for AI applications, supporting diverse runtimes like Docker and Kubernetes. It provides multi-language SDKs and a unified API to facilitate tasks such as code execution, agent evaluation, and browser automation. The platform ensures secure isolation through container runtimes while offering robust network controls and lifecycle management.
The skills CLI provides a unified interface for managing reusable instruction sets across a wide range of coding agents. It allows developers to easily install, update, and remove skills from various sources including GitHub, GitLab, and local directories. By standardizing skill definitions through YAML-based markdown files, it enables consistent agent behavior across different development environments.
World Monitor is an AI-powered platform that aggregates global news, geopolitical data, and infrastructure tracking into a unified situational awareness interface. It features a dual-map engine with 45 data layers and supports local AI processing via Ollama for enhanced privacy. The project provides a native desktop experience across multiple platforms and offers specialized variants for finance, technology, and commodity monitoring.
TileKernels provides a collection of high-performance GPU kernels specifically designed for large language model operations using the TileLang framework. The project includes specialized implementations for Mixture of Experts routing, advanced quantization techniques, and manifold hyper-connection operations. These kernels are built to maximize hardware performance and are currently utilized in internal training and inference workflows.
OpenSandbox is a versatile sandbox platform designed for AI applications, supporting diverse runtimes like Docker and Kubernetes. It provides multi-language SDKs and a unified API to facilitate tasks such as code execution, agent evaluation, and browser automation. The platform ensures secure isolation through container runtimes while offering robust network controls and lifecycle management.
MNN is a high-performance, lightweight deep learning framework designed for efficient model inference and training on mobile and embedded devices. It supports a wide range of neural network architectures and provides versatile tools for model conversion, compression, and general-purpose computation. The framework is widely used in production environments, including various Alibaba applications, to enable device-cloud collaborative machine learning.
Weft is a programming language designed to integrate LLMs, human interactions, and infrastructure into a unified, visual workflow. It features durable execution to ensure programs survive crashes and supports complex logic through a typed, modular node system. Developers can build and manage sophisticated agentic systems by wiring together native nodes without the need for manual plumbing.
DeepEP is a high-performance communication library designed for modern machine learning training and inference, specifically focusing on expert parallelism. The library utilizes a lightweight Just-In-Time compilation module and the NCCL Gin backend to deliver high-throughput, low-latency GPU kernels. It supports advanced features like pipeline parallelism and remote memory access while significantly reducing SM resource consumption compared to previous versions.
DeepGEMM is a unified CUDA library providing high-performance tensor core kernels specifically optimized for modern large language models. It features a lightweight Just-In-Time compilation module that eliminates the need for CUDA compilation during installation. The library delivers expert-tuned performance for various matrix operations, including FP8, FP4, and BF16 GEMMs, as well as fused MoE and MQA scoring.
DeerFlow is an open-source super agent harness designed to orchestrate sub-agents, memory, and sandboxes for complex task execution. The platform features a ground-up rewrite in version 2.0, offering enhanced extensibility through a modular skill and tool architecture. It supports diverse deployment environments, including local development and Docker-based production setups, with integrated support for multiple messaging channels.
WeKnora is an open-source, LLM-powered framework designed for enterprise-grade document understanding, semantic retrieval, and autonomous reasoning. It features a ReAct agent for complex multi-step tasks and a Wiki mode that distills raw documents into a structured, interlinked knowledge base. The platform supports multi-source data ingestion, various LLM integrations, and flexible deployment options to ensure complete data sovereignty.
RuView is an edge-based sensing platform that utilizes WiFi Channel State Information (CSI) to detect human presence, vital signs, and activities without the need for cameras or wearables. The system processes radio signal disturbances through low-cost ESP32 hardware to provide real-time spatial intelligence and environment mapping. It supports advanced features like 3D point cloud generation, pose estimation, and persistent data storage using local neural networks.
Hermes WebUI provides a lightweight, dark-themed browser interface that offers full parity with the Hermes Agent CLI. It features a three-panel layout for chat, file management, and session navigation without requiring complex build steps or frameworks. Users can securely access their self-hosted agent via SSH tunnels or mobile devices while maintaining persistent memory and cross-session context.
Voicebox is a comprehensive, local-first voice synthesis studio that allows users to clone voices and generate speech using seven different TTS engines. The platform features a multi-track timeline editor for creating complex narratives and supports advanced post-processing effects to refine audio output. Designed for privacy and performance, it runs natively on major operating systems while providing a robust REST API for developer integrations.
skills-manage is a Tauri-based desktop application designed to centralize the management of AI coding agent skills across multiple platforms. It utilizes a single source of truth to drive various AI tools through symlinks, supporting a wide range of coding and lobster-related platforms. The application features a comprehensive interface for browsing marketplaces, organizing collections, and performing local discovery of skill libraries.
CC Switch is a desktop application designed to centralize the management of Claude Code, Codex, Gemini CLI, OpenCode, and OpenClaw. It eliminates the need for manual configuration file editing by providing a visual interface with over 50 built-in provider presets and system tray quick-switching. The tool also features unified management for MCP servers, prompts, and skills, alongside cross-device cloud synchronization.
CL4R1T4S is a comprehensive repository dedicated to exposing the hidden system prompts, guidelines, and tools used by major AI models and agents. By documenting these unseen instructions, the project aims to provide users with a clearer understanding of the underlying frameworks that shape AI behavior and decision-making. The platform encourages community contributions to maintain an up-to-date collection of extracted system prompts from various industry-leading AI providers.
This repository provides a curated collection of DESIGN.md files that define the visual identity and design systems of popular websites. These markdown-based documents allow AI coding agents to understand and replicate specific UI styles without needing complex tooling or Figma exports. Each entry includes detailed design tokens, typography rules, and component styling to ensure consistent and pixel-perfect AI-generated interfaces.
RAG-Anything is a comprehensive framework designed to process and query diverse document types including text, images, tables, and mathematical equations. Built on LightRAG, it provides an end-to-end pipeline that integrates multimodal content into a unified knowledge graph for intelligent retrieval. This system eliminates the need for multiple specialized tools by offering a single, cohesive interface for complex document analysis.
OpenClaude is an open-source coding-agent CLI that supports a wide range of cloud and local model providers. It offers a unified terminal-first workflow featuring tools for file management, bash execution, and agentic tasks. Users can easily integrate various backends, including OpenAI, Ollama, and Gemini, while leveraging advanced features like agent routing and gRPC support.
Claude Code Game Studios transforms a standard AI coding session into a structured, professional game development environment. It utilizes a hierarchy of 49 specialized agents, 72 workflow skills, and automated validation hooks to maintain project organization and quality. The system ensures developers remain in control while benefiting from expert-level guidance across design, programming, and production phases.
AgentKit Code Workshop is an AI Agent development platform sample repository launched by Volcengine, designed to help developers quickly master the construction and deployment of intelligent agents. The project provides a variety of code examples ranging from basic introductions to complex scenarios, covering core functions such as multi-agent collaboration, RAG retrieval enhancement, and tool invocation. Developers can use these tutorials to gain an in-depth understanding of the AgentKit development toolchain and integrate it efficiently into various business applications.
Slime is a specialized post-training framework designed to scale reinforcement learning for large language models. It integrates Megatron-LM for high-performance training with SGLang to provide flexible, efficient data generation workflows. The architecture decouples training and rollout processes, enabling researchers to build and deploy complex agentic RL systems.
Omi is an open-source platform that functions as a second brain by capturing and transcribing your screen and conversations in real-time. It provides AI-driven summaries, action items, and a chat interface that remembers everything you have seen or heard. The system supports cross-platform integration across desktop, mobile devices, and specialized AI wearables.
ANOLISA is an evolution of Anolis OS designed specifically to support AI agent workloads at the server-side operating system level. The project provides a comprehensive suite of components including an AI-powered terminal, security kernels, and observability tools. Users can easily integrate these features into their systems through standard RPM package installations.
Toonflow-app is an AI workbench designed for short drama production, achieving full-process automation from script to video through an infinite canvas and a three-layer Agent collaboration system. The platform supports chapter event graph-driven adaptation and provides a programmable provider system to flexibly integrate various AI models. Users can leverage its persistent memory system and modular skill configuration to significantly improve the efficiency and consistency of short drama creation.
PaddleX 3.0 is a low-code development tool built on the PaddlePaddle framework, integrating a vast array of out-of-the-box pre-trained models to support full-process development. Through a minimalist Python API and a graphical interface, the tool enables rapid implementation from model training to inference deployment. Furthermore, it is widely compatible with mainstream domestic and international hardware, helping developers efficiently complete industrial practices.
ROLL is an efficient, user-friendly library designed for scaling reinforcement learning workflows for large language models across large-scale GPU clusters. It supports diverse training paradigms including RLVR, agentic interaction, and distillation, while integrating advanced backends like Megatron-Core, vLLM, and SGLang. The framework provides robust observability and flexible resource management to enhance performance in complex reasoning and human preference alignment tasks.
Hermes Agent is a self-improving AI assistant designed by Nous Research that creates and refines skills through a built-in learning loop. It supports a wide range of LLM providers and can be deployed across various platforms including Telegram, Discord, and local terminal environments. The system features persistent memory, scheduled automations, and the ability to spawn subagents for complex, parallelized tasks.
FastDeploy is an inference deployment toolkit for large language models and vision-language models based on PaddlePaddle, designed to provide out-of-the-box production-grade deployment solutions. This tool supports various mainstream hardware platforms and integrates load-balanced PD separation, unified KV cache transmission, and multiple advanced acceleration technologies. Developers can achieve rapid deployment through OpenAI API-compatible interfaces and optimize inference performance using full quantization format support.
RTP-LLM is a high-performance LLM inference acceleration engine developed by the Alibaba Foundation Model Inference team. This engine has been widely applied in various Alibaba business scenarios such as Taobao and Tmall, supporting multiple mainstream model formats and hardware backends. It provides efficient production-level services for large language models by integrating advanced operator optimization, quantization techniques, and distributed inference capabilities.
The OpenAI Agents SDK is a lightweight framework designed for building complex multi-agent workflows. It supports a wide range of LLMs and provides essential features like tool integration, guardrails, and human-in-the-loop capabilities. Developers can also utilize sandbox agents for long-running tasks and leverage built-in tracing to debug and optimize their agentic applications.
Claude Code Local provides a suite of high-performance AI models that run entirely on Apple Silicon hardware without requiring cloud connectivity. The project features a native MLX server that enables local execution of Claude Code, browser automation, and voice interaction while ensuring complete data privacy. By eliminating outbound network calls and telemetry, it offers a secure, air-gapped environment for handling sensitive professional tasks.
Protenix is an open-source framework designed for high-accuracy biomolecular structure prediction, offering models that perform competitively with state-of-the-art methods. The project provides multiple versions, including the enhanced Protenix-v2, which demonstrates significant improvements in antibody-antigen structure prediction and ligand-related plausibility. It is released under the Apache 2.0 license, making it freely accessible for both academic and commercial research applications.
MedgeClaw is an open-source biomedical research assistant that integrates OpenClaw and Claude Code to automate complex scientific workflows. Users can interact with the system via messaging platforms like WhatsApp, Slack, or Discord to trigger analyses in R and Python environments. The platform provides a real-time research dashboard for monitoring progress, viewing code, and accessing interactive outputs.
Open Agents is an open-source reference application designed for building and running background coding agents on the Vercel platform. The system utilizes a three-layer architecture that separates the web interface, durable agent workflows, and isolated sandbox execution environments. This modular design allows developers to perform complex coding tasks, such as repository management and automated pull requests, without requiring active local machine involvement.
EvoCUA is a high-performance open-source multimodal model designed for end-to-end computer automation across various desktop applications. It currently holds the top ranking on the OSWorld benchmark and demonstrates superior cross-OS generalization capabilities. Additionally, the model is recognized for its robust safety profile, exhibiting the lowest unintended-behavior rate among leading computer-use agents.
AngelSlim is a highly integrated toolkit designed to provide efficient compression solutions for large language, vision, and diffusion models. It supports a wide range of techniques including advanced quantization, speculative decoding, and token pruning to optimize model performance. The framework offers developers a unified interface for training, deployment, and performance evaluation across various hardware environments.
Humanizer-zh is a skill tool designed for Claude Code that helps users identify and remove common AI-generated traces in text. The project analyzes 24 AI writing patterns to guide users in rewriting mechanical content into more natural and personalized human expressions. It not only provides automated rewriting features but also helps creators improve the authenticity and readability of their articles through specific writing principles.
Tair KVCache is an Alibaba Cloud system designed to accelerate Large Language Model inference through distributed memory pooling and dynamic multi-level caching. The project provides a centralized manager for global KVCache metadata and storage capacity, ensuring efficient data reliability and resource utilization. Additionally, it includes a high-fidelity simulation tool that allows developers to predict performance metrics without requiring actual GPU resources.
Chrome DevTools for Agents is an MCP server that enables AI coding assistants to control and inspect live Chrome browser instances. It provides a comprehensive suite of tools for browser automation, performance analysis, and in-depth debugging. The project supports seamless integration with various AI coding platforms to enhance developer workflows through reliable browser interaction.
MedgeClaw is an open-source biomedical research assistant that integrates OpenClaw and Claude Code to automate complex scientific workflows. Users can interact with the system via messaging platforms like WhatsApp, Slack, or Discord to trigger analyses in R and Python environments. The platform provides a real-time research dashboard for monitoring progress, viewing code, and accessing interactive outputs.
OpenCLI transforms websites, browser sessions, and desktop applications into deterministic command-line interfaces for both humans and AI agents. It leverages existing browser authentication to provide secure, reliable automation without requiring additional credentials. Users can utilize over 90 built-in adapters or create custom ones to streamline workflows and integrate external tools into a unified CLI hub.
Agent Sprite Forge is a tool designed to convert natural-language prompts into game-ready 2D sprites and layered maps using Codex. It automates the asset pipeline by combining AI image generation with deterministic local post-processing for cleanup and export. The system supports various outputs, including animation sheets, transparent GIFs, collision data, and complex scene layouts.
Harmonist is a portable multi-agent framework that enforces development protocols through mechanical IDE-level hooks rather than relying on LLM prompts. It provides a structured, validated memory system and supply-chain verification to ensure that code changes meet non-negotiable quality and security standards. The framework integrates seamlessly with popular AI coding assistants like Cursor and Claude Code, offering a catalogue of 186 specialized agents without requiring external runtimes or databases.
Pipcook is a modular JavaScript application framework designed to help front-end engineers integrate machine learning into their workflows. It provides a comprehensive pipeline system that allows users to train, validate, and deploy machine learning models directly within the Node.js environment. By bridging access to Python packages, the framework enables developers to leverage powerful machine learning tools without requiring deep expertise in the field.
This tutorial provides a comprehensive guide for users to build an AI work assistant from scratch, covering installation, configuration, core features, and advanced techniques. The content is proofread based on the stable OpenClaw v2026.4.14 version and offers multiple deployment options to meet various scenario requirements. Through rich practical cases and detailed command cheat sheets, it helps users achieve a significant boost in personal efficiency.
Index-AniSora is a powerful open-source framework designed specifically for high-quality anime video generation and animation production. The system features a comprehensive data processing pipeline, a controllable generation model with spatiotemporal masking, and a specialized evaluation benchmark. It supports diverse creative tasks including character 3D generation, video style transfer, and multimodal guidance for precise motion control.
Humanizer-zh is a skill tool designed for Claude Code that helps users identify and remove common AI-generated traces in text. The project analyzes 24 AI writing patterns to guide users in rewriting mechanical content into more natural and personalized human expressions. It not only provides automated rewriting features but also helps creators improve the authenticity and readability of their articles through specific writing principles.
OpenMontage is an open-source, agentic system that transforms AI coding assistants into comprehensive video production studios. It automates the entire creative workflow, including research, scripting, asset generation, editing, and final composition. The platform supports both AI-generated visuals and real-footage documentary montages using a variety of free and premium tools.
OpenDataLoader PDF is a high-performance, open-source parser designed to convert PDF documents into structured formats like Markdown, JSON, and HTML for AI and RAG pipelines. It features a hybrid processing mode that combines deterministic local parsing with AI-driven analysis to achieve industry-leading extraction accuracy for complex tables, formulas, and scanned documents. Additionally, the project provides automated accessibility solutions, including end-to-end Tagged PDF generation compliant with international standards.
AutoFlow is an open-source knowledge base tool that utilizes graph RAG technology built on TiDB Vector, LlamaIndex, and DSPy. The platform provides a Perplexity-style conversational search experience powered by an advanced built-in website crawler. Users can also integrate a customizable search widget into their own websites using a simple JavaScript snippet.
ROCK is a scalable environment management framework designed specifically for agentic reinforcement learning applications. It utilizes a client-server architecture with robust isolation mechanisms to ensure stable and secure sandbox operations. The platform provides a unified SDK and is fully compatible with GEM protocols to standardize environment interactions.
Xiaomi Miloco is an open-source smart home solution that utilizes on-device large language models to integrate and control IoT devices. By leveraging camera data streams, the system enables natural language interaction for complex home automation and event analysis. It prioritizes user privacy by performing visual understanding and task planning locally on the user's hardware.
GBrain provides a persistent, self-wiring knowledge graph that enables AI agents to store and retrieve complex information across meetings, emails, and documents. The system automatically extracts entity relationships and maintains a structured timeline, allowing agents to answer queries that standard vector search cannot reach. By utilizing a durable job queue and modular skill system, it ensures that agents become smarter and more reliable over time.
VoxCPM2 is a tokenizer-free, 2B parameter text-to-speech system that utilizes a diffusion autoregressive architecture to generate high-quality, expressive audio. The model supports 30 languages and offers advanced capabilities including voice design, controllable voice cloning, and studio-quality 48kHz output. It is fully open-source under the Apache-2.0 license and provides production-ready deployment options via vLLM-Omni and Nano-vLLM.
pi-autoresearch is an extension for the pi AI coding agent that enables autonomous optimization loops by testing, benchmarking, and refining code changes. It supports various optimization targets such as test speed, bundle size, and LLM training metrics through a persistent session workflow. The tool includes a live dashboard, confidence scoring to filter out noise, and the ability to finalize experiments into clean, reviewable branches.
TimesFM is a decoder-only foundation model developed by Google Research specifically for time-series forecasting tasks. The latest 2.5 version features a 200M parameter architecture that supports up to 16k context length and continuous quantile forecasting. The repository provides comprehensive tools for inference, fine-tuning with LoRA, and integration with agentic workflows.
Caveman is a specialized plugin for AI agents that significantly reduces output token usage by enforcing a concise, telegraphic communication style. It maintains full technical accuracy while cutting approximately 75% of output tokens and 46% of input tokens through its compression tools. The project supports various agents including Claude Code, Cursor, and Gemini, offering multiple intensity levels and specialized modes like 文言文.
The OpenClaw Chinese Edition provides full Chinese interface support for the open-source personal AI assistant platform, covering both the CLI tool and the Dashboard web console. This project automatically synchronizes with official updates every hour, ensuring users can experience the latest features and enjoy a deeply localized interface. Additionally, the project includes the ClawPanel management console and ClawApp mobile client, significantly enhancing the usability and cross-platform interaction capabilities of the AI assistant.
Cua provides a unified ecosystem for building, benchmarking, and deploying autonomous agents capable of interacting with computer interfaces. The platform includes specialized tools for background macOS automation, cross-platform sandboxing, and high-performance virtualization. Developers can leverage these components to create agents that perform tasks, execute code, and navigate complex GUI environments seamlessly.
OpenSpec is a lightweight specification framework designed to align human intent with AI coding assistants before implementation begins. It organizes development changes into structured folders containing proposals, technical designs, and implementation tasks. The tool integrates with over 20 existing AI coding assistants to provide a predictable and fluid development workflow.
EvoCUA is a high-performance open-source multimodal model designed for end-to-end computer automation across various desktop applications. It currently holds the top ranking on the OSWorld benchmark and demonstrates superior cross-OS generalization capabilities. Additionally, the model is recognized for its robust safety profile, exhibiting the lowest unintended-behavior rate among leading computer-use agents.
Web-Bench is a comprehensive benchmark designed to evaluate how effectively large language models handle real-world web development tasks. It consists of 50 complex projects featuring sequential dependencies that simulate professional engineering workflows. The benchmark provides a challenging environment where even state-of-the-art models currently demonstrate significant room for improvement.
PaddleCustomDevice is the custom hardware integration solution provided by the PaddlePaddle framework. Through standardized interface design, this project enables developers to integrate various third-party hardware backends into the PaddlePaddle ecosystem. It currently covers support for mainstream hardware platforms including Ascend, Cambricon, Intel GPU, and Apple MPS.
Rowboat is an open-source AI coworker that integrates with your email and meeting notes to build a persistent, local knowledge graph. It utilizes this context to assist with tasks like drafting documents, preparing for meetings, and tracking projects while keeping all data in an editable Markdown format. By maintaining long-lived memory on your machine, it allows for compounding context that improves over time without relying on external cloud storage.
LiteRT-LM is a high-performance, production-ready inference framework designed by Google for deploying Large Language Models on edge devices. It supports a wide range of platforms including Android, iOS, desktop, and IoT, while leveraging GPU and NPU hardware acceleration for optimal performance. The framework enables advanced capabilities such as multi-modality and function calling, powering on-device AI experiences in various Google products.
JaQMC is a modular, JAX-based framework designed for performing neural network quantum Monte Carlo simulations. It utilizes deep neural networks as variational wavefunctions to solve the electronic Schrödinger equation without relying on traditional basis sets. The project supports various quantum systems, including molecules, solids, and fractional quantum Hall states, through a highly configurable and extensible architecture.
fireworks-tech-graph enables users to generate professional SVG and PNG technical diagrams directly from natural language descriptions. The tool supports 14 UML diagram types and includes 7 distinct visual styles tailored for various documentation needs. It is specifically optimized for AI and agent-based domain patterns, allowing for rapid visualization without manual drawing.
oh-my-claudecode provides a multi-agent orchestration layer designed to enhance the Claude Code experience with zero learning curve. It enables advanced features like team-based task execution, intelligent model routing, and persistent autonomous workflows directly within your terminal. The tool simplifies complex development tasks by automating delegation, parallelization, and Socratic requirement clarification.
Qwen Code is an open-source AI agent designed to operate directly within the terminal to help developers understand codebases and automate tasks. It supports multiple authentication methods, including Qwen OAuth and various OpenAI-compatible API providers, to offer flexible model integration. The tool provides a feature-rich agentic workflow and can be integrated into popular IDEs like VS Code, Zed, and JetBrains.
Pipcook is a modular JavaScript application framework designed to help front-end engineers integrate machine learning into their workflows. It provides a comprehensive pipeline system that allows users to train, validate, and deploy machine learning models directly within the Node.js environment. By bridging access to Python packages, the framework enables developers to leverage powerful machine learning tools without requiring deep expertise in the field.
Thunderbolt is an open-source, cross-platform AI client designed for on-premise deployment and data ownership. It supports a wide range of frontier, local, and on-premise models across desktop and mobile environments. The project is currently under active development with a focus on enterprise readiness and security.
JoyAI-Image is a unified multimodal foundation model that integrates an 8B Multimodal Large Language Model with a 16B Multimodal Diffusion Transformer to support image understanding, generation, and editing. The model utilizes a closed-loop collaboration between understanding and generation to enhance spatial reasoning and controllable editing capabilities. It provides a scalable training pipeline and supports advanced features like multi-view generation and precise spatial manipulation.
Goose is a general-purpose AI agent designed to run locally on your machine for tasks ranging from coding and research to automation and data analysis. It is built in Rust to ensure high performance and portability across macOS, Linux, and Windows via desktop, CLI, and API interfaces. The project supports over 15 AI providers and integrates with more than 70 extensions through the Model Context Protocol.
Awesome-finance-skills is a plug-in skill collection that provides financial analysis capabilities for large language models. It supports various professional financial functions such as real-time news aggregation, stock data queries, sentiment analysis, and market forecasting. Users can integrate these skills into mainstream AI Agent frameworks through simple installation to quickly enhance their financial analysis level.
Magika is an AI-powered tool that utilizes deep learning to provide highly accurate file type identification for over 200 content types. It features a highly optimized model that delivers inference results in milliseconds while maintaining approximately 99% accuracy. The project offers a versatile command-line interface and language bindings for Python, JavaScript, and Rust to support diverse developer workflows.
Archon is an open-source workflow engine that allows developers to define AI coding processes using deterministic YAML workflows. By structuring tasks like planning, implementation, and validation, it ensures that AI-driven development is repeatable, isolated, and reliable across projects. Users can compose workflows that mix deterministic operations with AI-driven steps to automate complex software development tasks.
Awesome-finance-skills is a plug-in skill collection that provides financial analysis capabilities for large language models. It supports various professional financial functions such as real-time news aggregation, stock data queries, sentiment analysis, and market forecasting. Users can integrate these skills into mainstream AI Agent frameworks through simple installation to quickly enhance their financial analysis level.
The OpenClaw Chinese Edition provides full Chinese interface support for the open-source personal AI assistant platform, covering both the CLI tool and the Dashboard web console. This project automatically synchronizes with official updates every hour, ensuring users can experience the latest features and enjoy a deeply localized interface. Additionally, the project includes the ClawPanel management console and ClawApp mobile client, significantly enhancing the usability and cross-platform interaction capabilities of the AI assistant.
Vibe-Trading is an AI-powered multi-agent workspace that translates natural language requests into executable trading strategies and portfolio analysis. It features 71 specialized finance skills and 29 pre-built swarm workflows to automate research, backtesting, and risk management across global markets. Users can easily export their generated strategies to platforms like TradingView, TDX, and MetaTrader 5 with a single command.
Recursive Language Models (RLMs) provide a task-agnostic inference paradigm that enables language models to handle near-infinite contexts through programmatic decomposition and recursive self-calling. The framework replaces standard completion calls with an RLM-specific interface that offloads context into a REPL environment for interactive execution. This repository offers an extensible engine supporting various local and cloud-based sandbox environments to facilitate complex, multi-step language model reasoning.
PersonaPlex is a real-time, full-duplex speech-to-speech model built on the Moshi architecture that enables precise persona control through text prompts and audio voice conditioning. The model is trained on a mix of synthetic and real-world conversational data to deliver natural, low-latency interactions. Users can deploy the model via a provided server interface or perform offline evaluations using specific voice embeddings and role-based prompts.
Chinese-novelist is a skill plugin designed for Claude Code, aimed at helping users complete the entire process of writing Chinese novels through simple interactions. Users only need to answer five core questions, and the AI can automatically generate detailed outlines, character profiles, and coherent chapter content. The tool incorporates professional writing principles and quality checklists to ensure the coherence and appeal of the novel's plot.
AI Daily Digest is an automated tool that scrapes top technical blogs from Hacker News and uses AI for multi-dimensional scoring and summary generation. It supports quick article filtering via command line or interactive interface and automatically summarizes macro trends in the tech circle. The project is written in pure TypeScript and supports Gemini as well as various OpenAI-compatible API models.
This repository gathers 49 verified real-world use cases for OpenClaw personal AI agents, designed to help users improve work and life efficiency through automation. The content covers a wide range of applications from domestic ecosystem adaptation to international general scenarios, providing detailed configuration guides and reproducible prompts. Whether you are a beginner or a developer, you can quickly get started and build your own AI agents through these structured cases.
Memvid is a database-free, single-file memory layer designed to provide AI agents with instant retrieval and long-term memory capabilities. Through an innovative "smart frame" design, it encapsulates data, embeddings, and indexes into a single file, achieving efficient compression and parallel reading. The system is model-agnostic and requires zero infrastructure dependencies, supporting persistent memory in various offline or online scenarios.
Reversa is a framework that coordinates specialized AI agents to analyze legacy codebases and generate comprehensive, traceable technical specifications. It functions as a bridge between existing systems and modern coding agents by creating operational contracts that ensure safe and informed development. The tool operates with a strict immutability guarantee, ensuring that no existing project files are ever modified or deleted during the analysis process.
dot-skill is a versatile AI framework that distills individuals into interactive digital skills by analyzing their unique thought patterns and communication styles. The platform supports three distinct character families, including professional colleagues, personal relationships, and public figures. It integrates seamlessly with multiple AI agent hosts to provide a unified, automated experience for creating and invoking personalized AI personas.
OmniVoice is an advanced large-scale multilingual zero-shot speech synthesis model based on a diffusion language model architecture, supporting over 600 languages. The model features exceptional inference speed and enables high-quality voice cloning and voice design capabilities. Users can easily perform speech generation via Python API or command-line tools, with support for fine-grained non-linguistic symbols and pronunciation control.
This project provides a systematic and beginner-friendly tutorial in Chinese, covering the official Anthropic programming tool Claude Code and the open-source AI assistant framework OpenClaw. The tutorial includes 25 in-depth guides, over 70 runnable code examples, and more than 170 FAQs, aiming to help developers quickly master AI programming and automated workflows. The content stays up-to-date with the latest versions, using dual learning paths to help users advance from zero-based knowledge to enterprise-level practical applications.
MemPalace is a local-first AI memory system that stores conversation history as verbatim text for high-accuracy semantic retrieval. It utilizes a structured indexing approach with pluggable backends to organize content into wings, rooms, and drawers without requiring external API calls. The platform also features a temporal knowledge graph, MCP tools, and agent-specific diaries to provide comprehensive context management.
Evolver is a GEP-powered self-evolution engine designed to transform ad hoc AI agent prompts into auditable and reusable evolution assets. It scans runtime logs to identify patterns and emits protocol-bound prompts that guide agents through structured self-repair and optimization cycles. The system supports various host runtimes and offers optional network features for collaborative skill sharing and decentralized validation.
Paseo provides a unified interface to manage and run various coding agents like Claude Code, Codex, and OpenCode on your local machine. It supports cross-device workflows, allowing users to interact with agents through desktop, mobile, web, or CLI applications. The platform prioritizes privacy by operating without telemetry or forced logins while enabling powerful agent orchestration capabilities.
Claude-Mem is a persistent memory compression system designed to maintain context across sessions for Claude Code and similar CLI tools. It automatically captures tool usage and generates semantic summaries to ensure continuity of project knowledge. The system includes a web viewer, hybrid search capabilities, and fine-grained privacy controls for developers.
Camofox-browser is a specialized server designed to provide AI agents with reliable web browsing capabilities by leveraging the Camoufox engine for C++ level fingerprint spoofing. It offers a REST API that simplifies interactions through accessibility snapshots, stable element references, and built-in search macros. The system is optimized for efficiency and deployment, featuring automatic idle shutdown and session isolation to support scalable agent operations.
Onyx is a feature-rich open source AI platform designed to provide an easy-to-deploy application layer interface for large language models. The platform supports RAG, deep research, code execution, and various AI agent capabilities, while remaining compatible with mainstream self-hosted and proprietary LLMs. Users can deploy via the standard or lightweight versions to meet different needs ranging from personal use to enterprise-level collaboration.
ROCK is a scalable environment management framework designed specifically for agentic reinforcement learning applications. It utilizes a client-server architecture with robust isolation mechanisms to ensure stable and secure sandbox operations. The platform provides a unified SDK and is fully compatible with GEM protocols to standardize environment interactions.
Chinese-novelist is a skill plugin designed for Claude Code, aimed at helping users complete the entire process of writing Chinese novels through simple interactions. Users only need to answer five core questions, and the AI can automatically generate detailed outlines, character profiles, and coherent chapter content. The tool incorporates professional writing principles and quality checklists to ensure the coherence and appeal of the novel's plot.
Claude Context is an MCP plugin that enables semantic code search for AI coding agents by indexing your entire codebase into a vector database. It significantly reduces costs and improves retrieval quality by providing only relevant code snippets to the AI instead of entire directories. The tool supports incremental indexing, AST-based code chunking, and integrates seamlessly with various AI assistants and IDEs.
Clicky is an open-source AI teaching assistant that integrates directly into your macOS environment to provide real-time guidance. The application uses screen recording, voice interaction, and cursor control to act as a virtual tutor that can see and interact with your desktop. Users can deploy the project locally by configuring a Cloudflare Worker proxy and building the Swift-based application via Xcode.
Waza provides a collection of Claude Code skills designed to translate essential engineering habits into executable AI workflows. By focusing on specific, high-impact techniques rather than bloated configurations, it helps developers maintain high standards for design, debugging, and documentation. These skills are built from real-world project data to ensure Claude operates with the precision and intentionality of an experienced engineer.
ANOLISA is an evolution of Anolis OS designed specifically to support AI agent workloads at the server-side operating system level. The project provides a comprehensive suite of components including an AI-powered terminal, security kernels, and observability tools. Users can easily integrate these features into their systems through standard RPM package installations.
Open CoDesign is an open-source, desktop-native application that allows users to transform prompts into polished prototypes and design artifacts locally. It supports a wide range of AI models through a bring-your-own-key approach, eliminating reliance on cloud-only subscriptions. The tool features interactive editing, responsive previews, and multi-format exports to streamline professional design workflows.
Claudian is an Obsidian plugin that integrates AI coding agents like Claude Code and Codex directly into your vault. It transforms your vault into an active working directory where agents can read, write, search, and execute bash commands. Users can interact with these agents through a chat sidebar, inline editing, and support for Model Context Protocol servers.
LLM Wiki is a cross-platform desktop application that transforms your documents into an organized, interlinked knowledge base using an incremental LLM-driven pipeline. It features a sophisticated two-step ingestion process, a persistent knowledge graph, and deep research capabilities to maintain and expand your personal library. The system ensures high-quality output through source traceability, human-in-the-loop review, and seamless integration with tools like Obsidian.
VibeVoice is a family of open-source voice AI models that utilizes continuous speech tokenizers and next-token diffusion to achieve high-fidelity audio processing. The framework includes advanced tools for long-form speech recognition and real-time streaming text-to-speech generation. These models are designed for research purposes to advance collaboration and innovation within the speech synthesis community.
NeuTTS is a collection of open-source, on-device text-to-speech models designed for real-time performance and high-quality voice synthesis. The framework utilizes lightweight LLM backbones and a neural audio codec to enable instant voice cloning with as little as three seconds of audio. These models are optimized for deployment on mobile and embedded devices, supporting multiple languages including English, Spanish, German, and French.
Pairec is a Go-based web framework designed to accelerate the development of online recommendation services. It utilizes JSON-based configurations to streamline the setup and deployment of complex recommendation logic. The framework includes various built-in model functionalities to simplify the creation of efficient recommendation systems.
This repository provides a curated list of LLM API providers that offer permanent free tiers for text inference. It categorizes services into direct provider APIs and third-party inference platforms, detailing model capabilities, context windows, and rate limits. The collection serves as a comprehensive resource for developers seeking cost-effective access to various large language models.
get-shit-done is a spec-driven development system designed to maintain high code quality by preventing context rot in AI coding assistants. It orchestrates subagents to handle project planning, research, and execution while maintaining clean git history and atomic commits. The system provides a structured workflow for developers to build complex features consistently without the overhead of enterprise project management.
QMD is an on-device search engine that indexes markdown notes, documentation, and transcripts for efficient local retrieval. It utilizes a hybrid approach combining BM25 full-text search, vector semantic search, and LLM-based re-ranking to deliver high-quality results. The tool is designed for agentic workflows, offering both a command-line interface and an MCP server for seamless integration with AI agents.
Paperclip is an open-source platform that provides a Node.js server and React UI to orchestrate teams of AI agents as a cohesive business entity. It functions like a task manager, offering features such as org charts, budget enforcement, and goal alignment to manage autonomous operations. Users can integrate their own agents to run businesses 24/7 while maintaining oversight through a centralized dashboard.
This project successfully restored the complete source code of Claude Code version 2.1.88 by parsing legacy source map files from the npm package. Developers can use this to deeply study the CLI tool's command system, the terminal UI built with React and Ink, and the implementation of the MCP protocol. This project aims to provide a reference for learning and analyzing the internal architecture of Claude Code, intended solely for technical research and archiving.
MiroFish is a next-generation AI prediction engine based on multi-agent technology that constructs high-fidelity digital parallel worlds by extracting real-world seed information. Users can perform simulations within this sandbox by injecting variables, thereby precisely deducing future trajectories. The platform aims to provide decision-makers with a zero-risk testing laboratory while offering individual users a creative simulation space.
Dothething is a local AI agent that autonomously handles complex tasks like research, browser automation, and code execution. It plans its own work, manages tools, and can be extended with custom skills or MCP servers. The system supports persistent sessions, cost tracking, and orchestrator mode for managing multiple parallel agents.
Scout is an open-source intelligence agent designed to navigate and synthesize information from fragmented company sources like Slack, Drive, and CRM systems. It functions as a central brain that builds its own wiki and CRM by learning from user interactions and context providers. The system utilizes sub-agents to manage source-specific quirks, ensuring efficient data retrieval and persistent memory for organizational knowledge.
CLIProxyAPI is a versatile proxy server that provides OpenAI, Gemini, and Claude-compatible API interfaces for various command-line tools. It supports OAuth authentication for major AI services, enabling users to manage multiple accounts with round-robin load balancing. The project also includes a reusable Go SDK and extensive support for IDE extensions and AI coding assistants.
Google AI Edge Gallery is a mobile application designed to run powerful open-source Large Language Models directly on your device. It offers a fully offline and private environment for users to experience advanced generative AI capabilities, including the latest Gemma 4 family. The app provides a comprehensive suite of tools for model management, benchmarking, and interactive AI features.
Verified Agent Identity is a decentralized toolkit designed for AI agents to create, manage, and verify decentralized identities using the iden3 protocol. It enables secure human-to-agent linking through cryptographic signatures and supports robust identity management features. The system ensures security by storing sensitive cryptographic material outside the agent's workspace with optional AES-256-GCM encryption.
Toprank is a Claude Code plugin that provides AI agents with direct access to Google Search Console and Google Ads for data-driven optimization. It enables users to perform automated audits, identify wasted ad spend, and implement technical SEO improvements through simple CLI commands. The tool also supports cross-model reviews via Gemini and integrates with various CMS platforms to streamline content and performance management.
The last30days tool is an AI agent-led search engine that synthesizes real-time data from social media, developer platforms, and prediction markets to provide current insights. By bridging disconnected platforms like Reddit, X, GitHub, and YouTube, the agent scores information based on actual human engagement rather than traditional SEO metrics. It functions as an expert research assistant that delivers concise, evidence-based briefs on any topic, person, or company from the past month.
vLLM Kunlun is a community-maintained hardware plugin that enables the seamless execution of vLLM on Kunlun XPU hardware. It utilizes a hardware-pluggable interface to decouple the integration process, ensuring compatibility with a wide range of open-source models. The project supports various architectures including Transformer-based, Mixture-of-Expert, and multi-modal LLMs on the Kunlun3 P800 platform.
This repository contains the source files for the official PaddlePaddle documentation platform. It organizes content into specific directories for API references, user guides, and tutorials to support developers. The project also provides CI scripts and build instructions to facilitate local documentation generation and community contributions.
Beads is a distributed, Dolt-powered issue tracking system designed to provide persistent, structured memory for AI coding agents. It utilizes a dependency-aware graph structure to help agents manage complex, long-horizon tasks without losing critical context. The tool offers flexible storage modes and integrates seamlessly into development workflows with or without Git.
Open Generative AI is a free, open-source platform providing an unrestricted alternative to commercial AI media tools. It supports over 200 state-of-the-art models for image, video, and lip-sync generation without content filters or subscription fees. Users can access these capabilities through a web-based interface or a desktop application that supports both local and remote inference.
This comprehensive guide provides a detailed walkthrough of the Hermes Agent framework developed by Nous Research. It covers core mechanisms like the self-improving learning loop, memory systems, and automated skill evolution across seventeen chapters. The book serves as a practical resource for developers and AI enthusiasts looking to implement and customize their own intelligent agents.
Ghost Pepper is a privacy-focused macOS application that provides local speech-to-text transcription without relying on cloud APIs. Users simply hold the Control key to record audio, which is then transcribed and automatically pasted into any text field. The app utilizes local models for both speech recognition and text cleanup to ensure that no data ever leaves the user's machine.
This project provides a reusable template for reverse-engineering websites into modern Next.js codebases using AI coding agents. It automates the process by extracting design tokens, assets, and component specifications to reconstruct sections in parallel. The template supports various AI platforms and includes built-in tools for assembly and visual quality assurance.
OfficeCLI is an open-source command-line tool that enables AI agents to create, read, and modify Microsoft Office documents without requiring local Office installations. It features a three-layer architecture that allows for simple semantic views, structured element manipulation, and direct XML access. The tool supports seamless integration with AI coding agents through a built-in MCP server and automatic skill configuration.
Ralph is an autonomous AI agent loop that repeatedly executes coding tasks using tools like Amp or Claude Code until all project requirements are met. Each iteration operates within a fresh context, maintaining project state through git history, progress logs, and a structured JSON task list. The system ensures continuous progress by breaking down large features into manageable user stories that are verified through automated quality checks.
OpenCode is a free, open-source AI practical course designed for beginners, aiming to help users master methods to improve work efficiency using AI within 4 hours. The tutorial provides in-depth Chinese content and supports direct connection to mainstream domestic models without complex network configurations. The course covers five stages from quick start to deep customization, and provides rich practical projects and Prompt templates for learners to use.
PaddleCustomDevice is the custom hardware integration solution provided by the PaddlePaddle framework. Through standardized interface design, this project enables developers to integrate various third-party hardware backends into the PaddlePaddle ecosystem. It currently covers support for mainstream hardware platforms including Ascend, Cambricon, Intel GPU, and Apple MPS.
agent-browser is a high-performance browser automation command-line tool built with Rust, specifically designed for AI agents. It supports web interaction, element localization, and state management through simple commands, eliminating the need for complex Playwright or Node.js environments. The tool provides extensive session persistence, authentication management, and debugging features to ensure that AI agents can operate safely and efficiently.
Shimmy is a lightweight, single-binary server that provides a 100% OpenAI-compatible API for running GGUF models locally. It features zero-configuration model discovery, automatic GPU backend detection, and advanced CPU/GPU hybrid processing for large models. Designed for privacy and performance, it allows developers to integrate local LLMs into existing tools without code changes.
Feynman is an open-source AI research agent designed to assist with complex tasks like literature reviews, paper auditing, and experiment replication. The platform utilizes a multi-agent system to gather evidence, perform simulated peer reviews, and draft structured research findings. Users can interact with the tool via a terminal interface, supporting both local and cloud-based execution environments.
DeepTutor is an agent-native platform designed to provide personalized, intelligent tutoring through a unified chat workspace and multi-agent architecture. It features advanced capabilities like a Book Engine for interactive learning, an AI Co-Writer, and persistent memory to tailor the experience to individual user profiles. Users can deploy the system easily via a guided CLI setup or Docker, supporting a wide range of LLM and embedding providers.
The Claude Cookbooks provide a comprehensive collection of code snippets and guides to help developers integrate Claude into their own applications. The repository covers a wide range of topics including tool use, multimodal capabilities, and advanced techniques like prompt caching. These resources are designed to be easily adaptable for various programming languages and project requirements.
T3 Code provides a minimal web-based graphical user interface designed specifically for interacting with coding agents. The platform currently supports integration with Codex and Claude, with plans to expand support for additional providers in the future. Users can access the tool via a desktop application or run it directly using npx for quick deployment.
This plugin allows Claude Code users to invoke Codex directly within their workflow for code reviews or task delegation. Users can execute read-only reviews, adversarial reviews, and background task management through a series of slash commands. The tool leverages the locally installed Codex CLI and existing authentication configurations, ensuring seamless integration with the user's current development environment.
This tool is officially maintained by Paddle and aims to achieve efficient automated migration from PyTorch code to PaddlePaddle code. It supports one-click conversion of over 1,600 PyTorch APIs and 200 torchvision APIs, maintaining an average conversion rate of over 95% in tests. The conversion process is operated via the command line, preserves the style and structure of the original code, and provides detailed conversion logs and summaries.
k-skill is a collection of automation tools designed for AI agents to perform various tasks related to popular Korean services like SRT, KTX, Coupang, and KakaoTalk. It supports integration with major coding agents and allows users to execute tasks without needing additional client API layers. Users can get started by installing the full suite and configuring their credentials through the provided setup tool.
The AI Hedge Fund is an educational proof-of-concept project designed to explore how artificial intelligence can be utilized for making trading decisions. It employs a multi-agent system that simulates various famous investment strategies and analytical approaches to evaluate stocks. The system is strictly for research purposes and does not execute real-world financial trades.
llmfit is a terminal-based utility that analyzes your system's hardware to identify which large language models will run effectively on your specific configuration. It provides an interactive TUI and CLI to score models based on quality, speed, and memory fit while supporting various backends like Ollama, llama.cpp, and MLX. Users can also perform hardware simulations to test how different model configurations would perform on target system specifications.
This project provides a structured set of guidelines designed to improve LLM coding behavior by addressing common pitfalls like overcomplication and making wrong assumptions. It implements four core principles—Think Before Coding, Simplicity First, Surgical Changes, and Goal-Driven Execution—to ensure more precise and verifiable code generation. Users can integrate these rules into their development workflow via a Claude Code plugin, a CLAUDE.md file, or Cursor project rules.
Claude Ads is a comprehensive audit and optimization tool designed as a skill for Claude Code to improve paid advertising performance. It provides over 250 audit checks across major platforms like Google, Meta, LinkedIn, and TikTok using parallel subagent delegation. Users can generate professional PDF reports, perform PPC financial modeling, and access industry-specific strategic planning templates.
This project contains the complete leaked source code for Anthropic's official Claude Code CLI tool, which was discovered on March 31, 2026, via map files in the npm package. The repository provides the original source code built with TypeScript and Bun, accompanied by detailed architectural documentation and exploration guides. Users can interactively delve into the tool's internal implementation and design patterns through the built-in MCP server.
Obsidian Mind is an Obsidian knowledge base template designed specifically for Claude Code, aiming to solve the problem of AI long-term memory loss through automated conversation hooks and structured storage. It ensures that Claude has access to the complete context at the start of every session by automatically linking work notes, decision records, meeting minutes, and performance evidence into a knowledge graph. The system supports daily workflow management through natural language interaction and can automatically generate performance evaluation briefs and project summaries.
The PaddlePaddle community serves as a central hub for developers to contribute to the framework through code improvements, documentation, and presentations. It provides structured governance, specialized working groups, and various mentorship programs to support active participation. Contributors are recognized through official certifications, release notes, and inclusion in the project's authorship records.
Kronos is an open-source decoder-only foundation model specifically designed to analyze and forecast financial K-line sequences. It utilizes a two-stage framework that quantizes multi-dimensional market data into hierarchical tokens before processing them through an autoregressive Transformer. The project provides a comprehensive suite of pre-trained models and tools for both direct forecasting and domain-specific fine-tuning.
AI Engineering from Scratch is a comprehensive 320-hour curriculum that guides students from fundamental linear algebra to building autonomous agent swarms. The course emphasizes an AI-native learning approach where students use AI coding agents to test their knowledge and build reusable tools throughout 20 distinct phases. By working across Python, TypeScript, Rust, and Julia, learners develop a professional portfolio of prompts, skills, and agents that can be deployed in real-world environments.
AI Marketing Skills is an open-source project designed for marketing and sales teams, providing a series of ready-to-run automated workflows and scripts. These tools are designed to integrate with Claude Code or other AI coding agents, optimizing business processes through expert panels, scoring algorithms, and automated pipelines. The project covers professional skills in various fields, ranging from growth experiments and sales lead mining to content operations and financial analysis.
TradingAgents is a multi-agent based LLM financial trading framework designed to simulate the operational processes of real trading firms. The framework deploys specialized agents, including fundamental, sentiment, news, and technical analysis, to collaboratively evaluate market conditions and formulate trading strategies. The system is built using LangGraph, supports various mainstream LLM providers, and offers an interactive command-line interface as well as a Python development API.
ACE-Step UI provides a professional, Spotify-inspired interface for the open-source ACE-Step 1.5 AI music generation model. It allows users to generate high-quality songs, instrumentals, and lyrics entirely locally without subscription fees or cloud restrictions. The platform includes advanced tools for audio editing, stem extraction, and batch processing to give creators full control over their music production.
oh-my-codex is a workflow enhancement layer designed to improve the functionality and consistency of the OpenAI Codex CLI. It provides specialized roles, reusable skills, and structured project state management to streamline complex development tasks. The tool is optimized for macOS and Linux environments, offering advanced features like team-based parallel execution and persistent completion loops.
This repository provides a comprehensive educational framework for building agent harnesses, which are the essential environments that allow AI models to perceive and act. It argues that true agency is learned by models during training, while the developer's role is to construct the tools, knowledge, and context management systems that enable these models to function. Through twelve progressive sessions, users learn to build robust, scalable agent architectures by reverse-engineering the principles behind Claude Code.
Oh My OpenCode is an open-source agent tool designed to break free from single-model lock-in and achieve efficient development by orchestrating multiple AI models. It enables parallel task processing and automated execution through the introduction of Discipline Agents and the ultrawork command, eliminating the need for manual model switching. The tool significantly improves code modification accuracy and the development experience through its Hash-Anchored Edit Tool and deep initialization features.
Humanizer is a specialized skill for Claude Code and OpenCode designed to remove common markers of AI-generated text. It identifies and corrects 29 distinct patterns, such as significance inflation and excessive hedging, to produce more natural writing. Users can also provide personal writing samples to calibrate the tool to match their unique voice and rhythm.
Marketing Skills for AI Agents is a comprehensive collection of markdown-based workflows designed to provide AI coding assistants with specialized marketing expertise. These skills cover a wide range of domains including conversion optimization, SEO, copywriting, and growth engineering to help agents execute complex marketing tasks. By referencing a shared product-marketing-context, the skills work together to ensure consistent and strategic outputs across all marketing activities.
TranslateBooksWithLLMs is a versatile tool designed to translate books, subtitles, and documents of any length using various local or cloud-based AI models. It features an intelligent chunking system that preserves original formatting, styles, and structure while allowing users to resume interrupted tasks via automatic checkpoints. The software supports multiple file formats including EPUB, SRT, DOCX, and TXT, offering both a user-friendly web interface and a robust command-line tool.
Claude Code Templates provides a comprehensive collection of agents, custom commands, and integrations to optimize your Anthropic Claude Code workflow. Users can browse and install over 100 components through an interactive web interface or via command-line tools. The project also includes advanced utilities for session analytics, real-time conversation monitoring, and system health diagnostics.
This repository provides comprehensive best practices and implementation guides for leveraging Claude Code in agentic engineering workflows. It details core concepts such as subagents, commands, skills, and orchestration patterns to optimize development tasks. Developers can explore advanced features like cloud-based routines, ultrareview, and automated testing to enhance their coding productivity.
Taste Skill provides a collection of specialized instructions designed to improve the visual quality and design output of AI coding agents. The toolkit includes various skills for tasks ranging from general frontend generation to specific styles like minimalist or brutalist design. These framework-agnostic files can be easily integrated into major AI coding agents to ensure premium, professional-grade interface results.
Claude How To provides a structured, visual learning path to help developers master the full capabilities of Claude Code. The guide includes ten tutorial modules, copy-paste templates, and interactive quizzes to bridge the gap between basic usage and advanced automation. It enables users to build complex workflows by combining slash commands, memory, subagents, and MCP servers effectively.