davebcn87

pi-autoresearch

AI🌱 NEW PROJECT BOOST#AI Agent #Automation#Optimization#Benchmarking

115

// summary

pi-autoresearch is an extension for the pi AI coding agent that enables autonomous optimization loops by testing, benchmarking, and refining code changes. It supports various optimization targets such as test speed, bundle size, and LLM training metrics through a persistent session workflow. The tool includes a live dashboard, confidence scoring to filter out noise, and the ability to finalize experiments into clean, reviewable branches.

// technical analysis

pi-autoresearch is an extension for the pi AI coding agent that facilitates autonomous optimization loops by iteratively testing, benchmarking, and refining code based on specific performance metrics. Its architecture separates domain-agnostic infrastructure from domain-specific skills, allowing the agent to maintain state across restarts via persistent log files and session documentation. This design solves the problem of manual, repetitive benchmarking by automating the 'try-measure-keep' cycle, though it requires careful management of API token usage and benchmark noise to ensure reliable results.

// key highlights

Enables autonomous optimization loops for various targets like test speed, bundle size, and LLM training metrics.

Maintains session persistence through autoresearch.jsonl and autoresearch.md, allowing the agent to resume work after restarts or context resets.

Provides a confidence scoring system using Median Absolute Deviation to help distinguish genuine performance improvements from benchmark noise.

Includes a finalize feature that organizes noisy experimental branches into clean, independent, and reviewable logical changesets.

Supports optional backpressure checks via shell scripts to ensure that performance optimizations do not compromise code correctness.

Offers a real-time dashboard and status widget with keyboard shortcuts for monitoring progress and managing experiment states.

// use cases

Automated performance optimization for test speed, build times, and bundle size

Autonomous LLM training loop management with metric tracking

Systematic benchmarking and regression testing with automated branch finalization

// getting started

To begin, install the extension by running 'pi install https://github.com/davebcn87/pi-autoresearch' in your terminal. Once installed, initiate a session by running the '/skill:autoresearch-create' command, which will guide you through configuring your optimization goal, metrics, and target files. You can then monitor the autonomous loop via the provided dashboard or by using the '/autoresearch' command set.