AgentCI

CI/CD for AI agents

Event

Codex Community Hackathon - Pune

Award

Winner — Codex

Top 10

Links

Repository

github.com/ChiragArora31/AgentCI

Website

agentci.vercel.app

Demo video

drive.google.com/file/d/1biJjhSuJdaj0EZ2HXcE_w6DCJ1cgctPy/view?usp=sharing

Team

2 members

CH
Chirag Arora
Owner
SU
Subhav Goyal

Overview

AgentCI is a CI/CD platform for AI agents that catches behavior regressions before they reach production. When a team changes a prompt, model, retrieval setup, or tool configuration, AgentCI runs realistic evaluation scenarios, captures full agent traces, checks correctness, grounding, abstention, access control, safety, latency, and cost, then compares the candidate against the production baseline. If the agent becomes inaccurate, unsafe, slow, or leaks restricted information, AgentCI blocks the release and shows exactly why through evidence, grader findings, and root-cause summaries.