Back
Explore EventsExplore ProjectsMy Projects
AgentCI — screenshot 1
AgentCI — screenshot 2
AgentCI — screenshot 3
AgentCI — screenshot 4

AgentCI

CI/CD for AI agents

Codex Community Hackathon - Pune

Links

Repository

github.com/ChiragArora31/AgentCI

Website

agentci.vercel.app

Demo video

drive.google.com/file/d/1biJjhSuJdaj0EZ2HXcE_w6DCJ1cgctPy/view?usp=sharing

Team

1 member
  • CH

    Chirag Arora

    Owner

Overview

AgentCI is a CI/CD platform for AI agents that catches behavior regressions before they reach production. When a team changes a prompt, model, retrieval setup, or tool configuration, AgentCI runs realistic evaluation scenarios, captures full agent traces, checks correctness, grounding, abstention, access control, safety, latency, and cost, then compares the candidate against the production baseline. If the agent becomes inaccurate, unsafe, slow, or leaks restricted information, AgentCI blocks the release and shows exactly why through evidence, grader findings, and root-cause summaries.

ExploreProjectsMine