← Back to AI Insights
Gemini Executive Synthesis

An open-source version control system (VCS) specifically for AI agents, currently supporting Claude Code, providing capabilities like action history, rollback (/rewind), compaction (/compact), and forensic analysis (bisect).

Technical Positioning
A fundamentally missing version control system for AI agents; provides core capabilities similar to Git for code, but for agent actions.
SaaS Insight & Market Implications
This project addresses a critical operational gap in AI agent development: the lack of robust version control and auditability. As AI agents gain autonomy, understanding their decision-making, tracking changes, and debugging their actions becomes paramount. A 'Git for AI Agents' provides essential capabilities like action history, rollback, and forensic analysis (bisect), mirroring established software development practices. This directly mitigates risks associated with opaque agent behavior, improving reliability, accountability, and developer confidence. It is a foundational tool for managing the lifecycle of complex agentic systems, enabling systematic debugging and iterative improvement, which is crucial for enterprise adoption of AI agents.
Proprietary Technical Taxonomy
AI agents VCS rewind compact bisect Claude Code open source

Raw Developer Origin & Technical Request

Source Icon Hacker News May 8, 2026
Show HN: Git for AI Agents

hi guys.
been working on something i think is fundamentally missing in today's workflow with ai agents.vcs.i find myself struggling with questions that agents can't answer like "why did you do it?", "when did u delete this folder? why?", etc. or trying to /rewind (after a /compact...) or basically `bisect` to find when and why something was done by the agent in the current / previous session.just like git did for code, i think we are the same core capabilities with ai agentsso...i developed an open source solution for that (currently supporting claude code)would love to get feedback, contribution or maybe other ideas or solutions you find for those problems.

Developer Debate & Comments

c7b • May 9, 2026
Great idea! I was really baffled when I found out that that's not how agents, or also Chat-based IDEs like Cursor, work by default (guess how).
clutter55561 • May 8, 2026
I think this is very interesting but you need a better slogan.Many people here made comments such as “why do I need another SVC since agents are pretty good with git”, which means they barely read your blurb and did not understand your project.
lifis • May 8, 2026
This seems easily solved with a tool use hook that calls git add .; git commit a -m "", specifying an alternate .git directory if desired
hombre_fatal • May 8, 2026
Everything but trivial changes should go through a prompt -> plan -> impl phase where you revise a concrete plan file until it's ready for impl.Now impl is just a derivation of the plan, and the plan gets checked in with the same commit so that you can see the why, the intent, the objective, the research that informed the decisions.Much simpler, and a much, much more effective process than prompt -> impl.
bel8 • May 8, 2026
I found LLMs to be really smart with command-line git.This week I told DeepSeek v4 Flash (max variant) to scavenge for all changes and additions of a specific feature of the project and build a report of the feature timeline with example code and rationale behind the changes.It fired a ton of read-only git commands (isolated inside Docker) and came up with a neat markdown report of the feature from inception to current state.If DS4 Flash can do it, for SOTA LLMs like GPT 5.5 and Opus it should be a walk in the park.That said I don't let LLMs commit. I like to take a close look at every change before committing. Early changes are cheaper.
dimgl • May 8, 2026
Hey, that's cool. Does this support conversation lookups? Like, "find this conversation we talked about yesterday"? I built a similar tool to this, although Regent seems much more elegant: https://github.com/divmgl/clancey/
Zambyte • May 8, 2026
People in this thread seem to be too focused on the agent creating a git log. This seems to be solving a different problem than that does.When you're interacting with agents, multiple prompts may reasonable culminate in a single commit. It may be useful to track or undo things between commits - at the prompt level. I personally have a workflow when I use Jujutsu (jj) for git already, and this slotted in very nicely to solve this problem. The auto-committing in jj makes it very easy and natural to compare diffs between prompts, and undo specific chunks or restore previous states without making a new commit every prompt. I only finish a commit, giving it a message and advancing the branch, once I've iteratively dialed in the changes I want.I probably won't use this tool since I already have a flow that works for me, but maybe this will help people see why such a tool can be helpful.Edit: fixed typo
j-pb • May 8, 2026
Very cool approach! We build something super similar, also going for content addressed storage and compare&swap as fundamental primitives.Also commit dag based, but we also wrote this whole knowledge graph / triple-store CRDT data format on top.[1]We also have p2p syncing of the history so you can use it to track your local work but also to have your agents coordinate within your team.We had our agents build their own tools on top of that substrate, that way we're vendor independent, this stuff works everywhere from claude web, to self hosted openclaw, you only need to tell your agent to use the faculties.Because the substrate takes care of everything, every new faculty you write on top of that inherits all of the same properties.1: https://github.com/triblespace/triblespace-rs2: https://github.com/triblespace/faculties
sudb • May 8, 2026
I think the idea of tracking intent in git commits is a great idea but it feels to me like this might be reducible to some prompts/extending git/pre-commit hooks?
tfrancisl • May 8, 2026
Just use git. If your agent (especially claude) doesnt seem to know how, there are skills and hooks and other options to make it work. My 2c.

Engagement Signals

95
Upvotes
46
Comments

Cross-Market Term Frequency

Quantifies the cross-market adoption of foundational terms like Claude Code and AI agents by tracking occurrence frequency across active SaaS architectures and enterprise developer debates.

Macro Market Trends

Correlated public search velocity for adjacent technologies.

Ai Agents Compact Cnns Compaction