karpathy/autoresearch

Name: karpathy/autoresearch
Rating: 4.5 (4460 reviews)

AI agents running research on single-GPU nanochat training automatically

33,215

Traction Score

4,460

Forks

Mar 6, 2026

Launch Date

View Origin Link

Product Positioning & Context

AI Executive Synthesis

AI agents running research *automatically* to discover new architectures. The question challenges the guarantee of novelty.

This issue directly questions the core value proposition of 'autoresearch': how adding agents *guarantees* novel architectures. It highlights a fundamental developer concern regarding the actual efficacy and innovation output of multi-agent systems. The pain point is the lack of clear, demonstrable mechanisms linking agent deployment to guaranteed novel outcomes, rather than mere optimization or iteration. Market implications include the need for AI agent platforms to articulate a stronger, evidence-based narrative around their capacity for true innovation and discovery, beyond efficiency gains. This suggests a demand for more sophisticated agent design that explicitly targets and measures architectural novelty.

AI agents running research on single-GPU nanochat training automatically

Related Ecosystem & Alternatives

Discover adjacent products, open-source repositories, and developer tools sharing similar technical architecture.

Deep-Dive FAQs

What is karpathy/autoresearch?

karpathy/autoresearch is analyzed by our AI as: AI agents running research *automatically* to discover new architectures. The question challenges the guarantee of novelty.. It focuses on This issue directly questions the core value proposition of 'autoresearch': how adding agents *guarantees* novel architectures. It highlights a fun...

Where did karpathy/autoresearch originate?

Data for karpathy/autoresearch was aggregated directly from the GitHub Open Source community ecosystem, representing raw developer and early-adopter sentiment.

When was karpathy/autoresearch publicly launched?

The initial public indexing or launch date for karpathy/autoresearch within our tracked developer communities was recorded on March 6, 2026.

How popular is karpathy/autoresearch?

karpathy/autoresearch has achieved measurable traction, logging over 33,215 traction score and facilitating 4,460 recorded discussions or engagements.

Are there active development issues for karpathy/autoresearch?

Yes, we are currently tracking open architectural debates and bug reports for this project on GitHub. There are currently 2 active high-priority issues logged recently.

Is karpathy/autoresearch recognized by media or academic researchers?

Yes. It has been covered by media outlets like Github.com. This indicates the concept has reached a level of mainstream or scientific viability beyond just developer forums.

How does the creator describe karpathy/autoresearch?

The original author or development team describes the product as follows: "AI agents running research on single-GPU nanochat training automatically"

Active Developer Issues (GitHub)

open Codex doesn't seem to work?

Logged: Mar 8, 2026

open improvements to novelty

Logged: Mar 8, 2026

Community Voice & Feedback

ngoiyaeric • Mar 10, 2026

you just added a readme, maybe @karpathy can chime in

jonathanpwang • Mar 9, 2026

I was able to get codex to get codex to loop where it has `agent_loop.sh` for the while loop, `monitor_loop.sh` to monitor the agent, and `watchdog_loop.sh` to restart the agent loop.

Whamp • Mar 9, 2026

I think you can achieve a model agnostic version of what you're looking for by using Pi pi.dev (https://github.com/badlogic/pi-mono/) and combining it with the Interactive Shell extension: https://github.com/nicobailon/pi-interactive-shell

can handle long running looping behavior with the ability for both human and agent to monitor and interrupt/interact.

That way you can use one agent harness framework but change the models, have the models compete or collaborate or review etc . . .

codex/claude subs work in pi oauth

sen-ye • Mar 9, 2026

I ran into the same issue while using codex. It seems to be related to the OpenAI API (or the model itself). I tried integrating GPT-5.4 into Claude Code, but it still wouldn't work continuously..

rankun203 • Mar 9, 2026

I'm having exactly this issue, with Codex using GPT 5.4.

I ended up having to run it in a `while` loop

```bash
while true; do
codex exec --dangerously-bypass-approvals-and-sandbox "have a look at program.md and kick off a new experiment loop" 2>&1 | tee -a agent.log
sleep 1
done
```

then I can search for "have a look at program.md" in agent.log to see it getting restarted.

But then I lose the interactivity of Codex.

SlipstreamAI • Mar 9, 2026

experiencing this with 5.4?

ngoiyaeric • Mar 9, 2026

https://github.com/karpathy/autoresearch/pull/70 we can also do these manually like the novelty verification part you're referring too/ Seems to be an infinite loop.

mkemka • Mar 9, 2026

Currently I can only talk to the experiments I made in the fork (https://github.com/mkemka/autoresearch/blob/master/spiritualguidance.md). There are two competing agents that argue and generate a combined directive that is used to alter the program.md for the next run. The history is stored in the spiritual guidance.md and used as a working memory. So to actually measure the utility I would need to see if there is actually novelty or variance of ideas from this approach and if in the long term the loss is lower compared to a single agent.

ngoiyaeric • Mar 9, 2026

so how do you measure the utility of novelty?

mkemka • Mar 9, 2026

One approach I am experimenting with is to have two sub-agents with different backgrounds debate the best strategy to adopt. This doesn't guarantee a new architecture but adds novelty.

Discovery Source

GitHub Open Source

Aggregated via automated community intelligence tracking.

Tech Stack Dependencies

No direct open-source NPM package mentions detected in the product documentation.

Media Tractions & Mentions

Autoresearch: Agents researching on single-GPU nanochat training automatically
Github.com • Mar 7, 2026

Deep Research & Science

No direct peer-reviewed scientific literature matched with this product's architecture.