Show HN: How LLMs Work – Interactive visual guide based on Karpathy's lecture

Name: Show HN: How LLMs Work – Interactive visual guide based on Karpathy's lecture
Rating: 4.5 (49 reviews)

An interactive, visual, and revisitable guide based on a prominent lecture, generated by an LLM.

207

Traction Score

Discussions

Apr 24, 2026

Launch Date

View Origin Link

Product Positioning & Context

AI Executive Synthesis

An interactive, visual, and revisitable guide based on a prominent lecture, generated by an LLM.

This addresses the growing need for accessible, high-quality educational content on complex AI topics. The use of Claude Code to generate the site highlights a trend in content creation: leveraging AI for rapid development of educational tools. While not a direct B2B SaaS product, it demonstrates the potential for AI-assisted content generation in corporate training or developer onboarding. The 'revisit this content time to time' aspect suggests a demand for durable, digestible learning resources amidst rapid technological change. This model could be scaled for enterprise learning platforms, offering customized, interactive modules on emerging tech, reducing reliance on traditional, static documentation. The pain point is the difficulty in grasping complex AI concepts; the the solution is an interactive, AI-generated guide.

All content is based on Andrej Karpathy's "Intro to Large Language Models" lecture (youtube.com/watch?v=7xTGNNLPyMI). I downloaded the transcript and used Claude Code to generate the entire interactive site from it — single HTML file. I find it useful to revisit this content time to time.

Related Ecosystem & Alternatives

Discover adjacent products, open-source repositories, and developer tools sharing similar technical architecture.

Deep-Dive FAQs

What is How LLMs Work – Interactive visual guide based on Karpathy's lecture?

How LLMs Work – Interactive visual guide based on Karpathy's lecture is analyzed by our AI as: An interactive, visual, and revisitable guide based on a prominent lecture, generated by an LLM.. It focuses on This addresses the growing need for accessible, high-quality educational content on complex AI topics. The use of Claude Code to generate the site ...

Where did How LLMs Work – Interactive visual guide based on Karpathy's lecture originate?

Data for How LLMs Work – Interactive visual guide based on Karpathy's lecture was aggregated directly from the Hacker News community ecosystem, representing raw developer and early-adopter sentiment.

When was How LLMs Work – Interactive visual guide based on Karpathy's lecture publicly launched?

The initial public indexing or launch date for How LLMs Work – Interactive visual guide based on Karpathy's lecture within our tracked developer communities was recorded on April 24, 2026.

How popular is How LLMs Work – Interactive visual guide based on Karpathy's lecture?

How LLMs Work – Interactive visual guide based on Karpathy's lecture has achieved measurable traction, logging over 207 traction score and facilitating 49 recorded discussions or engagements.

Which technical categories define How LLMs Work – Interactive visual guide based on Karpathy's lecture?

Based on metadata extraction, How LLMs Work – Interactive visual guide based on Karpathy's lecture is categorized under topics such as: LLMs, Andrej Karpathy's 'Intro to Large Language Models' lecture, transcript, Claude Code.

What are some commercial alternatives to How LLMs Work – Interactive visual guide based on Karpathy's lecture?

Our semantic intelligence engine identifies potential commercial alternatives in the SaaS space, such as Osaurus, which offers overlapping value propositions.

How does the creator describe How LLMs Work – Interactive visual guide based on Karpathy's lecture?

The original author or development team describes the product as follows: "All content is based on Andrej Karpathy's "Intro to Large Language Models" lecture (youtube.com/watch?v=7xTGNNLPyMI). I downloaded the transcript and used Claude Code to generate the entire interac..."

Community Voice & Feedback

siva7 • Apr 24, 2026

> WITHOUT RAG
> "I don't have reliable information about a colony called Ares Base. As of my > training cutoff, no such Mars colony has been established..."Oh we must have lived in a parallel universe then if this is a "without rag" textbook example.

jasonjmcghee • Apr 24, 2026

Highly recommend instead reading the human created "The Illustrated GPT-2" by Jay Alammar - https://jalammar.github.io/illustrated-gpt2/And his similar work.He also has a free course on "how llms work"

ynarwal__ • Apr 24, 2026

I disagree with some comments saying it's not worth reading since it's generated by LLM. Even though I made it clear that I have download the transcript. LLMs are exceptionally good at generating accurate information if information is directly loaded into context window.

vova_hn2 • Apr 24, 2026

I think that BPE visualization is slightly misleading, because it seems to imply that the "old" (smaller) tokens are thrown away and replaced with longer tokens, which is not the case.In fact, it is purely additive process: we iteratively add the most frequent pairs to the set, until we reach the desired total number of tokens. But we never remove tokens, we keep everything, including the initial 256 tokens, representing bytes.This ensures that the model is capable of producing every possible unicode sequence (in fact, I think that it is capable of producing every possible byte sequence, but bytes that are not valid unicode are filtered during sampling).Edit #1: also, this page entirely skips the attention mechanism, which is, in my opinion, both the most interesting part and the part that is hardest to understand (I can't say that I fully understand it, to me it is just some linear algebra matrix multiplication magic).

thesz • Apr 24, 2026

The page does very poor job tokenizing phrase "Noinceolik fiyulnabmed fyvaproldge" into "Noinceolik fiyulnabm ed fyvaproldge", factoring only "ed" suffix. As if made up words such as "noinceolik" are so common they are part of 100K token vocabulary.The actual application of GPT-5 tokenizer at [1] to my made up phrase results in 14 tokens, only two of them are four characters long and there are tokens containing spaces.[1] https://gpt-tokenizer.dev/I will read along, though.

ynarwal__ • Apr 24, 2026

Update:
The "single hard drive" claim was wrong and I've corrected it to "roughly 10 consumer hard drives" (44TB ÷ ~4TB = ~11). Attribution to Karpathy is now a direct link. Added a caveat under the stats noting these are representative 2024-era figures — the exact numbers shift with every release and that's somewhat the point.
Also did a few iterations on visual redesign (linked in the header as v2) with a proper top
navigation bar after a few people found the dot nav hard to use and UI was jumping.Also I have not fact checked everything but I have read it and it seems to be aligned with what is described in the lecture.

lateral_cloud • Apr 24, 2026

This is completely AI generated..don't bother reading.

PetitPrince • Apr 24, 2026

Have you reread what was produced by Claude Code before publishing ? This thing in one of the first paragraph jumps out:> you end up with about 44 terabytes — roughly what fits on a single hard driveNo normal person would think that 44 TB is a usual hard drive size (I don't think it even exists ? 32TB seems the max in my retailer of choice). I don't think it's wrong per se to use LLM to produce cool visualization, but this lack of proof reading doesn't inspire confidence (especially since the 44TB is displayed proheminently with a different color).

gushogg-blake • Apr 24, 2026

I haven't found an explanation yet that answers a couple of seemingly basic questions about LLMs:What does the input side of the neutral network look like? Is it enough bits to represent N tokens where N is the context size? How does it handle inputs that are shorter than the context size?I think embedding is one of the more interesting concepts behind LLMs but most pages treat it as a side note. How does embedding treat tokens that can have vastly different meanings in different contexts - if the word "bank" were a single token, for example, how does embedding account for the fact that it can mean river bank or money bank? Do the elements of the vector point in both directions? And how exactly does embedding interact with the training and inference processes - does inference generate updated embeddings at any point or are they fixed at training time?(Training vs inference time is another thing explanations are usually frustrating vague on)

lukeholder • Apr 24, 2026

Page keeps annoyingly scroll-jumping a few pixels on iOS safari

Discovery Source

Hacker News

Aggregated via automated community intelligence tracking.

Tech Stack Dependencies

No direct open-source NPM package mentions detected in the product documentation.

Media Tractions & Mentions

No mainstream media stories specifically mentioning this product name have been intercepted yet.

Deep Research & Science

No direct peer-reviewed scientific literature matched with this product's architecture.