Illustration for "The Feynman Technique Prompt: How to Make AI Explain Anything in 4 Layers of Depth" — a guide on prompt engineering and Feynman Technique | Applied AI Hub

The Feynman Technique Prompt: How to Make AI Explain Anything in 4 Layers of Depth

By blobxiaoyao Updated: Jun 10, 2026
prompt engineeringFeynman TechniqueAI learningLLMChatGPTClaudedeep learning promptsexplanation prompts
Key Takeaways / TL;DR
  • A copy-paste prompt framework that forces ChatGPT or Claude to break down any concept into four escalating levels of clarity — from a 5-year-old's analogy to an expert's edge-case teardown — so you actually retain what you learn.

You asked ChatGPT to explain self-attention. It gave you a paragraph that sounded right. You nodded. Two hours later, you couldn’t explain it back to yourself.

That’s not a knowledge gap. That’s a prompting gap. You asked the model to explain, but you never told it how deep to go — or at which level you actually needed the explanation. So it defaulted to a single-register answer that optimized for sounding helpful rather than for making you understand.

The Feynman Technique fixes this — not by asking the AI to “explain simply,” but by forcing it to explain the same concept at four distinct cognitive levels, each one exposing a different layer of the idea. The result isn’t one explanation. It’s four passes over the same territory, each at a different altitude. And the gaps between those layers are where genuine understanding forms.

What the Feynman Technique Actually Is

Richard Feynman — Nobel laureate in physics, legendary teacher — had a learning method that was disarmingly straightforward. Pick a concept. Try to explain it as if teaching someone who knows nothing about it. When you hit a wall, go back to the source material. Simplify again. Repeat until the explanation is clean. As documented by Farnam Street, the power of this method lies in its brutality: it reveals exactly where your understanding is performative rather than structural.

Feynman didn’t actually name this “technique” — others formalized the four-step process based on his documented habits and teaching philosophy. But the underlying principle is real and measurable: if you can’t explain something without jargon, you don’t own the concept. You’re renting it.

What makes this powerful as a prompting pattern is that LLMs are extraordinarily good at shifting registers on demand. A model can explain the same concept at the level of a bedtime story, a college lecture, a research seminar, and a one-liner — if you explicitly ask it to. The problem is that almost nobody does.

The Prompt: Four Layers, One Concept

Here’s the template. Copy it. Replace the bracketed concept with whatever you’re trying to learn. (You can paste this directly into the Task field of Prompt Scaffold to assemble it with a Role and Context before running it.)

Use the Feynman Technique to break down this concept for me: [YOUR CONCEPT HERE]

Provide four levels of explanation:

1. **For a 5-year-old**: Use a vivid, everyday analogy. Zero jargon. 
   Make it feel like a bedtime story.

2. **For a curious tech enthusiast**: Introduce the core mechanism. 
   Explain how it actually works, not just what it does. Use precise 
   but accessible language.

3. **For a domain expert**: Full technical teardown. Use exact 
   terminology, discuss boundary conditions, failure modes, and 
   known limitations. Don't simplify — stress-test.

4. **One-sentence distillation**: Capture the irreducible core of 
   the concept in a single sentence. If this sentence doesn't hold 
   up without the other three levels, rewrite it until it does.

That’s the skeleton. It works on GPT-4o, Claude, Gemini, or any capable model. The four-layer structure isn’t decorative — each level serves a distinct cognitive function.

Why Four Levels, Not One

A single explanation, no matter how good, operates at one cognitive altitude. It either oversimplifies (and you feel you understood something you didn’t) or overcomplicates (and you disengage before the insight lands).

The four-level structure does something different. Each layer tests a different dimension of the concept:

Level 1 (5-year-old) tests whether the concept has an intuitive core. If you can’t anchor it to a concrete, physical analogy, there may be a foundational piece you’re skipping over. Some concepts genuinely resist analogy — and discovering that is itself useful information.

Level 2 (tech enthusiast) tests mechanism. This is where “what it does” shifts to “how it works.” The model has to explain causality, not just output. This level catches the most common failure mode in AI-generated explanations: descriptions that are technically accurate but mechanically empty.

Level 3 (expert) stress-tests the boundaries. Where does this concept break? What are the known edge cases? What do practitioners argue about? If the model’s expert-level explanation is just a longer version of Level 2 with more jargon, the concept wasn’t properly decomposed. Push back.

Level 4 (one-sentence) is the compression test. After three passes, can the model — and by extension, you — reduce the entire thing to a single load-bearing sentence? This is where Feynman’s actual philosophy lives. The sentence should feel inevitable, not clever.

Author’s Note: I’ve found Level 4 to be the most diagnostic. If the one-sentence summary is vague (“X is a way of doing Y more efficiently”), the model hasn’t actually distilled anything. Ask it to rewrite Level 4 without using any word that appeared in Levels 1–3. That constraint forces genuine compression rather than summary.

A Live Example: Self-Attention in Transformers

Let’s run the prompt on a concept that trips up a lot of people: the self-attention mechanism in the Transformer architecture.

Level 1 — For a 5-year-old:

Imagine you’re in a classroom and the teacher asks a question. Instead of just listening to the kid next to you, you get to look around the whole room and decide which kids’ answers are most helpful for yours. Some kids are super relevant — maybe they said something earlier that connects to your idea. Others, not so much. Self-attention is like being able to pick who to pay attention to before you raise your hand.

Level 2 — For a curious tech enthusiast:

In a neural network that processes sequences (like sentences), each word needs context from the other words to be understood correctly. Self-attention computes a relevance score between every pair of words in the input. It does this by projecting each word into three vectors — a Query, a Key, and a Value — then using dot-product similarity between Queries and Keys to determine how much each word should attend to every other word. The weighted sum of Values becomes the new, context-enriched representation of each word. This happens in parallel, not sequentially, which is why Transformers are faster to train than RNNs.

Level 3 — For a domain expert:

Self-attention computes scaled dot-product attention, formalized as:

Attention(Q,K,V)=softmax(QKTdk)V\text{Attention}(Q, K, V) = \text{softmax}\left(\frac{QK^T}{\sqrt{d_k}}\right)V

The scaling factor dk\sqrt{d_k} prevents the dot products from growing large enough to push softmax into saturation, which would collapse gradients. Multi-head attention runs hh parallel attention functions with independent learned projections, capturing different relational subspaces. A known limitation: self-attention is O(n2)O(n^2) in sequence length for both compute and memory, which makes it prohibitively expensive for very long sequences without approximations (sparse attention, linear attention, or sliding-window variants like those in Longformer). The position-agnostic nature of the mechanism also means it requires explicit positional encoding — the architecture has no inherent notion of token order.

This is the level where, if you were reading it yourself, you’d know immediately whether you actually understand the concept or were just comfortable with the Level 2 version. That discomfort is the learning signal.

Level 4 — One-sentence distillation:

Self-attention lets every element in a sequence dynamically decide how much to weight every other element, replacing fixed-order processing with learned, context-dependent relevance.

Where This Prompt Fits in a Learning Workflow

The Feynman prompt is not a replacement for reading primary sources. It’s a diagnostic tool. You use it to find out which layer of a concept you actually own and which layers you’re faking.

A practical workflow looks like this:

  1. Read the source material (paper, textbook chapter, documentation).
  2. Run the Feynman prompt on the core concept.
  3. Read each level. At the level where you start nodding along without being able to rephrase the explanation in your own words — that’s your gap.
  4. Go back to the source and attack that specific layer.

This is faster than re-reading an entire paper. It’s more targeted than asking the model open-ended questions. And it builds the kind of layered understanding that survives past the end of the study session.

If you’re using this to build up a personal reference system — a prompt library of concepts you’ve broken down and internalized — the Prompt Vault is built for exactly this. It runs 100% locally in your browser — no server, no account, no data leaving your machine. When you’re running deep-learning prompts that map your personal knowledge gaps and intellectual trajectory, that privacy guarantee matters. Save your Feynman breakdowns with the original concept as the title, and you’ve got a searchable second brain of multi-layered explanations you can revisit when the knowledge starts to fade.

Customizing the Layers for Different Domains

The four-level template works out of the box for technical concepts. For non-technical domains, the layer labels need adjustment, but the principle is identical: escalate the cognitive demand.

For business/strategy concepts:

  • Level 1: Explain it to someone with no business background
  • Level 2: Explain it to a smart generalist who reads the Financial Times
  • Level 3: Explain it to a board member who will poke holes in every assumption
  • Level 4: One-sentence version a CEO would use in an earnings call

For legal concepts:

  • Level 1: Plain-language summary for a non-lawyer client
  • Level 2: The mechanism — how the legal principle actually functions in practice
  • Level 3: Case law nuances, jurisdictional variations, common misapplications
  • Level 4: The principle distilled to its doctrinal essence

The structural point: each level should force the model to demonstrate a different kind of understanding, not just a longer or shorter version of the same explanation.

Common Mistakes That Kill the Feynman Prompt

Mistake 1: Not specifying the concept precisely enough. “Explain machine learning” is too broad. “Explain gradient descent in the context of training a neural network” is scoped enough to produce useful layer differentiation. Broad concepts generate broad explanations at every level.

Mistake 2: Accepting Level 1 as understanding. The analogy level is seductive. It feels like understanding because it’s concrete and emotionally satisfying. But analogies are lossy compressions — they always drop information. If you can only explain the concept at Level 1, you don’t understand it. You understand a story about it.

Mistake 3: Skipping the expert level. Level 3 is where the model (and you) get tested. If you skip it because it feels intimidating, you’re using the Feynman Technique as a comfort tool rather than a learning tool. The discomfort is the signal. Sit with it.

Mistake 4: Treating Level 4 as a summary. The one-sentence distillation is not a summary of the other three levels. It’s a standalone statement that should hold up without the other levels. If removing Levels 1–3 makes Level 4 meaningless, the distillation failed. Here’s the difference:

Failed Level 4 (just a recap): As seen above, Q, K, and V vectors help the model pay attention to different kids in the classroom. — This borrows the Level 1 analogy and the Level 3 notation. Strip away the other levels and this sentence is incoherent.

Real Level 4 (standalone compression): Self-attention lets every element in a sequence dynamically decide how much to weight every other element, replacing fixed-order processing with learned, context-dependent relevance. — No dependency on any prior level. It stands on its own.

This is the hardest part to get right, and it’s worth iterating on.

Making the Model Work Harder: Advanced Variations

Once you’re comfortable with the base template, there are two extensions worth testing.

The Adversarial Follow-Up

After the model generates all four levels, add this:

Now identify the weakest explanation among the four levels. 
Which level would a genuine expert find most misleading or 
incomplete? Rewrite only that level.

This forces the model into self-critique mode — the same principle behind the Recursive Reflection framework, where a Draft → Critique → Rewrite loop consistently produces higher-quality output than a single-pass generation. The model evaluates its own output against expert standards and patches the weakest point.

The Cross-Domain Bridge

Now explain the same concept using an analogy from a completely 
different field: [e.g., music theory, cooking, urban planning]. 
The analogy must be structurally accurate, not just superficially 
similar.

Cross-domain analogies test whether the model (and you) have grasped the structure of the concept or just its surface features. A structurally accurate analogy from an unrelated field is the strongest indicator of genuine understanding.

Why This Works Better Than “Explain Simply”

Most people prompt AI for explanations with some variant of “explain this simply” or “ELI5.” Those prompts produce a single-altitude response. The model picks a register and stays there. You get one explanation, at one level, and no mechanism for discovering what you missed.

The Feynman prompt works better for two mechanical reasons.

First, it forces the model to traverse the full depth of its knowledge about the concept. Each level activates a different subset of the training data — children’s educational content, technical blog posts, academic papers, textbook summaries. A single “explain simply” prompt samples from one region. Four levels sample from four distinct regions. The resulting output contains more information, organized in a way that mirrors how understanding actually develops.

Second, the structure gives you a diagnostic tool. When you read the four levels, you’ll notice where the model’s explanation clicks and where it goes fuzzy. That fuzziness maps to your own knowledge gaps with surprising accuracy — because if the concept were well-understood, you’d recognize a vague explanation immediately.

The Latent-Space Mechanics Behind Four-Level Prompting

There’s a deeper, more technical reason this pattern is effective — and it has to do with how LLMs represent knowledge internally.

An LLM’s training corpus spans a vast spectrum of registers: from children’s picture books to arXiv papers on quantum field theory. Each register occupies a distinct region in the model’s latent space — the high-dimensional vector space where the model encodes meaning. A generic “explain simply” prompt activates a single region and samples from one narrow probability distribution. The output is coherent but shallow.

The four-level Feynman prompt does something fundamentally different. It forces the model to perform what amounts to multi-distribution sampling across the latent space — projecting the same semantic vector (the concept) through four distinct decoder paths, each calibrated to a different knowledge distribution. Level 1 samples from the region associated with simplified, analogical content. Level 3 samples from the region associated with peer-reviewed, technically precise content. The model isn’t just rephrasing — it’s traversing different manifolds of its learned representation for the same underlying concept.

The practical consequence: if the model produces nearly identical output for Level 2 and Level 3, that’s a signal that the concept’s expert-level representation in the training data is thin, or that your concept scoping is too broad. The four levels function as a probe into the model’s own knowledge density — which, by extension, probes yours.

This connects directly to what makes well-structured prompts work in general. As covered in The Anatomy of a Perfect Prompt, the quality of an AI output is determined by how precisely you constrain the model’s generation space. The Feynman prompt constrains along a specific axis — cognitive depth — that most prompts never touch.

Building a Feynman Prompt Workflow With Prompt Scaffold

If you want to systematize this approach — especially if you’re using it regularly for study or content creation — structuring the prompt in a dedicated builder before pasting it into your AI of choice helps avoid the most common errors: forgetting a level, being too vague on the concept, or not specifying the audience precisely enough for each layer.

The Prompt Scaffold tool is designed for exactly this kind of structured prompt assembly. You define the Role (Feynman-style educator), the Task (four-level explanation), the Context (your specific concept and domain), and the Format (the four numbered layers). The live preview shows you the assembled prompt before you run it — which catches structural gaps that are easy to miss when typing directly into a chat window.

The Retention Test

Here’s a way to verify whether the Feynman prompt actually helped you learn, not just read.

Twenty-four hours after running the prompt, try to reproduce Level 2 (the mechanism) and Level 4 (the distillation) from memory. Don’t look at the model’s output. Write them yourself.

If Level 4 comes back immediately but Level 2 is hazy, your understanding is top-level only. You memorized the conclusion but lost the mechanism.

If Level 2 comes back in detail but Level 4 doesn’t, you understood the parts but never synthesized the whole.

If both come back — the mechanism and the essence — the concept is yours. That’s the entire point.

The Feynman Technique was never about making hard things sound easy. It was about discovering the specific point where your understanding stops being real. The prompt just gives you a machine to run that test as many times as you need, on any concept, in under sixty seconds.

Frequently Asked Questions

What is the Feynman Technique Prompt?

It’s a structured prompt template that applies Richard Feynman’s learning method to AI interactions. Instead of asking a model for a single explanation, you instruct it to explain the same concept at four escalating levels of depth — from a child-friendly analogy to an expert-level teardown to a one-sentence distillation. The multi-layer structure exposes knowledge gaps that a single-register explanation would hide.

How do you write a Feynman Technique prompt for ChatGPT or Claude?

Start with the template in this article: specify your concept, then request four levels of explanation — 5-year-old, tech enthusiast, domain expert, and one-sentence essence. The key is scoping the concept tightly (“gradient descent in neural network training” rather than “machine learning”) and placing the level definitions explicitly in the prompt so the model knows what cognitive altitude each layer requires.

Why does “Explain Like I’m 5” (ELI5) fail for learning complex topics?

ELI5 produces one explanation at one cognitive altitude. It optimizes for accessibility but sacrifices mechanism, boundary conditions, and compression. You walk away feeling like you understood — but that feeling comes from the analogy, not from the concept. The Feynman prompt fixes this by requiring the model to also explain how (Level 2), where it breaks (Level 3), and what it reduces to (Level 4). The gaps between those layers are where real learning happens.

Can I use this prompt for non-technical subjects?

Yes. The four-level structure works for any domain — adjust the audience labels to match the field. For legal concepts: layperson → practitioner → litigator → doctrinal essence. For business strategy: non-business reader → FT-reading generalist → skeptical board member → CEO earnings-call one-liner. The principle is identical: escalate cognitive demand across levels.

How many tokens does the Feynman prompt cost?

The template itself is roughly 120 tokens. The model’s four-level response typically runs 600–1,200 tokens depending on concept complexity. On GPT-4o at current pricing, a single Feynman run costs less than $0.01. If you’re running it at scale — say, building a structured knowledge base across dozens of concepts — model the cost before committing.

Related reading:

  • The Anatomy of a Perfect Prompt — The six structural components that determine whether any prompt — including the Feynman template — produces precise output
  • Beyond One-Shot: The Recursive Reflection Framework — How the Draft → Critique → Rewrite loop can improve the quality of your Feynman explanations after initial generation
  • Prompt Scaffold — Assemble structured prompts with dedicated fields for Role, Task, Context, and Format — ideal for building reusable Feynman templates
  • Prompt Vault — Save your Feynman breakdowns as reusable, searchable prompt assets in a privacy-first local manager