9 Things People Get Wrong With /grill-me and /grill-with-docs
The /grill-with-docs skill has become a popular replacement for plan mode in agent workflows. However, many people struggle with using it effectively. The skill works by asking you relentless questions until you reach a shared understanding, but this requires good planning skills on your part.
These skills aren't meant to replace engineers - they're designed to aid you. The goal of this guide is to help you master them by understanding the nine most common failure modes.
Understanding Questions: Low vs High Fidelity
When you enter a grilling session, your goal is to answer questions about the thing you're going to build. But not all questions are equal.
| Question Type | Definition | Example | Grillable? |
|---|---|---|---|
| Low Fidelity | Questions that don't require detailed prototypes or images to answer | "What URL should this route live on?" | ✓ Yes |
| High Fidelity | Questions that need zoomed-in, detailed images or prototypes to answer | "How should this UI feel when we're using it?" | ✗ No |
Form field layout is a good example of a high-fidelity question. Should you split form fields across multiple pages, or use one enormous form? You really need to see a prototype or build the whole thing to answer that properly.
Handling Ungrillable Questions
The first major failure mode is trying to answer high-fidelity questions during a grilling session.
When you hit an ungrillable question, something that needs higher fidelity to understand, use the handoff pattern:
- Continue grilling low-fidelity questions in your first session (the blue session)
- When you hit a high-fidelity question, hand off to a prototyping session
- Build or prototype in that separate session to understand the question better
- Hand back off to your original grilling session to continue with grillable questions
This pattern looks like: grill → prototype → grill again. It lets you answer high-fidelity questions without breaking your grilling flow.
Choosing the Right Scope
Scope, how large a thing you're grilling, is absolutely critical.
If your scope is too large, two problems emerge:
Problem 1: Hidden high-fidelity questions It's always easier to build off something you know works rather than endlessly planning into the future. When people try to schedule days of tasks for their AI to work on, they often end up with poor results because they're not building on a solid foundation they understand.
Problem 2: Context window limits You might start with a nearly empty context window, but as you keep grilling and grilling, you'll hit the model's "dumb zone" - around 120k tokens for most state-of-the-art models. Once you pass this threshold, the model's attention relationships get strained and it starts making worse decisions.
Breaking Down Large Scopes
Instead of grilling on one massive scope, ask the agent upfront to break it down:
- Start with a large scope
- Have the agent break it into smaller, grillable chunks
- Grill on each smaller scope individually
- Answer all the questions across those separate sessions
This keeps you comfortably within the "smart zone" and avoids hitting the context window wall mid-session.
Being Active, Not Passive
Many long grilling sessions fail because people are too passive with their agent. Remember: it's a conversation, not an interview.
The agent asks questions, but you need to:
- Figure out where you're going
- Understand the scope
- Keep things on track
- Steer the conversation actively
If you're too passive, the agent will bombard you with 540 questions and explode the scope with requests about things that are way too low-fidelity.
But there's a balance. Being too active means grilling endlessly on low-fidelity details when you should actually be writing code. If you're just planning and planning without building, you're over-grilling.
Find the middle ground: Be actively involved in steering the session while knowing when to stop planning and start implementing.
Preserving Your Design Decisions
This is a critical but often overlooked failure mode. When you grill, you create an incredibly valuable artifact: a context window full of design decisions.
By the time you finish grilling, you've made hundreds of tokens worth of choices about how your system should work. This is pure gold.
Your options:
- If you have context budget left: Implement directly in the same session without handing off
- If you're running out of context: Create a PRD (product requirements document) using the
/2PRDskill as a handoff artifact
Do not clear the context and start fresh just to write a PRD. That's throwing away all your design work. Every decision in that grilling session has value and should either become code or be documented in a handoff artifact.
Using Smart Models for Grilling
Using a "dumb" model for grilling is a common mistake. Here's why it matters:
When you grill, you rely on two sources of knowledge:
| Knowledge Type | Source | Reliability | When It Matters |
|---|---|---|---|
| Contextual | Files you pass, prompts, tool results | Very reliable | Implementation phase |
| Parametric | The model's training data and parameters | Less reliable, but creative | Grilling phase |
During grilling, you're relying on the model's parametric knowledge - its innate understanding of systems and applications to suggest things you might not have considered. If you had thought of them, you would have passed them in as context.
A dumb model won't give you good ideas. You need a model with lots of parameters - usually the big frontier models - to get creative suggestions and thought-provoking questions.
However, for implementation, you can use a smaller model since most information there is contextual (your detailed plan, the codebase, etc.).
Running Parallel Grilling Sessions
Finally, one simple but powerful technique: grill multiple sessions in parallel.
Here's how it works:
- You're grilling in session A
- The agent asks you a question
- You answer it
- While the agent is thinking, switch to session B
- Answer the question there
- Session A is ready; switch back
- Repeat
This is just like managing two Slack threads at once. You're not actually context-switching heavily - you're managing separate conversations.
Benefits:
- You double your throughput
- You get more planning done in less time
- You can keep multiple design decisions flowing
Most people max out at two sessions comfortably. If one session is doing a long-running task like research, you might handle three. As you get better at grilling, you can increase parallelism.
Key Takeaways
- Grilling is about questions. Low-fidelity questions are grillable; high-fidelity ones require prototyping
- Scope matters. Choose smaller scopes to avoid blowing through your context window
- Be active. Guide the conversation while knowing when to stop planning and start coding
- Preserve value. Every decision you make in grilling should be recorded somewhere
- Use smart models. You need parametric knowledge to get creative suggestions
- Run in parallel. Once you understand what each session does, flip between them efficiently
The better you understand these failure modes, the more effectively you can use grilling sessions to design before you code.