Context Engineering - Making Every Token Count

CONTEXT ENGINEERING ADDY OSMANI MAKING EVERY TOKEN COUNT

Based on illustration by dexhorthy

WHAT ARE TOKENS? https://platform.openai.com/tokenizer

WHAT’S A CONTEXT WINDOW?

CONTEXT WINDOWS ARE LIKE LIMITED RAM Curation of what fi
ts into RAM is analogous to “context engineering” ANALOGY

PROMPT ENGINEERING Clear instructions for models so they can accomplish
a task https://www.youtube.com/watch?v=ysPbXH0LpIE PRELUDE

PROMPT ENGINEERING FAILURES STEM FROM CONTEXT MISMANAGEMENT https://www.instagram.com/p/DMc4ujyTuk6/?img_index=6

CONTEXT ENGINEERING MEANS PROVIDING AN AI WITH ALL THE INFORMATION
AND TOOLS IT NEEDS TO SUCCESSFULLY COMPLETE A TASK – NOT JUST A CLEVERLY WORDED PROMPT. DEFINITION

ADD CONTEXT CONTEXT WINDOW USE CONTEXT IN EDITORS & IDES

VISUAL CONTEXT IS PRETTY POWERFUL

https://www.youtube.com/watch?v=ysPbXH0LpIE

https://context.addy.ie

GOOD CONTEXT IS HARD. PROBLEM

AI CODING PERFORMANCE OFTEN DIPS WHEN CONTEXT WINDOWS EXCEED ~50%
FULLNESS PROBLEM

TOO LITTLE CONTEXT: VAGUE OR HALLUCINATED RESPONSES. TOO MUCH CONTEXT:
DISTRACTED, UNABLE TO FIND RELEVANT INFO, OVER-INDEXING ON PATTERNS. BAD CONTEXT: POISONING, TRUSTING INCORRECT STATEMENTS OVER TRAINING.

PATTERNS FOR CONTEXT MANAGEMENT https://blog.langchain.com/context-engineering-for-agents/

WRITE CONTEXT • Long-term memories (across sessions) • Scratchpad (within
session) • State (within sessions) @aifolksorg

SELECT CONTEXT • Retrieve relevant tools • Retrieve from scratchpad
• Retrieve long-term memory • Retrieve relevant knowledge. @aifolksorg

COMPRESS CONTEXT • Summarize context to retain relevant tokens •
Trim to remove irrelevant tokens @aifolksorg

ISOLATE CONTEXT • Partition context in state • Hold in
environment/sandbox • Partition across multi-agents @aifolksorg

AUTOMATIC CONTEXT OPTIMIZATION CURSOR CLINE

HOW TO FIX YOUR CONTEXT https://www.dbreunig.com/2025/06/26/how-to- fi x-your-context.html

•Be precise: Vague requests lead to vague answers. The more
specific you are, the better your results will be. •Provide relevant code: Share the specific files, folders, or code snippets that are central to your request. •Include design documents: Paste or attach sections from relevant design docs to give the AI the bigger picture. •Share full error logs: For debugging, always provide the complete error message and any relevant logs or stack traces. •Show database schemas: When working with databases, a screenshot of the schema helps the AI generate accurate code for data interaction. •Use PR feedback: Comments from a pull request make for context-rich prompts. •Give examples: Show an example of what you want the final output to look like. •State your constraints: Clearly list any requirements, such as libraries to use, patterns to follow, or things to avoid. CONTEXT ENGINEERING FOR AI CODERS

CONTEXT IT’S NOT THE PROMPT, IT’S THE ADDYOSMANI.COM

Context Engineering - Making Every Token Count

Context Engineering - Making Every Token Count

Addy Osmani

More Decks by Addy Osmani

Other Decks in Technology

Featured

Transcript

CONTEXT ENGINEERING ADDY OSMANI MAKING EVERY TOKEN COUNT

Based on illustration by dexhorthy

WHAT ARE TOKENS? https://platform.openai.com/tokenizer

WHAT’S A CONTEXT WINDOW?

CONTEXT WINDOWS ARE LIKE LIMITED RAM Curation of what fi

PROMPT ENGINEERING Clear instructions for models so they can accomplish

PROMPT ENGINEERING FAILURES STEM FROM CONTEXT MISMANAGEMENT https://www.instagram.com/p/DMc4ujyTuk6/?img_index=6

CONTEXT ENGINEERING MEANS PROVIDING AN AI WITH ALL THE INFORMATION

ADD CONTEXT CONTEXT WINDOW USE CONTEXT IN EDITORS & IDES

VISUAL CONTEXT IS PRETTY POWERFUL

https://www.youtube.com/watch?v=ysPbXH0LpIE

https://context.addy.ie

GOOD CONTEXT IS HARD. PROBLEM

AI CODING PERFORMANCE OFTEN DIPS WHEN CONTEXT WINDOWS EXCEED ~50%

TOO LITTLE CONTEXT: VAGUE OR HALLUCINATED RESPONSES. TOO MUCH CONTEXT:

TOO LITTLE CONTEXT: VAGUE OR HALLUCINATED RESPONSES. TOO MUCH CONTEXT:

TOO LITTLE CONTEXT: VAGUE OR HALLUCINATED RESPONSES. TOO MUCH CONTEXT:

PATTERNS FOR CONTEXT MANAGEMENT https://blog.langchain.com/context-engineering-for-agents/

WRITE CONTEXT • Long-term memories (across sessions) • Scratchpad (within

SELECT CONTEXT • Retrieve relevant tools • Retrieve from scratchpad

COMPRESS CONTEXT • Summarize context to retain relevant tokens •

ISOLATE CONTEXT • Partition context in state • Hold in

AUTOMATIC CONTEXT OPTIMIZATION CURSOR CLINE

HOW TO FIX YOUR CONTEXT https://www.dbreunig.com/2025/06/26/how-to- fi x-your-context.html

•Be precise: Vague requests lead to vague answers. The more

CONTEXT IT’S NOT THE PROMPT, IT’S THE ADDYOSMANI.COM