Log in or create a free Rosenverse account to watch this video.
Log in Create free account100s of community videos are available to free members. Conference talks are generally available to Gold members.
Building impactful AI products for design and product leaders, Part 3: Understand AI architectures: RAG, Agents, Oh My!
This video is featured in the AI and UX playlist.
Summary
Agents, RAG, Memory, Vector Databases, Tool Use, MCP! The sector is rapidly evolving lots of architectures to make AI products more helpful and impactful. Peter van Dijck of Simply Put will share a simple framework that helps you understand how to think about these architectures, how to plan around them, and how to work with engineering teams on them. None of this is rocket science, but the acronyms are many. You don’t need to write code, but you do need to understand what is going on in these systems. You will learn how to think about and understand these complex-seeming architectures, and how to think about new ones as they come out.
Key Insights
-
•
Large language models are stateless and rely entirely on the provided context window for each response.
-
•
The context window is essentially a text file aggregating system instructions, user queries, documents, and other relevant data.
-
•
All modern AI techniques (retrieval, augmented generation, tool use) aim to improve the quality and relevance of this context window.
-
•
Tool use allows the model to autonomously decide when to invoke external APIs or services based on the input query.
-
•
Agent models enhance tool use by planning and executing multiple tool calls in an iterative, reasoning loop until a task is complete.
-
•
Post-training with billions of examples significantly improves models' abilities in reasoning, tool use, and planning.
-
•
Designing AI products should begin with user needs and the necessary context rather than starting with complex agent architectures.
-
•
Prompt structure, including semantic content and organization (e.g., XML tags), helps the model parse context effectively but is flexible.
-
•
Token limits constrain the context window size; modern models like Google’s can handle up to a million tokens, enabling very large context inputs.
-
•
User-specific data (e.g., PTO policy, employee info) can be integrated into the context window dynamically through backend queries to provide accurate personalized responses.
Notable Quotes
"Models are stateless; they have no memory and forget everything after each response."
"Context design and context engineering mean figuring out and building what needs to go into that text file sent to the model."
"All the complicated-sounding techniques are just ways to put the relevant text into the context window."
"Think of the context window like an intern's briefing document: would the intern be able to answer the question with this information?"
"Tool use lets the model decide itself when to call external APIs or services to get needed information."
"An agent is a model using tools in a loop, making plans, reasoning, and calling tools until it’s done."
"Post-training on billions of examples is like training a dog over and over until it gets really good at reasoning and tool use."
"You never start AI product design from the technology itself; you start from the user outcomes and retro-engineer the needed context."
"Language is hard, and we use anthropomorphic words like reasoning and thinking to describe what the model does technically."
"If you understand how engineers think about these models, you won’t be scared of concepts like synthetic data or tool use."
Or choose a question:
More Videos
"Empathy means constantly going back to the people using the tools and hearing their feedback."
Jon Fukuda Amy Evans Ignacio Martinez Joe MeersmanThe Big Question about Innovation: A Panel Discussion
September 25, 2024
"I have never found a product that can’t be made accessible."
Sam ProulxAccessibility: An Opportunity to Innovate
March 9, 2022
"Design and research people must report to leaders who understand their functions, or else they get assigned irrelevant tasks like social coordinator."
Anna Avrekh Amy Jiménez Márquez Morgan C. Ramsey Catarina TsangDiversity In and For Design: Building Conscious Diversity in Design and Research
June 9, 2021
"The biggest thing for me is that bots bring our human values to AI and help curb abuses."
Greg NudelmanDesigning Conversational Interfaces
November 14, 2019
"The components you design in Sketch or XD come with matching coded components in the app builder, so what you design is what you get in code."
George Abraham Stefan IvanovDesign Systems To-Go: Reimagining Developer Handoff, and Introducing App Builder (Part 2)
October 1, 2021
"Synthetic data created by AI—like fake personas and journeys—is super derivative and often not insightful."
Shipra KayanMake your research synthesis speedy and more collaborative using a canvas
January 24, 2025
"The experience of using a screen reader is probably 10 times faster as you become more expert with it."
Sam ProulxDesigning For Screen Readers: Understanding the Mental Models and Techniques of Real Users
December 10, 2021
"I spent too much time trying to change company culture and should have focused more on just getting things done."
Shipra KayanHow we Built a VoC (Voice of the Customer) Practice at Upwork from the Ground Up
September 30, 2021
"Physical therapists see their hands as a golden resource and tend to resist technology that might replace that."
Dane DeSutterKeeping the Body in Mind: What Gestures and Embodied Actions Tell You That Users May Not
March 26, 2024