Scale Smart: AI-Powered Content Organization Strategies
Summary
Keeping large content repositories organized is an ongoing challenge. There's always new stuff coming in, and taxonomies evolve over time. Resource-strapped teams seldom have opportunities to re-categorize older content. It's a task well-suited for generative AI. Large language models have powerful capabilities that can help teams keep content organized at scale. Using LLMs in this capacity can lead to better user experiences and free team members to focus on more valuable efforts. This presentation explores two approaches for using LLMs to organize content at scale: 1) re-categorizing content using existing categories and 2) developing new categories from existing content. Both will be shown as proofs of concept alongside feasible next steps.
Key Insights
-
•
Maintaining organized large content repositories is often deprioritized despite evolving product portfolios, leading to reduced findability.
-
•
Search alone is insufficient for large repositories because users often lack context about what content exists.
-
•
Evolving taxonomies require periodic retagging of older content to keep it relevant and findable.
-
•
AI, specifically large language models like GPT-4, can automate taxonomy maintenance tasks significantly faster than humans.
-
•
Human oversight is critical to review AI-suggested tags to prevent hallucinations and ensure quality.
-
•
Developing new taxonomies requires analyzing content holistically, which exceeds typical LLM context windows.
-
•
Embedding databases help chunk and relate content statistically for clustering similar topics at scale.
-
•
Graph Retrieval Augmented Generation (Graph RAG) integrates knowledge graphs with LLMs to increase precision by leveraging semantic relationships.
-
•
RAG enables querying domain-specific content unseen by generic LLM training data, supporting bespoke AI assistants.
-
•
AI tools enable higher speed and new methods for content work but require new workflows and ongoing experimentation.
Notable Quotes
"Keeping large content repositories organized is an ongoing challenge that often gets deprioritized."
"Search alone doesn’t cut it, particularly when people don’t know what they’re looking for or lack context."
"If organizations don’t keep content organized, it becomes less usable and less useful over time."
"Large language models can help organize content faster and at much larger scale than people can."
"I used GPT-4 to retag 1,200 blog posts, reducing an estimated 10–12 hours of tedious manual work to about a third of that time."
"I made sure the LLM would only use terms from my predefined taxonomy, but it still introduced some of its own tags."
"Working with these tools requires new workflows and checkpoints to validate and tweak the AI's output."
"Graph RAG uses knowledge graphs instead of just plain text snippets, which really improves precision."
"I see these AI tools as augmenting human work, letting us focus on what matters most, not replacing humans."
"The best way to understand these technologies is to learn them hands-on, especially if your work involves language or taxonomies."
Or choose a question:
More Videos
"Desktop screen readers can run at speeds up to 800 words per minute because users get used to synthetic speech."
Sam ProulxEverything You Ever Wanted to Know About Screen Readers
June 11, 2021
"If you are a cohort participant and don’t see your private Slack channel, please let us know so we can guide you."
Bria AlexanderOpening Remarks
November 17, 2022
"Build relationships before you need them. You can’t create them when the house is on fire."
Corey Nelson Amy SanteeLayoffs
November 15, 2022
"Edgy is like a Rosetta Stone for enterprises, expressing the same thing in languages designers, strategists, and architects use."
Milan GuentherA Shared Language for Co-Creating Ambitious Endeavours
June 6, 2023
"People will talk to customers whether you want them to or not. The question is how to make it a better experience."
Erin May Roberta Dombrowski Laura Oxenfeld Brooke HintonDistributed, Democratized, Decentralized: Finding a Research Model to Support Your Org
March 10, 2022
"When blind users choose between Android and iPhone, they weigh trade-offs between stability and customizability."
Sam ProulxUnderstanding Screen Readers on Mobile: How And Why to Learn from Native Users
June 6, 2023
"We used AI tools like NotebookLM and Gemini primarily to shortcut getting up to speed and managing transcripts, without focusing on specific tools themselves."
Mujtaba HameedThe new horizon of ethnography: using AI to unlock the full potential of in-person research
March 11, 2026
"Futures thinking is not about predicting the future, but about being smarter about anticipating risks and consequences of our actions today."
Ilana LipsettAnticipating Risk, Regulating Tech: A Playbook for Ethical Technology Governance
December 10, 2021
"Accessibility is innovation and the kinds of features people with disabilities need are incredible conveniences for the rest of us."
Samuel ProulxFrom Standards to Innovation: Why Inclusive Design Wins
September 10, 2025
Latest Books All books
Dig deeper with the Rosenbot
What strategies help UX teams translate usability findings into stakeholder-relevant outcomes like cost savings or risk reduction?
How do I map and balance stakeholders effectively in complex healthcare product efforts?
When is it best to use interviews versus surveys during product and UX research?