Rosenverse

This video is only accessible to Gold members. Log in or register for a free Gold Trial Account to watch.

Log in Register

Most conference talks are accessible to Gold members, while community videos are generally available to all logged-in members.

[Demo] How to re-categorize content at scale using LLMs

Gold
Wednesday, June 5, 2024 • Designing with AI 2024
Share the love for this talk
[Demo] How to re-categorize content at scale using LLMs
Speakers: Jorge Arango
Link:

Summary

Large Language Models (LLMs) are to language as spreadsheets are to numbers: tools for modeling, exploration, and development. Among their many capabilities, LLMs can alleviate chores related to the design and implementation of information architectures. But doing so requires venturing beyond chat-based interfaces. In this brief demonstration, we'll see how to use OpenAI's API and a few open source command line tools to re-categorize content in a 1,000+ page website. The techniques demonstrated can be extended to other common content organization tasks.

Key Insights

  • Manual retagging of 1,200 blog posts would take about 10 hours, but leveraging GPT-4 reduced active human time to about 2 hours.

  • Using GPT-4 via command line and shell scripts enables automated tagging outside typical chat interfaces.

  • An organically grown taxonomy over 20 years contained unclear acronyms and inconsistent tag forms that GPT initially struggled with.

  • Cleaning and standardizing the taxonomy before prompting GPT is critical for effective AI assistance.

  • A review step of AI-suggested tags in CSV format allows human correction to avoid hallucinations entering production.

  • GPT-4 can propose new and useful tags outside the original taxonomy, enriching content classification.

  • The four-step GRU framework (Gather, Review, Update, Wrap up) balances automation with human oversight.

  • Storing blog content as markdown files simplifies integrating AI workflows via scripting and file manipulation.

  • The approach is adaptable and scalable to other CMS platforms by replacing scripting with API calls.

  • Taxonomies should use clear, unambiguous terms to improve both human and AI understanding.

Notable Quotes

"Some of the older content has discoverability problems, which is typical with blogs."

"Doing this tagging manually would have taken me around 10 hours of mind-numbing work."

"I’m actually using GPT-4, but not via the chat interface—I'm calling it from the Mac’s command line."

"I had to clean the taxonomy up because GPT wouldn’t know what to do with acronyms like TAOI."

"I save the proposed tags to a CSV file so I can preview and edit them before applying the changes."

"A middle review step prevents hallucinations from making it into the production site."

"GPT-4 functioned as an assistant not just in retagging but also in improving the taxonomy itself."

"The entire process took about three hours from start to finish, about a fifth of the manual time."

"Use clear and obvious terms in taxonomies—unusual acronyms won’t make sense to GPT or others."

"You need to review proposed changes before committing them to production, otherwise errors sneak in."

Ask the Rosenbot
Ariel Kennan
Civic Design in 2022
2022 • Civic Design Community
Prayag Narula
HCI 2.0: Humanity Deserves the Attention that UX Research has to Offer
2023 • Advancing Research 2023
Gold
Ryan Matthew
Bridging Design and Code: AI-Powered Design System Integration
2025 • DesignOps Summit 2025
Gold
Joshua Graves
We Need To Talk: Managing Ludicrous Requests at Work (Part 3 of 3)
2025 • Rosenfeld Community
Christian Madsbjerg
Influencing Strategy
2020 • Advancing Research 2020
Gold
Eduardo Ortiz
Theme 3 Intro
2025 • Advancing Research 2025
Gold
Panel Discussion: Communicating the Value of DesignOps
2018 • DesignOps Summit 2018
Gold
Bria Alexander
Opening Remarks
2021 • DesignOps Summit 2021
Gold
Deanna Zandt
The Unspoken Complexity of “Self-Care” with Deanna Zandt
2022 • Civic Design Community
Nathan Shedroff
Redefining Value: Bridging the Innovation Culture Divide
2015 • Enterprise UX 2015
Gold
Hugh Dubberly
Problems with Problems: Reconsidering the Frame of Designing as Problem-Solving
2019 • Enterprise Community
Angy Peterson
More Than Technology: Personalized Public Sector Experiences
2021 • Civic Design 2021
Gold
Harry Brignull
Beyond Clicks and Tricks: Why deceptive design has grown into a regulatory faultline
2026 • Rosenfeld Community
Marjorie Stainback
Transforming Strategic Research Capacity through Democratization
2019 • DesignOps Summit 2019
Gold
Marc Majers
Interrupted UX - Add A Dose of Reality To Usability Testing
2022 • Advancing Research 2022
Gold
Surya Vanka
Unleashing Swarm Creativity to Solve Enterprise Challenges
2021 • Design at Scale 2021
Gold

More Videos

Jim Kalbach

"There are no mistakes in jazz, just missed opportunities."

Jim Kalbach

Jazz Improvisation as a Model for Team Collaboration

November 6, 2017

Louis Rosenfeld

"A really strong book has to be designed as a journey with a consistent voice guiding the reader."

Louis Rosenfeld

Coffee with Lou: Should You Write a (UX) Book?

March 7, 2024

Catt Small

"Keeping your craft sharp, learning new tools like auto layout in Figma, and challenging yourself help maintain relevance."

Catt Small Micah Bennett Brian Carr Jessica Harllee

What's Next for ICs: Exploring Staff and Principal Designer Roles

February 22, 2024

Marieke McCloskey

"We can’t be there for every behavior after the nudge, nor do we necessarily want to be."

Marieke McCloskey

User Science: Product Analytics & User Research

March 11, 2021

Llewyn Paine

"User recordings are your most valuable asset but have become riskier due to biometric privacy laws."

Llewyn Paine

[Demo] Deploying AI doppelgangers to de-identify user research recordings

June 5, 2024

Joshua Noble

"Designers have a hard time interpreting econometrics-driven quantitative research without bridging approaches."

Joshua Noble

Casual Inference

October 6, 2023

Sara Logel

"We need to think about who we’re sharing with, how they might react, and what motivates them."

Sara Logel

Your Colleagues are Your Users Too

March 29, 2023

Bria Alexander

"If something makes you feel unwelcome, the code of conduct explains how to engage with staff to resolve issues."

Bria Alexander Louis Rosenfeld

Welcome

January 8, 2024

Sam Proulx

"If you have to learn a workaround, you want to learn it once and reuse it again and again."

Sam Proulx

Online Shopping: Designing an Accessible Experience

June 7, 2023