This video is only accessible to Gold members. Log in or register for a free Gold Trial Account to watch.

Log in Register

Most conference talks are accessible to Gold members, while community videos are generally available to all logged-in members.

[Demo] How to re-categorize content at scale using LLMs

Gold

Wednesday, June 5, 2024 • Designing with AI 2024

Jorge Arango

Jorge Arango

Author of Living in Information and Duly Noted

Summary

Large Language Models (LLMs) are to language as spreadsheets are to numbers: tools for modeling, exploration, and development. Among their many capabilities, LLMs can alleviate chores related to the design and implementation of information architectures. But doing so requires venturing beyond chat-based interfaces. In this brief demonstration, we'll see how to use OpenAI's API and a few open source command line tools to re-categorize content in a 1,000+ page website. The techniques demonstrated can be extended to other common content organization tasks.

Key Insights

•

Manual retagging of 1,200 blog posts would take about 10 hours, but leveraging GPT-4 reduced active human time to about 2 hours.
•

Using GPT-4 via command line and shell scripts enables automated tagging outside typical chat interfaces.
•

An organically grown taxonomy over 20 years contained unclear acronyms and inconsistent tag forms that GPT initially struggled with.
•

Cleaning and standardizing the taxonomy before prompting GPT is critical for effective AI assistance.
•

A review step of AI-suggested tags in CSV format allows human correction to avoid hallucinations entering production.
•

GPT-4 can propose new and useful tags outside the original taxonomy, enriching content classification.
•

The four-step GRU framework (Gather, Review, Update, Wrap up) balances automation with human oversight.
•

Storing blog content as markdown files simplifies integrating AI workflows via scripting and file manipulation.
•

The approach is adaptable and scalable to other CMS platforms by replacing scripting with API calls.
•

Taxonomies should use clear, unambiguous terms to improve both human and AI understanding.

Notable Quotes

"Some of the older content has discoverability problems, which is typical with blogs."

"Doing this tagging manually would have taken me around 10 hours of mind-numbing work."

"I’m actually using GPT-4, but not via the chat interface—I'm calling it from the Mac’s command line."

"I had to clean the taxonomy up because GPT wouldn’t know what to do with acronyms like TAOI."

"I save the proposed tags to a CSV file so I can preview and edit them before applying the changes."

"A middle review step prevents hallucinations from making it into the production site."

"GPT-4 functioned as an assistant not just in retagging but also in improving the taxonomy itself."

"The entire process took about three hours from start to finish, about a fifth of the manual time."

"Use clear and obvious terms in taxonomies—unusual acronyms won’t make sense to GPT or others."

"You need to review proposed changes before committing them to production, otherwise errors sneak in."

Previous video

Next video

Ask the Rosenbot

Or choose a question:

How can GPT-4 be integrated into a static site workflow for content retagging?

What steps can I take to prepare an inconsistent taxonomy so GPT can tag content effectively?

How can I use command line tools to send markdown file content to GPT-4 for tagging?

What is a good workflow to review and approve AI-generated tag changes before applying them?

Can LLMs contribute to improving a taxonomy, not just tagging content?

Alberto Ferreira

Making it Count: Developing a custom digital metric framework that works

2021 • QuantQual Interest Group

Adam Thomas

Survival Metrics – Making Change in a Fast, Data-Informed, and Politically Safe Way

2022 • Design in Product 2022

Lada Gorlenko

Theme 1: Discussion

2024 • Enterprise Experience 2020

Clara Kliman-Silver

UX Futures: The Role of Artificial Intelligence in Design

2023 • Enterprise UX 2023

Shan Shen

Translating UX Terms into Business Contexts

2023 • Design in Product 2023

Pippa Lomas

Paving the Path for Neurodiversity in Design

2023 • DesignOps Summit 2023

Gabriela Barneva

Operationalizing Inclusive Design in Design Ops

2025 • DesignOps Summit 2025

Dane DeSutter

Keeping the Body in Mind: What Gestures and Embodied Actions Tell You That Users May Not

2024 • Advancing Research 2024

Meredith Black

Scaling Design Culture

2017 • DesignOps Summit 2017

Patrizia Bertini

Designing Within the Lines: How the EU AI Act Can Spark Better AI Innovation

2025 • DesignOps Community

Andrew Custage

The Digital Journey: Research on Consumer Frustration and Loyalty

2023 • Advancing Research 2023

Dem Gerolemou

Climate technology fundamentals

2024 • Climate UX Interest Group

Sandra Camacho

Creating More Bias-Proof Designs

2025 • Rosenfeld Community

Kaaren Hanson

Stop Talking, Start Doing

2017 • Enterprise Experience 2017

Zen Ren

Taking Inspiration from Instructional Design for Research

2022 • Advancing Research 2022

Irina Tikhonova

Small Wins, Big Impact: Leveraging and Elevating User Engagement

2021 • Civic Design 2021

More Videos

Sam Proulx

"Testing with the tab key is important, but it has nothing to do with a screen reader user’s actual experience."

Everything You Ever Wanted to Know About Screen Readers

June 11, 2021

Bria Alexander

"You can access the digital swag bag by scanning the QR code or visiting fld.me/cd2022 for cool sponsor offers."

Opening Remarks

November 17, 2022

Corey Nelson

"Keep your LinkedIn profile updated and compelling so you’re findable for passive job opportunities."

Corey Nelson Amy Santee

Layoffs

November 15, 2022

Milan Guenther

"Shared language lets different roles venture out of their comfort zones and collaborate on changing the enterprise system."

A Shared Language for Co-Creating Ambitious Endeavours

June 6, 2023

Erin May

"Without releasing control, democratizing research won’t scale. We have to empower people even if some things won’t be perfect."

Erin May Roberta Dombrowski Laura Oxenfeld Brooke Hinton

Distributed, Democratized, Decentralized: Finding a Research Model to Support Your Org

March 10, 2022

Sam Proulx

"Simulation or personas can never replace testing with real users to build empathy and get accurate results."

Understanding Screen Readers on Mobile: How And Why to Learn from Native Users

June 6, 2023

Mujtaba Hameed

"AI can’t yet fully replace bespoke and careful proposal building, though it helps junior researchers draft the first outlines."

The new horizon of ethnography: using AI to unlock the full potential of in-person research

March 11, 2026

Ilana Lipsett

"We decided these would be the social norms now, and we just went for it."

Anticipating Risk, Regulating Tech: A Playbook for Ethical Technology Governance

December 10, 2021

Samuel Proulx

"When we build accessibility into an environment, especially if we do it subtly, it becomes the new normal."

From Standards to Innovation: Why Inclusive Design Wins

September 10, 2025

Latest Books All books

Sentient Design

Sentient Design

Crafting Intelligent Interfaces with AI

By Josh Clark, Veronika Kindred

June 2026

Designing Assistant Technology

Designing Assistant Technology

AI That Makes Us Smarter

By Christopher Noessel

March 2026

The Staff Designer

The Staff Designer

Grow, Influence, and Lead as an Individual Contributor

By Catt Small

December 2025

Design for Privacy

Design for Privacy

Keeping Personal Information Private

By Robert Stribley

November 2025

Service Design (2nd edition)

Service Design (2nd edition)

From Insight to Implementation

By Lavrans Løvlie, Ben Reason, Andy Polaine

October 2025

The Game Development Strategy Guide

The Game Development Strategy Guide

Crafting Modern Video Games That Thrive

By Cheryl Platz

September 2025

Stop Wasting Research

Stop Wasting Research

Maximize the Product Impact of Your Organization's Customer Insights

By Jake Burghardt

June 2025

We Need to Talk

We Need to Talk

A Survival Guide for Tough Conversations

By Joshua Graves

April 2025

Human-Centered Security

Human-Centered Security

How to Design Systems That Are Both Safe and Usable

December 2024

The Design Conductors

The Design Conductors

Your Essential Guide to Design Operations

October 2024

Research That Scales

Research That Scales

The Research Operations Handbook

By Kate Towsey

September 2024

The User Experience Team of One (2nd Edition)

The User Experience Team of One (2nd Edition)

A Research and Design Survival Guide

By Leah Buley, Joe Natoli

August 2024

Design for Impact

Design for Impact

Your Guide to Designing Effective Product Experiments

By Erin Weigel

June 2024

Managing Priorities

Managing Priorities

How to Create Better Plans and Make Smarter Decisions

By Harry Max

May 2024

Duly Noted

Duly Noted

Extend Your Mind through Connected Notes

By Jorge Arango

January 2024

Dig deeper with the Rosenbot

Why is prior authorization such a persistent problem and how can patient-centered solutions be designed around it?

Is Rosenbot free to use for exploring UX and product design content?

What are the key psychological benefits of incorporating play in team workshops?