Praveen Juge

Written on

Designing an AI Metadata Pipeline for a Personal Knowledge Hub

How Teak uses a simple, predictable AI pipeline to quietly make sense of all your saved links, screenshots, notes, and files.

Teak is a personal knowledge hub. You save links, screenshots, notes, files. Teak’s job is to quietly make sense of all that. (Teak)

The way it does that is through metadata.

Not the glamorous kind. The boring kind that makes everything else possible.


Metadata as a promise

For every card in Teak, I treat metadata like a promise:

That promise is written down as a schema:

Same fields. Same types. Same meaning.

The AI model doesn’t get to invent new shapes. It only gets to fill in the blanks.


The simple pipeline

Most AI products hide behind magic. Teak is more like a kitchen.

For each thing you save, the pipeline is:

  1. Scrape Grab what we can without AI: title tags, HTML, file info.

  2. Enrich Clean it up. Normalize URLs. Strip noise. Prepare a clear input.

  3. AI Send a single, boring request: “Given this content, return JSON in this format. Nothing else.”

  4. Post-process Check the JSON. Trim it. Sanity-check tags. Throw away junk.

  5. Card UI Show a card that feels “smart”, but is really just consistent.

From the user’s point of view: a rough card appears → a few seconds later, it sharpens into something useful.


Making AI boring on purpose

The hard part is not getting AI to be clever. It’s getting it to be predictable.

So I give it constraints:

Then I add guardrails:

The result is an AI system that feels calm:


In the end, “AI metadata pipeline” sounds complex. But the idea is simple: