EPISODE 01 · What is ai agent · 2026.05.17

The word 「Agent」 means
three different things to three different people.

This is a talk about AI Agents.
More accurately — it's a talk about not getting fooled by the phrase "AI Agent."

AGENT 101 SERIES · 90 MIN · NO-BULLSHIT EDITION
§ 01 · Start with the word

"AI Agent" means three completely different things in three different mouths

Wittgenstein: the meaning of a word is its use in language. When the same word means completely different things to different people, the word starts spinning empty — you think you're discussing one thing, but actually you're talking past each other about three.

  • USE · 01 · The people selling tools

    Agent = my product

    "We just launched a new AI Agent" — translated, that means "I have a new version I want to sell you." In this context, Agent is marketing speak, interchangeable with "AI Assistant" / "Copilot" / "intelligent agent" — synonyms with zero new information.

  • USE · 02 · The people writing think-pieces

    Agent = an era label

    "We're entering the Agent era" — this kind of usage exists to make you feel like you're missing out. But the people saying it can't actually tell you where the "Agent era" stops and the "not Agent era" begins. Delete the word "Agent" from these sentences and the meaning doesn't change.

  • USE · 03 · The people actually using them

    Agent = LLM × tools × loop

    This is the only use where deleting the word "Agent" loses information. It refers to a program that can plan its own steps, call tools, and decide the next move based on results. This kind of Agent is fundamentally different from a Chatbot — it can act, not just talk.

§ 02 · Audit your goal

"I want to use AI to boost productivity" is an empty phrase

This sentence sounds obviously correct, but it fails the three simplest checks. Take it apart and you'll see — most people aren't actually "failing to use AI." They're chasing a goal that can never be verified in the first place.

  • USAGE TEST · 01

    Who gets to say "productivity went up"?

    You? Your boss? Some KPI? If you can't name them, there is no one to accept the goal. A goal with no acceptance criteria is just a note you wrote to comfort yourself.

  • USAGE TEST · 02

    Can you point at a specific moment and say "this — this is what the Agent did for me"?

    "I save 5 hours a week" — saved where, exactly? "I handle email faster" — how much faster, on what kind of email? If you can't point to a specific action, a specific timestamp, and an output you can put on a screen — that "boost" is just a feeling.

  • USAGE TEST · 03

    A week in, nothing's happening — is the problem the AI, or you?

    This is the critical one. If "use AI to boost productivity" isn't working, can you tell at a glance whether it's the Agent failing, the prompt being wrong, or you not sticking with it? If you can't tell, this was never an Agent task. It's a psychological task wearing an Agent's costume.

See the difference? The first is a wish. The second is an executable protocol — specific action, specific tool, specific time, specific failure signal. Every "Agent application" we cover today must be rewritable into the second form. If it can't be, skip it.

§ 03 · After unpacking the word

The difference between Chatbot and Agent is the difference between "talk" and "do"

The first two sections cleared the noise. This one is the only hard concept you have to remember in these 90 minutes: Chatbots only talk, Agents do. One line of difference — but two completely different kinds of leverage.

CHATBOT · the one you already know

Talks. Doesn't act.

You ask, it answers. Conversation over — every actual action is still on you.

  • Can only answer from training data
  • Can't call any tools
  • Doesn't remember what you said last time
  • Can't decide what comes next
AGENT · the one you're installing tonight

Talks, and actually does the work.

You give it a goal. It breaks down steps, calls tools, reads docs, edits files, sends emails — comes back only when the work is done.

  • Lives on the live internet, can pull current data
  • Can call Gmail / Calendar / databases / browsers
  • Has project-level memory, remembers your preferences
  • Can decide for itself what to do next
§ 04 · How an Agent actually acts

The four-step loop: Perceive → Plan → Act → Reflect

Every Agent behavior that looks magical is just this loop turning. Understand these four steps and you stop falling for "AI is so magical" — and you can instantly tell whether a task belongs to an Agent and when it's guaranteed to fail.

AGENT LLM CORE PERCEIVE 01 PLAN 02 ACT 03 REFLECT 04
  • 01

    PerceiveREAD INPUT

    Read your goal, the relevant context, and the list of available tools. Like a new hire on day one — first figure out what the job is and what resources exist.

  • 02

    PlanDECOMPOSE

    Break the goal into a sequence of small steps and decide which one to do first, with which tool. A good model writes a to-do list before it starts moving.

  • 03

    ActCALL TOOLS

    Call tools: send email, check calendar, read docs, run SQL, click buttons on a webpage. This step was completely impossible in the Chatbot era.

  • 04

    ReflectSELF-CHECK

    Look at the result. Decide whether it's enough, whether it's right, and whether to stop, retry, or try a different angle. This step is what lets an Agent self-correct.

§ 05 · What four kinds of people actually do

Not "what your job can do with AI" — just what runs end-to-end tonight

The four groups below are the people who actually showed up at last hackathon. For each one, we only cover one mainline — one specific action, one specific stack, one result you can check by 8 AM tomorrow. Everything else, you go figure out at home. Listening to a talk doesn't solve that.

DALLAS · HACKATHON · LAST EDITION · N=32

This is who was sitting next to you last time

Four job categories covered 100% of the room — today's Agent cases follow the same ratio.

32 PEOPLE
  • AI & Data Science 46.9%
  • Software Engineering 21.9%
  • Operations & Business 15.6%
  • Product & Design 15.6%
  • CASE · 01 · AI & DATA SCIENCE · 46.9% 5 analysts → 1

    You already know how to use AI. But you're playing with it, not selling with it.

    Contrarian take: AI / data people fall into "tool worship" the easiest — study new models, tune new prompts, ship new demos. None of that produces money. What produces money is wiring AI into the explore → experiment → production pipeline and crushing every manual step out of it.

    WORKFLOWMonday auto-brief (production-grade ETL + LLM)
    01TRIGGERcron · every Monday 08:00 (n8n / Airflow)
    02FETCHSnowflake / BigQuery · pull 7 core KPIs
    03DETECTthreshold alerts + 4-week YoY / WoW
    04ANALYZEClaude · write insights + Plotly charts
    05DELIVERmarkdown → Slack #leadership / email
    SETUPonce · ROI4-6h saved per week · ad-hoc requests 1 day → 10 min
    Claude Code Snowflake / BigQuery n8n / Airflow LangGraph Claude Agent SDK
  • CASE · 02 · SOFTWARE ENGINEERING · 21.9% 1 ticket/week → 1 ticket/day

    This isn't a tool upgrade. It's a role replacement.

    Contrarian take: people still relying on Copilot autocomplete are being eaten by people who hand whole tickets to Claude Code. "Can write code" is no longer the core skill — "can decompose a task and hand it to an Agent" is.

    WORKFLOWIssue → PR fully automated (pick one to run this week)
    01INTAKEread Linear / Jira issue + comment context
    02PLANClaude Code · draft implementation + impact analysis
    03CODEimplementation + unit tests in an isolated worktree
    04VERIFYruns pytest / vitest itself · until all green
    05PRpush branch + write description + self-review the diff
    06REVIEWyou only check: is the spec right, is the diff clean
    SETUPonce · ROIin 3 weeks your role shifts from "typing" → "reviewer + architect"
    Claude Code Cursor Devin Aider Linear / Jira
  • CASE · 03 · OPERATIONS & BUSINESS · 15.6% 1.5h/day saved · inbox anxiety gone

    Email isn't anxiety — it's the work you never turned into SOP.

    Contrarian take: "ops/business work is just messy" isn't fate — it's the result of never turning anything into SOP. If you can't SOP 50 emails, you have to pay AI to do it. Handling them by hand every day is spending your most expensive resource (attention) on the cheapest possible work.

    WORKFLOWDaily email → decision list (with auto-draft replies)
    01TRIGGER07:30 daily · n8n / Zapier
    02PULLGmail Connector · unread mail in last 24h
    03CLASSIFYClaude · 5 buckets (decide / reply / archive / subscription / junk)
    04DRAFTstandard replies drafted · 1-click for you to send
    05RANK"needs decision" ranked by SLA + customer priority
    06PUSHlist → iOS Reminders + Slack DM
    SETUP30 min · ROI1.5h/day saved · inbox anxiety gone by day 3
    Claude + Gmail Apollo + Clay n8n / Zapier Custom GPTs
  • CASE · 04 · PRODUCT & DESIGN · 15.6% Product loop 2 weeks → 5 days

    A PRD that takes two weeks to write isn't a PRD problem — it's a you problem.

    Contrarian take: the bottleneck in product and design has never been "no tools." The process never changed — interviews finished but nothing transcribed, research dragging two weeks, three rounds of design that still won't converge. Agents won't make decisions for you, but they will compress every "waiting" step out of the loop.

    WORKFLOWInterview → PRD converged in one week
    01RECORDuser interviews · Granola / Fireflies recording
    02EXTRACTClaude · transcribe + pull "core pain in 3 sentences"
    03CLUSTERacross N interviews · theme clustering + frequency rank
    04RESEARCHPerplexity Deep Research · competitor scan in 1h
    05PRDClaude drafts PRD · you edit 2 things
    06DESIGNv0 / Figma AI · 5 layout variants · team converges via 30-min vote
    SETUPone-time template · ROIone product loop: 2 weeks → 5 days
    Claude Perplexity Figma AI v0 Granola / Fireflies
  • —— HIGH-VALUE SCENARIOS, REGARDLESS OF JOB ——

    The four above are by job. The five below are by scenario.

    The first half answers "who are you, what can you do with an Agent."
    The second half answers "which specific scenarios make money for anyone who runs them" — the 5 lines a single person is most likely to turn into commercial results in 2026: quant trading, e-commerce, creator-led media, AI image/video, SEO/GEO.

  • CASE · 05 · QUANT TRADING 8h watching screens → 15 min deciding

    90% of retail traders fail on execution, not strategy.

    Contrarian take: you think you lose money because you can't find a good strategy — actually, you lose because you can't sit down at 9 AM every day and review the same 7 indicators. The point of an Agent isn't "predict the market" — it's to turn the "watch → judge → risk-check" loop into SOP so your strategy actually gets executed.

    WORKFLOWPre-market auto-screening + risk check (decision-support, not auto-trade)
    01TRIGGER08:30 ET · 1h before open
    02FETCHPolygon / yfinance · positions + watchlist
    03EVENTBenzinga API · earnings / catalysts / risks
    04SIGNALrun preset strategies (dual MA / RSI / MACD / custom)
    05RISKcheck position size / stop-loss / correlation
    06OUTPUT"3 candidates today + reasoning + suggested size" → phone push
    SETUP2 days · ROI15 min decision per day · no more being held hostage by market mood
    ⚠ This workflow only produces signals. It does not place orders. All investment decisions and risk are yours.
    Polygon yfinance Backtrader Claude n8n Benzinga API
  • CASE · 06 · E-COMMERCE OPS 5 → 50 SKUs launched per month

    Product selection isn't about staring at data — it's about collapsing "data → decision → listing" into one chain.

    Contrarian take: a one-person shop hits a ceiling not because of the product — but because "find product → list product" is entirely manual. SOP that chain — same number of people who can manage 5 SKUs can manage 50.

    WORKFLOWProduct selection → listing (Amazon / Shopify / Temu)
    01SCRAPEApify · pull BestSeller Top 100 + reviews
    02PARSEextract price / rating / review count / launch date
    03SENTIMENTClaude · mine reviews for "pain points / selling points / defects"
    04SCOREscore (margin potential × differentiation room × sourcing difficulty)
    05SELECTTop 5 candidates + ROI estimate + risk flags
    06LISTINGauto-generate title / description / keywords / A+ content / hero image brief
    07PUBLISHShopify API / Amazon SP-API · one-click listing
    SETUP1 week · ROIproduct research 1 day → 30 min · 5 → 50 SKUs per month
    Helium 10 Apify Claude Shopify API Amazon SP-API
  • CASE · 07 · CREATOR OPS 2 posts/week → 8 posts/week

    You're not unable to write hits — you just never industrialized "raw material → angle → hit."

    Contrarian take: 99% of creators are stuck waiting for inspiration. Inspiration isn't waited for — it's a production-line byproduct. Turn "hit-post library → angle generation → writing → one draft across platforms" into a process, and output jumps from 2 to 8 posts a week, with steadier quality.

    WORKFLOWTopic factory (Xiaohongshu / Twitter / YouTube / WeChat)
    01INGESTscreenshot 10 competitor hits daily → Claude Project
    02DECONSTRUCTClaude · extract title formulas / hooks / emotional triggers
    03IDEATEevery Monday · output 7 candidate titles + outlines
    04DRAFTClaude writes first draft · tuned on your voice corpus
    05ADAPTone draft, every platform · auto-adjust length / tone / image notes
    06SCHEDULEBuffer / Hypefury · auto-schedule + analytics back into the loop
    SETUP2 weeks training voice corpus · ROI4× output · single creator monthly revenue 5k → 30k without overtime
    Claude Perplexity Buffer / Hypefury CapCut Notion
  • CASE · 08 · GENERATIVE MEDIA · AI IMAGE / VIDEO single image 30min → 2min · single video 4h → 30min

    Knowing Midjourney doesn't mean you can produce usable assets.

    Contrarian take: 90% of AI creators are still "trying prompts" — that's artisanal work. Anyone making money from this has a fixed pipeline: theme brief → styled prompts → batch generation → auto-filter → post-process → archive. Without the pipeline, even the most expensive model is just a toy.

    WORKFLOWCommercial asset batch production (e-commerce heroes / short video / marketing assets)
    01BRIEFClaude · write theme brief (scene / style / use / constraints)
    02PROMPTbatch-generate 20 variants in MJ / Sora syntax
    03RENDERMJ API / ComfyUI · run 100 images / Runway runs 20 clips
    04SCOREGPT-4V / CLIP · score on "commercial usability"
    05SELECTkeep top 5% · auto-cull bad fingers / mangled text
    06POLISHPhotoshop AI / Topaz · sharpen / fix artifacts
    07ARCHIVENotion / Airtable · tag and archive (style / client / use case)
    SETUP1 week to establish standards · ROIsingle image 30min → 2min · one person's output = a 3-person team before
    Midjourney Sora ComfyUI Runway Topaz GPT-4V
  • CASE · 09 · SEO / GEO · OMNI-CHANNEL SEARCH VISIBILITY "AI citation rate" — from invisible to weekly measured

    SEO isn't dead. Manual SEO is dead.

    Contrarian take: anyone still shouting "SEO is dead" in 2026 is really saying "my old SEO playbook is dead." The truth — Google rankings still matter, and getting cited by ChatGPT / Perplexity / Claude / Gemini matters just as much. But running both by hand is no longer possible. GEO (Generative Engine Optimization) isn't a new concept — it's a new metric you have to start measuring. The metric you can't see is the metric you've already lost.

    WORKFLOWOmni-channel search visibility (Google + AI engines)
    01TRACKpull keyword rankings weekly · Ahrefs / SEMrush API
    02QUERYrun 50 preset queries against ChatGPT / Perplexity / Claude / Gemini
    03EXTRACTparse answers · pull cited domains + context + citation style
    04GAPcompare to competitors · list queries where they get cited and you don't
    05AUDITanalyze cited pages · Schema / fact density / token-friendliness
    06GENERATEClaude writes AI-friendly versions · explicit fact statements + quotable data points
    07PUBLISHpush to CMS (Webflow / Next.js / WordPress) + Schema.org markup
    08LOOPnext week back to 01 · watch citation rate + tune strategy
    SETUP1 week monitoring + 1 week content template · ROIfrom "no idea who AI is recommending" → "weekly citation report"
    💡 Most companies still aren't measuring GEO — your window to claim AI citation slots is roughly 12-18 months.
    Profound AthenaHQ Ahrefs / SEMrush Claude Perplexity API Schema.org Screaming Frog
§ 06 · Market observation

Five AI businesses — none of which is actually about AI

The most profitable businesses in the AI era are often not AI applications themselves — they're "AI anxiety" repackaged and sold to people who aren't actually using AI. The five patterns below ran most reliably between 2024 and 2026. Understand the business logic — and you can choose: be the seller, don't be the buyer, or just go build something real that earns money directly. All three are valid.

  • MARKET · 01 · FOMO CONTENT

    The "AI next-wave" content creators

    Main customer: people afraid of missing the next internet. Conversion is driven by FOMO — a stable, reliable psychological motive. One observation worth holding onto: people who post "the next AI wave" articles almost never make money from that "wave" itself — their income comes from spreading the content. If an opportunity needs viral articles to spread, it's probably not a real opportunity. Real opportunities are usually being quietly worked on by sellers who don't want you to know.

  • MARKET · 02 · ANXIETY PLACEBO

    The "1000 prompts bundle" wholesalers

    They're not selling prompts — they're selling "I'm using AI" as a feeling. But prompts are highly personal — prompts written for someone else's workflow work at less than 5% efficiency in yours. This market persists because the act of buying briefly relieves the anxiety of "I'm not using AI." Once you see this, you understand why "$199 for 1000 prompts" will always have buyers.

  • MARKET · 03 · INFORMATION ARBITRAGE

    "AI coach / AI consultant" services

    The fastest-growing service category of the past 18 months — fundamentally an information arbitrage on the AI literacy gap. If a business can only survive while "people don't understand AI," it's a window business — it depreciates automatically as AI becomes mainstream. Nothing wrong with running it. Just be clear: you're earning window-period money, not long-term value money.

  • MARKET · 04 · ZERO-FRICTION PROMISE

    The "3 minutes to set up" / "one-click solution" AI tools

    "Zero friction" is effective marketing speak — but every Agent that runs in production has friction: connectors to configure, prompts to tune, output formats to adjust. That friction is your moat. If a tool claims "zero learning required," it means your competitors also "needed zero learning" — and your advantage evaporates. The ceiling on these tools is usually "as good as everyone else."

  • MARKET · 05 · CERTAINTY PREMIUM

    "The complete AI methodology" paid courses

    They sell certainty, not knowledge. In uncertain times, certainty itself has price. But AI applications are highly nonlinear — the same method in different contexts produces completely different results. Methods that actually work are almost always discovered by iterating in your own business; someone else's "complete methodology" is either out of date or too abstract to apply.

§ 07 · You can run these tonight

Three Agents — pick one and run it end-to-end tonight

Everything above is theory until one Agent actually runs. The three below are zero-code, same-day effect, installable before you sleep — pick one, finish it, and by 8 AM tomorrow you'll see the first result.

DEMO · 01 15min setup · 0min daily use

Gmail → today's to-do list

In the time it takes to drink one coffee, let an Agent read the last 24 hours of email and output "what you have to do today." From now on, opening the inbox stops being anxious — you read the list, not the emails.

  1. Get Claude Pro, enable Gmail Connector in settings
  2. Create a Project, name it "Daily Email Assistant"
  3. Paste the prompt template into Project Instructions (handed out live)
  4. Set a phone alarm for 8 AM to remind you to run it once
  5. After a week, come back and tune the prompt — the more you use it, the better it knows you
claude · daily email assistant
YOUrun today's email
CLAUDEreading the last 24h…
GMAILread 47 messages
▌ Must do today (by priority)
· Reply to David — contract terms (waiting 18h)
· Confirm Thursday 2pm TI team meeting (needs reply)
· Approve Sarah's expense report $842
▌ Auto-archived 38 notifications / subscriptions
▌ Drafted 6 replies, ready for you to send
DEMO · 02 5min setup · 30min saved per meeting

Meeting notes → action items

You walk out of a meeting and nobody remembers who owns what or when it's due. Let an Agent listen the whole time, and 30 seconds after the meeting ends it produces a structured note — decisions, action items, owners, deadlines, all clear.

  1. Install Granola or Fireflies (Mac / Windows / phone)
  2. Before each meeting, hit "record" — that's all you do
  3. When the meeting ends, transcript + summary + action items auto-generate
  4. Drop the summary into Claude and let it write a "meeting follow-up email" to send to the group
  5. Push action items into your Todoist / Notion / Linear
granola · q2 product review
▌ Key decisions
· No new features in Q2, all-in on performance
· North America pricing — plan B approved
▌ Action items
· Mike · ship performance baseline report · 5/22
· Lisa · brief sales team on new pricing · 5/19
· Jack · draft Q2 OKRs · 5/20
▌ Open questions
· Adjust Europe pricing in parallel? (next meeting)
✓ Follow-up email auto-drafted, ready to send
DEMO · 03 2min setup · one day saved per report

Market research → a complete report

The old way: spend an afternoon on Google, read 20 articles, build a table by hand. Now: one prompt, the Agent searches the web, reads, compares, summarizes itself — and 30 minutes later you have a 5,000-word report with citations.

  1. Open Perplexity Pro, switch to "Deep Research" mode
  2. Paste the prompt template (handed out live; preview below)
  3. Wait 15-30 min — the Agent runs through 30+ pages and 5+ data sources on its own
  4. Export as Markdown or PDF
  5. Drop it into Claude for the "boss-view summary" — three paragraphs covering the core
perplexity · deep research
YOUDallas EV charging market, last 90 days
▶ Researching…
🔍 Searched 34 pages
📄 Read 11 reports
📊 Compared 7 company filings
▌ Report outline
1. Market size and growth (with charts)
2. Top 5 players and market share
3. Policy and subsidy changes
4. 3 opportunity windows
5. 2 risk signals
✓ 5,243 words · 47 citations · done
§ 08 · The next 90 days

12 weeks, no-bullshit edition

This 90-day path doesn't promise you'll get great at AI — it gives you four specific actions, each with an explicit failure signal. If a week's failure signal lights up, go back to the previous week. Don't keep moving forward.

  • WEEK 1 – 2 · Rewire your reflex

    Any question — ask AI first

    Use ChatGPT or Claude Pro 30 minutes a day, every day — any scenario, any difficulty. Every "I want to search / write a paragraph / sort out my thinking" moment, your first move is to open AI. Failure signal: by the end of week 2, you still open Google more often than AI. If it fails: change Google's keyboard shortcut.

  • WEEK 3 – 4 · SOP-ify one thing

    Pick a task you do 3+ times a week and turn it into a Project

    Weekly status reports, replies to standard email types, daily standup notes, sorting product feedback — pick one, build it as a Claude Project / Custom GPT, write the Instructions. Failure signal: by the end of week 4, you're still rewriting the prompt from scratch every time. If it fails: the task you picked is too messy to SOP — pick a more specific one.

  • WEEK 5 – 8 · Cross tools

    Open up Connectors. Let the Agent act.

    Turn on the official connectors for Gmail / Calendar / Drive / Notion. This is the moment you graduate from "AI user" to "Agent user." Failure signal: connectors are on, but "let the Agent handle this" never shows up in your actual daily workflow. If it fails: go back to weeks 3-4 — your SOP didn't lock in.

  • WEEK 9 – 12 · Design the system

    Turn core workflows into executable protocols

    Pick 2-3 core workflows. Write them as SOPs. Let the Agent run them — you only do high-value judgment and final review. Failure signal: by week 12, you're still doing typing work all day and haven't carved out time for system design. If it fails: the problem isn't the Agent — it's that you never separated "doing the work" from "designing the system."

In 90 days you should be able to answer this question: "In the past week, what specific thing did an Agent do for me?" — if you can answer, with screenshots and outputs, the path worked. If you can't, you spent 90 days "using AI" without "installing an Agent."

Real Agent Use Cases

After the talk, the important thing doesn't happen on its own.

90 minutes, one set of notes. But notes are not action.
"Execution is always undervalued." If you don't run one of these Agents end-to-end when you get home tonight, you won't tomorrow either. That's not a pep talk. It's an observation.