Skip to content
EN DE

ChatGPT Agent Mode: AI on the Web

L4 Lesson 3 of 5 — AI as Coworker
1
2
3
4
5

A Different Approach: Cloud Instead of Desktop

Section titled “A Different Approach: Cloud Instead of Desktop”

Claude Cowork works on your computer — with your local files. ChatGPT Agent Mode OpenAI's agentic feature in ChatGPT that combines a visual browser, code execution, file editing, and app integrations. The agent works in the cloud and can independently execute multi-step tasks on the web. works in the cloud — with its own virtual browser that can navigate websites, fill out forms, and complete online tasks.

Both solve the same problem: AI that doesn’t just answer, but acts. But their strengths lie in different places.

Agent Mode unifies three capabilities that OpenAI originally developed separately:

ComponentWhat It DoesOriginally
Visual BrowserNavigates websites, clicks, scrolls, fills formsOperator (Jan 2025)
Deep ResearchSearches the web for 5–30 minutes, analyzes hundreds of sourcesDeep Research (Feb 2025)
ChatGPT CoreConversation, code execution, file editingChatGPT
ToolFunction
Visual BrowserInteracts with websites via GUI (clicking, scrolling, typing)
Text BrowserSimpler, reasoning-based web queries
TerminalCode execution (Python), data analysis
File SystemCreate spreadsheets, presentations, documents
AppsGmail, Google Calendar, Drive, GitHub, Slack, Notion, HubSpot, and more
  • “Check my calendar and brief me on upcoming meetings based on recent news about the participants.”
  • “Research the three largest competitors in our market and create a comparison slide deck.”
  • “Find the cheapest flights to Berlin next week and create a comparison table.”

Deep Research deserves special attention because it solves a task knowledge workers face daily: Comprehensive research with sources.

  • Independently searches the web for 5–30 minutes
  • Analyzes hundreds of sources
  • Creates structured reports with citations
  • Can narrow scope to specific websites
  • Can work with uploaded files
PlanDeep Research Queries/Month
Free5
Plus ($20)25
Pro ($200)250

Limitation: Occasional hallucinations. Always spot-check sources.

ChatGPT can plan recurring tasks:

  • One-time, daily, weekly, monthly, or yearly
  • Maximum: 10 active tasks at once
  • Works on web, desktop, and mobile

Examples:

  • “Every Friday: Weekend plan based on weather and location”
  • “Every Monday: Summary of my GitHub activity”
  • “Daily at 7 AM: Industry news briefing”
PlanPriceAgent ModeDeep Research
Free$0No5/month
Plus$20/monthYes (40 msg/month)25/month
Pro$200/monthYes (400 msg/month)250/month
Team$25–30/user/monthYes25/month

Important: Only initial requests count toward the quota. Follow-up questions and clarifications within a task don’t.

  • Web automation — the biggest difference from Cowork: Agent Mode can navigate websites, fill forms, conduct interactive web research
  • Cross-platform — web, desktop (Mac + Windows), iOS, Android
  • Deep Research — one of the best available solutions for comprehensive web research
  • App integrations — Gmail, Calendar, Drive, Slack, Notion, GitHub, and many more
  • No local file access — everything runs in the cloud. You can upload files, but Agent Mode can’t access your local folders
  • Anti-bot blocks — many websites (Amazon, LinkedIn, social media) block the visual browser
  • CAPTCHAs — needs your help with “I’m not a robot”
  • Speed — multi-step tasks take 1–10 minutes as each page is processed individually
  • Complex authentication — MFA, SSO redirects, and OAuth are problematic
  • Audit gap — agent actions are indistinguishable from user actions in logs
AspectClaude CoworkChatGPT Agent Mode
FocusDesktop and local filesWeb and cloud
Local file accessYes (folder sandbox)No
Web browsingChrome extensionDedicated visual browser
PlatformsmacOS + Windows desktopWeb, desktop, iOS, Android
Sub-agentsYes (parallel)No
Deep ResearchWeb search with summarization5–30 min autonomous research
Scheduled tasksYes (requires app open)Yes (up to 10 active)
Entry price$20/month$20/month
Best forDocument creation, file workWeb automation, research

Cowork when you:

  • Work with local files (organizing PDFs, Excel analysis, presentations)
  • Need cross-app workflows (Excel → PowerPoint)
  • Primarily work at your desktop

Agent Mode when you:

  • Have web-based tasks (research, bookings, forms)
  • Need Deep Research for comprehensive analysis
  • Want to work mobile or cross-platform

Both when you have different task types — which is the case for most knowledge workers.

Activate Agent Mode in ChatGPT (Tools dropdown or /agent). Set a multi-step web task: “Find the 3 most-cited studies on [your field] from 2025 and summarize each in 3 sentences.”

Start a Deep Research query on a topic you need to research anyway. Compare the output with what you would have found in 30 minutes on your own.

Take a task and complete it with both Cowork and Agent Mode. Document: Which was faster? Which produced higher quality? Which was more convenient?

Claude Cowork and ChatGPT Agent Mode represent two philosophies: desktop-first vs. cloud-first. Both are improving, both will converge. For you as a knowledge worker, the most important skill isn’t mastering one tool perfectly — it’s knowing when each is the right choice.

In the next lesson, you’ll tackle perhaps the most important skill when working with AI agents: Trust Calibration — when you can trust, when you must verify, and how to calibrate that balance over time.

Part of AI Learning — free courses from prompt to production. Jan on LinkedIn