AI Tool Selection in the Agentic Era¶
Category: guide Last updated: 2026-04-07 Status: complete
Summary¶
A practical framework for selecting which AI tools to use, based on Ethan Mollick's February 2026 guide. The key insight is that AI now consists of three separable layers — Models, Apps, and Harnesses — and understanding each layer is necessary to get meaningful results. The shift from conversational AI to agentic AI (systems that autonomously complete multi-step tasks) represents the most significant change since ChatGPT's launch.
Details¶
The Three-Layer Framework¶
Understanding AI tools requires distinguishing between three components:
Models are the underlying AI brains — the core intelligence that determines reasoning ability, writing quality, coding capability, and image processing. As of early 2026, the three leading frontier models are:
| Model | Provider | Notable Strength |
|---|---|---|
| GPT-5.2 / GPT-5.3 Thinking | OpenAI | Complex analytical and statistical work; Extended and Heavy variants for increasing difficulty |
| Claude Opus 4.6 | Anthropic | Writing quality; extended thinking feature; limited image generation |
| Gemini 3 Pro | Competitive with other frontiers; Deep Think variant for hardest problems; weaker harness ecosystem |
Apps are the products users interact with directly — primarily the main chat interfaces (chatgpt.com, claude.ai, gemini.google.com) plus specialized tools like concepts/claude-code and OpenAI Codex.
Harnesses are the environments that give AI tools, autonomous action, and multi-step task capability. The same underlying model produces dramatically different results depending on its harness. This is why the "which AI is best?" question is increasingly inadequate — the harness often matters more than the model.
Why Model Selection Within Apps Matters¶
A critical and non-obvious finding: default model selections within apps are often not the best available option. Apps frequently route users to weaker variants by default. Manually selecting advanced models (e.g., choosing GPT-5.2 Thinking Heavy instead of the default) yields substantially better results.
Action required: When opening any AI app, explicitly select the most capable model rather than accepting the pre-selected default.
Specialized Applications Worth Knowing¶
For Technical Users¶
concepts/claude-code — Anthropic's agentic development environment. Provides access to a virtual computer, web browser, and code terminal. Enables autonomous software development from research through testing. Best for: software development, complex automation, data processing.
OpenAI Codex — Similar to Claude Code; provides terminal and codebase access. Best for: software development, code-heavy tasks.
For Non-Technical Users¶
Claude Cowork — Desktop application from Anthropic for non-technical work. Operates in an isolated virtual machine for security. Organizes documents, extracts data, drafts summaries. Best for: document management, data extraction, writing tasks.
Claude for Excel / Claude for PowerPoint — Spreadsheet and presentation extensions. The Excel integration is described as particularly impactful for analysts. Best for: spreadsheet automation, presentation creation.
OpenClaw — Open-source local agent with web browsing, file management, and email capabilities. Free but carries significant security risks due to broad computer access. Best for: users who understand the security tradeoffs and want local control.
For Research and Knowledge Work¶
NotebookLM (Google) — Accepts documents, videos, websites, and files as input. Generates interactive knowledge bases, slideshows, mind maps, and AI-generated podcasts that discuss source material. Has a free tier. Best for: research synthesis, literature review, learning from documents.
Recommendations by User Level¶
For beginners: 1. Pick one system (ChatGPT, Claude, or Gemini) 2. Subscribe at $20/month 3. Always manually select the most advanced available model 4. Learn its capabilities before branching out
For proficient users: 1. Use NotebookLM for research synthesis (free tier available) 2. Use Claude Code or Cowork for advanced automation 3. Try Excel/PowerPoint plugins for office productivity 4. Experiment with OpenAI Codex for code-heavy tasks
The Agentic Shift¶
The fundamental change described by Mollick: AI has moved from conversation to action. Rather than chatting about a task, you can now assign it — the AI uses tools, makes decisions, and returns with completed work.
This shifts the relevant questions from "which AI writes best?" to "which AI-plus-harness combination accomplishes this task best?" The harness determines what tools the model can use and how autonomously it can act.
See also concepts/agentic-engineering for the engineering-focused perspective on this shift, and concepts/ai-inflection-point for the November 2025 threshold where agentic reliability crossed into production-viable territory.
Key Claims & Data Points¶
- The three leading models as of early 2026: GPT-5.2/5.3, Claude Opus 4.6, Gemini 3 Pro — [source: a-guide-to-which-ai-to-use-in-the-agentic-era]
- Default model selections in apps are often weaker variants; manual selection substantially improves results — [source: a-guide-to-which-ai-to-use-in-the-agentic-era]
- Gemini 3 Pro has a weaker harness/application ecosystem despite a competitive base model — [source: a-guide-to-which-ai-to-use-in-the-agentic-era]
- The agentic shift represents "the most significant change since ChatGPT's launch" — [source: a-guide-to-which-ai-to-use-in-the-agentic-era]
- NotebookLM (Google) has a free tier and generates AI podcasts from source material — [source: a-guide-to-which-ai-to-use-in-the-agentic-era]
Open Questions¶
- How rapidly are these model rankings changing? The article is from Feb 2026 and the landscape shifts monthly.
- What is the precise definition and scope of "Claude Cowork" — is it a separate product or a mode within Claude?
- How does NotebookLM's knowledge base quality compare to alternatives like the concepts/llm-knowledge-base pattern?
- Does "harness" in Mollick's framework map directly to "harness engineering" in the OpenAI Codex team's usage, or are these different concepts?
Related Articles¶
- concepts/claude-code
- concepts/agentic-engineering
- concepts/ai-inflection-point
- concepts/harness-engineering
- concepts/openclaw-security
- entities/ethan-mollick
Sources¶
- A Guide to Which AI to Use in the Agentic Era — Ethan Mollick, One Useful Thing, February 18 2026; framework of Models/Apps/Harnesses; beginner and expert recommendations