Skip to content

Cursor AI IDE tips, tricks & best practices - Keyboard shortcuts, Composer mode, .cursorrules examples, and Reddit community wisdom

Notifications You must be signed in to change notification settings

murataslan1/cursor-ai-tips

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

19 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸš€ Cursor AI Tips & Tricks

GitHub stars GitHub forks Awesome PRs Welcome License: MIT Last Updated Cursor Version

The ultimate guide to mastering Cursor AI IDE

Keyboard shortcuts, Composer workflows, best practices for 2026, and benchmarks for GPT-5.3 Codex vs Claude Opus 4.6 vs Composer 1.5

⚠️ Critical Bug β€’ πŸ†• Cursor 2.4 β€’ βš”οΈ 2026 Models β€’ 🎭 PlayWhite β€’ Shortcuts β€’ Composer β€’ MCP β€’ Enterprise β€’ πŸ“… Weekly Updates


Caution

🚨 CRITICAL: Revert Bug Active in Cursor 2.4.x

A new file locking conflict between Agent Review Tab and the editor causes code changes to silently revert. The AI writes changes to disk (visible in git diff), but the IDE cache doesn't update β€” you see old code while new code is committed.

Workaround: Close the Agent Review Tab before using "Fix in Chat". Disable "Format On Save" and Prettier. Use defensive commits. Read the Revert Bug Guide β†’


Why Cursor?

πŸ’° December 2025: Anysphere (Cursor's parent company) valued at $29.3 billion after Series D funding, reflecting the explosive adoption of AI-assisted development.

Cursor is not just VS Code with AI - it's a fork that integrates LLMs directly into the rendering pipeline. This enables:

  • Cursor Tab: Multi-line predictions (not just single line like Copilot)
  • Composer: Autonomous multi-file editing agent
  • Shadow Workspace: Background indexing for semantic search
  • Native Diffs: Inline green/red diff visualization

πŸ†• What's New in Cursor 2.0/2.1

Plan Mode (2.1)

Strategic thinking before coding. AI analyzes, plans, asks clarifying questions - then executes.

"Plan Mode forces 'measure twice, cut once' philosophy"

Instant Grep (2.1)

Millisecond codebase search. No more context hallucinations - agent verifies every reference instantly.

Multi-Agent Interface (2.0)

Run parallel agents using git worktrees:

  • Agent 1: Refactoring component
  • Agent 2: Writing tests
  • No file lock conflicts

Composer Improvements

  • 4x faster than general LLMs for diff-edit loops
  • Codebase-wide semantic search
  • Implicit checkpoints for instant rollback

β†’ Full 2.0/2.1 Guide

⚠️ Deprecated Features

Some features were removed in 2.0/2.1:

Removed Replacement
Interpreter Mode Agent + Terminal
@Web, @Definitions Auto-context
Reapply Button Checkpoints
.cursorrules .mdc files
Fast Request Packs Usage-based pricing

β†’ Full Deprecated Features Guide

Lovable + Cursor Workflow

Popular "vibe coding" pattern for rapid prototyping:

1. Lovable β†’ Design UI visually, connect to GitHub
2. Clone β†’ Pull repo to local
3. Cursor β†’ Add backend logic, APIs, complex features
4. Push β†’ Sync back to Lovable

Users report building full SaaS products in 4 days using this hybrid approach.

β†’ Full Lovable + Cursor Guide


πŸ†• What's New in Cursor 2.4 (February 2026)

🎯 "The Agentic Engineering Release" β€” Cursor 2.4 marks the shift from "AI assistant" to "autonomous engineer" with Long-Running Agents, Subagents, and Composer 1.5.

Long-Running Agents (Research Preview)

Available for Ultra, Teams & Enterprise users since Feb 12, 2026. Agents can now plan and execute tasks for hours without human intervention.

  • Plan-First Architecture: Agents perform full dependency analysis before making any changes β€” no more "chat and fix" loops
  • Cross-Module Awareness: Agents scan all inter-module relationships to predict side-effects before editing
  • PR-Quality Output: Tests, docs, and refactoring delivered as a single atomic operation β€” fewer follow-up fixes

Subagents (Specialized Workers)

The main agent decomposes tasks into specialized subagents:

Main Agent
β”œβ”€β”€ Terminal Subagent β†’ Runs commands
β”œβ”€β”€ Docs Subagent β†’ Scans documentation
β”œβ”€β”€ Test Subagent β†’ Runs & writes tests
└── Refactor Subagent β†’ Code changes

Parallel processing significantly reduces development time for complex features.

Composer 1.5

A new distilled model optimized for code synthesis, positioned as a cost-effective alternative to Claude Opus 4.6:

Aspect Composer 1.5 Claude Opus 4.6
Speed 30% faster Baseline
Refactoring Excellent Excellent
Abstract Planning Good Superior
Cost 50% discount $$$

⚠️ "Auto-Mode Anxiety": Users report confusion about how credits are consumed in Auto mode. API costs vs. subscription pool credits lack transparency.

Revert Bug (v2.4.x) β€” Still Active

A file locking conflict between Agent Review Tab and the editor window:

Symptom Cause
Code appears then reverts Race condition between Review Tab (read-only) and Chat agent (write)
git diff shows changes but editor shows old code IDE cache not synced with disk

Workaround: Close Agent Review Tab before "Fix in Chat". Disable Format On Save.

β†’ Full 2.4 Guide



βš”οΈ 2026 Model Landscape

The AI coding landscape has shifted dramatically in February 2026. The era of "Agentic Engineering" replaces "Vibe Coding" β€” engineers now orchestrate agents rather than write code.

Tier Model Best For Cost
Senior Architect Claude Opus 4.6 Deep reasoning, multi-file refactoring, legacy migration (SWE-bench: 80.8%) $$$
10x Implementer GPT-5.3 Codex Speed, one-shot features, 30% faster than Opus $$
Cost-Effective Composer 1.5 Code synthesis, refactoring (distilled, 50% cheaper) $
Disruptor DeepSeek V3.2 / V4 Budget, local/privacy, open-source (1/50th cost of GPT-5.1) Β’
Daily Driver Gemini 3 Pro Routine changes, codebase indexing $

β†’ Full 2026 Model Guide | β†’ 2026 Best Practices Rules


🎭 "PlayWhite" Workflow (New Trend)

The viral Playwright + MCP workflow for self-healing tests.

  • What: Connect Playwright to Cursor via MCP.
  • Why: Agent runs tests β†’ Fails β†’ Fixes code β†’ Verify.
  • Result: Test-Driven Development on Autopilot.

β†’ Full "PlayWhite" Guide


πŸ†• What's New in Cursor 2.3 (December 2025)

🎯 "The Stability Release" - Cursor 2.3 focuses entirely on fixing the "Agent Hang" and "Zombie Revert" bugs that plagued version 2.2.

Key Fixes

  • Agent Stability: Fixed issues where Composer would freeze mid-generation.
  • Layout Controls: New panel positioning system is now production-ready.
  • Diff View: Critical fixes to the diff application logic.

Recommendation: Upgrade to 2.3. The release focuses specifically on fixing stability issues like the "Agent Hang" and diff application bugs from 2.2.

Process Separation (The Big Change)

Extensions now run in an isolated process. If an extension crashes, AI keeps working:

Before: Extension crash β†’ Everything freezes
After:  Extension crash β†’ AI continues working βœ…

This is critical for enterprise users with large codebases.

Layout Customization Engine

Four preset layouts with ⌘+βŒ₯+β‡₯ (Mac) / Ctrl+Alt+Tab (Win):

Mode Description Best For
Agent 50/50 Chat + Editor Pair programming with AI
Editor Maximized editor Deep focus
Zen Hidden chrome Complex algorithms
Browser Split with Chromium Frontend dev

Enterprise Features

  • Service Accounts - Headless CI/CD automation
  • SOC 2 Certified - Enterprise compliance ready
  • Enforcement Hooks - Block sensitive data in prompts
  • Linux Sandboxing - Container-friendly deployments

β†’ Full 2.3 Guide | β†’ Enterprise Features


🚨 Critical: Revert Bug (v2.4.x)

⚠️ DATA LOSS WARNING: Active in Cursor 2.4.x. A file locking conflict between Agent Review Tab and editor.

What Happens

  • AI writes changes to disk (git diff confirms), but IDE cache shows old code
  • "Fix in Chat" while Agent Review Tab is open causes a race condition β€” new code gets overwritten
  • Risk of committing stale code without realizing

Safety Protocol (February 2026)

❌ DO NOT βœ… SAFE TO DO
Use "Fix in Chat" with Review Tab open Close Review Tab first
Trust editor view without git diff Verify with git diff after every agent op
Use Format On Save with agents Disable Prettier/formatters during agent sessions
Run multiple agents on same files Use sequential agents or separate branches

Defensive Commit (Critical)

# BEFORE every agent operation:
git add -A && git commit -m "pre-agent-$(date +%s)"

β†’ Full Revert Bug Guide | β†’ Windows Terminal Fixes


🧠 Gemini 3 Pro Guide

πŸ† HLE Benchmark Leader: 37.5% - Best reasoning performance ever recorded.

Model Comparison (February 2026)

Model SWE-bench Best For Cost
Claude Opus 4.6 80.8% Deep reasoning, multi-file refactoring, legacy migration $$$
GPT-5.3 Codex High Speed (30% faster than Opus), one-shot features $$
Composer 1.5 β€” Code synthesis, refactoring (distilled, 50% cheaper) $
Gemini 3 Pro β€” Architecture, reasoning, codebase indexing Free (Beta)
DeepSeek V3.2 β€” Budget, local/privacy, open-source (1/50th cost) Β’

New Model Strategy (Feb 2026)

Architecture Planning  β†’ Claude Opus 4.6 (Best reasoning, 1M context)
Daily Implementation   β†’ Composer 1.5 (Fast, 50% cheaper)
Speed-Critical Tasks   β†’ GPT-5.3 Codex (30% faster execution)
Legacy Refactoring     β†’ Claude Opus 4.6 (91% legacy migration success)
Local/Privacy          β†’ DeepSeek V3.2 via Ollama (local-first)
Budget Bulk Work       β†’ DeepSeek V3.2 (50x cheaper)

Configure Gemini for Codebase Indexing

Settings β†’ Codebase Indexing β†’ Model β†’ Gemini 3 Pro

β†’ Full Gemini 3 Pro Guide


πŸ†• What's New in Cursor 2.2

⚠️ WARNING: Cursor 2.2 has critical bugs. See Cursor 2.2 Bugs before using.

Debug Mode (2.2)

Agent instruments your code with logging, you trigger the bug, agent analyzes runtime data for empirical debugging.

1. Describe bug β†’ 2. Agent adds logging β†’ 3. YOU trigger bug β†’ 4. Agent analyzes β†’ 5. Fix proposed

Visual Editor (2.2)

Bidirectional DOM ↔ Source Code editing. Select elements in browser, modify via GUI, changes write to source files.

Multi-Agent Judging (2.2)

Multiple agents solve your prompt in parallel. "Judge" agent picks the best solution. Increases token cost but improves quality.

⚠️ Critical Bugs in 2.2

Bug Impact Workaround
Revert Broken Data loss! Git commit before every agent call
Visual Editor Loop Infinite re-apply Avoid 'Visual' tab
WSL Terminal Agent can't run commands Enable Legacy Terminal

β†’ Full 2.2 Features Guide | β†’ 2.2 Bugs & Workarounds


⌨️ Keyboard Shortcuts

The Command Hierarchy

Command Mac Windows/Linux Scope When to Use
Inline Edit Cmd + K Ctrl + K Single file Quick fixes, rename, split function
Chat Cmd + L Ctrl + L Conversational Explain code, debug, explore
Composer Cmd + I Ctrl + I Multi-file Refactoring, new features
Composer Full Cmd + Shift + I Ctrl + Shift + I Multi-file Large refactors, review diffs
Add to Context Cmd + Shift + L Ctrl + Shift + L Selection Add selected code to chat
Terminal AI Cmd + K (in terminal) Ctrl + K Shell Generate shell commands

Quick Reference

Cmd + K  β†’ "Fix this type error"
Cmd + L  β†’ "Explain how auth works"  
Cmd + I  β†’ "Refactor to use Axios instead of Fetch"

Pro Tip: Use Cmd + K for local scope, Cmd + I for global scope. Don't use Cmd + K for multi-file tasks.

β†’ Full Shortcuts Guide | β†’ Development Workflows


πŸ€– Composer Mode

Composer is Cursor's killer feature - an autonomous agent that can plan and execute multi-file edits.

Normal vs Agent Mode

Mode Description Risk Level
Normal Proposes edits, you click "Accept" Safe
Agent Creates/deletes files, runs terminal ⚠️ Use with caution

Checkpoints (Time Travel)

Composer creates snapshots at each step. If the AI breaks something:

  1. Click previous Checkpoint
  2. Workspace reverts instantly
  3. Try different approach

Effective Prompting

❌ Bad: "Add a login page"

βœ… Good:

Implement a login route:
- @user_model.ts @auth_service.ts @routes.json
- Use Zod for validation
- Match error format in @errors.ts
- Create unit test in tests/auth/

β†’ Full Composer Guide


πŸ“Ž Context Management

The AI is only as good as the context you provide.

@ Symbol Reference

Symbol What it does Best for
@Files Full file content Active editing
@Folders File tree + summaries Architecture questions
@Codebase Semantic RAG search "Where is X used?"
@Docs External documentation Third-party APIs
@Git Git history/diff Commit messages, history
@Web Web search Current info

Pro Tips

@Codebase is probabilistic - if you call it "Login" 
but code says "SessionCreation", RAG may fail.

Use explicit @Files for critical tasks.

Notepads (Persistent Context)

Create a current_task_spec Notepad with:

  • PRD / Requirements
  • Design constraints
  • Architecture decisions

Reference with @current_task_spec in every new chat.

β†’ Full Context Guide | β†’ Security Best Practices


πŸ“‹ .cursorrules

System prompts that customize AI behavior per project.

Basic Setup

Create .cursorrules in project root:

You are an expert TypeScript engineer.
Use functional components with Hooks.
Use Tailwind CSS for styling.
Never use CSS modules or styled-components.
Every function needs a unit test.

Advanced: .mdc Files

New system uses .cursor/rules/*.mdc with glob patterns:

---
description: "React Component Rules"
globs: ["src/**/*.tsx"]
alwaysApply: false
---

Use shadcn/ui for primitives.
Components in src/ui must be presentational only.
Business logic goes in src/services.

Essential Rules

Category Example Rule
Tech Stack "Use Tailwind. Never styled-components."
Architecture "Services in src/services, UI in src/ui"
Anti-Lazy "Output FULL file. No placeholders. No //...existing code"
Testing "Every function needs unit test in tests/"

The Anti-Lazy Prompt (Reddit Gold)

You are an expert engineer.
You DO NOT use placeholders.
You output the FULL content of the file every time.
You do not be lazy.

β†’ Full .cursorrules Guide | β†’ .mdc Examples


🧠 Model Selection (February 2026 Update)

Latest Models Comparison

Model Best For Context Speed Cost
Claude Opus 4.6 Deep reasoning, legacy migration, multi-file refactoring 1M Medium $$$
GPT-5.3 Codex Speed, one-shot features, tool orchestration ~1M Fast $$
Composer 1.5 Code synthesis, refactoring (distilled) β€” Fast $
Gemini 3 Pro Visuals, massive context, codebase indexing 2M Fast $
DeepSeek V3.2 Budget, local/privacy, open-source Large Medium Β’
DeepSeek V4 (coming soon) Expected to rival Opus 4.6 in reasoning TBD TBD Β’

πŸ†• Claude Opus 4.6 (Feb 2026): SWE-bench 80.8%, 1M context with "Adaptive Thinking", 99.3% tool orchestration score. Dominant in telecom benchmarks.

πŸ†• GPT-5.3 Codex (Feb 2026): 30% faster than Opus 4.6, excellent one-shot feature implementation.

πŸ†• Composer 1.5 (Feb 2026): Cursor's own distilled model. 50% cheaper, optimized for code synthesis.

Model Personalities

Model "Vibe"
Claude Opus 4.6 Senior Architect (Unmatched reasoning)
GPT-5.3 Codex 10x Implementer (Speed demon)
Composer 1.5 Efficient Specialist (Cost-effective)
Gemini 3 Pro Creative Designer
DeepSeek V3.2 The Disruptor (Open-source champion)

The Plan-Act Pattern (Updated for 2.4)

1. PLAN (Claude Opus 4.6 β€” Long-Running Agent):
   "Analyze request. Scan all dependencies. Create plan.md"
   
2. CRITIQUE (Gemini 3 Pro - optional):
   "Review plan.md for efficiency gaps"
   
3. EXECUTE (Composer 1.5 / GPT-5.3 Codex β€” Subagents):
   "Implement Step 1 of plan.md. Run tests. Deliver as atomic PR."

Cost Optimization (Feb 2026)

Pro Plan ($20/mo):
β”œβ”€β”€ Credit pool system (fast vs slow)
β”œβ”€β”€ "Auto" mode switches models by task complexity
β”œβ”€β”€ ⚠️ "Auto-Mode Anxiety" β€” unclear credit consumption
└── Set per-user spending limits (Enterprise)

Strategy:
β”œβ”€β”€ Daily work β†’ Composer 1.5 (50% cheaper, fast)
β”œβ”€β”€ Heavy refactoring β†’ Claude Opus 4.6 (BYOK)
β”œβ”€β”€ Speed-critical β†’ GPT-5.3 Codex
β”œβ”€β”€ Budget/privacy β†’ DeepSeek V3.2 via Ollama (local)
└── Hard bugs β†’ Claude Opus 4.6 (deep reasoning)

Warning: Don't switch models mid-conversation. It breaks the "train of thought."

β†’ Full Model Guide


πŸ”Œ MCP Integration

Model Context Protocol lets Cursor connect to databases, GitHub, and browsers.

Quick Setup

Create mcp.json in project root:

{
  "mcpServers": {
    "postgres": {
      "command": "npx",
      "args": ["-y", "@modelcontextprotocol/server-postgres", "${env:DATABASE_URL}"]
    }
  }
}

Popular Servers

Server Use Case
server-postgres Query database schema
github-mcp-server Manage PRs/Issues
server-puppeteer Browser automation
@playwright/mcp E2E testing, self-healing tests
@sentry/mcp-server Production error debugging

Playwright Integration (Self-Healing Tests)

The most transformative MCP application:

1. Agent runs E2E test
2. Test fails (selector changed)
3. Agent analyzes DOM
4. Agent fixes test automatically
5. Agent verifies fix

β†’ Full MCP Guide


πŸ”’ Security Concerns

⚠️ As AI agents gain autonomy, security risks increase exponentially.

The Agentic Attack Surface

The ability of an Agent to execute shell commands and connect to the internet (via MCP) introduces new risks:

Risk Description
Prompt Injection Malicious READMEs or docs that trick the AI
Credential Exfiltration Agent accessing environment variables
MCP Exploits Malicious MCP servers hijacking machines
YOLO Mode Dangers Auto-execution of destructive commands

Security Best Practices

βœ… Review all terminal commands before approval
βœ… Use read-only MCP configurations
βœ… Audit .mdc files in cloned repositories
βœ… Never approve `env` or `printenv` commands
βœ… Set strict YOLO mode restrictions
❌ Don't trust arbitrary @Docs sources
❌ Don't enable unrestricted shell access

The "Black Box" Risk

⚠️ "Vibe Coding" can create unmaintainable code

If you don't understand the AI-generated code:
- You can't debug it when AI fails
- It becomes "Legacy Code" immediately
- Security vulnerabilities go unnoticed

β†’ Full Security Guide


πŸ”§ Troubleshooting

Quick Fixes

Problem Solution
"Connection Failed" New chat (Cmd+L), disable HTTP/2
"Stuck Generating" New Composer (Cmd+N)
Files deleted by Agent Use checkpoint to restore
Rules ignored Restart Cursor
High token usage Set API spending limits

The "Single Purpose Composer" Rule

Don't reuse Composer windows. One task = one Composer. Prevents context pollution.

Always Commit Before Agent

git add -A && git commit -m "checkpoint"

β†’ Full Troubleshooting Guide


πŸ”₯ Reddit Community Wisdom

Tips from r/cursor power users:

The "Fresh Chat" Rule

If debugging exceeds ~20 messages, context is polluted. Start NEW chat with summary.

The "Delete Bug"

In Agent mode, Cursor sometimes deletes and recreates files instead of editing. Always commit before Agent sessions.

Screenshot Debugging

Paste UI bug screenshots directly into chat. Vision models diagnose CSS issues better than text descriptions.

Cost Control

Set hard limits in OpenAI/Anthropic dashboard. Runaway Agent loops can drain your credit card.

Model Switching

Don't switch from GPT-4o to Claude mid-task. Stick to one model per conversation.

β†’ Full Reddit Tips


πŸ”„ Workflows

Research-First Protocol

1. Discovery: "Map usage of User component" (no code yet)
2. Plan: "Propose refactor plan, list affected files"
3. Critique: Review plan, challenge assumptions
4. Execute: Open Composer, paste approved plan
5. Audit: Review git diff line-by-line

TDD with AI

1. "Write Vitest test for compound interest calculator"
2. Review test logic
3. "Now write function to pass this test"

Debug with AI

1. Command fails in terminal
2. Click "Debug with AI" button
3. AI gets full error context automatically
4. Don't manually copy-paste errors

β†’ Full Workflows Guide


βš”οΈ Cursor vs Competitors (February 2026)

Feature Cursor 2.4 Google Antigravity Windsurf (Wave 14) GitHub Copilot
Philosophy Agentic Engineering Full Autonomy Parallel Agents Assistance
Architecture Fork (Native) New IDE (Cloud) Fork (Cognition) Extension
Multi-File Long-Running Agents + Subagents ⭐ Agents Cascade + Parallel Agents Limited
Model Choice All models + Composer 1.5 Gemini + Anthropic (capped 40%) Proprietary OpenAI only
Special Plan-First, Long-Running Agents Mission Control, Artifacts Plan Mode β†’ Auto Code Mode Enterprise SSO
Price $20/mo Free Preview (⚠️ quota issues) $15/mo (⚠️ rate limits) $10/mo
Status βœ… Market leader ⚠️ Trust crisis ⚠️ Acquired by Cognition Stable

Cursor vs Google Antigravity (Feb 2026 Update)

Aspect Cursor Antigravity
Control Developer is pilot Developer is mission controller
Verification Git diffs, code review Artifacts (screenshots, logs)
Best For Precision, existing codebases Greenfield, rapid prototyping ⚠️ Quota crisis
Lock-in Model agnostic Google ecosystem
Stability βœ… Reliable (revert bug aside) ❌ Silent throttling, 4-10 day cooldowns
Feb 2026 Issue Revert Bug (workaround available) Quota crisis: Promised 5hr refresh, actual 4-10 day cooldowns. Mass refund requests.

⚠️ Antigravity Trust Crisis: Google silently changed quota policies. Users report 4-10 day cooldowns instead of the promised 5-hour refresh. Claude Opus 4.6 usage capped at 40% of pool. Agents hallucinate and enter recursive optimization loops in v2.1.4.

Windsurf: Acquired by Cognition (Feb 2026)

The biggest drama of the week:

OpenAI bid $3B for Windsurf β†’ Microsoft vetoed (conflict with GitHub Copilot)
   ↓
Google hired CEO + 40 core engineers for $2.4B ("reverse acquihire")
   ↓
Cognition (Devin) acquired remaining Windsurf brand & product

Windsurf Wave 14 Features:

  • Parallel Agents: Unlike Cursor's subagents, Windsurf agents work on separate git branches simultaneously (frontend + backend in parallel)
  • Plan β†’ Code Auto-Transition: Plan Mode automatically switches to Code Mode after approval β€” less friction
  • ⚠️ "Blocked" Phenomenon: Despite 85% credits remaining, users get blocked due to burst token consumption from parallel agents

The "Initiative Gap" (Updated)

Aspect Cursor 2.4 Windsurf (Wave 14)
Autonomy Long-Running Agents (hours) Parallel Agents (multi-branch)
Shell Commands Requires approval Often auto-executes
Reliability βœ… "Gets the job done" ⚠️ "Tech demo" β€” great output but rate-limited
User Feeling Controlled, professional "Magical" but unpredictable billing

"Windsurf produces high-quality code, but a tool that stops mid-task is unacceptable for production environments."

Verdict: Cursor remains the professional's safe harbor despite the revert bug. Antigravity is in crisis mode. Windsurf has the best parallel vision but is hobbled by economics. DeepSeek V4 (coming soon) could disrupt everything for open-source advocates.

β†’ Full Comparison | β†’ Cursor vs Windsurf Deep Dive


πŸ’‘ From Vibe Coding to Agentic Engineering

πŸ†• February 2026: The era of "Vibe Coding" is officially over. Welcome to "Agentic Engineering" β€” where engineers orchestrate agents rather than write code.

The Paradigm Shift

2025 "Vibe Coding":    Developer describes intent β†’ AI implements β†’ Developer reviews
2026 "Agentic Engineering": Developer designs architecture β†’ Agents execute autonomously β†’ Developer supervises

The New Role: Agent Orchestrator

Engineers no longer deal with syntax. Their job is now:

  • System Architecture β€” Designing the blueprint agents follow
  • Data Modeling β€” Defining schemas and relationships
  • Rule Codification β€” Writing .cursorrules, CLAUDE.md to constrain agents
  • Quality Supervision β€” Reviewing agent output, not writing code

"99% of code is written by agents. Humans provide high-level oversight and quality control." β€” Community consensus, Feb 2026

The Cultural Divide: "Shadow Engineers" vs "Vibe Coders" vs "Agentic Engineers"

Type Description Strength Risk
Agentic Engineer (new) Designs systems, orchestrates agents, codifies rules Full control, scalable Requires deep architectural knowledge
Shadow Engineer Manages AI agents, writes PRDs, verifies output Maintainable code Slower iteration
Vibe Coder Relies entirely on natural language Rapid MVPs "Black box" code, debugging walls

Context Rot (New Phenomenon)

As Long-Running Agents work for hours, their mental model drifts from reality:

Agent starts β†’ Modifies files β†’ Mental model diverges from disk state
   ↓
Agent hallucinates β†’ Calls non-existent functions β†’ Corrupts codebase

Mitigation: Force periodic re-indexing. Use .cursorrules to mandate git diff verification at each step.

Essential "Constitution" Files

File Purpose
.cursorrules / .mdc Project rules, tech stack, code style
CLAUDE.md Agent behavioral constraints
MCP: Context7 Real-time documentation access
MCP: Playwright Self-healing E2E tests

Success Stories

  • Tradofire: Solo developer shipped complex crypto trading app
  • Enterprise ERP: Full-scale systems built in weeks using TaskMaster workflow

The TaskMaster Workflow

1. Generate detailed PRD (Product Requirement Document)
2. Feed PRD to task management system
3. Agent parses PRD into individual tickets
4. Execute tickets one by one
5. Human review at each milestone

β†’ Full Vibe Coding Guide


🧬 GPT-5.1 Codex Guide

The deployment of GPT-5.1 Codex (December 2025) introduced new capabilities and quirks.

The "Stupidity" Paradox

The new "smart" model sometimes behaves "stupidly" due to over-reasoning and safety alignment:

  • Over-analyzes simple requests
  • Hallucinated constraints (e.g., insisting VPS is required)
  • Conservative refusals on legitimate security utilities

Model Arbitrage Strategy

Complex Architecture Planning β†’ GPT-5.1 Codex Max (expensive)
Implementation Details       β†’ Gemini 3 Pro or Claude Sonnet (cheaper)

Key Insight

Use Composer over raw chat β€” Cursor's agent harness improves model behavior significantly.

β†’ Full GPT-5.1 Codex Guide


πŸš€ GPT-5.2 Guide

Released December 11, 2025 β€” OpenAI's "Code Red" response to competitors.

Key Specs

Variant Context Output Best For
GPT-5.2 Instant 128K 16K Quick edits
GPT-5.2 Thinking 200K 32K Complex reasoning
GPT-5.2 Pro 400K 128K Massive refactors

Benchmarks

  • AIME 2025: 100% (math reasoning)
  • SWE-Bench Pro: 55.6% (can solve majority of mid-level tickets)
  • Tool Reliability: 98.7%

Pricing

Tier Input/1M Output/1M
Standard $1.75 $14.00
Cached $0.175 $14.00

90% discount on cached inputs β€” ideal for IDE usage.

β†’ Full GPT-5.2 Guide


🎯 Confidence Scoring (Anti-Hallucination)

A powerful technique to combat AI hallucinations:

"Fix this only if you are 100% confident. Tell me your confidence score."

This prompt bypasses the "helpful assistant" persona and accesses the model's raw probability assessment, forcing it to:

  1. Re-evaluate its own logic
  2. Search for actual evidence
  3. Admit uncertainty instead of hallucinating

β†’ Full Confidence Scoring Guide


πŸ› Known Bugs (Feb 2026)

CRITICAL: Revert Bug (v2.4.x)

Agent Review Tab file locking conflict causes silent code reversion. Close Review Tab before "Fix in Chat".

# Always verify after agent operations:
git diff
ls -la --time=modified <filename>

NEW: Context Rot

Long-Running Agents lose sync between their mental model and disk state over time. Agents start calling non-existent functions.

Mitigation: Force periodic re-indexing. Keep agent sessions under 2 hours or add re-indexing checkpoints.

Other Issues

Bug Severity Status
Revert Bug (Agent Review Tab conflict) CRITICAL Open β€” Workaround available
Context Rot (Long-Running Agents) HIGH New β€” Mitigation: re-indexing
Auto-Mode credit transparency MEDIUM Open β€” "Auto-Mode Anxiety"
Plan mode not writing files CRITICAL Open (since Dec 2025)
Context Decay MEDIUM Workaround: Session Reset

β†’ Full Known Bugs Guide


🌐 Google Antigravity (⚠️ Trust Crisis β€” Feb 2026)

⚠️ February 2026: Antigravity experienced a catastrophic trust failure. Quota policies silently changed, agents enter infinite loops, and the Windsurf talent acquisition signals Google's desperation.

The Quota Crisis

Promised Reality (Feb 2026)
5-hour refresh cycle 4-10 day cooldowns
Generous Pro/Ultra limits ~90% service reduction
Transparent billing Silent throttling, no notification

Users describe this as a "bait-and-switch" β€” mass refund requests filed.

Claude Opus 4.6 Infrastructure Collapse

Users massively preferred Opus 4.6 over Google's own Gemini models, causing:

  • Unexpected compute costs for Google's backend
  • Anthropic model usage capped at 40% of total pool
  • Forced redirection to Gemini 3 Pro (users call it "lobotomized")

v2.1.4 "Logic Patch" β€” Made Things Worse

Bug Description
Hallucination Loops Agents wrongly assume models lack vision, autonomously switch models mid-task, lose all context
Recursive Optimization Agents enter infinite loops "optimizing" their own code, delete human-readable code as "unnecessary complexity"
File System Corruption Irreversible file damage from recursive optimization agents

Who Should Still Use Antigravity?

βœ… Free-tier experimentation only
βœ… Non-critical greenfield projects
❌ Production codebases
❌ Anything requiring reliable uptime
❌ Claude Opus 4.6 heavy usage

β†’ Full Google Antigravity Guide


πŸ“œ Advanced .cursorrules

Sophisticated patterns from production teams:

The "Shout" Protocol

If existing code is altered, warn by shouting:
❗️SHOUT WITH LARGE LETTERS❗️
"WARNING: Modified existing function in file.ts"

"Dumb" Component Enforcement

Presentation components must include "Dumb" in filename:
- UserProfileCardDumb.vue
- ProductListDumb.tsx

Anti-Flake Testing

NEVER use page.waitForTimeout(5000)
ALWAYS use built-in auto-wait mechanisms
Target elements using data-testid attributes

β†’ Full Advanced .cursorrules Guide


πŸ“‹ Strategic Recommendations (Feb 2026)

Based on community intelligence and the Agentic Engineering paradigm shift:

  1. Embrace Agentic Engineering: Shift from "code writing" to "agent orchestration" β€” define architecture, not syntax
  2. Close Review Tab Before "Fix in Chat": Critical workaround for the v2.4.x Revert Bug
  3. Use Composer 1.5 as Daily Driver: 50% cheaper, optimized for code synthesis
  4. Codify Your Project Constitution: .cursorrules, CLAUDE.md, and .mdc files are now mandatory β€” agents need rules
  5. Mitigate Context Rot: Force re-indexing every 2 hours during Long-Running Agent sessions
  6. Defensive Commits: Always git commit -m "checkpoint" before every agent operation
  7. Avoid Google Antigravity for Production: Trust crisis β€” silent throttling, unreliable quotas
  8. Watch DeepSeek V4: The next potential disruptor for local-first, privacy-focused development
  9. Adopt MCP Standards: Context7 (docs) + Playwright (tests) are essential for any Agentic Engineering setup
  10. Verify with git diff: Never trust the editor view alone β€” always verify agent changes on disk

πŸš€ Quick Start Path

Beginner (Week 1)

  1. Learn Cmd + K for inline edits
  2. Learn Cmd + L for chat
  3. Use @Files to add context

Intermediate (Week 2-3)

  1. Master Cmd + I Composer
  2. Create .cursorrules
  3. Use @Codebase for exploration

Advanced (Month 2+)

  1. Agent Mode with checkpoints
  2. Custom .mdc rules per file type
  3. Research-First protocol
  4. BYOK API for heavy tasks

πŸ“ Configuration Files (Copy-Paste Ready)

Ready-to-use configuration files for optimal Cursor setup:

MCP Configuration

// .cursor/mcp.json
{
  "mcpServers": {
    "playwright": {
      "command": "npx",
      "args": ["-y", "@playwright/mcp@latest"]
    }
  }
}

Defensive Settings

// .cursor/settings.json
{
  "agents": {
    "auto_apply_changes": false,
    "legacy_terminal_tool": true,
    "max_turns_per_session": 40
  },
  "models": {
    "codebase_indexing": "gemini-3-pro"
  }
}

β†’ Full MCP Config | β†’ Full Settings | β†’ React Rules | β†’ Test Rules


🎭 Playwright Self-Healing Tests

Tests that automatically fix themselves when selectors change:

1. Agent runs E2E test
2. Test fails (selector changed)
3. Agent analyzes DOM
4. Agent updates selector
5. Test passes βœ…

Quick Setup

# 1. Create .cursor/mcp.json (see above)
# 2. Create .cursor/rules/testing.mdc
# 3. Enable auto-tools in settings

β†’ Full Playwright MCP Guide


πŸ“š Resources


🀝 Contributing

Found a tip? Share it!

  1. Fork this repo
  2. Add your tip to relevant file
  3. Include source (Reddit, Twitter, etc.)
  4. Open PR

⭐ Star this repo if it helped you!

Made with πŸ’™ by Murat Aslan

Last updated: February 14, 2026

About

Cursor AI IDE tips, tricks & best practices - Keyboard shortcuts, Composer mode, .cursorrules examples, and Reddit community wisdom

Topics

Resources

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published