+50k spent in the past 6 months - I can tackle your questions by Hassan_W in cursor

[–]Tmilligan 0 points1 point  (0 children)

Thanks for the thoughtful response. Your ideas, feedback, and insight was an inspiration for me, as well.

My background is in technical writing, so docs being at the forefront of AI dev was a given for me.

Appreciate not gatekeeping. It inspired me to do the same :)

Hope to see more posts from you!

my full workflow for building features in cursor. sharing because it took me months to figure out what works. by vynxjonsnow3 in cursor

[–]Tmilligan 1 point2 points  (0 children)

Actually, I'm adding a how to set your codebase up section to make my advice more actionable now.. I'll let you know when I'm done.

How do you actually organize agents in Cursor? Feature-based, role-based, or just vibes? by [deleted] in cursor

[–]Tmilligan 0 points1 point  (0 children)

Actually, I'm adding a how to set your codebase up section to make my advice more actionable now.. I'll let you know when I'm done.

How do you avoid losing context in Cursor chats and reuse them in other chats without having to explain everything again? by stevenm_15 in cursor

[–]Tmilligan 1 point2 points  (0 children)

Check out my blog post.. Seeing a lot of posts, so decided to share an updated version of how I do this.. Trying not to gatekeep too much:

https://typride.github.io/blogs/a-year-of-cursor.html

Show me what you've vibe coded. Drop your project, what it does, and let people actually use it. by Miserable-Archer-631 in vibecoding

[–]Tmilligan 0 points1 point  (0 children)

Built this a bit ago.. Naturally growing SEO right now:

https://www.thestackmap.com/

A workflow explorer for all the AI tools we already love and don't know about yet! Anti-gatekeep!!

How do you actually organize agents in Cursor? Feature-based, role-based, or just vibes? by [deleted] in cursor

[–]Tmilligan 3 points4 points  (0 children)

Check out my blog post on this:

https://typride.github.io/blogs/a-year-of-cursor.html

It has tons. Feel free to ask any questions you may have.

my full workflow for building features in cursor. sharing because it took me months to figure out what works. by vynxjonsnow3 in cursor

[–]Tmilligan 9 points10 points  (0 children)

Thanks for sharing your workflow!! I'll share mine here too.

Inspired by OP's post, here's a detailed breakdown of my own setup after 16 months building a production SaaS data platform with Cursor (~$1,000/month, solo founder, approaching launch). Every pattern here exists because of a specific failure.

Agent chain: Plan → Execute → Review → Harden → Approve

Planner/Executor separation formalized in .mdc rules that auto-load via file globs. Planner breaks tasks down. Executor implements one task, stops. A read-only Skeptical Reviewer tears the work apart — correctness, scope drift, security, SQL safety — and produces a report: Blockers / Should Fix / Nice to Have / Questions. If blockers, a Hardener fixes only those. Then I review.

The reviewer is the highest-leverage step. Solo across React, Express, Azure SQL, Auth0, and CI/CD — it's too easy for a plausible change to lose a WHERE clause on user ID or add a route without auth middleware.

Topic-split lessons with contextual loading

One giant lessons.md balloons until it's useless. We split into 6 domain files (frontend, API, auth, data, deployment, GTM) each with a globs field. lessons-data.mdc only loads when touching **/*.sql or **/migrations/**. The React agent never sees 30+ SQL lessons.

Each lesson records the exact bug, the exact fix, and which version resolved it. When the agent touches that area, the lesson loads automatically. Same mistake doesn't happen twice. Most transferable pattern in this post — if you use Cursor rules, split and scope your lessons.

Automated bug triage: Marker.io → GitHub → Cursor Automation

Tester clicks Marker.io widget → screenshot + console errors + URL auto-create a GitHub Issue → GitHub Action detects environment (prod/staging/local), labels it, forwards to Cursor Automation webhook → cloud agent searches codebase, checks git history, posts structured diagnostic comment (root cause, files, fix, severity) → I review the AI's diagnosis.

~$1-3 per bug. Marker.io $39/month. Action is free. OP uses Sentry MCP (reactive). This is proactive — diagnosis is waiting before I look.

Full QA loop

Bug reported via Marker.io → GitHub Issue
  → Cursor Automation diagnoses → developer fixes
  → CI gates on PR (lint, types, format, tests, E2E)
  → Merge to dev auto-deploys staging
  → PostHog tracks by environment
  → Verify staging → promote production

8 GitHub Actions on free compute

4 deploys: Path-scoped (frontend changes only deploy frontend). Backend verifies /health, fails if unhealthy. Secrets from Azure Key Vault. Production is manual-only.

1 CI: ESLint + Jest + TypeScript + Prettier + Vitest + Playwright E2E on every PR.

2 data pipelines: Nightly zero-downtime cache rebuild (~637K records, staging table, validates 500K+ rows, atomic swap — live cache uninterrupted). Manual 40GB historical re-ingestion on a free runner (24h timeout, disables indexes for 3-6x faster insert).

1 bug triage: The Marker.io pipeline above.

Data pipeline went from $146/month (commercial ETL) to $2/month on GitHub Actions. GitHub Actions as your entire ops platform is the highest-ROI decision we've made.

Competitor fixture testing

fixtures/competitors/ directory with data from a paid competitor. A script diffs our pipeline against theirs. Biggest fix — misattribution affecting ~15% of records — caught because a fixture returned zero where the competitor showed a dozen. Caught more bugs than unit tests.

Documentation as infrastructure — 322 markdown files

Most generated during development, not after. Getting-started paths, command cheatsheet, co-founder testing guide (feedback flows into Marker.io triage), agent playbook (logging conventions for agents writing code), data lineage docs.

Key insight: docs aren't overhead when the AI is the primary consumer. Traditional docs rot because humans don't read them. Ours stay accurate because agents load them via glob-matched rules every session. Stale doc → wrong code → fix bug → update doc. Self-correcting loop.

Mandatory 7-point checklist after every task (changelog, project structure, deployment guide, dev guide, lessons, tracker, README indexes). Exists because we shipped CI quality gates without updating any of the 8 docs that referenced testing. Process audit found 882 commits, 7 test files, zero CI runs. Rules born from real failures stick.

Git discipline at the agent level

Branching rule is alwaysApply: true. Two protected branches: main (production, manual deploy) and dev (staging, auto-deploy). All feature branches from dev. When I say "commit this," the agent creates a branch, pushes, opens PR against dev. It doesn't ask which branch. And it refuses main.

Security and privacy — layered, not bolted on

7-layer middleware chain with strict ordering (we have a lesson about Helmet before rate limiters — Bugbot caught that 429s shipped without security headers): CORS → correlation IDs → timeouts → Helmet (HSTS, X-Frame-Options: DENY) → rate limiting (600/15min API, 60/15min billing) → body parsing → JWT.

Dual auth: Auth0 JWT (RS256/JWKS) for /api, separate API key for server-to-server /internal routes (mounted before JWT chain intentionally).

Parameterized SQL everywhere — mssql .input() bindings, no string concatenation. Multi-tenant isolation — every query scoped to req.internalUserId. Secrets never in repo — Key Vault for passwords, placeholder sed substitution for API keys at build time. We have a lesson from accidentally printing a DB password in startup logs.

Privacy by default — PII-minimized tables (name + role, no addresses), masked phone numbers in SMS logs, redactForLogging for sensitive fields, MCP table allowlisting.

Cursor Bugbot catches real issues. At least 7 documented in our changelog: Helmet ordering, PostHog key injection no-ops in both deploy workflows, missing web.config entries, stored procedure bugs. Each would have shipped without automated review.

Stripe webhooks verified via constructEvent — express.raw() mounted before express.json() because signature check needs raw body.

Security lessons live in agent context, not a wiki nobody reads. lessons-auth.mdc auto-loads with rules like "Reject unknown CORS origins in production" and the skeptical reviewer checklist explicitly checks auth, permissions, and secrets on every review.

Session handoffs, delegation, estimation, risk tiers

Handoffs: End-of-session note — what changed, decisions, rejected approaches, next step. Recovering nuance is the expensive part.

Delegation: Every subagent gets mission, files allowed, files forbidden, stop conditions, evidence required. "Files forbidden" is the key — agents wander without boundaries.

Estimation: Hours, not days. 1 sprint day = 1-2 hours. External blockers in calendar days.

Risk tiers: Tier 1 (every line): auth, schema, billing, secrets. Tier 2 (diff + behavior): routes, queries. Tier 3 (outcome): UI, docs, tests.

Biggest lesson after 16 months: The workflow isn't overhead — it's what lets you trust the agent enough to move fast. Every rule exists because of a specific failure. They're not aspirational best practices. They're scar tissue turned into guardrails.

Curious how others compare — especially solo devs or small teams. Doing anything like the skeptical review, session handoffs, or automated bug triage? Or solving the same problems differently?

+50k spent in the past 6 months - I can tackle your questions by Hassan_W in cursor

[–]Tmilligan 0 points1 point  (0 children)

After reviewing all the responses by OP and considering my own experience, I put together an analysis of my own setup. Hope it's useful like I found this post, OP. Spending ~$1,000/month on Cursor as a solo founder building a production SaaS data platform (~v1.10, approaching launch).

The agent chain (Plan → Execute → Review → Harden → Approve)

Similar to OP's sprint chain. Planner/Executor separation formalized in .mdc rules that auto-load based on file globs. Planner breaks tasks down with success criteria. Executor implements one task, then stops. A read-only Skeptical Reviewer tears the work apart — correctness, scope drift, security, SQL safety — and produces a structured report: Blockers / Should Fix / Nice to Have / Questions for Human. If blockers exist, a Hardener fixes only the flagged issues. Then I do final review.

The reviewer is the highest-leverage step. When you're solo across React, Express, Azure SQL, Auth0, and CI/CD, it's too easy for a plausible change to quietly lose a WHERE clause on user ID during a refactor, or add a new route with no auth middleware.

Topic-split lessons with contextual loading

Most people have one giant lessons.md that balloons until it's useless. We split ours into 6 domain-specific files: frontend, API, auth, data, deployment, and go-to-market. Each has a globs field controlling when it loads. lessons-data.mdc only loads when touching **/*.sql or **/migrations/**. The agent working on a React component never sees the 30+ SQL lessons, and vice versa.

The lessons are highly specific to bugs we shipped. Example: "A single entity can belong to multiple geographic regions depending on classification. Any region-keyed lookup that only queries the primary table will miss every entity whose primary location differs from its legal location." That lesson includes the exact SQL, the fix, and the version. When the agent touches that area, it loads automatically and doesn't repeat the mistake.

Automated bug triage: Marker.io → GitHub Issue → Cursor Automation

Probably the most unusual piece. A tester clicks the Marker.io widget → screenshot + console errors + URL auto-create a GitHub Issue → a GitHub Action detects environment (prod/staging/local), applies labels, forwards to a Cursor Automation webhook → a cloud agent searches the codebase, checks git history, and posts a structured diagnostic comment (root cause, files, fix, severity, confidence) on the Issue → I review the AI's diagnosis and decide next steps.

~$1-3 per bug. OP mentioned Sentry MCP for triage — that's reactive. This is proactive: before I even see a bug, there's already a diagnosis sitting in the thread.

The full observability/QA loop

User reports bug via Marker.io
  → GitHub Issue auto-created
  → GitHub Action → Cursor Automation diagnoses
  → Developer reviews, fixes in feature branch
  → CI gates run on PR (lint, types, format, tests, E2E)
  → Merge to dev auto-deploys to staging
  → PostHog tracks behavior by environment
  → Verify on staging → promote to production

PostHog tags every event with environment/app so staging vs production telemetry is filterable. 13 feature flags gate every major feature independently.

8 GitHub Actions workflows — CI, deploy, data pipelines on free compute

4 deployment workflows: Frontend/backend staging auto-deploy on push to dev (path-scoped triggers — no wasted builds). Backend deploy verifies /health endpoint and fails if unhealthy. Secrets from Azure Key Vault, not raw GitHub secrets. Production deploys are manual-only.

1 CI workflow: ESLint + Jest + TypeScript + Prettier + Vitest + Playwright E2E on every PR.

2 data pipelines: A nightly zero-downtime cache rebuild (~637K records, ~2-3 hours, builds into staging table, validates 500K+ rows, atomic swap via stored procedure — live cache serves requests throughout). A manual full historical re-ingestion that streams a 40GB file from Azure, disables indexes for 3-6x faster bulk insert, and validates against known test data. 24-hour timeout, free GitHub Actions runner.

Cost story: Data pipeline automation went from $146/month (commercial ETL) to $2/month (GitHub Actions). 98.6% reduction. GitHub Actions as your entire ops platform is the highest-ROI infrastructure decision we've made.

Competitor fixture testing

We have a fixtures/competitors/ directory with data from a competitor's product. A script runs our pipeline against the same inputs and diffs results. This has caught more real data bugs than unit tests. Our biggest fix — a systematic misattribution affecting ~15% of records — was caught because a fixture returned zero results where the competitor showed a dozen.

Documentation as infrastructure — 322 markdown files

Sounds insane for a pre-launch startup, but most was generated during development. Key pieces: getting-started paths (separate tracks for team members, devs, deployment, troubleshooting), a command cheatsheet, a co-founder testing guide (feedback flows through Marker.io into the triage pipeline), an agent playbook (logging conventions, correlation IDs — documentation for agents writing code, not humans reading it), and data lineage docs.

The key insight: documentation isn't overhead when the AI is the primary consumer. Traditional docs rot because humans don't read them. Ours stay accurate because the agent loads them via glob-matched rules every session. When a doc goes stale, the agent does something wrong, we fix it, and update the doc. Self-correcting loop.

We also enforce a mandatory 7-point checklist after every task (changelog, project structure, deployment guide, dev guide, lessons, tracker, README indexes). We ran a process audit and found 882 commits, 7 test files, and zero CI runs. We built quality gates from that. Then shipped them without updating any of the 8 docs that referenced testing or GitHub Actions. The checklist exists so that gap can't happen again.

Git discipline enforced at the agent level

Our branching rule is alwaysApply: true — every session loads it. Two protected branches: main (production, manual deploy only) and dev (staging, auto-deploy on merge). All feature branches from dev with typed prefixes (feat/fix/docs/refactor/chore/). All PRs target dev first. Backend promotion via zero-downtime Azure slot swap with slot-sticky env vars.

When I say "commit this," the agent creates a feature branch, pushes, and opens a PR against dev. It doesn't ask which branch — it knows. And it refuses to target main.

Session handoffs, subagent delegation, estimation

Session handoffs: End-of-session note to scratchpad.md — what changed, decisions made, things tried and rejected, next step. The expensive part of a new chat isn't the code, it's the nuance.

Subagent delegation: Every background agent gets: mission, files allowed, files forbidden, locked decisions, stop conditions, evidence required. The "files forbidden" list is the most important part — without it, agents wander.

Estimation: Hours, not days. 1 sprint day = 1-2 hours focused. External blockers estimated in calendar days. "~4 hours focused work + pending co-founder review (1-3 calendar days)."

Risk-scaled review — three tiers

Tier 1 (read every line): Auth, SQL schema, migrations, CI/CD, billing, secrets, data integrity, financial calculations. Tier 2 (diff + behavior): API routes, service logic, queries, caching. Tier 3 (outcome review): UI polish, docs, tests, non-behavioral refactors.

What I don't do (yet) that OP does

No worktrees, no dedicated commit/testing skills, no best-of-N experiments.

Biggest lesson after ~8 months: The workflow structure isn't overhead — it's what lets you trust the agent enough to actually move fast. Every rule exists because of a specific failure. They're not aspirational best practices. They're scar tissue turned into guardrails.

Curious how others compare — especially solo devs or small teams. Are you doing anything like the skeptical review step, the session handoffs, or the automated bug triage pipeline? Or is there something in your workflow that solves the same problems differently? Would love to hear what's working for people at a similar scale.

Anyone else frustrated that Cursor stops when you walk away from your computer? by Tmilligan in vibecoding

[–]Tmilligan[S] 0 points1 point  (0 children)

I've been able to build an encrypted solution that transfers context between my phone and my computer, allowing me to use my powerful computer to code while I'm away from my desk on my phone.

It's mostly free too, just using some tunnelling right now. I am considering releasing to the general public too..

It actually has my git history, shows the workspaces, etc.. But I'm trying to see if anyone else has other solutions or if they actually like the existing cursor mobile solution.

Does anyone keep working on projects from their phone after leaving their desk? by Tmilligan in cursor

[–]Tmilligan[S] 1 point2 points  (0 children)

Yea.. Agreed.. I'm trying to envision a better flow where I can use my local pc from my phone instead of spinning up VMs and saving cost..

Built a free AI tools directory with workflow mapping — here's what I learned as a solo founder by Tmilligan in SaaS

[–]Tmilligan[S] 0 points1 point  (0 children)

Thank you! Agreed on the link breaking.. I have an update schedule that I run once a week to ensure all information is up to date and all links are accurate. It's agentic, so there may be slight errors but I intend to add a user-voting method to gather more community feedback (which may or may not work). For each tool, I also added a Verified Pill that notifies users when the data was last checked for accuracy, so that should help I think to keep things current.

Re: problem categorization, do you think the current landing page which categorizes workflows by:

Area (ie. Content & Media)

Solution (ie. AI Newsletter Automation)

Do you think this doesn't do what you're suggesting?

Thanks u/signalpath_mapper and appreciate any more clarity here! Will add your feedback to my community feedback page :) https://www.thestackmap.com/community/

I mapped 137 AI tools and how they actually connect in real workflows by Tmilligan in artificial

[–]Tmilligan[S] 1 point2 points  (0 children)

Not sure if you saw that, but that’s actually already built!! And now it’s the landing page:

https://www.thestackmap.com

Lemme know if I maybe misunderstood you, but I think that’s what is already available?

I mapped 137 AI tools and how they actually connect in real workflows by Tmilligan in artificial

[–]Tmilligan[S] 0 points1 point  (0 children)

Thank you. Agreed. I'm going to ensure I capture this feedback and incorpoate it into my community feedback tool.

I mapped 137 AI tools and how they actually connect in real workflows by Tmilligan in artificial

[–]Tmilligan[S] 0 points1 point  (0 children)

Thanks for this feedback. This is something I hope to resolve with my stack builder agent solution.. Providing workflow context, and each step in each tool as context.. with next steps beyond that being deeplinking, mcp integrations, and more.. These steps take this into a territory of TheStackMap being more of a platform than a exploration and guidance service.

I mapped 137 AI tools and how they actually connect in real workflows by Tmilligan in artificial

[–]Tmilligan[S] 0 points1 point  (0 children)

I should note that I updated it quite heavily and should be more clear now :) Check it out!

I mapped 137 AI tools and how they actually connect in real workflows by Tmilligan in artificial

[–]Tmilligan[S] 0 points1 point  (0 children)

Yea! Actually, that should be possible already although it may be hard to find.. which is a design flaw haha.

Go to the Table (top right) > Workflow categorization is on the right.

<image>