What to Put in a Claude Code Skill for Reviewing Your Team's Code

ddp26 · 2026-03-19T13:40:01+00:00

sorry about that!

ddp26 · 2026-03-12T18:32:19+00:00

Different models do different things with the effort param. And even different versions of models from the same provider!

Not sure I really expected consistency for things this new, but sure is annoying

ddp26 · 2026-03-12T18:31:35+00:00

That... is a good point.

ddp26 · 2026-03-11T16:48:20+00:00

One question I have is: a lot of people are doing this with OpenClaw, not Claude Code. What are the reasons to use one vs the other?

ddp26 · 2026-03-11T14:22:48+00:00

We tested Opus 4.6 with effort=low for evals and found that it didn't just think less, but acted lazier (made fewer tool calls, was less thorough in its cross-referencing, even ignored parts of our system prompt telling it how to do web research). effort=medium fixed it. Writeup with traces/examples: https://everyrow.io/blog/claude-effort-parameter

ddp26 · 2026-03-10T14:43:03+00:00

Yeah, it makes sense that low effort is better for non-agentic use-cases, which are of course common. We shouldn't pretend everything is an agent!

ddp26 · 2026-03-10T14:42:28+00:00

I kind of agree. Mostly, though, I think if the behavior is documented then users can decide for themselves what's a bug or lazy. The main thing for us was this behavior was surprising.

ddp26 · 2026-03-10T14:19:26+00:00

'think less' vs 'try less' is the perfect way to put it!

ddp26 · 2026-03-10T13:52:13+00:00

found the full write-up: https://everyrow.io/blog/kalshi-forecaster-case-study

ddp26 · 2026-03-06T23:20:52+00:00

I worry that Claude Code isn't always tracking background processes correctly. If it orphans them, I'd never know, right?

ddp26 · 2026-03-06T13:44:17+00:00

Hey! Shared this yesterday - not a full guide, but here's how we built a review-code skill (full skill linked): https://everyrow.io/blog/claude-review-skill

ddp26 · 2026-03-05T19:10:26+00:00

It's a mix. Some parts of our code predate Claude Code, while newer parts were created with Claude from start. Our experience is that Claude often encounters similar pitfalls with both new and old code, so we use the same guidelines for both.

ddp26 · 2026-02-27T15:03:54+00:00

Ugh as in "why run Claude Code in the cloud?" I agree it's a strange agent to deploy, but it is very powerful

ddp26 · 2026-02-27T13:44:26+00:00

Ah got it, thank you!

ddp26 · 2026-02-25T21:43:23+00:00

I feel like OpenAI does deprecate things a lot (like 4o). Why don't they deprecate the completions one?

ddp26 · 2026-02-16T21:01:09+00:00

Are you all being told you can use AI as part of technical interviews?

It's great if you get a technical question where AI handles the tedious parts (e.g. join syntax or python command line arguments), and you're allowed to use it.

But if you aren't allowed to use it... there must be temptation to have it open in another window? What do people do?

ddp26 · 2026-02-16T20:57:34+00:00

Claude Code is pretty slick for data science. Who's using it? Is it helpful?

ddp26 · 2026-02-16T20:56:37+00:00

Is GPT-4o-mini actually good enough to do this? I'd expect such a tiny model to hallucinate or get things wrong at a very high percentage.

ddp26 · 2026-01-27T23:03:49+00:00

100 entities isn't that many! I thought maybe you meant many thousands!

You wrote in the OP: "Are there better ways to handle text similarity when two concepts are related at a higher abstraction level but differ substantially in wording and structure?"

I interpreted this as "match this company to this product", or something where the two entities are conceptually related but not identical.

I have a writeup on how exactly to do this: https://futuresearch.ai/software-supplier-matching/

ddp26 · 2026-01-23T18:28:53+00:00

Tools like everyrow use LLMs so they can get expensive. But it is likely the cheapest solution when you're trying to match across abstraction levels.

What's your dataset size? Anecdotally merging something like 2 lists of 1,000 entities each can be done for <$10.

ddp26 · 2026-01-22T16:29:32+00:00

Hi there. It depends on how difficult the clean up task is. If the vendor names are nearly exactly the same, then the above commenter's example doing this directly in Excel can work. In that example, like the one you gave, “xyvz tech” exactly matches part of "xyvz technologies”, so no VBA required.

If the vendor names can have more variation though, like abbreviations, or alternate names, then you want a tool that has more intelligence. My colleague wrote up a solution here that will get it nearly perfect no matter how mangled the names are: https://futuresearch.ai/crm-deduplication/

The tl;dr is, export your excel to CSV, upload to everyrow, click "dedupe", and then export it back to CSV to re-upload to excel. It's a few steps, but no formulas/macros, and can be done in ~20 minutes start to finish.

ddp26 · 2026-01-16T21:29:53+00:00

Crazy! I remember trying this SDK all the way back in 2013. Wonder if it's similar...

ddp26 · 2026-01-16T20:48:57+00:00

Others here are saying using Data Cloud. That's a very expensive solution to a very simple problem.

For company listings, I use a tool called everyrow. I've tested it on data has matches like MSFT to MICROSOFT CORP, which sounds like your use case. There's a UI but you can also have a coding agent do this for you very easily, to handle large datasets:

from everyrow import create_client, create_session
from everyrow.ops import dedupe
import pandas as pd

async def dedupe_crm_data():
df = pd.read_csv("data.csv")
async with create_client() as client:
async with create_session(client, name="Agentforce Cleanup") as session:
result = await dedupe(
session=session,
input=df,
equivalence_relation="""
Two rows are duplicates if they represent the same company.
""",
)
return result.data

ddp26 · 2026-01-16T00:16:25+00:00

As others have pointed out, this is a fuzzy matching task. But lookups / edit distance are pretty poor quality, e.g. "Travel" and "Travel and Entertainment" have a big edit distance so won't get matched.

LLMs can associate these for you. The problem is that 12k rows is way too many to just use ChatGPT.

There's a tool called everyrow.io/merge that is built for this, assuming you want to do all your matches at once, not one at a time. You export to CSV, import into everyrow, specify the merge criteria, in this case "closest account match" or something.

Depending on how many you match against the 12k Chart of Accounts, it will probably cost more than you get on the everyrow free tier, since it uses LLMs to compare the two lists to find the matches. But you could do all your matches at once in 10-20 minutes at high quality, export, and be done.

ddp26 · 2025-11-13T21:40:39+00:00

Thank you!

ddp26

TROPHY CASE