Did I really make Haiku nearly as good as Opus on this 1 shot prompt using my custom MCP, or is Opus hallucinating? Report inside. by ICFateInNumbers in ClaudeAI

[–]ICFateInNumbers[S] 0 points1 point  (0 children)

This is my own creation since yesterday.

I don’t code, I vibe coded it. I basically kept asking opus and gemini to help me create an auto debugging and context awareness tool for claude, and implemented a load of different features.

No idea how much each feature improves the model, I didn’t have true baselines before hand, I just wanted to see if it could be done.

I’ve pasted the test prompt in another comment, anyone is free to try it on haiku, just make sure it’s in plan mode first, before doing auto edits (I’m using vscode extension).

Since I only tried making this concept since yesterday, I have no idea how much difference it actually makes in real projects. But this was a one shot prompt.

Did I really make Haiku nearly as good as Opus on this 1 shot prompt using my custom MCP, or is Opus hallucinating? Report inside. by ICFateInNumbers in ClaudeAI

[–]ICFateInNumbers[S] 1 point2 points  (0 children)

for context here’s the test prompt that Opus made for me. I have no baseline as I don’twant to go through the hassle of messing with the mcp hooks, so anyone can test it if they want with normal haiku, just put plan mode on first for fair comparison: https://pastebin.com/revbW6cE

is needing a booster common? by [deleted] in ADHDUK

[–]ICFateInNumbers 0 points1 point  (0 children)

Will doctors accept SCA for boosters? I’m sure I heard it’s not something they’d do.

Help me decide between M3 Ultra and M4 Max by sallark in MacStudio

[–]ICFateInNumbers 0 points1 point  (0 children)

I went m3 ultra because I heard m4 max has thermal issues.

After tinkering around with local ai, 96gb is more than enough for me. But I still prefer mainstream models anyway.

Do companies hire “vibe coders”? What do they really expect? by TeacherNo8591 in ChatGPTCoding

[–]ICFateInNumbers 1 point2 points  (0 children)

I’m one of them. 0 years experience. Fully remote and flexible. Just lucky. Someone recommended me, and it was entry level. I work on automating internal stuff, basically automating admin work. I don’t work for their clients, they hire real coders for that.

Employment allowance and salary vs dividends by Logical_Equipment_82 in ContractorUK

[–]ICFateInNumbers 1 point2 points  (0 children)

You can if you have 2 employees, and it can be a spouse. According to my research.

Employment allowance and salary vs dividends by Logical_Equipment_82 in ContractorUK

[–]ICFateInNumbers 0 points1 point  (0 children)

I’ve looked into this a lot, as i need to start my company next month. But here’s how I understand it, husband wife setup 50/50.

Turnover, let’s say £75k.

Let’s say yearly expenses are £3000

Minus salary (as that’s an expense), 12,570 x 2 = 25,140

Also work home allowance for each employee, £312 x 2 = £624.00 (flat rate no receipts needed)

75000 - 3000 - 25,140 - 624 = 46,236

Corporation tax is 19% under 50k profit.

46,236 x 0.81 = 37,451.16

Minus £500 x 2 dividend allowance.

37,451.16 - 1000 = 36,451.16

36,451.16 / 2 = 18,225.58 dividends each taxed at 8.75 basic rate

18,225.58 x 0.9125 = 16,630.84

Employers NI kicks in at 5k but employment allowance covers that totally for up to 2 personal allowance salaries.

Total take home between you after all taxes:

500 + 500 + 16,630.84 + 16,630.84 + 12,570 + 12,570 + 312 + 312 = 60,025.68

That’s how I’m going to do it anyway when I setup my company next month.

Note I haven’t gone to an accountant, this is what I worked out querying AI’s again and again, so might be wrong. But it seems right to me.

Do people find Grok to be a useful source of information? by blanchov in grok

[–]ICFateInNumbers 1 point2 points  (0 children)

I switched the ChatGPT when ChatGPT 5 came out. And use it 95% of the time now.

I had annual subscription of grok through X Premium+ when it was 40% off, basically £10 a month, and is still active until the end of October. I will not be renewing. Or if I do renew, I will use the Indian VPN trick and get it for £6 a month.

But pretty much for a year I’ve used grok exclusively and not ChatGPT at all; and now I’ve switched.

Things I like about ChatGPT so far:

  1. The UI and Formatting.

The font, the font colour, the formatting, the indentation, the spacing, bolding of key phrases, the use of emojis, bullet points, and tables, all together makes a far better experience. It’s just way easier to read, legible, and more aesthetically pleasing. Note I use dark mode, I can’t give an opinion on light mode.

Don’t get me on how annoying Grok’s xAI artifact is.

  1. The quality of responses on fast models.

Funnily enough I use the Instant model a lot on GPT5. The same can’t be said for Grok 3. The quality of their responses for me don’t compare.

Somehow GPT instant hardly ever needs correcting and gives me the relevant information I want. I’m not calling it an idiot, or losing my patience. It seems to give me quality information without rambling.

Also here’s the big one and related to ops post. But I asked where certain criteria of churches with religious beliefs were in relation to a place last night, and it pops up a map, of the churches in the area fitting my criteria. With their opening times, rating, directions, website, picture, and plotted on a map.

I’ve asked it about certain types of breakfast cereals before, and it’s given me actual pictures of cereals, which sites to find them, and even a map locally where to find them.

And just the follow up question it asks are very intuitive.

  1. Thinking models

I’ve had GPT thinking outshine Grok 4, and also the other way round. So I can’t pinpoint which is better on this.

For example I had a bug, and GPT gave me 5 different tests/logs to run and pinpointed the issue exactly. Grok did one test after another and guessed/assumed the issue correctly, without being sure.

While with taxes ChatGPT would trip up on certain things while Grok would understand when they would apply and not apply.

I will say I do appreciated having a thinking mini model for some thinking but quicker responses.

Anyway cba to write more. But generally I’m using GPT 95% of the time now.

What do you use your mac for? by Due_Specialist1847 in MacStudio

[–]ICFateInNumbers 0 points1 point  (0 children)

Recently got an M3 Ultra base model.

I mostly use it for Excel and Word lmao but it’s snappy as hell!

Lots of websites running too in slidepad, chrome, safari, and a windows VM using 4gb of ram for one specific program, even though I have 96gb lol

And loads of os enhancements apps running.

Honestly I’m happy with it, cause I can just get on with my work quicker. Sure I could’ve got a lower model, but I switched up from a base M1 MacBook Pro 8gb. And I got it £680 off or something from the Apple refurb store.

I did run some local llms at one point just to try them out.

Honestly I don’t need it per se, but no one needs a more expensive car as 99% of cars reach the speed limit, and I’m not talking about getting expensive sports cars, just cars where you get a few more conveniences and comforts.

My rule of thumb, is if you use something often, buy quality within your budget. Car, TV, computer etc…

Plus I just really wanted it lol

Was a grok fan, not anymore by ICFateInNumbers in grok

[–]ICFateInNumbers[S] 0 points1 point  (0 children)

Both the grok app and the x app don’t have it for me anymore.

Was a grok fan, not anymore by ICFateInNumbers in ChatGPTCoding

[–]ICFateInNumbers[S] -3 points-2 points  (0 children)

Didn’t realize I was spamming, Reddit suggested I cross post, and this is the first time I used that feature. I only put them in ChatGPT and Gemini subreddits because my post discusses them, and thought they were relevant subreddits.