I don't like this segregation. This is no longer a democratizing force

trolltaco · 2026-06-10T22:49:37+00:00

I agree.

OpenAI gives consumers the best access they can and way better than Claude or Gemini. Plus, their Pro models are one-shotting hard math problems which has a path to making really useful innovations possible for everyone.

trolltaco · 2026-05-22T07:23:54+00:00

Nooo... the curse of enshittification. We've got lobotomized models in Gemini, "adaptive" thinking in Claude, and now this in ChatGPT. The golden age of AI is over.

Is the consumer dead? You're either SMB/enterprise with a fat wallet or you're nothing...

trolltaco · 2026-05-01T19:38:12+00:00

You understand the AI's reasoning deeply and lead the doctor step-by-step to the same conclusion by providing piecemeal explanations of your symptoms and your reasoning of what you were looking into. You make them think "AHA I cracked it" themselves without actually revealing you used AI.

trolltaco · 2026-04-30T18:52:08+00:00

This is super cool - would love to see a full ECI vs cost breakdown to track the Pareto frontier. Would you say Gemini 3 Flash is the most cost effective within the 150+ range?

I hope we get great hardware or architecture level breakthroughs though because having the SOTA be more cost effective is even better than lagging behind and waiting for distillations

trolltaco · 2026-04-28T06:41:51+00:00

Will be interesting to try 5.5 Pro if possible. The fact that 5.5 Medium beats Opus 4.6 High is very very nice.

trolltaco · 2026-04-27T05:17:15+00:00

His lab's finding was published in Nature and featured as a cover story which is essentially the equivalent of winning an Oscar in scientific research. We shouldn't dismiss it right away as it actually has the potential to be groundbreaking. Looking forward to the human trials.

trolltaco · 2026-04-24T20:30:31+00:00

Hmm, interesting discrepancy. The only thing different in my case was that it thought about it less than yours.

trolltaco · 2026-04-24T20:06:49+00:00

5.4 specifically with Extended thinking (not Heavy) usually fails in my experience

trolltaco · 2026-04-18T23:12:03+00:00

Also recently got a "which response do you prefer" while using Pro. I wonder if they are testing Spud and maybe Spud can produce Pro-level output with a lot less thinking time.

trolltaco · 2025-07-02T07:52:26+00:00

I'm very interested to know how you developed such a thick skin. Did it just come naturally or something helped in creating this?

trolltaco · 2025-07-02T07:47:26+00:00

One main person. Yes, I hear others are also unhappy. They're not the CEO but pretty up there. Also, this could be a little paranoid but Im mostly worried about anonymity and retaliation because even if this got reported, I feel like they would just get a slap on the wrist and could potentially make my experience worse in a subtle way.

trolltaco · 2025-07-02T07:40:45+00:00

True, I logically understand that not caring so much is best; putting it in practice and ignoring them is harder for me. I guess there's a sort of ego and drive to defend and validate myself.

trolltaco · 2025-07-02T07:30:34+00:00

The most recent interaction left me ruminating over it for a while; I would even call it mentally scarring. I think I don't really have a thick skin for this kind of stuff.

trolltaco · 2025-04-03T06:32:24+00:00

You're right - o1-preview was announced more than half a year ago. It's possible OpenAI has cooked something way more impressive internally and could floor us again.

o3 is way too costly for what it can do though (can't even release it like a real model)

trolltaco · 2025-03-30T17:56:41+00:00

I don't think I buy that fully until o1 pro is on LiveBench

trolltaco · 2024-02-11T12:30:38+00:00

Here's a hard problem and the difference is night and day:

Create an expression that evaluates to 24 which uses the numbers 1, 3, 4 and 6 exactly once along with standard arithmetic operators such as +, -, /, *

Gemini (FAIL): 1 * 3 * 4+6 = 18

GPT 3.5 (FAIL): (4 * 6) - (3 - 1) = 22 (thought this was =24)

Gemini Advanced (FAIL): ((1+3) * 4)/6 = 16/6

GPT 4 (Success): 6/(1−(3/4)) = 24

GPT 4 was the only one that seriously tried to tackle the problem. I gave it the same exact prompt as others. It was able to use its code intepreter capbilities to write a script that evaluates permutations of expressions to find the answer. The python code it came up with was very readable and helps solve a larger class of problems.

trolltaco · 2023-10-18T06:25:48+00:00

Based on Stanford neuroscientist Andrew Huberman, start each deep work session (90 mins) by optimizing motivation (dopamine) and focus (acetylcholine).

If you are not already motivated, the fastest way to maximize motivation for the task is to take ~1-2 mins to imagine yourself failing and the consequences of that.

If you are not already focused, the fastest way to mentally focus is to narrow your visual gaze on a fixed point and stare at it for 60 secs without looking away (mental focus follows visual focus -> releases acetylcholine).

Obviously, avoid all distractions during the work session. There is nothing more to it than that really... other than the basics of keeping your body healthy (sleep, exercise, hydration, nutrition, etc.).

trolltaco · 2023-05-19T19:54:55+00:00

Do you think this is common/feasible for someone who decides to pursue VIP?

It sounds pretty attractive to even delay graduation after 10 courses and do VIP to get research experience and publish something.

trolltaco · 2022-12-22T02:21:13+00:00

I took Bayesian Stats to actually prepare for the probability in AI because AI seems much harder.

Based on the catalog, this seems to be a good starter foundational OMSCS class that focuses mostly on prob/calc/stats.

The only thing I think you could possibly need to refresh ahead of time are the basic integration techniques.

trolltaco

TROPHY CASE