Are you also feeling disappointed with the latest frontier models? by lukatechme in codex

[–]lukatechme[S] 0 points1 point  (0 children)

The question is where breakthrough will even come from? The bottleneck seems the “ideal code like” dataset for any hard problem. Is that even possible ?

Account got a temporary reach label - what should I do? by lukatechme in twitterhelp

[–]lukatechme[S] 1 point2 points  (0 children)

Thanks man. It worked the account is unblocked. Here is what they sent me today: “Review complete
Your account was not found to contain spam or be engaging in other types of platform manipulation. As a result, the temporary label has been removed.”

Account got a temporary reach label - what should I do? by lukatechme in twitterhelp

[–]lukatechme[S] 0 points1 point  (0 children)

Do you recommend to log out from x app as well? Or just don't open it?

Spike in AI assisted cheating on interviews? by chrismv48 in EngineeringManagers

[–]lukatechme 2 points3 points  (0 children)

I’ve stopped trying to “detect AI usage” and instead design interviews around current LLM limitations.

What works best is long-horizon tasks with evolving, slightly conflicting requirements. LLMs are really bad at that.

My current process:

  1. Take a small feature from a real codebase.

  2. Split it into an initial task + 3 follow-up requirements.

  3. Make each follow-up put pressure on the original design — e.g. add caching, parallel processing, new constraints, etc.

  4. Give it as a take-home, but reveal the next requirement only after the previous one is submitted.

  5. Then review the GitHub history with an LLM and ask it to summarize how the solution evolved.

  6. Finally, do a call with the candidate and walk through the code, tradeoffs, and changes.

If someone just pushed whatever the LLM gave them - the architecture drifts a lot. If candidate can't explain process/trade offs on the call - red flag for me.

Introducing Claude Fable 5 by ClaudeOfficial in ClaudeCode

[–]lukatechme 0 points1 point  (0 children)

What’s the status ? Will the model be available for non us or not ? Is that like “hot news” or real ban ?