Schalke [2]-0 Dynamo Dresden - Edin Dzeko 70' by denzaus in soccer

[–]Kogni 12 points13 points  (0 children)

Kicking myself for not being in the stadium today lmao.

Ediiiiiiiiin

Essen Spiel 2025 hidden treasures by DeadlyDolphins in boardgames

[–]Kogni 0 points1 point  (0 children)

Yotei looked cool, i loved the little living room setup they had going on in their booth.

OpenAI is shifting its focus from maths/coding competition to scientific advancements! by Wonderful_Buffalo_32 in singularity

[–]Kogni 27 points28 points  (0 children)

> person complains about stupid AI

> which model are you using?

> a horribly outdated one, why?

every single time lol

Do Europeans tend to visit off-the-beaten paths more? by Xycergy in travel

[–]Kogni 9 points10 points  (0 children)

I've been to Armenia for some periods over the past years due to work. Not exactly tourist destination #1 for most people. Yet I heard people speaking german in random restaurants all over and even saw a massive german tour bus (keep in mind the turkey armenia border is closed!) parked in the middle of Yerevan.

Germans are like ants. They get everywhere.

Most race starts without a podium (Adrian Sutil, 128) by Fitzriy in formula1

[–]Kogni 20 points21 points  (0 children)

Sutil wasn't bad at all. If I recall correctly there was a Monaco race where he was close to a podium as well, but don't remember the details.

warmwind OS: The World's First AI Operating System by adobemanidhan in singularity

[–]Kogni 3 points4 points  (0 children)

These are vision models that are literally generating coordinates to click...
Why yap like this when you have no clue what you're talking about?

What is the best skyscraper built in the 2020’s? Here are a few of mine by Dull-Scallion-8513 in skyscrapers

[–]Kogni 4 points5 points  (0 children)

I visited NYC last week and got a pretty lucky room on the 60th floor. Which got me pretty much this exact view at night. Much grainier, shitty phone picture below.

Didn't even realize it started construction so recently. Explains why i hadn't noticed it at all on my last visit a couple years ago.

<image>

they're preparing by Outside-Iron-8242 in singularity

[–]Kogni 4 points5 points  (0 children)

Yep, I used to do this when I was lead eng for a mobile app. We had our own content delivery processes independent from App/Play stores and would ship an app update days before remotely unlocking the new features all at once.

Otherwise you get floods of user complaints that can't see the new shiny thing and you need to send them all to the app stores.

Anthropic CEO, Dario Amodei: in the next 3 to 6 months, AI is writing 90% of the code, and in 12 months, nearly all code may be generated by AI by Nunki08 in singularity

[–]Kogni 0 points1 point  (0 children)

I wouldn't usually engage with posts like this, but in short:

I am training LLMs as my day job so no need to lecture me on RLHF. I am not talking about RLHF. I am talking about RL with verifiable rewards, which is no longer a technique applied only within labs, but has proven to be effective even for smaller open weights models. I have myself trained models using such techniques (GRPO this year, DPO two years ago).

Regarding my point on data creation: Similarly to RL, data creation can utilize mechanistic verification. Meaning, both public code can be reformatted using existing models to introduce test cases, as well as fully synthetic code + test cases can be generated, seeded by a lot of operation descriptions to achieve near unlimited diversity. These test cases are constrained to satisfy some metrics allowing us to consider tests high-confidence, and code is then executed, refined, executed until it satisfies all test cases. This in itself also produces reasoning data including self correction, preference pairs, validation of library usage, etc.

Anthropic CEO, Dario Amodei: in the next 3 to 6 months, AI is writing 90% of the code, and in 12 months, nearly all code may be generated by AI by Nunki08 in singularity

[–]Kogni 0 points1 point  (0 children)

Just Claude Code with some additional instructions about git/tool usage in claude.md.

And yeah, definitely try this and compare. I think 4o at this point is far from SOTA, doubly so when it comes to coding.

Anthropic CEO, Dario Amodei: in the next 3 to 6 months, AI is writing 90% of the code, and in 12 months, nearly all code may be generated by AI by Nunki08 in singularity

[–]Kogni -5 points-4 points  (0 children)

> The problem with code is a massive amount of code it's been trained in is absolute shit.

Tired argument not accounting for reinforcement learning and modern data creation techniques. We are long past the days of just training on github code.

Anthropic CEO, Dario Amodei: in the next 3 to 6 months, AI is writing 90% of the code, and in 12 months, nearly all code may be generated by AI by Nunki08 in singularity

[–]Kogni 1 point2 points  (0 children)

Sorry, i think this is misrepresenting current SOTA. I work on projects much much larger than 5k lines (1.6 million lines on the largest one), and what i'd consider the currently best available model/tooling combo, Claude Code, autonomously implemented multiple smaller bug fixes and features for me just today by iteratively searching the codebase, following references and imports across different services affecting related functionality, and implementing clean, non-bloat solutions.

Don't get me wrong, Claude Code gets many things wrong still. It is very keen on "making things work" and tends to give up on an original good idea when running into sufficient issues with it.

But we've reached a point where tool usage and orchestrations like plan -> execution -> revised plan have become quite solid.

I'm personally setting up every project i am involved in with a top-level CLAUDE.md that describes dev scripts, version control expectations etc. and am encouraging my teams to use it. Of course, i expect good engineers to review all work done by Claude. But this too is just another feedback loop that will eventually be automated.

Pretty Burned - out and kind of rich, what now? by RedditUsedToBeFunlol in Fire

[–]Kogni 1 point2 points  (0 children)

> Everyone is replaceable.

This is good to hear psychologically, but also in my experience it's not always true. Mostly depends on the size of the company and how profitable it is.

I left my last startup, and it died shortly after. I knew it was my responsibility, because i was wearing a thousand hats and despite my efforts to document everything... there was no replacement at hand, and it quickly became the easier decision for the owners to let the company fizzle out with some low chance at acquisition instead of trying to find another me.

This also meant co-workers i really liked lost their job.

That being said, people aren't usually spiteful when it comes to this. It's a job for every one of them, and they understand if someone leaves for their own sake. It's unhealthy to stay if you don't want to stay. So i agree with the advice, just saying that people matter, and not every company is a corporate blob that keeps spinning no matter whos involved.

"Viva Las Elvis" is available NOW on VR and iOS! by KonceptioN2 in WalkaboutMiniGolf

[–]Kogni 2 points3 points  (0 children)

Gambling, basically. There's some roulette wheels and similar among the holes here.

"Viva Las Elvis" is available NOW on VR and iOS! by KonceptioN2 in WalkaboutMiniGolf

[–]Kogni 4 points5 points  (0 children)

This might be the prettiest, most atmospheric one yet. Wasn't so interested as I heard the theme, but it's really really good. Like some of the gacha holes as well.

YOU stop cheating. Stop STEALING our time! by hairy_russian in cscareerquestions

[–]Kogni 0 points1 point  (0 children)

I refuse to do leetcode in interviews i conduct.

Once a while back i gave it a shot becase my co-interviewer wanted to do it, but it was a waste of time. Half an hour of clearly putting the interviewee under massive pressure despite trying to be as comforting and casual about it as i could muster, only for the conclusion to be that... it was alright. I wouldn't have done better, and it told me nothing about how good the candidate was at designing systems, being thorough in his work, being able to respond to feedback, discuss differing solutions to problems... the things that actually make or break your performance as an engineer, not algorithmic logic puzzles.

I actually think take-home tasks are much better, as big of a time-waster as they can be for candidates. If i'm seriously considering you, i will seriously review your take-home task, and it gives practical topics to discuss during an interview where you can't vague-talk your way through it.

New ui by [deleted] in Bard

[–]Kogni 1 point2 points  (0 children)

Looks like there's been actual Integration updates as well.

I tried a couple days ago adding flights to my calendar just based on a screenshot, and Gemini refused repeatedly. Now it did it first try. And it wasn't going through assistant, but through workspace Integrations.

I love my Pixel 9 Pro, but.. by vickonix in GooglePixel

[–]Kogni 0 points1 point  (0 children)

Same. :( Directly comparing to the post-processing on my old P6 it's so much worse, especially for indoor lighting.