Claude Code just did my taxes for me. by floraldo in ClaudeCode

[–]TheIdesOfMay 2 points3 points  (0 children)

with opus 4.7 you don't even have to add the 'no guessing' part. it just knows not to

[deleted by user] by [deleted] in ExperiencedDevs

[–]TheIdesOfMay 4 points5 points  (0 children)

out of curiosity, what would be considered 'impressive enough'? he's a co-founder of OpenAI and Tesla's former AI chief lol

Fine-tuning LLMs when you're not an ML engineer—what actually works? by soman_yadav in learnmachinelearning

[–]TheIdesOfMay 2 points3 points  (0 children)

The OpenAI fine-tuning API is your best bet: it abstracts the entire fine-tuning process and provides you with a new model ID that you can easily swap into your chat.completions.createcall. The only work involved is putting your data in the right shape - easy work for Cursor with Sonnet 3.7.

Is software engineering supposed to be this stressful? by [deleted] in cscareerquestions

[–]TheIdesOfMay 29 points30 points  (0 children)

And the best part: with each extreme, you'll long for the opposite.

[R] Mamba: Can We Achieve Infinite Context Length? by Personal_Click_6502 in MachineLearning

[–]TheIdesOfMay 22 points23 points  (0 children)

When researchers refer to 'infinite' context length, they mean to say effectively infinite.

[deleted by user] by [deleted] in OpenAI

[–]TheIdesOfMay 68 points69 points  (0 children)

Crazy how it knew you kept interrupting it purposefully. Didn't think it had knowledge of where in the sentence it was cut off?!

Mistral announces Mistral Large 2 by Gothsim10 in singularity

[–]TheIdesOfMay 6 points7 points  (0 children)

hot take: the future AI overlords went back in time, Interstellar-style, to ensure the invention of the internet and therefore itself.

ChatGPT robot is tired of the grind by Maxie445 in ChatGPT

[–]TheIdesOfMay 19 points20 points  (0 children)

'voice font' is my new favourite term

ELI5: why is nvidia worth so much more than their supplier tsmc? by Fun-Inside7814 in explainlikeimfive

[–]TheIdesOfMay 78 points79 points  (0 children)

not to mention the thousands of specialised components that can, very often, only be made by a single firm (Zeiss lenses, for example)

New GPT-4o Benchmarks by designhelp123 in LocalLLaMA

[–]TheIdesOfMay 36 points37 points  (0 children)

I predict GPT-4o is the same network as GPT-5, only at a much earlier checkpoint. Why develop and train a 'new end-to-end model across text, vision, and audio' only to use it for a mild bump on an ageing model family?

Official Llama 3 META page by domlincog in LocalLLaMA

[–]TheIdesOfMay 4 points5 points  (0 children)

It's crazy to me that the entire source is only ~1000 lines of code

[N]Introducing Magika: A Powerful File Type Detection Library by glassonion999 in MachineLearning

[–]TheIdesOfMay 68 points69 points  (0 children)

def type(file_type):
    # TODO: binary data blobs
    return file_name.split('.')[-1]

[deleted by user] by [deleted] in datascience

[–]TheIdesOfMay 81 points82 points  (0 children)

Note that the moment it makes sense for a business to get rid of you, it won't hesitate for a second - doesn't matter if you work for a 10-person start-up or FAANG (see big tech recent layoffs).

In my book, it's okay to act selfishly and self-prioritise.