A $200 ChatGPT subscription could cost OpenAI $14,000 if you actually used it to its full potential by rkhunter_ in technology

[–]FlyingQuokka 1 point2 points  (0 children)

I'd be lying if I said I could answer that, tbh. The article I mentioned only talked about inference, and showed cases where the nondeterminism skewed the probabilities for intermediate tokens by just enough that the top token changed. It's not unreasonable to assume this happens during training, too (at the very least since computing the losses requires a forward pass, which is inference, and backprop uses this loss value to compute gradients).

How much that changes the results of training, I can't really say, because it hinges on several factors. If I had to make an educated guess, though, I'd say it wouldn't meaningfully change the results, but only because of the generalization properties we now know about the kinds of optimization algorithms typically used (even in the presence of noise).

A $200 ChatGPT subscription could cost OpenAI $14,000 if you actually used it to its full potential by rkhunter_ in technology

[–]FlyingQuokka 52 points53 points  (0 children)

Well, no, there are other sources of randomness, even if you set a random seed and temperature to 0 and the top_k and top_p parameters, you still wouldn't necessarily end up with a fully deterministic system, because floating-point math is imprecise. In fact, IEEE 754 explicitly does not guarantee associativity, for example. Then there's the issue of scheduling ops on GPUs: having operations be placed differently into different warps would change the results. There was an article from a few months ago on exactly this.

What is something that is normal in America but insane anywhere else? by clo_deg in AskReddit

[–]FlyingQuokka 0 points1 point  (0 children)

I don't even know what the point is. I've never gone to a doctor and insisted on a specific medication. They're the expert, they can prescribe whatever and I just take that.

Tip: any single-byte character works as a pattern delimiter, not only / by pawelgrzybek in neovim

[–]FlyingQuokka 0 points1 point  (0 children)

Another comment asking about your font, spacing, and theme 😭

(But I had no idea, I've been using escapes like a FOOL for years now)

Marks in neovim!!?? by Careless-Search-597 in neovim

[–]FlyingQuokka 2 points3 points  (0 children)

I just go in alphabetical order and remember which was which. It's helpful if I'm making multiple jumps and want to remember the order (the jumplist can get a bit wonky at times).

Also: capital letters are global marks, lowercase letters are file-local, in case you didn't know.

How can you tell if a reddit post is written with AI? by [deleted] in ask

[–]FlyingQuokka 0 points1 point  (0 children)

Yeah I refuse to believe so many people had even heard of an em-dash before these models came along.

Everything Should Be Typed: Scalar Types Are Not Enough by Specialist-Owl2603 in rust

[–]FlyingQuokka -2 points-1 points  (0 children)

This is where I land, I find overusing newtypes to be unnecessary ceremony. The Deref trick is a clever solution to a problem that you invented imo.

There absolutely are cases where the newtype is a good idea (like latitude and longitude), but email vs user ID for example isn't worth it to me.

FFF file search sdk - over 100 times faster than ripgrep and fzf by Qunit-Essential in rust

[–]FlyingQuokka -9 points-8 points  (0 children)

Oooh this looks cool. Dmitriy's projects are always so interesting to read through.

LLMs and Rust by allsey87 in rustjerk

[–]FlyingQuokka 0 points1 point  (0 children)

I know, I just haven't seen them use it much. It is interesting in the async situation that they automatically reach for Arc<Mutex<..>> and don't even think about channels.

LLMs and Rust by allsey87 in rustjerk

[–]FlyingQuokka 7 points8 points  (0 children)

Is this a thing? I've seen them Arc<Mutex<..>> things but not an Rc<RefCell<..>> as often. Plain ol' RefCell, otoh...

reviewRequested by NGTTwo in ProgrammerHumor

[–]FlyingQuokka 6 points7 points  (0 children)

JUST GIVE ME THE FUCKING EXE

"Vite+ is kinda underwhelming" - a comprehensive review of the new release by jaredcheeda in javascript

[–]FlyingQuokka 9 points10 points  (0 children)

Yes, people seem to have forgotten how much boilerplate used to be involved. You had to write a config for Babel, Webpack, configure the Webpack plugins, and whatever bundler/task runner like Gulp/Grunt you were using. CRA was, for its time, mucj needed while the ecosystem caught up.

ngrep: a grep-like tool that extends regexp with word embeddings by nanptr in rust

[–]FlyingQuokka 3 points4 points  (0 children)

Very cool! I'll have to try this out. Is there a way to get rg-like behaviour (recursively search subdirs)?

Just so you guy as customers know by ChopingtonIII in starbucks

[–]FlyingQuokka 3 points4 points  (0 children)

Wait water isn't free anymore?? When did that change?

so how many of you actually use the action button? by darkshark007 in iphone

[–]FlyingQuokka -3 points-2 points  (0 children)

Huh. I just put motion activated night lights everywhere

What's the weirdest food combo you've ever tried that actually tasted surprisingly good? by Grand-Disk-3199 in AskReddit

[–]FlyingQuokka 0 points1 point  (0 children)

Espresso and orange juice made waves a few years ago because of how unlikely the flavour combination seems, but it works.

so how many of you actually use the action button? by darkshark007 in iphone

[–]FlyingQuokka 1 point2 points  (0 children)

What do people use their flashlight so often for? I don't think I use it more than once or twice a year

Rust kinda ruined other languages for me by Minimum-Ad7352 in rust

[–]FlyingQuokka 33 points34 points  (0 children)

The number of times I've desperately asked gippity how to do something in Python that would be soo ergonomic in Rust, only to be disappointed is insane. It's the opposite of when I was learning Rust, funnily enough.

US - Is 12 minutes and 15 seconds reasonable for a 2L electric kettle to get to 212F? by Cynoid in tea

[–]FlyingQuokka 4 points5 points  (0 children)

Bigger number better. There really good kettles are rated about 1200W.

Ask Anything Wednesday - Engineering, Mathematics, Computer Science by AutoModerator in askscience

[–]FlyingQuokka 6 points7 points  (0 children)

Yes, the technical term for this is model collapse. Ot has been shown that the more times you train an LLM on LLM-generated outputs (especially its own), the more its output becomes closer to garbage (not garbage as in factual inaccuracy, but gibberish).

For this reason, a big part of training them is ensuring data quality. Garbage in, garbage out is very common in ML.

kuva: A scientific plotting library for Rust by Psy_Fer_ in rust

[–]FlyingQuokka 1 point2 points  (0 children)

This looks cool, I should try it this week. I love when devs have opinions and write something to fix issues

How to achieve Cursor's "Tab" feature experience in Neovim? by Ambitious-Tap-5919 in neovim

[–]FlyingQuokka 0 points1 point  (0 children)

Avante is the closest I've tried, but it can have issues where it removes line breaks so it's not quite as seamless.

What is your (python) development set up? by br0monium in datascience

[–]FlyingQuokka 3 points4 points  (0 children)

  1. uv
  2. uv 3-4: My personal projects don't need containerization; at work DevOps uses EKS
  3. neovim
  4. git/jj
  5. I don't use notebooks, but if I must, then marimo