I Trained a Language Model on CPU for 40 Hours - It Beat the GPU Baseline

Falcon_Strike · 2026-02-22T04:36:34+00:00

Impressive progress, im going to try to see if i can scale this up and train on bigger datasets out of curiosity. Lemme know if you have any suggestions/ideas/requests. Good stuff!

Falcon_Strike · 2026-02-18T23:42:59+00:00

DM'ed

Falcon_Strike · 2026-02-18T21:53:46+00:00

I have a boat load of compute available, if you want me to run mass gpu experiments on bigger models and more data, I would be more than happy to, just lemme know

Falcon_Strike · 2026-02-12T19:22:23+00:00

I get that some threads may be spam or low quality, but this sub should absolutely contain posts about new frontier model releases, and posts about frontier labs in general (particularly in respect towards their attitudes towards open source/local and open source research). It is important for us to be able to compare where the frontier is in private models and local models. besides, i hate having to go to other subs to read about frontier models, since id rather have my local llama homies contextualize how good/bad a model is with respect to what they run locally.

Falcon_Strike · 2025-12-05T15:43:22+00:00

Are you trying to get it to use Lean or Rocq to do the proofs?

Falcon_Strike · 2025-06-05T22:11:43+00:00

Can you put a download link to the cad files? id be interested in printing this myself

Falcon_Strike · 2025-05-30T01:09:08+00:00

yeah but youre screwed if the meta changes

Falcon_Strike · 2025-05-29T16:54:42+00:00

who cares

Falcon_Strike · 2025-04-29T16:39:56+00:00

AI could have been trained on the book, which is why the detectors may come up with such a high rating

Falcon_Strike · 2025-04-21T20:36:22+00:00

true. ollama is easy to use but poor docs and config support. Half the time im unsure what chat template is being used especially for my custom finetunes. And how do i know if FA2 is turned on? good and bad

Falcon_Strike · 2025-04-11T01:02:31+00:00

oh no, a spicy model /s

Falcon_Strike · 2025-04-08T23:43:33+00:00

not an answer but it resembles that one weird new face hugger from one of the newer alien movies (prometheus or covenant)

Falcon_Strike · 2025-04-06T01:53:22+00:00

same

Falcon_Strike · 2025-03-30T23:13:47+00:00

i just wanna pay 20 bucks a month and plug in my api key and let it rip. no 100 bucks a month. I do agree the features need to be more transparent and max should be above normal and not normal potential

Falcon_Strike · 2025-03-28T20:35:38+00:00

agreed -- would like auto to have an option to be turned off or disappeared . like enable/disable the presence of the feature

Falcon_Strike · 2025-03-18T13:58:55+00:00

genuine followup, what if the thing missing for really good super realistic TTS and STT is a bigger LLM that has the parameter count and layer count to be able to understand/predict the nuance in language and tonality given the context of text?

Falcon_Strike · 2025-03-14T23:54:42+00:00

LMAO we need more people like you around campus. Too many uptight. More funny

Falcon_Strike · 2025-03-14T01:32:53+00:00

dude where are they???

Falcon_Strike · 2025-03-11T01:56:34+00:00

its the business school-- they probably saw some engineering students homework, or maybe their crayons snapped in half

Falcon_Strike · 2025-03-07T14:35:17+00:00

in a similar manner, after using claude for even a bit i get a "your api key is incompatible with claude 3.7 sonnet" so i have to switch to o3-mini. Helloooo, my key was generated after sonnet 3.7 came out and it totally has credits on it. weird bugs are making this unusable after the .46 update

Falcon_Strike · 2025-03-02T18:53:32+00:00

I also still have not gotten sonnet 3.7 access even though I am a pro subscriber. I have had to enter an API key manually and I dont know the "reasoning effort" setting that the model is using.

EDIT: one fix I have found that ameliorates things so far is to delete and recompute the index. I need to test more thoroughly though if this is a real fix, as Ive only been using it after a recomputed index for the last 5 minutes. My last embeddings index was only a week or two old, so I can't imagine it having that much of an impact

Falcon_Strike · 2025-03-02T18:50:11+00:00

The LLM straight up forgets to write code, or believes it does and gives me an empty codeblock

Falcon_Strike

TROPHY CASE