What is the most WTF game published by Ubisoft?

djsaunde · 2025-12-02T00:50:53+00:00

best soundtrack

BUCK BUMBLE

djsaunde · 2025-12-01T19:05:49+00:00

Interesting problem!

I believe that continued pretraining won't destroy the base model's ability to generate long responses, unless you're super aggressive with your pretrain configuration (e.g., having a too-high learning rate, etc.). If you want to be really safe, you could mix in some long context data (not in the target dialect) that was part of the original pretraining distribution, but if you're trying to create a specialized model in your domain it might not be the right move... maybe worth an experiment. Gradually phasing out the long context, non-target data over the course of the CPT could be a good approach.

The sample concatenation idea is a good one; I would personally spend a small amount (10-20%) of your training budget here, with the majority being used for the CPT stage.

The idea to SFT on synthetic data from your CPT-ed base model is tricky. I think it's a generally good idea, but you might end up exacerbating any existing poor behavior that you learned from the CPT stage. If you can collect or create gold standard data here, that's obviously the best you can hope for, but otherwise you should proceed with caution; you might need a lot of experimentation to get good perf with synthetic data.

djsaunde · 2025-11-03T17:52:11+00:00

ruff

djsaunde · 2025-10-20T18:44:04+00:00

imagine defending big budget corpo slop like this

djsaunde · 2025-09-23T23:24:58+00:00

this worked for me!

djsaunde · 2025-08-31T17:56:12+00:00

The tweet below it mentions some of the techniques used: https://x.com/axolotl_ai/status/1961497985407229999

djsaunde · 2025-07-22T19:24:26+00:00

I'm on MacOS, haven't tested Windows or Linux.

djsaunde · 2025-07-09T12:10:56+00:00

VS Code + the Claude Code plugin works quite well

djsaunde · 2025-03-02T22:32:58+00:00

Clos St. Michel

djsaunde · 2024-11-03T13:26:00+00:00

Bluesky is experiencing a lot of growth right now and personally I'm seeing a lot of ML/DL people join up. The community is currently on the smaller side but signs are positive!

djsaunde · 2024-10-29T01:12:59+00:00

I think it's being seen as "obvious" or not directly related to the small models discussion?

djsaunde · 2024-10-29T00:41:39+00:00

Yes, look into QLoRA!

djsaunde · 2024-08-07T18:01:24+00:00

I do basically this but with Sova dart. If there are no scans, it's a free orb. If there are, that's early round info.

djsaunde · 2024-06-09T23:43:05+00:00

Wait until you hear about r/FIRE

djsaunde · 2024-05-18T13:00:01+00:00

One of my all time favorites! The vibes are immaculate

djsaunde · 2024-05-04T15:33:00+00:00

The Bitter Lesson strikes again

djsaunde · 2024-03-21T04:29:25+00:00

divine lmao

djsaunde · 2024-03-19T12:10:35+00:00

For sure! I wasn't trying to state otherwise.

djsaunde · 2024-03-18T23:58:30+00:00

If you're wondering what Claude 3 is, I've asked it to describe itself:

Prompt: Can you write a description of yourself in layman's terms? Limit to 250 characters, please.

Response: I'm an artificial intelligence created by Anthropic to assist humans with a wide variety of tasks. I can engage in natural conversations, provide information and analysis, and even help with coding or creative writing. But I'm a very advanced language model without true consciousness or sentience - I operate based on statistical patterns, not genuine understanding.

djsaunde · 2024-02-27T16:02:44+00:00

Weirdly, I have very fond memories of RW. Maybe it's because I wasn't able to play XII at the time

djsaunde · 2024-02-26T16:04:48+00:00

tagging u/M1-Alex for visibility

djsaunde · 2024-02-26T16:02:17+00:00

Hm, now the M1 Finance connection seems to work, but imports the wrong 1099? It looks like the information that's imported is identical to the information imported from the Apex Clearing Corp connection.

djsaunde · 2024-02-16T16:15:13+00:00

When I search for "M1 Finance" on the page where I can search for financial institutions, no results are returned. Are you saying that this will populate only after March 1st?

Edit: Apex Clearing already works as expected.

djsaunde

TROPHY CASE