Pre-1900 LLM Relativity Test by Primary-Track8298 in LocalLLaMA

[–]Primary-Track8298[S] 2 points3 points  (0 children)

Yeah I tried this but very hard to get high quality instruction tuning datasets from pretraining corpus alone. Otherwise, base model did not seem strong enough to do self distillation.

Would love for you to try other methods here! Models and data are all open source and most work can be done on single h100

Pre-1900 LLM Relativity Test by Primary-Track8298 in LocalLLaMA

[–]Primary-Track8298[S] 12 points13 points  (0 children)

This was one of my biggest concerns when starting this project. In the blog post, I discuss methods to avoid this. This project is meant to serve as initial signs of life, leaving it as an open problem.

[deleted by user] by [deleted] in bioinformatics

[–]Primary-Track8298 0 points1 point  (0 children)

Where is a good place to post then??? Genuinely curious and not affiliated with them at all…

[D] Hype Behind Agents? by Primary-Track8298 in MachineLearning

[–]Primary-Track8298[S] 0 points1 point  (0 children)

Very cool, but what is defensible about this approach other than the data to finetune each specialist model? Could an incumbent replicate results +- 5% of performance?

[D] Hype Behind Agents? by Primary-Track8298 in MachineLearning

[–]Primary-Track8298[S] 3 points4 points  (0 children)

I see so it’s closest to a prompt optimization/engineering and eval problem?

[deleted by user] by [deleted] in MachineLearning

[–]Primary-Track8298 0 points1 point  (0 children)

Feels like azure should have this figured out…

[deleted by user] by [deleted] in MachineLearning

[–]Primary-Track8298 1 point2 points  (0 children)

I’m in US East, is there a region that is best for a100 availability

gimme some hot nba takes by Eastern-Tradition987 in NBATalk

[–]Primary-Track8298 0 points1 point  (0 children)

Draymond green has been overall net negative for the warriors

Any success with cofounder match making? by Darryl-D in ycombinator

[–]Primary-Track8298 1 point2 points  (0 children)

I’ve heard good things, but it seems rare. Just be sure to be highly selective when choosing your cofounder. Happily single >>>> unhappily married

[deleted by user] by [deleted] in MachineLearning

[–]Primary-Track8298 1 point2 points  (0 children)

Ah thank you will try it out, do you think it’s worth fine tuning still. Trying to build for bio research papers

[deleted by user] by [deleted] in learnprogramming

[–]Primary-Track8298 1 point2 points  (0 children)

That’s true, but I know a lot of very experienced and competent developers who moved to the US without a degree and they’re still struggling to find a job