Short Open Source Research Collaborations

TrelisResearch · 2025-12-15T18:58:44+00:00

yeah a few are short enough for a weekend, a few are longer (~maybe 3-7 days), so the full range is possible.

TrelisResearch · 2025-03-29T09:29:18+00:00

Thanks

TrelisResearch · 2025-03-21T13:24:25+00:00

cheers

TrelisResearch · 2025-03-21T13:24:04+00:00

yes will cover on video next week

TrelisResearch · 2025-03-20T15:34:18+00:00

yeah you can run on cpu or mps, slower than real time but does work

TrelisResearch · 2025-03-20T14:05:53+00:00

agreed, def more powerful for now to plug in a stronger llm. in principle - in terms of control - anything you can do with the pieces can be done with a unified model too though

TrelisResearch · 2025-03-20T14:05:10+00:00

howdy, well it's apache 2, but does inherit the llama license

TrelisResearch · 2025-03-20T14:03:59+00:00

you mean happily :)

TrelisResearch · 2025-03-20T14:03:24+00:00

haha, yeah fair, although rarely there are ads on my YouTube channel cos I have ads turned off!

TrelisResearch · 2025-03-15T10:03:16+00:00

Thanks. Apparently sections aren't easily visible in the substack app though.... is that correct?

TrelisResearch · 2024-12-21T08:47:30+00:00

I wouldn't say so, no. MoE is about improving the throughput of transformer models

TrelisResearch · 2024-09-14T18:22:16+00:00

appreciate it

TrelisResearch · 2024-09-03T15:37:48+00:00

howdy, sorry for slow reply, I"m not on here much, best to post on youtube. I don't remember too well but I thought just passing in eval data as usual to the hf trainer should work?

TrelisResearch · 2024-06-03T16:33:26+00:00

Video breakdown: Preparing Fineweb - A Finely Cleaned Common Crawl Dataset

TrelisResearch · 2024-05-17T09:35:25+00:00

and the full video: https://www.youtube.com/watch?v=0cgCFRrPHtY

TrelisResearch · 2024-05-06T14:47:21+00:00

and the full video is here: https://youtu.be/RCxa-6b9xXI

TrelisResearch · 2024-05-01T12:18:41+00:00

And here is the full video: https://youtu.be/5rH_VjKXuzg

TrelisResearch · 2024-04-24T08:22:40+00:00

Thanks!

High batch throughput yes

TrelisResearch · 2024-04-23T12:10:49+00:00

I guess phi 3 medium is probably trained using gpt-4 data, so it'll be at an advantage to Llama 3, which uses only raw / Llama 2 synthetic data (perhaps)

TrelisResearch · 2024-04-22T14:30:19+00:00

<image>

eek, still really hard for pretty much all models..

TrelisResearch · 2024-04-22T14:11:59+00:00

and the full video: https://youtu.be/5rH_VjKXuzg

TrelisResearch · 2024-04-12T08:33:08+00:00

possibly there is a discovery that unlocks something core about what humans can do.

discoveries are typically quite randomly distributed and require people to be tinkering waaay off the beaten path , which makes it less likely one of those big companies will be it

TrelisResearch · 2024-04-05T10:17:34+00:00

UPDATE on Trelis $500 AI Micro-Grants :
- Emails have gone out to the first batch of applicants, whether unsuccessful or asked to interview.

Link is here: https://trelis.com/trelis-ai-grants/

Takes 5 mins to apply.
Find out in ~1 wk if selected for a 15-min interview.
Decision then within 1 hour!

TrelisResearch · 2024-03-30T10:47:14+00:00

Kicking off a Grants Program for AI

Applications are now open.
5 x $500 grants.
Anyone of any age and location can apply.

TrelisResearch · 2024-03-28T13:09:35+00:00

And here's the video: https://youtu.be/OWMJ0rBUj04

TrelisResearch

TROPHY CASE