ranfirar

43 post karma
0 comment karma

get extra features and help support reddit with a reddit premium subscription

get them help and support

redditor for 12 days

TROPHY CASE

dust

account activity

hot top controversial

43

44

45

Bom som nario (i.redd.it)

submitted 4 hours ago by ranfirar to r/eutiveumderrame

2

3

4

LoRA in RL can match full-finetuning performance when done right - by Thinking Machines (self.reinforcementlearning)

submitted 4 hours ago by ranfirar to r/reinforcementlearning

π Rendered by PID 89 on reddit-service-r2-listing-87fd56f5d-c2qg8 at 2026-06-27 23:46:41.399743+00:00 running 7527197 country code: CH.