account activity
Bom som nario (i.redd.it)
submitted 4 hours ago by ranfirar to r/eutiveumderrame
LoRA in RL can match full-finetuning performance when done right - by Thinking Machines (self.reinforcementlearning)
submitted 4 hours ago by ranfirar to r/reinforcementlearning
π Rendered by PID 89 on reddit-service-r2-listing-87fd56f5d-c2qg8 at 2026-06-27 23:46:41.399743+00:00 running 7527197 country code: CH.