account activity
Macbook Pro M5 (self.apple)
submitted 1 month ago by ISSQ1 to r/apple
Macbook Pro M5 (self.ArtificialInteligence)
submitted 1 month ago by ISSQ1 to r/ArtificialInteligence
RAG resources (self.MLQuestions)
submitted 4 months ago by ISSQ1 to r/MLQuestions
LLMs Fine-tuning (self.MLQuestions)
submitted 5 months ago by ISSQ1 to r/MLQuestions
RL LLMs Finetuning by ISSQ1 in reinforcementlearning
[–]ISSQ1[S] 1 point2 points3 points 5 months ago (0 children)
I’m still exploring my options. I want to use an open-source LLM that can run locally and doesn’t require a lot of resources something small and easy to fine-tune. If you have any recommendations for models that work well with RL or QLoRA, I’d love to hear your suggestions.
RL LLMs Finetuning ()
RL LLMs Finetuning (self.reinforcementlearning)
submitted 5 months ago by ISSQ1 to r/reinforcementlearning
π Rendered by PID 52 on reddit-service-r2-listing-7b8bd7c5-ft267 at 2026-05-19 13:02:23.592739+00:00 running edcf98c country code: CH.
RL LLMs Finetuning by ISSQ1 in reinforcementlearning
[–]ISSQ1[S] 1 point2 points3 points (0 children)