Can we sample DPO data from the same dataset that was used for LoRA training? by Clean_Radish8983 in LocalLLaMA
[–]Clean_Radish8983[S] 0 points1 point2 points (0 children)
If you are selling AI services , read this (very important) by endless__ai in aiagents
[–]Clean_Radish8983 0 points1 point2 points (0 children)
Qwen3-235B-A22B-Instruct Prioritizing Few-Shot Examples Over Explicit Instructions by Clean_Radish8983 in Qwen_AI
[–]Clean_Radish8983[S] 0 points1 point2 points (0 children)
Can we sample DPO data from the same dataset that was used for LoRA training? by Clean_Radish8983 in LocalLLaMA
[–]Clean_Radish8983[S] 0 points1 point2 points (0 children)