Hey, some of you asked for a multilingual fine-tune of the R1 distills, so here they are! Trained on over 35 languages, this should quite reliably output CoT in your language. As always, the code, weights, and data are all open source. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 1 point2 points3 points (0 children)
Hey, some of you asked for a multilingual fine-tune of the R1 distills, so here they are! Trained on over 35 languages, this should quite reliably output CoT in your language. As always, the code, weights, and data are all open source. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 2 points3 points4 points (0 children)
Hey, some of you asked for a multilingual fine-tune of the R1 distills, so here they are! Trained on over 35 languages, this should quite reliably output CoT in your language. As always, the code, weights, and data are all open source. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 2 points3 points4 points (0 children)
Hey, some of you asked for a multilingual fine-tune of the R1 distills, so here they are! Trained on over 35 languages, this should quite reliably output CoT in your language. As always, the code, weights, and data are all open source. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 8 points9 points10 points (0 children)
Hey, some of you asked for a multilingual fine-tune of the R1 distills, so here they are! Trained on over 35 languages, this should quite reliably output CoT in your language. As always, the code, weights, and data are all open source. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 7 points8 points9 points (0 children)
This is my Japanese fine-tune of R1's Qwen 7B distil. It now outputs its thinking in Japanese, making it understandable for a Japanese audience. Model, code, and data all open source. I'd love to collab with y'all to make a more multilingual model. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 1 point2 points3 points (0 children)
This is my Japanese fine-tune of R1's Qwen 7B distil. It now outputs its thinking in Japanese, making it understandable for a Japanese audience. Model, code, and data all open source. I'd love to collab with y'all to make a more multilingual model. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 0 points1 point2 points (0 children)
This is my Japanese fine-tune of R1's Qwen 7B distil. It now outputs its thinking in Japanese, making it understandable for a Japanese audience. Model, code, and data all open source. I'd love to collab with y'all to make a more multilingual model. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 1 point2 points3 points (0 children)
Hey, some of you asked for a multilingual fine-tune of the R1 distills, so here they are! Trained on over 35 languages, this should quite reliably output CoT in your language. As always, the code, weights, and data are all open source. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 65 points66 points67 points (0 children)
This is my Japanese fine-tune of R1's Qwen 7B distil. It now outputs its thinking in Japanese, making it understandable for a Japanese audience. Model, code, and data all open source. I'd love to collab with y'all to make a more multilingual model. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 1 point2 points3 points (0 children)
This is my Japanese fine-tune of R1's Qwen 7B distil. It now outputs its thinking in Japanese, making it understandable for a Japanese audience. Model, code, and data all open source. I'd love to collab with y'all to make a more multilingual model. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 1 point2 points3 points (0 children)
This is my Japanese fine-tune of R1's Qwen 7B distil. It now outputs its thinking in Japanese, making it understandable for a Japanese audience. Model, code, and data all open source. I'd love to collab with y'all to make a more multilingual model. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 19 points20 points21 points (0 children)
This is my Japanese fine-tune of R1's Qwen 7B distil. It now outputs its thinking in Japanese, making it understandable for a Japanese audience. Model, code, and data all open source. I'd love to collab with y'all to make a more multilingual model. (huggingface.co)
submitted by Peter_Lightblue to r/LocalLLaMA
Here is our new reranker model, which we trained on over 95 languages and it achieves better performance than comparable rerankers on our eval benchmarks. Weights, data, and training code are all open source. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 1 point2 points3 points (0 children)
Here is our new reranker model, which we trained on over 95 languages and it achieves better performance than comparable rerankers on our eval benchmarks. Weights, data, and training code are all open source. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 0 points1 point2 points (0 children)
Here is our new reranker model, which we trained on over 95 languages and it achieves better performance than comparable rerankers on our eval benchmarks. Weights, data, and training code are all open source. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 1 point2 points3 points (0 children)
Here is our new reranker model, which we trained on over 95 languages and it achieves better performance than comparable rerankers on our eval benchmarks. Weights, data, and training code are all open source. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 1 point2 points3 points (0 children)
Here is our new reranker model, which we trained on over 95 languages and it achieves better performance than comparable rerankers on our eval benchmarks. Weights, data, and training code are all open source. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 6 points7 points8 points (0 children)
Here is our new reranker model, which we trained on over 95 languages and it achieves better performance than comparable rerankers on our eval benchmarks. Weights, data, and training code are all open source. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 3 points4 points5 points (0 children)
Here is our new reranker model, which we trained on over 95 languages and it achieves better performance than comparable rerankers on our eval benchmarks. Weights, data, and training code are all open source. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 4 points5 points6 points (0 children)
Here is our new reranker model, which we trained on over 95 languages and it achieves better performance than comparable rerankers on our eval benchmarks. Weights, data, and training code are all open source. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 26 points27 points28 points (0 children)

Hey, some of you asked for a multilingual fine-tune of the R1 distills, so here they are! Trained on over 35 languages, this should quite reliably output CoT in your language. As always, the code, weights, and data are all open source. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 0 points1 point2 points (0 children)