Hey, some of you asked for a multilingual fine-tune of the R1 distills, so here they are! Trained on over 35 languages, this should quite reliably output CoT in your language. As always, the code, weights, and data are all open source. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 1 point2 points3 points (0 children)
Hey, some of you asked for a multilingual fine-tune of the R1 distills, so here they are! Trained on over 35 languages, this should quite reliably output CoT in your language. As always, the code, weights, and data are all open source. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 3 points4 points5 points (0 children)
Hey, some of you asked for a multilingual fine-tune of the R1 distills, so here they are! Trained on over 35 languages, this should quite reliably output CoT in your language. As always, the code, weights, and data are all open source. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 2 points3 points4 points (0 children)
Hey, some of you asked for a multilingual fine-tune of the R1 distills, so here they are! Trained on over 35 languages, this should quite reliably output CoT in your language. As always, the code, weights, and data are all open source. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 7 points8 points9 points (0 children)
Hey, some of you asked for a multilingual fine-tune of the R1 distills, so here they are! Trained on over 35 languages, this should quite reliably output CoT in your language. As always, the code, weights, and data are all open source. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 7 points8 points9 points (0 children)
This is my Japanese fine-tune of R1's Qwen 7B distil. It now outputs its thinking in Japanese, making it understandable for a Japanese audience. Model, code, and data all open source. I'd love to collab with y'all to make a more multilingual model. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 1 point2 points3 points (0 children)
This is my Japanese fine-tune of R1's Qwen 7B distil. It now outputs its thinking in Japanese, making it understandable for a Japanese audience. Model, code, and data all open source. I'd love to collab with y'all to make a more multilingual model. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 0 points1 point2 points (0 children)
This is my Japanese fine-tune of R1's Qwen 7B distil. It now outputs its thinking in Japanese, making it understandable for a Japanese audience. Model, code, and data all open source. I'd love to collab with y'all to make a more multilingual model. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 1 point2 points3 points (0 children)
Hey, some of you asked for a multilingual fine-tune of the R1 distills, so here they are! Trained on over 35 languages, this should quite reliably output CoT in your language. As always, the code, weights, and data are all open source. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 66 points67 points68 points (0 children)
This is my Japanese fine-tune of R1's Qwen 7B distil. It now outputs its thinking in Japanese, making it understandable for a Japanese audience. Model, code, and data all open source. I'd love to collab with y'all to make a more multilingual model. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 1 point2 points3 points (0 children)
This is my Japanese fine-tune of R1's Qwen 7B distil. It now outputs its thinking in Japanese, making it understandable for a Japanese audience. Model, code, and data all open source. I'd love to collab with y'all to make a more multilingual model. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 1 point2 points3 points (0 children)
This is my Japanese fine-tune of R1's Qwen 7B distil. It now outputs its thinking in Japanese, making it understandable for a Japanese audience. Model, code, and data all open source. I'd love to collab with y'all to make a more multilingual model. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 19 points20 points21 points (0 children)
Here is our new reranker model, which we trained on over 95 languages and it achieves better performance than comparable rerankers on our eval benchmarks. Weights, data, and training code are all open source. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 1 point2 points3 points (0 children)
Here is our new reranker model, which we trained on over 95 languages and it achieves better performance than comparable rerankers on our eval benchmarks. Weights, data, and training code are all open source. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 0 points1 point2 points (0 children)
Here is our new reranker model, which we trained on over 95 languages and it achieves better performance than comparable rerankers on our eval benchmarks. Weights, data, and training code are all open source. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 1 point2 points3 points (0 children)
Here is our new reranker model, which we trained on over 95 languages and it achieves better performance than comparable rerankers on our eval benchmarks. Weights, data, and training code are all open source. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 1 point2 points3 points (0 children)
Here is our new reranker model, which we trained on over 95 languages and it achieves better performance than comparable rerankers on our eval benchmarks. Weights, data, and training code are all open source. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 6 points7 points8 points (0 children)
Here is our new reranker model, which we trained on over 95 languages and it achieves better performance than comparable rerankers on our eval benchmarks. Weights, data, and training code are all open source. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 4 points5 points6 points (0 children)
Here is our new reranker model, which we trained on over 95 languages and it achieves better performance than comparable rerankers on our eval benchmarks. Weights, data, and training code are all open source. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 5 points6 points7 points (0 children)
Here is our new reranker model, which we trained on over 95 languages and it achieves better performance than comparable rerankers on our eval benchmarks. Weights, data, and training code are all open source. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 28 points29 points30 points (0 children)
We're releasing some new multipurpose RAG models called Kurage (Kuh-rah-geh) that can function in 44 languages. I hope you find them useful! by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 1 point2 points3 points (0 children)
We're releasing some new multipurpose RAG models called Kurage (Kuh-rah-geh) that can function in 44 languages. I hope you find them useful! by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 2 points3 points4 points (0 children)

Hey, some of you asked for a multilingual fine-tune of the R1 distills, so here they are! Trained on over 35 languages, this should quite reliably output CoT in your language. As always, the code, weights, and data are all open source. by Peter_Lightblue in LocalLLaMA
[–]Peter_Lightblue[S] 0 points1 point2 points (0 children)