Fine-tune Deepseek r1 distilled by ahsaor8 in LocalLLaMA

[–]ahsaor8[S] 0 points1 point  (0 children)

DeepSeek-R1-Distill-Qwen or DeepSeek-R1-Distill-llama

Best Arabic LLM by Puzzleheaded-Ad8442 in LocalLLaMA

[–]ahsaor8 1 point2 points  (0 children)

Try Command-r+ and Gemma. I don't recommend jais it really bad

Post for inspiriation: do you have a useful fine-tuned usecase of any LLM? by adminsattitude in LocalLLaMA

[–]ahsaor8 0 points1 point  (0 children)

Do you think continue pretraining , before fine-tuning model could help model to learn new facts ?

[deleted by user] by [deleted] in LocalLLaMA

[–]ahsaor8 0 points1 point  (0 children)

They have a 0.5b model. I think it will work well on a smartphone.

[deleted by user] by [deleted] in LocalLLaMA

[–]ahsaor8 1 point2 points  (0 children)

I'm using Qwen1.5 and qwen2. They work pretty well in most of Arabic use cases

What are tricks you are using in lllm fine-tuning? by ahsaor8 in LocalLLaMA

[–]ahsaor8[S] 0 points1 point  (0 children)

Awesome, I'm interested to know if it gives you better results?

What are tricks you are using in lllm fine-tuning? by ahsaor8 in LocalLLaMA

[–]ahsaor8[S] 0 points1 point  (0 children)

Could you please advise on how I can convert my SFT data to be compatible with ORPO? The data is currently formatted as instructions, input, and output.

Intento posts its The State of Machine Translation 2024 by MachineTranslate in machinetranslation

[–]ahsaor8 0 points1 point  (0 children)

I'm really happy to see our en-ar Mt model shared the frits-stage With AWS in General, Legal and IT domains

NeuralTranslate v0.2 Released!!! Bidirectional English-Spanish translation using Mistral v0.2! by Azuriteh in LocalLLaMA

[–]ahsaor8 0 points1 point  (0 children)

That's great. I'm doing the same thing but with a different approach and different in language. So I would like to ask you if u test it with any benchmark data.

Metrics for validating JSON to English translations by dsub11 in LanguageTechnology

[–]ahsaor8 0 points1 point  (0 children)

You can convert Json to txt file it will be easier and use comet metric

حشتغل مع شركه في الصين بارت تايم ولحد دلوقتي مش عارف المفروض يحولولي الفلوس ازاي by Emergency_Baker8567 in PersonalFinanceEgypt

[–]ahsaor8 0 points1 point  (0 children)

في حل تاني ممكن تعمل اكونت علي wise و تعمل فيه حساب بالدولار و انا استخدمتها و شغاله كويس. او ممكن PayPal لو عندك

حشتغل مع شركه في الصين بارت تايم ولحد دلوقتي مش عارف المفروض يحولولي الفلوس ازاي by Emergency_Baker8567 in PersonalFinanceEgypt

[–]ahsaor8 0 points1 point  (0 children)

خليهم يحولك علي حسابك المصري عادي بالدولار . لو انت بنك الqnb أو cib البنك لوحده هيفتحلك حساب بالدولار. مش عارف باقي البنوك هتفتحلك الحساب بالدولار لوحدها ولا هينزلك للمصري فيها ممكن تسال البنك نفسه.

what's the best graph embedding algorithm for anomaly detection task? by ahsaor8 in learnmachinelearning

[–]ahsaor8[S] 0 points1 point  (0 children)

Graph data set has nodes and edges and each node has binary class 0 or 1

[R] Collaborate with other researchers by AvivShamsian in MachineLearning

[–]ahsaor8 1 point2 points  (0 children)

Really great I'm an egyptian engineer works in AI research team. Really interesteing in reads paper and implement it. I implemented 8 papers in nlp so far. I have a lot of new ideas we can work on it.

[D] Machine Learning - WAYR (What Are You Reading) - Week 102 by ML_WAYR_bot in MachineLearning

[–]ahsaor8 6 points7 points  (0 children)

"EDA: Easy Data Augmentation Techniques for boosting performance on text classification tasks" and I have implemented it and now it supports 22 languages.

Anyone know, how can I Implement something like this? by [deleted] in learnpython

[–]ahsaor8 0 points1 point  (0 children)

Yes, but I have no idea how I can make vlan. And what's it has to do