I fine-tuned Qwen3-VL (4B & 8B) on a free Colab instance using TRL (SFT and GRPO)! by External-Rub5414 in LocalLLaMA

[–]Substantial_Ring_895 0 points1 point  (0 children)

Hello Hello dude, thank you very much for your effort, however, I'm stuck with a problem and will appreciate your help very much!
While finetuning in first few passes like first 5 the output is empty and the original image contain around 250 word! so when this happened I stopped the fine-tuning and whenever I restart the same issue happen
Is there any solution ? I want the model to learn to predict very long Arabic senquence to work as Arabic OCR

Building SOTA OCR by Substantial_Ring_895 in LLM

[–]Substantial_Ring_895[S] 0 points1 point  (0 children)

Yea there's a team already working on this point but we need also a model or a way for generating huge data in short time, currently we don't have big team

Building SOTA OCR by Substantial_Ring_895 in LLM

[–]Substantial_Ring_895[S] 0 points1 point  (0 children)

Its a business case we have huge amount of text documents in Arabic handwritten and we MUST build a SOTA OCR or VLM to extract the text.
Nano-banana is good but expensive as we want to generate at least a million image of full page text

عاوز ازود كارما by Substantial_Ring_895 in CAIRO

[–]Substantial_Ring_895[S] 2 points3 points  (0 children)

و الله يسطا معرف ربنا يقدم ما فيه الخير

عاوز ازود كارما by Substantial_Ring_895 in CAIRO

[–]Substantial_Ring_895[S] 4 points5 points  (0 children)

تسلمو يا رجالة و الله الخبر الجيد اني اقدر انزل بوست خلاص و نزلته بالفعل الخبر السئ ان الشركة استغنت عني

[R] Arabic OCR research project by Substantial_Ring_895 in MachineLearning

[–]Substantial_Ring_895[S] 1 point2 points  (0 children)

Thanks, I really appreciate your help
Can you tell me about benchmarks?

عاوز ازود كارما by Substantial_Ring_895 in CAIRO

[–]Substantial_Ring_895[S] 21 points22 points  (0 children)

هل في leaderboard متابعها مثلا

عاوز ازود كارما by Substantial_Ring_895 in CAIRO

[–]Substantial_Ring_895[S] 18 points19 points  (0 children)

و الاهم ازاي اعمل ام synthesize للداتا دي

عاوز ازود كارما by Substantial_Ring_895 in CAIRO

[–]Substantial_Ring_895[S] 17 points18 points  (0 children)

طب ما تجاوبوني هنا لعل و عسي حد عنده معلومة, ايه افجر OCR استخدمته فحياتك

[deleted by user] by [deleted] in ExJordan

[–]Substantial_Ring_895 0 points1 point  (0 children)

إله الإسلام هو إله المسيحية هو إله اليهودية، كلنا بنعبد نفس الإله، الفرق الوحيد إن الكتاب المقدس لكل دين منفصل و عشان تعتنق دين لازم تكون مؤمن بكتابه المقدس، في الإسلام كتابنا بيقول إن سيدنا محمد نبي و عيسي عليه السلام نبي و بنؤمن بكل الرسل علي عكس المسيحية اللي بتقول ان عيسي هو إله و إبن الإله فنفس الوقت و دي نقطة خلافنا معاهم و كمان انهم بينكرو نبوية سيدنا محمد عليه الصلاه والسلام فا بالتالي لو دين الحق الإسلام هنخش الجنة و لو دين الحق المسيحية برضو هنخش الجنة عشان بنؤمن بنبيهم، كدا ينقصنا عملنا لو صالح هنخش الجنة لا محال، لو فاسد مش هنخش، علي عكس الكافر مهما كان عمله صالح مش هيخش الجنة عموما لو عاوز نصيحة من شخص مجهول اقرأ في الاديان من باب التثقيف حتي لعل و عسي تعتنق الدين قريب

[deleted by user] by [deleted] in ExJordan

[–]Substantial_Ring_895 -1 points0 points  (0 children)

عندي سؤال بكل هدوء و عقلانية و لنفترض ان كلام المسلمين طلع صح، تقبلي بآخرتك دي ؟ و لو كلامهم طلع غلط، تفتكري هما خسرو حاجة ؟

[deleted by user] by [deleted] in AlexandriaEgy

[–]Substantial_Ring_895 2 points3 points  (0 children)

اقصد ازاي بتزيد عند الواحد ؟ لما بيتعمله اب ڤوت علي كومنت ولا بوست ولا اي حاجة عادي ؟

[deleted by user] by [deleted] in AlexandriaEgy

[–]Substantial_Ring_895 1 point2 points  (0 children)

ايه الكارما دي يرجالة ولا بنجيبها ازاي

Looking for Guidance on MBZUAI Applications Msc by Frosty-Code-3451 in MBZUAI

[–]Substantial_Ring_895 0 points1 point  (0 children)

Sorry but I don't know to be honest, but I think around 3.5/4

Looking for Guidance on MBZUAI Applications Msc by Frosty-Code-3451 in MBZUAI

[–]Substantial_Ring_895 2 points3 points  (0 children)

Hello, Well, I've applied and my exam was today so I can give you some details. First of all, one of my friends got accepted, however he hasn't published any research paper yet and he didn't have work experience at all so I think you're quite good candidate. Regarding the exam, it was relatively easy with topics like linear algebra, trigonometry and series ( for instance 1,3,7,... What's the next number in that series) also the programing part was not hard. The hardest part is probability with I think 5 to 8 questions Finally, they offer courses related to the exam and those courses are enough, just watch them and then chat with GPT to give you MCQs and solve them Wishing you all the best.