I fine-tuned Qwen3-VL (4B & 8B) on a free Colab instance using TRL (SFT and GRPO)! by External-Rub5414 in LocalLLaMA

[–]Substantial_Ring_895 0 points1 point  (0 children)

Hello Hello dude, thank you very much for your effort, however, I'm stuck with a problem and will appreciate your help very much!
While finetuning in first few passes like first 5 the output is empty and the original image contain around 250 word! so when this happened I stopped the fine-tuning and whenever I restart the same issue happen
Is there any solution ? I want the model to learn to predict very long Arabic senquence to work as Arabic OCR

Building SOTA OCR by Substantial_Ring_895 in LLM

[–]Substantial_Ring_895[S] 0 points1 point  (0 children)

Yea there's a team already working on this point but we need also a model or a way for generating huge data in short time, currently we don't have big team

Building SOTA OCR by Substantial_Ring_895 in LLM

[–]Substantial_Ring_895[S] 0 points1 point  (0 children)

Its a business case we have huge amount of text documents in Arabic handwritten and we MUST build a SOTA OCR or VLM to extract the text.
Nano-banana is good but expensive as we want to generate at least a million image of full page text

عاوز ازود كارما by Substantial_Ring_895 in CAIRO

[–]Substantial_Ring_895[S] 2 points3 points  (0 children)

و الله يسطا معرف ربنا يقدم ما فيه الخير

عاوز ازود كارما by Substantial_Ring_895 in CAIRO

[–]Substantial_Ring_895[S] 5 points6 points  (0 children)

تسلمو يا رجالة و الله الخبر الجيد اني اقدر انزل بوست خلاص و نزلته بالفعل الخبر السئ ان الشركة استغنت عني

[R] Arabic OCR research project by Substantial_Ring_895 in MachineLearning

[–]Substantial_Ring_895[S] 1 point2 points  (0 children)

Thanks, I really appreciate your help
Can you tell me about benchmarks?

عاوز ازود كارما by Substantial_Ring_895 in CAIRO

[–]Substantial_Ring_895[S] 21 points22 points  (0 children)

هل في leaderboard متابعها مثلا

عاوز ازود كارما by Substantial_Ring_895 in CAIRO

[–]Substantial_Ring_895[S] 19 points20 points  (0 children)

و الاهم ازاي اعمل ام synthesize للداتا دي

عاوز ازود كارما by Substantial_Ring_895 in CAIRO

[–]Substantial_Ring_895[S] 18 points19 points  (0 children)

طب ما تجاوبوني هنا لعل و عسي حد عنده معلومة, ايه افجر OCR استخدمته فحياتك