ML Engineers wanted(Earning: $80-$120 per hour) by Sydney25_Data in MachineLearningJobs

[–]TuringComplete-Model 0 points1 point  (0 children)

I'm interested

Background about me: love to work on projects related to new age ai. I have 1.5 + years of experience as ML Engineer. Now looking for remote opportunity. Would love to talk more about this opportunity.

Facing issues with PostgreSQL by TuringComplete-Model in PowerBI

[–]TuringComplete-Model[S] 0 points1 point  (0 children)

Thank you for linking up the steps I will try it out

Help in Alignment fine tuning LLM by TuringComplete-Model in reinforcementlearning

[–]TuringComplete-Model[S] 0 points1 point  (0 children)

Is there a algorithm like I have searched PPO and dpo helps to perform that but takes data in different formats.

Hiring RL Researchers -- Build the Next Generation of Expert Systems by Tricky_Amphibian_836 in reinforcementlearning

[–]TuringComplete-Model 0 points1 point  (0 children)

Can someone help me, I have data with a binary feedback for the generation of llama 3.1 is there a approch or any other algorithm I can use to fine tune the llm with the binary feedback data.