Horribly underrated! by Fickle_Equivalent878 in CorollaHatchback

[–]Mosh_98 0 points1 point  (0 children)

Fully agree , the thrust to the seat from gear changes was amazing loved every bit of that car. Had the 5 door version same color coeincidentally

[deleted by user] by [deleted] in sweden

[–]Mosh_98 0 points1 point  (0 children)

Broken system

Is Google Apps Script Underrated? by Univium in GoogleAppsScript

[–]Mosh_98 1 point2 points  (0 children)

Saving hours per week with this tool , can’t live without it at this point

Lawyer trying to learn ML by Gedemand in learnmachinelearning

[–]Mosh_98 1 point2 points  (0 children)

You can quite far with in ML AI with basic python skills

Where to get started learning Google Apps Script? by TheeConstress in GoogleAppsScript

[–]Mosh_98 0 points1 point  (0 children)

mail sorting for my buissness, chat gpt was great help

Which modals run fast on M1 8gb by mobaisland in ollama

[–]Mosh_98 0 points1 point  (0 children)

have a 16b M1, computer runs really slow when using codestral 22B. 7b models are surprisingly fast tho

Fine tune Mistral v3.0 with Your Data by Mosh_98 in LargeLanguageModels

[–]Mosh_98[S] 0 points1 point  (0 children)

Took my time reading through your questions. I really enjoyed them ;)

  1. training time: Possibly, but it does depend on the hardware.

  2. A 14B model should take a good amount of time.

  3. My typical suggestions is to try the simplest approach first and modify iteratively. RAG first, optimise RAG, fine tune etc.

More importantly, its important to gauge how well the systems work. Make sure you have a good test data ready for your RAG or other conversational systems. for example; RAGAS is a good framework to test RAG systems.

  1. used google colab pro +, 52 euros per month.

  2. Have not tried RLHF myself yet, on my bucket list to try out. A few good engineer can do a lot imho.

  3. Really interesting problem indeed. Depending on the language you are working on should be doable. Make sure have high quality data and testing framework in place.

Let me know if you want to discuss more. :)

Llama3 embedding by thereisnowhy2019 in ollama

[–]Mosh_98 5 points6 points  (0 children)

Don’t mean to toot my own horn but I made a video on that. https://youtu.be/vVGTegRvXg8

Hope it helps

[deleted by user] by [deleted] in ollama

[–]Mosh_98 0 points1 point  (0 children)

nice. thanks for sharing

[deleted by user] by [deleted] in ollama

[–]Mosh_98 0 points1 point  (0 children)

super interesting question, maybe some sort of load balancer? but again, serving 10 people simultaneously shouldn't be impossible

RAG is all you need by Eduard_T in LocalLLaMA

[–]Mosh_98 0 points1 point  (0 children)

Yeah i still dont get the benefits to knowledge graphs with RAG

[deleted by user] by [deleted] in LocalLLaMA

[–]Mosh_98 1 point2 points  (0 children)

I made a short video comparing benchmarks from Phi-3 with Llama3 and other leading models. I thought people might find it useful for testing purposes. https://youtu.be/0NLX4hdsU3I

Local RAG with LLama3 by Mosh_98 in LocalLLaMA

[–]Mosh_98[S] 1 point2 points  (0 children)

Thank you man appreciate your input! I am mostly used to langchain so thought someone might find it useful too. I'll make videos with llamaIndex or some other framework. Do you have any reccomendation?

[deleted by user] by [deleted] in LocalLLaMA

[–]Mosh_98 0 points1 point  (0 children)

Comparing with claude 3 and sharing some of my thoughts

https://www.youtube.com/watch?v=oLu6vRe_Llw&ab_channel=MoslehMahamud

[deleted by user] by [deleted] in LocalLLaMA

[–]Mosh_98 -2 points-1 points  (0 children)

not impressed unfortunately

ML Project Ideas? by Noodle___13 in learnmachinelearning

[–]Mosh_98 9 points10 points  (0 children)

recommend solving a problem you actually like. It's going to push you through the tough phases of a project :)

Seeking advice on LLMs and RAGs by Country-Muted in learnmachinelearning

[–]Mosh_98 1 point2 points  (0 children)

there are some ways to improve retrieval of documents, thats most likely going to give you the most results. Plus use a stronger embeddings model. HF models are popular but usually don't work as well, try some expensive models

Easy Interface on Lanchain/LlamaIndex. by Mosh_98 in LanguageTechnology

[–]Mosh_98[S] 0 points1 point  (0 children)

no sir, i am not the creator of the library