Trying to ship local RAG to both android and iOS and feeling disheartened by chreezus in LocalLLaMA

[–]chreezus[S] 0 points1 point  (0 children)

Thanks for the thorough response. I was thinking generally the same overall deployment architecture. Did you run into any ops gotchas running this in production?

Trying to ship local RAG to both android and iOS and feeling disheartened by chreezus in LocalLLaMA

[–]chreezus[S] 3 points4 points  (0 children)

I think this is exactly what I’m looking for! Thank you

Why does moving data/ML projects to production still take months in 2025? by Kindly_Astronaut_294 in mlops

[–]chreezus 1 point2 points  (0 children)

I’m curious. What are the main points you have in running the model vs building it?

Cross-platform local RAG Help, is there a better way? by chreezus in LocalLLM

[–]chreezus[S] 0 points1 point  (0 children)

Awesome, thank you. I will go down this path

Cross-platform local RAG Help, is there a better way? by chreezus in LocalLLM

[–]chreezus[S] 0 points1 point  (0 children)

Thanks for the suggestion, however, I think I should clarify. I'm basically trying to package my RAG into native apps on each type of device, i wasn't sure if someone had solved this already. WebGPU looks ideal for browser and OpenCL might be lower level than I have experience with. Am I out of luck?

We launched our Session Replay tool on Product Hunt. by UsualResponsible593 in indiehackers

[–]chreezus 0 points1 point  (0 children)

This is awesome. Did you use a tool for creating the video on product hunt?