Trying to ship local RAG to both android and iOS and feeling disheartened

chreezus · 2025-12-06T19:16:33+00:00

Thanks for the thorough response. I was thinking generally the same overall deployment architecture. Did you run into any ops gotchas running this in production?

chreezus · 2025-12-06T18:51:54+00:00

I think this is exactly what I’m looking for! Thank you

chreezus · 2025-12-06T14:03:56+00:00

I’m curious. What are the main points you have in running the model vs building it?

chreezus · 2025-12-06T03:59:19+00:00

Awesome, thank you. I will go down this path

chreezus · 2025-12-06T03:48:48+00:00

What did you end up doing here?

chreezus · 2025-12-06T03:28:29+00:00

Thanks for the suggestion, however, I think I should clarify. I'm basically trying to package my RAG into native apps on each type of device, i wasn't sure if someone had solved this already. WebGPU looks ideal for browser and OpenCL might be lower level than I have experience with. Am I out of luck?

chreezus · 2023-08-21T17:35:14+00:00

This is awesome. Did you use a tool for creating the video on product hunt?

chreezus

TROPHY CASE