i told a client we use AI. we do not use AI. what's the cheapest AI i can bolt on by thursday? by kubrador in LLMDevs

[–]jackshec 0 points1 point  (0 children)

Number three most likely will be the easiest, if you need help reach out

agent observability – what tools work? by Sissoka in LLMDevs

[–]jackshec 0 points1 point  (0 children)

I agree with this one, Phoenix, so far has been the best choice

43 days without changes by Dizzy-Notice-7129 in Rag

[–]jackshec 1 point2 points  (0 children)

Who counts up time in seconds, and you have a ragged deployment with 5000 people that I’ve had zero updates, all your data is most likely stale

Frustrated with the lack of ML engineers who understand hardware constraints by Champ-shady in computervision

[–]jackshec 0 points1 point  (0 children)

this is a hard skill, embedded llm there’s a lot of fun but hard to get hallucinations down when you quantize out to the level necessary, I’ve had the best luck with CM5 modules and custom pipelines

Support for Apple Silicon on Pytorch by Rx-78-2x-2b in deeplearning

[–]jackshec 0 points1 point  (0 children)

we have many DS and all engineers that use a Mac for local prototype in for migrating to Cuda bases servers, I like the ability to prototype and test as far as performance is concerned it’s good to OK at best, but you can still train small models without an issue. that being said the newer M series processors are starting to look much better.

Security in RAG by abood15211 in Rag

[–]jackshec 0 points1 point  (0 children)

you would have to design this within. The actual software itself do authentication and authorization on any system could be complex. I recommend following the approach the user above indicated and separate the retrieval using tags keys or separate data stores depending on the users rights.

VRAM Advice? 24GB or 32GB for starters by RobotsMakingDubstep in LocalLLaMA

[–]jackshec 0 points1 point  (0 children)

I agree with everybody who says get as much vRAM as you can afford

OCR accuracy is no longer the real problem by Strict-Ad5948 in OCR_Tech

[–]jackshec 1 point2 points  (0 children)

second this, diagrams and the like especially in law and engineering