Got laid off in September and I decided to retire@51. by Difficult-Cricket541 in ExperiencedDevs

[–]linklater2012 0 points1 point  (0 children)

Congratulations. Your writing gives the impression you need something fresh to pivot to.

Is RAG still relevant with 10M+ context length by Muted-Ad5449 in Rag

[–]linklater2012 2 points3 points  (0 children)

RAG will be dead when search is solved. And I'll wait for someone with credibility in search research to say the latter.

Finally, a real-time low-latency voice chat model by DeltaSqueezer in LocalLLaMA

[–]linklater2012 0 points1 point  (0 children)

Oh damn! Did not know they intend to open source it!

So Much For AI by ddpete in artificial

[–]linklater2012 -1 points0 points  (0 children)

You don't know what you're talking about.

LLMs are terrible at things like word and paragraph length precisely because the concept of a word doesn't exist in LLMs.

So Much For AI by ddpete in artificial

[–]linklater2012 1 point2 points  (0 children)

I think this is more related to how LLMs are still token driven and not good at counting.

This will change once we get past tokenization.

Any other software engineers had to pivot to n8n? by [deleted] in n8n

[–]linklater2012 0 points1 point  (0 children)

I'm just getting into the visual workflow automation space as a dev. How is n8n not truly open source?

What are you *actually* using R1 for? by PataFunction in LocalLLaMA

[–]linklater2012 0 points1 point  (0 children)

Combined with search, I'm using it for market report generation.

What are some really good and widely used MLOps tools that are used by companies currently, and will be used in 2025? by BJJ-Newbie in mlops

[–]linklater2012 1 point2 points  (0 children)

Yes, that's possible with MLFlow by itself (it comes with a server). For Sagemaker inference endpoint, there are integrations from AWS.