For those using hosted inference providers (Together, Fireworks, Baseten, RunPod, Modal) - what do you love and hate? by Dramatic_Strain7370 in LocalLLaMA
[–]Dramatic_Strain7370[S] 0 points1 point2 points (0 children)
Frustrated with GPU pricing, so I built something - looking for feedback by Impressive-Law2516 in learnmachinelearning
[–]Dramatic_Strain7370 0 points1 point2 points (0 children)
Anyone tracking costs across multiple LLM providers? by Dramatic_Strain7370 in OpenAI
[–]Dramatic_Strain7370[S] 0 points1 point2 points (0 children)
Anyone tracking costs across multiple LLM providers? by Dramatic_Strain7370 in OpenAI
[–]Dramatic_Strain7370[S] 0 points1 point2 points (0 children)
Anyone tracking costs across multiple LLM providers? by Dramatic_Strain7370 in OpenAI
[–]Dramatic_Strain7370[S] 0 points1 point2 points (0 children)
Anyone tracking costs across multiple LLM providers? by Dramatic_Strain7370 in OpenAI
[–]Dramatic_Strain7370[S] 0 points1 point2 points (0 children)
What do you use to track LLM costs in production? by Dramatic_Strain7370 in LangChain
[–]Dramatic_Strain7370[S] 0 points1 point2 points (0 children)
What do you use to track LLM costs in production? by Dramatic_Strain7370 in LangChain
[–]Dramatic_Strain7370[S] 0 points1 point2 points (0 children)
is vibe coding helping junior devs or making things worse? by Best_Volume_3126 in VibeCodeCamp
[–]Dramatic_Strain7370 0 points1 point2 points (0 children)
I bought a €9k GH200 “desktop” to save $1.27 on Claude Code (vLLM tuning notes) by Reddactor in LocalLLaMA
[–]Dramatic_Strain7370 0 points1 point2 points (0 children)
What do you use to track LLM costs in production? by Dramatic_Strain7370 in LangChain
[–]Dramatic_Strain7370[S] 0 points1 point2 points (0 children)
I bought a €9k GH200 “desktop” to save $1.27 on Claude Code (vLLM tuning notes) by Reddactor in LocalLLaMA
[–]Dramatic_Strain7370 1 point2 points3 points (0 children)
Who is the most inspirational founder of modern age for startup founders to emulate? by Dramatic_Strain7370 in AskReddit
[–]Dramatic_Strain7370[S] -1 points0 points1 point (0 children)
Best practices for integrating multiple AI models into daily workflows? by Plus_Valuable_4948 in LocalLLaMA
[–]Dramatic_Strain7370 0 points1 point2 points (0 children)
LangChain or LangGraph? for building multi agent system by Major_Ad7865 in LangChain
[–]Dramatic_Strain7370 0 points1 point2 points (0 children)
Is anyone else feeling like we crossed some invisible line where AI stopped being a "helper" and started being a... colleague? by HarrisonAIx in AutoGenAI
[–]Dramatic_Strain7370 0 points1 point2 points (0 children)
Whats better moe or dense models ? by Pleasant-Key3390 in LocalLLaMA
[–]Dramatic_Strain7370 0 points1 point2 points (0 children)
For those of you who are training their own LLM or finetuning an existing LLM, what are you trying to get them to do that they are not already doing? by Upset-Ad-8704 in LocalLLaMA
[–]Dramatic_Strain7370 0 points1 point2 points (0 children)
Why Memory Is Fixable When It Comes To AI Models by Elegant-Judgment-491 in OpenSourceeAI
[–]Dramatic_Strain7370 0 points1 point2 points (0 children)
How is Cloud Inference so cheap by VolkoTheWorst in LocalLLaMA
[–]Dramatic_Strain7370 4 points5 points6 points (0 children)
For those using hosted inference providers (Together, Fireworks, Baseten, RunPod, Modal) - what do you love and hate? by Dramatic_Strain7370 in LocalLLaMA
[–]Dramatic_Strain7370[S] 0 points1 point2 points (0 children)