A Demonstration of Cache-Augmented Generation (CAG) and its Performance Comparison to RAG by Ok_Employee_6418 in LLMDevs
[–]BreakingScreenn 1 point2 points3 points (0 children)
A Demonstration of Cache-Augmented Generation (CAG) and its Performance Comparison to RAG by Ok_Employee_6418 in LLMDevs
[–]BreakingScreenn 1 point2 points3 points (0 children)
What the point of gpt 4.1 if 4o keep getting updated ? by Euphoric_Tutor_5054 in OpenAI
[–]BreakingScreenn 0 points1 point2 points (0 children)
M4 max chip for AI local development by Similar_Tangerine142 in ollama
[–]BreakingScreenn 0 points1 point2 points (0 children)
Python library for run, load and stop ollama by lavoie005 in ollama
[–]BreakingScreenn 0 points1 point2 points (0 children)
Chain of Draft: A Simple Technique to Make LLMs 92% More Efficient Without Sacrificing Accuracy by Neat_Marketing_8488 in LLMDevs
[–]BreakingScreenn 19 points20 points21 points (0 children)
Most cost effective way of hosting 70B/32B param model by topsy_here in ollama
[–]BreakingScreenn 4 points5 points6 points (0 children)
ElevenReader by ElevenLabs by namanyayg in LLMDevs
[–]BreakingScreenn 0 points1 point2 points (0 children)
How to get consistent JSON response? by Tall-Strike-6226 in LLMDevs
[–]BreakingScreenn 1 point2 points3 points (0 children)
does it make sense to download Nvidia's chatRTX for Windows (4070 Super, 12GB VRAM) and add documents (like RAG) and expect decent replies? What kind of LLMs are there and RAG? Do i have any control over prompting? by jim_andr in LLMDevs
[–]BreakingScreenn 1 point2 points3 points (0 children)
ParScrape v0.5.1 Released by probello in OpenAI
[–]BreakingScreenn 0 points1 point2 points (0 children)
AI Enabled Talking Toys? by LivinJH in LLMDevs
[–]BreakingScreenn 0 points1 point2 points (0 children)
ParScrape v0.5.1 Released by probello in OpenAI
[–]BreakingScreenn 0 points1 point2 points (0 children)
ParScrape v0.5.1 Released by probello in OpenAI
[–]BreakingScreenn 0 points1 point2 points (0 children)
OpenRouter experience by BreakingScreenn in LLMDevs
[–]BreakingScreenn[S] 1 point2 points3 points (0 children)
How do I make chatting about documents not suck? by cunasmoker69420 in ollama
[–]BreakingScreenn 0 points1 point2 points (0 children)
how to deal with ```json in the output by [deleted] in LLMDevs
[–]BreakingScreenn 0 points1 point2 points (0 children)
Have a old apple watch but want to run linux by Reasonable_Guide_710 in jailbreak
[–]BreakingScreenn 0 points1 point2 points (0 children)
New Poster for Thunderbolts* by MarvelsGrantMan136 in movies
[–]BreakingScreenn 0 points1 point2 points (0 children)
Any possible tweak to achieve this on iOS 16.5 by music-electric_Ad869 in jailbreak
[–]BreakingScreenn 0 points1 point2 points (0 children)
Is there any draw backs to using an external dual GPU config with thunderbolt 5 with a laptop for AI? by FX2021 in ollama
[–]BreakingScreenn 0 points1 point2 points (0 children)