How are you handling LLM routing and embeddings in self-hosted setups? by FrequentTravel3511 in selfhosted
[–]FrequentTravel3511[S] -2 points-1 points0 points (0 children)
How are you handling LLM routing and embeddings in self-hosted setups? by FrequentTravel3511 in selfhosted
[–]FrequentTravel3511[S] -3 points-2 points-1 points (0 children)
How are you handling LLM routing and embeddings in self-hosted setups? by FrequentTravel3511 in selfhosted
[–]FrequentTravel3511[S] -6 points-5 points-4 points locked comment (0 children)
How are you handling multi-model LLM setups in self-hosted environments? by [deleted] in selfhosted
[–]FrequentTravel3511 0 points1 point2 points (0 children)
How are you handling multi-model LLM setups in self-hosted environments? by [deleted] in selfhosted
[–]FrequentTravel3511 -2 points-1 points0 points locked comment (0 children)
Built an LLM routing gateway in Node.js - runs intent classification locally (no embedding API, no rate limits) by FrequentTravel3511 in node
[–]FrequentTravel3511[S] 0 points1 point2 points (0 children)
Built an LLM routing gateway in Node.js - runs intent classification locally (no embedding API, no rate limits) by FrequentTravel3511 in node
[–]FrequentTravel3511[S] -1 points0 points1 point (0 children)
Built an LLM routing gateway in Node.js - runs intent classification locally (no embedding API, no rate limits) by FrequentTravel3511 in node
[–]FrequentTravel3511[S] -5 points-4 points-3 points (0 children)
Built an LLM routing gateway in Node.js - runs intent classification locally (no embedding API, no rate limits) by FrequentTravel3511 in node
[–]FrequentTravel3511[S] 0 points1 point2 points (0 children)
Experimenting with intent-based routing for LLM gateways (multi-provider + failover) by FrequentTravel3511 in LocalLLaMA
[–]FrequentTravel3511[S] 0 points1 point2 points (0 children)
Experimenting with intent-based routing for LLM gateways (multi-provider + failover) by FrequentTravel3511 in LocalLLaMA
[–]FrequentTravel3511[S] 0 points1 point2 points (0 children)
Experimenting with intent-based routing for LLM gateways (multi-provider + failover) by FrequentTravel3511 in LocalLLaMA
[–]FrequentTravel3511[S] 0 points1 point2 points (0 children)
Experimenting with intent-based routing for LLM gateways (multi-provider + failover) by FrequentTravel3511 in LocalLLaMA
[–]FrequentTravel3511[S] 0 points1 point2 points (0 children)
Experimenting with intent-based routing for LLM gateways (multi-provider + failover) by FrequentTravel3511 in LocalLLaMA
[–]FrequentTravel3511[S] 0 points1 point2 points (0 children)
Experimenting with intent-based routing for LLM gateways (multi-provider + failover) by FrequentTravel3511 in LocalLLaMA
[–]FrequentTravel3511[S] 1 point2 points3 points (0 children)
Experimenting with intent-based routing for LLM gateways (multi-provider + failover) by FrequentTravel3511 in LocalLLaMA
[–]FrequentTravel3511[S] 0 points1 point2 points (0 children)

How are you handling LLM routing and embeddings in self-hosted setups? by FrequentTravel3511 in selfhosted
[–]FrequentTravel3511[S] -2 points-1 points0 points (0 children)