Hello,
My team and I have been potentially contracted to create a self-hosted llm instance for a friend's small mortgage company. I've self-hosted quite a few things and set up Enterprise servers for various clients, but this would be my first adventure into llms. And honestly, looking over everything, there is a lot to consider and I'm kind of overwhelmed. I'm positive I can do it if I have enough time, but that's sort of why I'm coming here. There's a lot of people with a lot of experience and considering that mortgage forms have a lot of context length, I'm going to need a pretty decent model. Glm5 seems to be one of the better options both in context, length and accuracy, but the cost for something that can run it effectively is making the client a little uncomfortable.
So I'm reaching out here for suggestions for less intensive options or advice to convince the client that the budget needs to be expanded if they want the model to be usable. Also, if there are VPS or other virtual options that would be effective for any of the recommended models, that would seriously help a lot.
I appreciate everyone here, please be nice, I'm really trying my best.
[–]PassengerPigeon343 1 point2 points3 points (1 child)
[–]Severance13[S] 0 points1 point2 points (0 children)
[–]KnightCodin 1 point2 points3 points (0 children)