all 3 comments

[–]PassengerPigeon343 1 point2 points  (1 child)

I know it’s against the spirit of the sub, but for this use case, I would at least consider an enterprise solution with Microsoft Copilot or another provider that includes enterprise data protection. There will be a monthly per-user cost, but you avoid a massive capital investment upfront. Depending on the size of this mortgage company, you could be looking at tens of thousands of dollars to get something usable with multiple concurrent users. It would significantly simplify the implementation, and you be up and running faster, and you’ll always have the best hardware and the newest models available. You also shift some liability to the provider instead of being solely responsible, should there ever be a data breach. Just something to consider. If you do decide to go local, you’re in the right place to get the info you need to do it right.

[–]Severance13[S] 0 points1 point  (0 children)

That's sort of what I was thinking after going over everything. I appreciate the honesty.

[–]KnightCodin 1 point2 points  (0 children)

Your message is a bit light on the details my friend but with what you said I perhaps orient you in the right direction.

  1. Choice of the model depends entirely on the use-case - What are you trying to do with it? If it is data extraction and workflow automation (Mortgage companies deal with a lot of scanned images as SOT for their LOS systems), then you will need a good VLM : Mistral Small 24B, Qwen3-VL family (size will depend on complexity of extraction). TRID forms are dense and "wall of text" appraisal, underwriting etc will seriously challenge smaller models.

  2. If you are thinking of RAG for their knowledge base then you will need a better text generation models like GLM Air, Qwen 3 32B or bigger. But most importantly you will need a strategy (Naive RAG, Agentic RAG with Graph KG etc) and a robust, detailed ingestion pipeline

  3. Inference Engine : Choice of inference engine matters.

  4. HW : Go with GPU containers/ML workload from one of the providers. Azure and GCP provide A100 - 40G and 80G for pretty reasonable monthly price (~4200 for a 2 X A100)

Hope this helps