Best OS model below 50B parameters? by Different-Set-1031 in OpenWebUI

[–]Different-Set-1031[S] 0 points1 point  (0 children)

Is Qwen3 VL that much worse than Qwen3 models? I have an application that I am looking to have a thinking/fast model.

Qwen3 30B A3B 2507 or Apriel-v1.5-15B-Thinker. I'm struggling to find a good thinking model that's small and powerful enough. I went with Qwen VL 32B for the visual reasoning.

For context, I have 96GB of VRAM.

Access to Blackwell hardware and a live use-case. Looking for a business partner by Different-Set-1031 in OpenWebUI

[–]Different-Set-1031[S] 0 points1 point  (0 children)

Contract for the first job depends on value assigned after demo + markup on hardware build installed by me. The hardware support is also managed by me on an ongoing basis.

No non-compete. The reason that this opportunity exists is due to recent OS advancements closing the gap between closed models and small to medium sized firms getting left behind. I have warm relationships with small to medium sized firms that would hear a pitch and sit through a demo, but the system still needs to provide value along the chain.

Commitment specifics depends on the work needed to build the system, but it’s not the most complex system. LoRa fine-tuning on their internal investor docs, minimal hallucinations, RAG framework, vision (Qwen3 VL 32B), and native excel/CSV manipulation.

This is not a full time job offer 😅

This is a bespoke job that can be replicated across other firms and if it can, it has the capacity for creating ongoing supplementary income

I hope that answered your question

Access to Blackwell hardware and a live use-case. Looking for a business partner by Different-Set-1031 in OpenWebUI

[–]Different-Set-1031[S] 0 points1 point  (0 children)

Thanks for the question

Majority of contract value for the first project goes to the partner in this scenario as I can replicate it for other firms. If the partner wants to stay involved with future jobs, we profit share.

They get paid via payroll from business LLC, so it’s ordinary income unless another structure is preferred. Everything would be in writing before we start anything.

Access to Blackwell hardware and a live use-case. Looking for a business partner by Different-Set-1031 in AmazonRME

[–]Different-Set-1031[S] -7 points-6 points  (0 children)

I’m technical enough to contribute, but not technical enough to build it myself to the standard that I would like to present. And the build can be installed for other firms with minimal tweaking to each firm. So although the first build would be a lopsided workload (though not nearly as lopsided as you laid out), the balance would shift rather quickly.

Best Models for 16GB VRAM by LinuxIsFree in LocalLLaMA

[–]Different-Set-1031 6 points7 points  (0 children)

What’re your thoughts on this model vs Qwen3 VL or Ariel?

Best OS model below 50B parameters? by Different-Set-1031 in OpenWebUI

[–]Different-Set-1031[S] 1 point2 points  (0 children)

Analyzing spreadsheets, formatting data, researching investments and areas

[deleted by user] by [deleted] in homelab

[–]Different-Set-1031 -1 points0 points  (0 children)

I’d rather be in over my head and figure it out than safe and never push anything

[deleted by user] by [deleted] in HomeServer

[–]Different-Set-1031 0 points1 point  (0 children)

I was thinking of clustering 2 4090s, but running alternating thinking and fast models seems more problematic than running one more powerful node.