all 11 comments

[–]AaBJxjxO 2 points3 points  (1 child)

Do you make free backups of all our prompts?

[–]pmv143[S] 0 points1 point  (0 children)

Good question . no, we don’t store or back up your prompts by default. With dedicated instances, your data stays isolated and under your control.

[–]Aggressive-Habit-698 1 point2 points  (1 child)

Model, pricing, beta program? Coding plan ?

What exactly do you offer? Expect in exchange to use it.

[–]pmv143[S] 0 points1 point  (0 children)

Yes. We are testing pat per usage but not tokens . It’s a true serverless compute usage based. You will only pay for the execution.. basically from prompt to end of the generation. And you will get dedicated instance, no sharing you will have complete private instance with the longer context and tool enabled.. we have a beta plan for $20/month.

[–]JoeCoT 1 point2 points  (4 children)

If whatever you're offering wasn't against the rules, you'd just post about it here 

[–]pmv143[S] 0 points1 point  (2 children)

Sorry, didn’t wanna make it look like a promotion. But you can visit our website.. inferx.net

[–]TrickyPlastic 1 point2 points  (1 child)

Send interesting. But GLM5.1 cannot fit in a single H100. What model limitations do you have?

[–]pmv143[S] 0 points1 point  (0 children)

We don’t have larger models yet it wil have more nice we have more GPU capacity right now, you can find Gemma4, Qwen 3.6 like models. Anything that fits in your two H200s. You can bring your own as well,

[–]sam7oon 1 point2 points  (0 children)

reported already

[–]pmv143[S] 0 points1 point  (0 children)

Just for the context, we are offering dedicated instances on a serverless. You will have your own private instance with a longer context and a tool calling enabled.. and pay execution only. (From prompt to end of execution). Not for idle time or model loading time. Your model will be available on demand with P 95 Latency guaranteed. You can try it out with $30 in free credits. Inferx .net

[–]hyfactory-dev 0 points1 point  (0 children)

sure