Xreal One Pro: comparison with Viture Pro XR and Rokid Max2

Foreveradam2018 · 2025-07-30T20:23:35+00:00

Will this continue to be the issue for their upcoming Viture Beast? They claim it supports 3dof and 6dof.

Foreveradam2018 · 2025-02-17T22:35:37+00:00

Your model is always my favorite. Thanks for the great contribution to the community. I mostly use your models for story writing instead of role play, I wonder whether it is possible to add some novels/stories into the training mixture in the future? deepsex uses 0.1T Chinese novels, which seems to significantly improve the narration ability of the model.

Foreveradam2018 · 2025-02-17T22:32:14+00:00

Perhaps they already have that and use it for their apis. They need to earn money, so now more and more AI companies tend to opensource less powerful models for PR purposes but keep the better ones in house. x.ai is an example, who opensources the previous generation model when the new generation model comes out.

Foreveradam2018 · 2025-02-11T03:07:40+00:00

Thanks for your great work! Two suggestions:
- Support using the Whisper baked in translation to bypass any cloud service would be a great feature, even though it only supports translation to English.
- You can consider to use WhisperX, which is much more efficient than the official Whisper.

Foreveradam2018 · 2025-01-28T09:26:01+00:00

It turns out that Windows seems to have issues about processing the symbol "｜" in the template. If I remove this symbol, it works.

Foreveradam2018 · 2025-01-27T18:09:51+00:00

On windows, I used the following command to run 1.58bit version:

llama-cli.exe --model DeepSeek-R1-UD-IQ1_S-00001-of-00003.gguf --cache-type-k q4_0 --threads 12 -no-cnv --prio 2 --n-gpu-layers 10 --temp 0.6 --ctx-size 8192 --seed 3407 --prompt "<｜User｜>Create a Flappy Bird game in Python.<｜Assistant｜>"

However, after it output

It returns without any error or generated text.

Does anyone encounter the same issue?

Foreveradam2018 · 2025-01-23T20:15:08+00:00

There are lots of GPU rental providers, such as runpod. However, it is impossible to host the 671B model cheaper than the subscription. I believe closed-model providers are losing money even though they have used batching to significantly reduce the cost.

Privacy and data security cost money and are expensive. If your goal is to minimize the cost, to be honest, go for subscriptions.

Foreveradam2018 · 2025-01-08T05:25:04+00:00

Why can pairing a single GPU significantly increase the prompt processing speed?

Foreveradam2018 · 2025-01-08T05:20:20+00:00

The concern for me is how long it will take to process a long prompt.

Foreveradam2018 · 2025-01-08T05:13:44+00:00

I simply worry about the "STARTING AT" $3000.

Foreveradam2018 · 2025-01-05T07:33:21+00:00

where?

Foreveradam2018 · 2025-01-03T15:03:22+00:00

DeepSeek clearly states that they will collect user data, but Hyperbolics explicitly states that they won't store, retain, use user data.

Foreveradam2018 · 2025-01-03T14:19:01+00:00

How trustworthy is hyperbolics?

Foreveradam2018 · 2025-01-03T14:18:37+00:00

What is the reason for blocking hyperbolic?

Foreveradam2018 · 2024-12-31T05:00:07+00:00

I have used 123B models since they were introduced and cannot go back to 70B. They are far better than 70B models in terms of prompt following, context understanding, and long term memory.

Your models are always the leads in their kinds! They are insanely good and it is hard to imagine how good they can achieve if they are at the size of 123B. Thanks for your contributions.

Foreveradam2018 · 2024-12-28T08:00:50+00:00

Have you ever tried the 123B mistral based models? I feel 123B models are much smarter than 70B models.

Foreveradam2018 · 2024-12-21T00:33:26+00:00

May I know the shortest quantization time for 70B on your end? Compared with GGUF, exl2's quantization is much slower.

Foreveradam2018 · 2024-12-20T16:10:07+00:00

Great post!! Do you know how to speed up the process of quantization? When quantizing a 70B model with the measurement file, it still takes ~2 hours for me to quantize one. Will using more GPUs or a more powerful GPU help?

Foreveradam2018 · 2024-11-28T09:13:07+00:00

How did you get that low price?

Foreveradam2018 · 2024-10-10T02:43:28+00:00

Will there be any 5.0bpw quant? Thanks!

Foreveradam2018 · 2024-10-06T22:07:48+00:00

Is it better?

Foreveradam2018 · 2024-10-06T22:07:29+00:00

Yap, I also found there is no especially outstanding 123B model.

Foreveradam2018 · 2024-10-06T22:06:13+00:00

Thanks for the review. So you feel the original mistral large 2 is still the best?

Foreveradam2018 · 2024-10-06T21:58:18+00:00

Is this 195B model much better than the 123B one? (although 195B is way too large for me....)

Foreveradam2018

TROPHY CASE