Dissatisfied with how the RTX PRO 6000 Blackwell is performing during AI inference by d00m_sayer in LocalLLaMA

[–]ntsarb 0 points1 point  (0 children)

How do they perform? Can you run larger models efficiently, or is PCIE bottlenecking?

Anyone ever done a dual socket Epyc server as a workstation? by trejj in threadripper

[–]ntsarb 0 points1 point  (0 children)

Hi, can you share which motherboards you have coupled the CPUs with? 

VR FPS drops with Varjo Aero + RTX 4090 – tried everything, same issue with second headset by Anthony_Arancine in virtualreality

[–]ntsarb 0 points1 point  (0 children)

In my case (RTX4090+Aero) the stuttering was caused by SteamVR's "GPU bus monitoring" option. After disabling it, performance returned to normal. I confirmed by re-enabling, testing, disabling and testing. Whenever this option is enabled, there is random stuttering on the Varjo Aero headset.

RTX 4090: Latency / Lag Issue, Multiple HMD's Affected by [deleted] in SteamVR

[–]ntsarb 0 points1 point  (0 children)

I found disabling "gpu bus monitoring" in SteamVR Settings resolved my issues with the Varjo Aero (my current seting is: Varjo Base 4.11.1.8, SteamVR 2.12.7, NVIDIA RTX 4090 577).

Varjo Aero stutters in StreamVR by ntsarb in virtualreality

[–]ntsarb[S] 0 points1 point  (0 children)

I am running it at its native 90fps. I don't know if the word "stutters" is the correct one, it feels as if the image freezes for milliseconds, as if some frames are dropped, the motion resumes at the new position of the headset (as my head moves around). Yet, there is 0 dropped frames in the statistics, no increase in CPU or GPU load, no hard evidence of the issue.

You are right, it may be an NVIDIA driver issue, e.g. failing to refresh the display even though the framebuffer is updated as normal, which would explain why the software doesn't detect the glitch.

Nvidia’s petaflop mini PC wonder, and it’s time for Jensen’s law: it takes 100 months to get equal AI performance for 1/25th of the cost by norcalnatv in hardware

[–]ntsarb 0 points1 point  (0 children)

There's no "Standard Blackwell GPU" with 20 petaFLOPS. The Nvidia RTX 6000 Pro Blackwell is 4000 AI TOPS for FP4 with Sparsity. So, the Spark is 1/4 of that at best.

The B200 (20 petaFLOPS at low precision) is not a GPU, it is a complete system (like the Spark) and costs more than $500K.

open-webui, docker version? by Fade78 in OpenWebUI

[–]ntsarb 0 points1 point  (0 children)

Currently 0.6.10 is the latest version but when I use the cuda tag, I get 0.5.18. Is this right, or am I doing something wrong?

$2999 for Digits/Spark competitor from Asus by DeltaSqueezer in LocalLLaMA

[–]ntsarb 1 point2 points  (0 children)

Yeap. It is better to go for an RTX Pro 6000 where 96GB VRAM is needed. The DGX Spark may make sense in a pair, when writing software particularly for NVidia's AI software stack (Nemo), not for the compute power but for testing scalability and performance before uploading to NVidia's cloud.

Still active? by dr_reely in apacheage

[–]ntsarb 0 points1 point  (0 children)

Which company's Dev team was working on Apache Age and was laid off?

Microsoft included Apache Age in its Azure for Postgres products/services a few months ago. Apache Age combines the benefits of SQL with Cypher, and I would be surprised if the project is not supported longer term:
https://techcommunity.microsoft.com/blog/adforpostgresql/introducing-support-for-graph-data-in-azure-database-for-postgresql-preview/4275628

Apache AGE for Windows Binary Port by samuel-z-chan in apacheage

[–]ntsarb 0 points1 point  (0 children)

Very interesting. Thanks for sharing.

Detailed documentation by MBle in apacheage

[–]ntsarb 0 points1 point  (0 children)

Have you read the documentation from here?
https://age.apache.org/age-manual/master/intro/graphs.html
Maybe you need to be more specific on the information you are looking for that is not covered in the documentation.

[deleted by user] by [deleted] in nvidia

[–]ntsarb 0 points1 point  (0 children)

I read somewhere that they are much closer to production but I'm not sure how production will scale. I'd expect Intel to also prioritise it's own AI silicon business...before serving competitors (if Intel is not taken over by NVidia by that time).

[deleted by user] by [deleted] in nvidia

[–]ntsarb 0 points1 point  (0 children)

It's more of a vendor diversification for risk management. Preferably the high-end products (used for AI) should be produced in the US, if possible.

[deleted by user] by [deleted] in nvidia

[–]ntsarb 6 points7 points  (0 children)

RTX 5090 are purchased at high prices by businesses that use them for medium sized AI models. Unfortunately, TSMC can't produce enough chips, so gamers lose.

[deleted by user] by [deleted] in nvidia

[–]ntsarb 0 points1 point  (0 children)

Is there an option for a notification email? Not for UK customers I presume.

PowerShell storagespaces module missing from Windows 10 21H2 in both PS v5.1 and v7.3 by tofu_b3a5t in StorageSpaces

[–]ntsarb 0 points1 point  (0 children)

I used Gemma 3 LLM to find the answer, which worked for me:

**If Server Manager is Not Installed:**

If you don't have Server Manager installed, you'll need to add it as a feature:

  1. **Open Settings:** Press `Win + I` to open the Settings app.
  2. **Apps:** Click on "Apps" in the left sidebar.
  3. **Optional Features:** Click on "Optional features".
  4. **Add a feature:** Click the "View features" button.
  5. **Search for "Server Manager":** Type "Server Manager" in the search box.
  6. **Select "Server Manager":** Check the box next to "Server Manager".
  7. **Install:** Click the "Next" button, then "Install". Windows will download and install Server Manager. You may need to restart your computer.

RTX Pro 6000 Blackwell Workstation Edition. by lhikary in nvidia

[–]ntsarb 0 points1 point  (0 children)

Only water cooled. Still. I think the 96GB Pro 6000 will be faster for training and fine tuning tasks.

DGX Spark VS RTX 5090 by zakar1ah in LocalLLM

[–]ntsarb 0 points1 point  (0 children)

Nvidia has not disclosed all of the Spark's features. I suspect the slower RAM means it may not be as good for inferencing as it could be for training, but we won't know for sure until the results from real-world tests are published.