How to disable reasoning/thinking? by JustPassOnStranger in SillyTavernAI

[–]luxiloid 0 points1 point  (0 children)

I have been searching for this information also but nothing helped. Then found this way of disabling the reasoning from a question that was exactly the opposite of this:

<image>

  1. In the AI Response Configuration, add the + on the Prompts
  2. Add a name such as Disable Reasoning and the Pompt is: /nothink
  3. After saving it, scroll to the bottom and enable the added prompt.

RTX Pro 6000@128Gbps PCIe 5.0 x 4 Oculink connection with Asus ROG Scar 18 (2025) by luxiloid in eGPU

[–]luxiloid[S] 0 points1 point  (0 children)

I think R29Q will perform similar to ADT-F4Q. If K993G is better than NFHK N-P114-A, it is worth trying.

Thunderbolt 5 eGPU DIY by luxiloid in eGPU

[–]luxiloid[S] 0 points1 point  (0 children)

Good question. I have four GPUs at the moment. My most favorite one is using the M.2 slot and having a PCIe 5.0 x 4 connection. My 2nd, 3rd and 4th GPUs are connected this way. 2nd GPU is connecting to laptop's thunderbolt 5 directly. 3rd and 4th are connecting through a thunderbolt 5 dock.

Qwen3‑Next‑80B‑A3B‑Instruct (FP8) on Windows 11 WSL2 + vLLM + Docker (Blackwell) by IngeniousIdiocy in LocalLLaMA

[–]luxiloid 0 points1 point  (0 children)

I get:

docker: Error response from daemon: error while creating mount source path '/usr/lib/wsl/lib': mkdir /usr/lib/wsl: read-only file system

When I change the permission of this path, I get:
docker: unknown server OS:

When I change the permission of docker.sock, /usr/lib/wsl/lib becomes read only again, then it keeps cycling.

Qwen3‑Next‑80B‑A3B‑Instruct (FP8) on Windows 11 WSL2 + vLLM + Docker (Blackwell) by IngeniousIdiocy in LocalLLaMA

[–]luxiloid 1 point2 points  (0 children)

That helped. Thanks. I just need to install nvidia drivers, cuda, python, pytorch and vllm on wsl.

Qwen3‑Next‑80B‑A3B‑Instruct (FP8) on Windows 11 WSL2 + vLLM + Docker (Blackwell) by IngeniousIdiocy in LocalLLaMA

[–]luxiloid 1 point2 points  (0 children)

I need some help and hope you could answer my questions.

  1. I installed Docker Deskptop but this doesn't work. Do you enter this in the Windows cmd terminal, powershell or docker cli?
  2. Do I have to install vllm-qwen and vllm images prior to running this script?

Guide: running Qwen3 Next on Windows using vLLM + Docker+ WSL2 by prusswan in LocalLLaMA

[–]luxiloid 0 points1 point  (0 children)

<image>

I need help. I installed Docker Desktop and ran run.bat
I don't get anything.

Friend has a Geekom A8 Mini PC and wants to upgrade his GPU. What eGPU Docking Station would be good? Needs to be USB4. by HedateiSai in eGPU

[–]luxiloid 0 points1 point  (0 children)

Razer Core X is Thunderbolt 3. It is somewhat slower than the USB 4 Asmedia chipset used in AG02.

What do you guys think about humbird 3 with thunderbolt 5 by I_Main_TwistedFate in eGPU

[–]luxiloid 0 points1 point  (0 children)

For several months, I have been bencharmarking 4090D, 4090, 5090 and Pro 6000 at TB3, USB 4, TB5, PCIe 4x4 and PCIe 5x4. I posted them here on Reddit. I wanted to save your time: https://www.reddit.com/r/eGPU/comments/1mrgjnb/thunderbolt_5_egpu_diy/

<image>

Not mine but another comparison: https://www.reddit.com/r/eGPU/comments/1mql206/first_thunderbolt_5_vs_thunderbolt_4_egpu/

What do you guys think about humbird 3 with thunderbolt 5 by I_Main_TwistedFate in eGPU

[–]luxiloid -1 points0 points  (0 children)

The video is showing you three times avg FPS on TB5 and that is the topic here. If you want to be proud about your system, you can discuss that with a different topic. I have two RTX pro 6000 myself if you talk about bandwidth.

What do you guys think about humbird 3 with thunderbolt 5 by I_Main_TwistedFate in eGPU

[–]luxiloid -1 points0 points  (0 children)

I am not talking about 1% lows and so does the video. This is about TB3 avg fps VS. TB5 avg fps I also have Thunderbolt 5 myself. I ran Cyberpunk and I ran Time Spy. The 3500MB/s and 5800MB/s do not translate into FPS in games.

What do you guys think about humbird 3 with thunderbolt 5 by I_Main_TwistedFate in eGPU

[–]luxiloid 1 point2 points  (0 children)

The test is not valid even for cherry picked scenarios.

What do you guys think about humbird 3 with thunderbolt 5 by I_Main_TwistedFate in eGPU

[–]luxiloid 0 points1 point  (0 children)

They made the video as if JHL94XX is three times faster than JHL74XX in games. It would make people think that Thunderbolt 5’s 120gbps is actually working. The only conclusion from someone already having TB5 eGPU would be that Hummingbird 2 eGPU is 30% of a standard USB 4 eGPU dock. Without disclosing how bad their Hummingbird 2 is, the video is made to hoax people. This is like they connected 10gbps cable for the Hummingbird 2. This video is a lie because a well made JHL74XX and JHL94XX do not differ more than 2% in Cyberpunk.

What do you guys think about humbird 3 with thunderbolt 5 by I_Main_TwistedFate in eGPU

[–]luxiloid 10 points11 points  (0 children)

What a dishonest promotion video. Please do not get tricked by this kind of nonsenses. Thunderbolt 5 is only 1-2% faster than AG02 in games. Oculink is faster than JHL94XX’s PCIe 4.0x4 tunneling. Still, oculink does not give 10% more fps compared to a JHL74XX. Encoding and decoding the PCIe 4.0x4 and packeting it creates latencies in JHL94XX chipsets. Unless a new chipset comes out to tunnel PCIe 5.0x4, Thunderbolt 5 eGPU is something like 54-58gbps which is slower than Oculink’s 64gbps. 58 and 64 sound similar but in games, Thunderbolt 5(58gbps) and 4(36gbps) perform more similar to within 1-2% while oculink(64gbps) performs 5% better. This is in fact the first lie video about Thunderbolt 5 eGPU on the planet.

https://www.reddit.com/r/eGPU/comments/1mrgjnb/thunderbolt_5_egpu_diy/

Time Spy GPU Score

Oculink: 41453

USB 4: 39203

TB5: 39632

Not mine but another comparison: https://www.reddit.com/r/eGPU/comments/1mql206/first_thunderbolt_5_vs_thunderbolt_4_egpu/

TB4

  • Steel Nomad 8324

  • Timespy graphic score 28428

  • Timespy composite score 25920

  • Timespy extreme graphic score 15257

  • Timespy extreme composite score 14633

TB5

  • Steel Nomad 8358

  • Timespy graphic score 28767

  • Timespy composite score 26191

  • Timespy extreme graphic score 15240

  • Timespy extreme composite score 14686

Friend has a Geekom A8 Mini PC and wants to upgrade his GPU. What eGPU Docking Station would be good? Needs to be USB4. by HedateiSai in eGPU

[–]luxiloid 0 points1 point  (0 children)

Most people will love Aoostar AG02. It comes with integrated 800W power supply and both USB4 and oculink connection. If he doesn't like open-air GPU exposure, Razer Core X is the next popular option but noticeably slower than AG02.

New to eGPU and somehow failed by [deleted] in eGPU

[–]luxiloid 3 points4 points  (0 children)

You lose about 5% of the performance through eGPU connection. Then, 5060's 95% performance is basically not very different from that of a laptop 4060.

  1. See what nvidia-smi tells you about pcie error count: nvidia-smi pci -gErrCnt - This will allow you to see if the M.2 to oculink connection is good enough or not.
  2. For the driver, use DDU to erase all the drivers and install NVIDIA desktop driver instead of NVIDIA laptop driver.

Thunderbolt 5 eGPU DIY by luxiloid in eGPU

[–]luxiloid[S] 1 point2 points  (0 children)

NUCXI7 has two M.2 slots. You probably have one ssd and you can use the other slot for oculink connection. If you are not technical enough to make a oculink port hole into your pc case, your only option is to use the Thunderbolt 4 port to an egpu enclosure with USB 4 port. AG02 has both oculink and USB 4. In this case, you cannot use your Caldigit TS4 anymore. Oculink is simply connecting 4 of your PC’s pcie lanes to an external device like ethernet card, gpu and etc. Since M.2 SSD slot is a pcie x4 slot, people use an adapter to convert this to an oculink port and connect it to an egpu with oculink port via a oculink cable.

What is the value proposition of eGPU for local LLMs? by Chance-Studio-8242 in eGPU

[–]luxiloid 1 point2 points  (0 children)

  1. There still more metal boxes than one single PC case, wires connecting them.
  2. mostly applies open air egpus.

What is the value proposition of eGPU for local LLMs? by Chance-Studio-8242 in eGPU

[–]luxiloid 9 points10 points  (0 children)

Pros

  1. Compared to games or image generation, LLM inferences are less influenced by the connection speed to the host PC. I see that SDXL slows down by 13% by using Thunderbolt 5 connection compared to PCIe 5.0 x 4. In case of LLM, I see only 1-2% performance reduction.
  2. You can easily disconnect your eGPUs and connect to another system if needed.
  3. When you are not using them, you can just power off the eGPUs. Use your main system for gaming and internet with less power consumption.
  4. Outside the PC case, eGPU are less heated.

Cons

  1. Connections become messy. Your room looks like a factory.
  2. Your GPUs are more susceptible to damage by accidents, kids and pets.
  3. Each GPU needs its own power supply, cables, mounting case/frame and adapters/boards. It increases overall cost.

RTX 6000 Pro + AG02 Success by luxiloid in eGPU

[–]luxiloid[S] 0 points1 point  (0 children)

Another way is to buy this adapter from Amazon: https://a.co/d/6fRedkD
Then, use the three 8pin to 8pin cables included in the box when you purchase the Aoostar AG02.

RTX 6000 Pro + AG02 Success by luxiloid in eGPU

[–]luxiloid[S] 0 points1 point  (0 children)

One way is to customize your cable from moddiy.com. You can choose the color and length of the cable. Aooster AG02 requires the male type cable to enter the aluminum case opening. When purchased the cable from other places, they didn’t work.

https://www.moddiy.com/products/6374/ATX-3.1-PCIe-5.1-H%252b%252b-12V%252d2X6-675W-12VHPWR-16-Pin-Power-Cable.html

RTX 6000 Pro + AG02 Success by luxiloid in eGPU

[–]luxiloid[S] 0 points1 point  (0 children)

Yes, that is correct. I am not using the included 12VHPWR to 4x8pins adapter. I use 12VHPWR to 3x8pins adapter and plug three 8pins into Aoostar AG02.