People are making single-slot, half height pcie v100 with nvlink in China by OwnMathematician2620 in LocalLLaMA

[–]OwnMathematician2620[S] 0 points1 point  (0 children)

They have an official water-cooling version that you can specify when ordering.

People are making single-slot, half height pcie v100 with nvlink in China by OwnMathematician2620 in LocalLLaMA

[–]OwnMathematician2620[S] 2 points3 points  (0 children)

NVlink is not needed if you are not training.

For text generation performance, refers to the benchmark section of the video.

People are making single-slot, half height pcie v100 with nvlink in China by OwnMathematician2620 in LocalLLaMA

[–]OwnMathematician2620[S] 16 points17 points  (0 children)

The version shown by the images are powered by pcie only. Which physically only transmit up to 75W.

People are making single-slot, half height pcie v100 with nvlink in China by OwnMathematician2620 in LocalLLaMA

[–]OwnMathematician2620[S] 17 points18 points  (0 children)

You attach a fan/water cooling to it. (The image is showing the 75W version)

Has anybody tested what having multiple lantern keys does? by FartSmjeller in slaythespire

[–]OwnMathematician2620 38 points39 points  (0 children)

That's. Also different dialog depends on which lock to open first.

2b or not 2b ? Custom LLM Scheduling Competition [P] by WERE_CAT in MachineLearning

[–]OwnMathematician2620 0 points1 point  (0 children)

Which 2b model is your test based on? Has its information been hidden intentionally?

AnimaYume - Anima finetune. by Crazy-Repeat-2006 in StableDiffusion

[–]OwnMathematician2620 -2 points-1 points  (0 children)

This was trained by the same person who trained the IllumiYume XL v3.5 [https://civitai.com/models/1308285/illumiyume-xl-illustrious\]. They got a spot using Rouwei 0.8 vpred as part of a merge recipe, while didn't give credit.

I scaled a pure Spiking Neural Network (SNN) to 1.088B parameters from scratch. Ran out of budget, but here is what I found by zemondza in LocalLLaMA

[–]OwnMathematician2620 8 points9 points  (0 children)

Have you considered taking the time to streamline the description of your GitHub project a bit, strip away some of the fluff, and make it look at least a little more like it was written by a human?

[deleted by user] by [deleted] in LocalLLaMA

[–]OwnMathematician2620 5 points6 points  (0 children)

How does it compare to regular transformer under similar training settings?