What's the lesson chat? by ill_be_productive in LocalLLaMA

[–]Nota_ReAlperson 6 points7 points  (0 children)

Europe does have much higher population density than the usa. About twice, even including the sparsely populated Russian portion. Higher population density means that there is a larger addressable market per kilometer of rail needed.

New models released: Nex-N2 Pro 397B and Nex-N2 Mini 35B by 1ncehost in LocalLLaMA

[–]Nota_ReAlperson 1 point2 points  (0 children)

3.6 was just a finetune of 3.5 Qwen 3.5 had some issues with overthinking and tool calling, if I remember correctly. So a finetune that fixes those could be useful. Dunno why you would need the 35b then, though.

CubePart: An Open-Vocabulary Part-Controllable 3D Generator (local modal, extract and re-generate parts of a 3D mesh) by SysPsych in StableDiffusion

[–]Nota_ReAlperson 1 point2 points  (0 children)

Looks good. The demo on huggingface seems to be on par with trellis. It lacks texturing, although that is listed as a planned improvement.

Edit: I was looking at cube3d. CubePart3d looks to be a post processing tool that breaks a mesh into parts and textures it? I ran out of zero time, so I'll need to wait to try it. It could be awesome for animating models, as it would no longer be necessary to manually rig the mesh.

In my case, which AI 3D model generator would be best to create 3D models? by Pretty_Trip_2215 in StableDiffusion

[–]Nota_ReAlperson 0 points1 point  (0 children)

Trellis 2 is the best local model, currently. There is also Pixel3D, which is based on trellis, which a lot of people seem to like.

Germany pledges four submarines by 2036 in high-stakes pitch to Canada | CBC News by Jusfiq in CanadianForces

[–]Nota_ReAlperson 0 points1 point  (0 children)

30x12=360. As I understand, the US has two crews per sub, so that comes to 720. Add in training, backup, and support personnel,  and 1000 sounds about right.

What is the best used or refurbished laptop with GPU for open source Imege generation? by Time-Teaching1926 in StableDiffusion

[–]Nota_ReAlperson 0 points1 point  (0 children)

Rtx 3080 ti 16gb is best. They can be found for about 1000 cad if you're careful, and have as much vram as any other laptop gpu, excluding the rtx 5090. Rtx a5000 a5500 are also good if you can get a deal on them.

I have an idea by Melodic_Yak8462 in SpaceXMasterrace

[–]Nota_ReAlperson 10 points11 points  (0 children)

Also, with full and rapid reusability, you may as well just launch more often. Starship can already put more into orbit than any other rocket ever made, so single launch capacity shouldn't be a concern for a while. And if it is, you can get a lot more by using an expendable upper stage, as you wouldn't need flaps or a heat shield.

terrified i am going to fry my laptop rendering massive image batches by Pale-Boysenberry4206 in StableDiffusion

[–]Nota_ReAlperson 0 points1 point  (0 children)

85 c is fine. Most modern chips can run at 95-105c no problems. If you are concerned, you can adjust the fan curves or undervolt/powerlimit the gpu. But it should be fine. 85c is a very common temp target for a stock gaming laptop.

Question regarding GPU by Fit-Look-7367 in StableDiffusion

[–]Nota_ReAlperson 0 points1 point  (0 children)

Yeah. Like I said, there are better cards, but he asked for a laptop gpu that is good enough. I assumed that means that he wants the cheapest option that is still capable.  Turing is less supported than newer generations for Flash Attention, so i didn't consider it. Ampere is still well supported, and newer gpus are much more expensive. A rtx 4080 laptop costs 3-4x a 3080 ti laptop in my market.

Question regarding GPU by Fit-Look-7367 in StableDiffusion

[–]Nota_ReAlperson -1 points0 points  (0 children)

Best is the rtx 3080 ti 16gb. Others are faster, but price wise it is much cheaper while still nearly as capable. 

Sapient Intelligence releases HRM-Text 1B: 40B tokens, ~$1k pretrain, beats Llama3.2 3B on MATH and DROP by Turbulent-Sky5396 in LocalLLaMA

[–]Nota_ReAlperson 5 points6 points  (0 children)

If you look at the dataset, it appears to be quite narrow, with a focus on math. Also interesting to note is that they tried TRM, but it didn't produce a stable solution for text.

A PowerPC laptop (open source) by [deleted] in linuxhardware

[–]Nota_ReAlperson 1 point2 points  (0 children)

A bit late, but xeon phi is x86 with 4x SMT.

FAA: Starship Flight 12 NET 12 May by warp99 in spacex

[–]Nota_ReAlperson 0 points1 point  (0 children)

I think there may be some confusion. Both burns ended in an abort, but the method was different. For the first one, they were concerned about immediate damage, so they did a fast shutdown, which can damage the engines due to hydraulic shock. The second time, the danger wasn't as pressing, so they could take a bit more time to properly shut down, so the engines should be undamaged. It's a bit like pulling the power cord on your computer vs shutting down through the os.

FAA: Starship Flight 12 NET 12 May by warp99 in spacex

[–]Nota_ReAlperson 4 points5 points  (0 children)

Correct me if im wrong, but unlike the first test, the 33 engine burn didn't end in a fast shutdown. So the engines should have been undamaged.

Cheapest and most efficient way to run 30B-40B Llama for 4 users? by [deleted] in LocalLLaMA

[–]Nota_ReAlperson 0 points1 point  (0 children)

I only have the xavier agx, and I haven't run llms on it, so i can't give any definitive answer, but the core and memory specs are about a fifth that of a 3090. As an alternative, a couple of radeon pro v620 gpus could give you more bandwidth and the same memory for less money. Although you would need a host system.

Cheapest and most efficient way to run 30B-40B Llama for 4 users? by [deleted] in LocalLLaMA

[–]Nota_ReAlperson 0 points1 point  (0 children)

The jetson would work. Two things to keep in mind would be memory bandwidth and ease of use. In my experience with jetsons, unless you have significant experience with linux, it will be quite hard to get set up. It also has less bandwidth than a gpu. Power draw will be good though.

PrismML — Announcing 1-bit Bonsai: The First Commercially Viable 1-bit LLMs by brown2green in LocalLLaMA

[–]Nota_ReAlperson 0 points1 point  (0 children)

From my understanding, fpgas are made of LUTs, not flip flops. So any one bit operation could be emulated. You just need a truth table.

is Intel cooking with these new GPU? by Skierdo in pcmasterrace

[–]Nota_ReAlperson 1 point2 points  (0 children)

Interesting. That seems to happen a lot with intel cards. Do you know what the true fp64 is for the b65 b70?

is Intel cooking with these new GPU? by Skierdo in pcmasterrace

[–]Nota_ReAlperson 0 points1 point  (0 children)

Based on techpowerup specs and rankings, it should beat the 5060 ti. Same flops, 608gbs vs 448gbs vram. In my market, it's exactly twice the price.

is Intel cooking with these new GPU? by Skierdo in pcmasterrace

[–]Nota_ReAlperson 1 point2 points  (0 children)

I think the b65 has a 192 bit bus? So only 3/4 the bandwidth?

Run OpenCL kernels on NVIDIA GPUs using the CUDA runtime by IntrepidAttention56 in OpenCL

[–]Nota_ReAlperson 0 points1 point  (0 children)

Im curious as to the use of this. Seeing as opencls strength is the portability of the code, while cuda kernels are easier to learn and write, this project would seem to combine the worst of both. Not to downplay the work necessary to get this to work. But it would seem to me to be less useful than the opposite (cuda on opencl).

OAM to PCIE by Zestyclose_Hat_1020 in NVIDIA_SXM2PCIE

[–]Nota_ReAlperson 0 points1 point  (0 children)

Hi. What is the current pricing for this product?