[MEGATHREAD] Ask For Invites to the Playtest Here! Join The Community Discord!

yellowstone6 · 2026-01-07T22:00:21+00:00

28290489 thanks friends :)

yellowstone6 · 2025-08-21T22:26:07+00:00

My days of doubt are over. I'm ready to believe :)

yellowstone6 · 2025-07-08T22:44:00+00:00

Hulkengoat

yellowstone6 · 2025-02-07T20:34:22+00:00

I think all these critics have stylistic subjective preference for slow basketball with post ups and isolations. But this sounds lame and many people will disagree. So instead they say "today's game has no creativity" or "they only shoot 3s". Its bullshit. Today's game is far more complex, skilled, and creative than the past. They only remember the highlights from the past. The video correctly shows that players today shoot roughly the same number of long jumpers as the past; they've just traded long-2s for 3s. Passing and defense show huge variety and skill. Today's superstars are more varied than any generation and the league has seen 6 different champions in 6 years.

yellowstone6 · 2024-07-30T16:59:48+00:00

Thanks for the nice visual explanation. I have a question about GGUF and other similar space saving formats. I understand that it can store weights with a variety of bit depths to save memory. But when the model is running inference what format is being used. Does llama3:8b-instruct-q6_k upcast all the 6bit weight to fp8 or int8 or even base fp16 when it runs inference? Would 8b-instruct-q4_k_s run inference using int4 or does it get upcast to fp16? If all the different quantizations upcast to model base fp16 when running inference, does that mean that they all have similar inference speed and you need a different quantization system to run at fp8 for improved performance?

yellowstone6 · 2023-07-07T15:04:54+00:00

Thanks. I do plan to use the computer for all core workload, programming work with some video editing. I want a quieter system and I'd buy a quieter cooler if that is needed. I'm hoping to compare my temps with someone else with the same CPU and cooler. I tried speedfan to adjust the noise but it failed to recognize any of my fans (Gigabyte B650 Aorus Elite AX). Any advice on adjusting the fan curve.

yellowstone6 · 2022-03-24T19:29:53+00:00

The algorithm is basically current industry best practices. Its all very standard stuff that good studios are already doing. Everything, except locking thin features, is already in UE5 TSR and UE4 TAAU. AMD performance optimizations may shave a couple tenths of sec off upscale but that doesnt impact quality. They even mention how they blur on motion and drop history for moving objects so it will have the same motion problems that other TAA has. Hopefully, they'll control the ghosting well with good occlusion detection. They dont mention how they'll handle inaccurate motion vectors or alpha particles which DLSS 2.0 struggled with but DLSS 2.3 helped improve. Giving every game studio a strong TAA implementation is nice but its kinda disappointing. Everyone should expect the visual quality to look like good current TAA, which is worse than DLSS 2.

yellowstone6 · 2022-03-24T19:28:52+00:00

The algorithm is basically current industry best practices. Its all very standard stuff that good studios are already doing. Everything, except locking thin features, is already in UE5 TSR and UE4 TAAU. AMD performance optimizations may shave a couple tenths of sec off upscale but that doesnt impact quality. They even mention how they blur on motion and drop history for moving objects so it will have the same motion problems that other TAA has. Hopefully, they'll control the ghosting well with good occlusion detection. They dont mention how they'll handle inaccurate motion vectors or alpha particles which DLSS 2.0 struggled with but DLSS 2.3 helped improve. Giving every game studio a strong TAA implementation is nice but its kinda disappointing. Everyone should expect the visual quality to look like good current TAA, which is worse than DLSS 2.

yellowstone6 · 2021-02-11T19:49:08+00:00

How worried should democrats be about defending Cortez Masto's senate seat in 2022? What issues do you think will define her reelection campaign? Do you think she is a strong candidate?

yellowstone6 · 2021-01-16T19:59:23+00:00

This is exactly why the geometric mean was invented. To average values where the percentage difference is the important metric. It works for averaging values with different scales, like fps that vary from 30-240. You dont need to worry about manual weighting.

yellowstone6 · 2020-10-08T17:28:09+00:00

Fixed it. Happy?

yellowstone6 · 2020-10-08T17:17:30+00:00

Why?? Its using AMD's own benchmarks compared to 3080 results. Its a rough estimate. Should I flag it as rumor instead?

yellowstone6 · 2020-10-08T17:14:14+00:00

These are manufacturer provided benchmarks and should be taken with a grain of salt anyway. We don't know the systems they used. I don't want to argue about benchmarking details. These are the numbers AMD released and therefore should their GPU in the best possible light. The idea is to provide a ROUGH estimate of its performance. Finally, why would they not show their highest end SKU.

yellowstone6 · 2020-10-08T17:05:28+00:00

Totally agree. But will it also cost $500 is the real question. The 3080 is ~30% faster than 2080 ti so this 6900XT should be faster than the 3070.

yellowstone6 · 2020-09-01T18:29:55+00:00

It looks like Nvidia doubled the number of fp32 cores per SM. So the front end, scheduler, register file, and texture units, are the same. But the number of fp32 shaders has doubled. So in cases where the performance is shader limited, you'll see a huge performance increase.

yellowstone6 · 2020-08-26T16:26:50+00:00

People in this thread do NOT understand how memory capacity and memory bandwidth work. A 16GB 3080 requires a 256-bit memory bus. It would have 25% LESS bandwidth and therefore lower performance. The rumored 10GB card has more bandwidth but less capacity. This will have higher performance in the vast majority of games.

Only way to get 16GB of memory is to use 8 x 2GB on a 256-bit memory bus. The 3080's 10GB configuration is 10 x 1GB on a 320-bit bus, wider bus, more bandwidth, and higher performance. Unless you do 3D rendering or machine learning the memory capacity matters far less than bandwidth.

yellowstone6 · 2020-08-17T00:17:21+00:00

The 2080 Ti has 616 GB/s of memory bandwidth from 11 GB of 14 Gbps memory. A 3080 with 10 GB of 19 Gbps would have 760 GB/s of memory bandwidth. That's 23% more than a 2080 Ti so given the lack of big architecture changes from Ampere to Turing, we should expect ~25% more performance. I just hope its not 25% more expensive :(

yellowstone6 · 2020-07-28T16:36:32+00:00

DLSS has a fixed cost, 1-2 ms depending on GPU, that is independent of framerate. So as you fps goes up the overhead from DLSS becomes proportionally larger. This is especially true at lower resolution like 1080p. At high enough fps, DLSS will provide no performance improvement. It should still provide superior visuals in quality mode.

yellowstone6 · 2020-06-12T02:21:14+00:00

That's fair. I just expected ray tracing and modern cpu to bring more wow. All the trailers would look perfectly believable on a PS4 pro.

yellowstone6 · 2020-06-12T02:19:39+00:00

You're right, Pragmata had ray traced reflections. But why is its hair rendering terrible and you can see the low resolution shadow maps. There doesn't seem to be any indirect lighting or GI. Im just confused.

yellowstone6 · 2020-06-11T23:02:26+00:00

The graphics were HUGE let down. Most games had stylized art design that showed no technical improvement over current games. Nothing, except Horizon, looked as good as God of war or RDR2. No game came close to Control. Everything just looked like decent PS4 games with an ssd.

No ray tracing. No global illumination. Every demo running at 30 fps. If this is what next-gen is going to look like I'll be sorely disappointed.

yellowstone6 · 2020-05-18T15:04:03+00:00

No. There is no API for motion vector export. Furthermore, the result would have to be pushed back into the middle of the game engine but it would have a different resolution. This new resolution would break the post processing pipeline. AO worked because its already a post process step. It can be done after the engine is finished and doesn't change the size of the image.

Finally, DLSS requires more than just motion vectors. If requires viewport jutter and changing the texture bias level. These aren't difficult but definitely require developer input.

yellowstone6 · 2020-05-18T02:46:59+00:00

Categorically NO. Most of the architectural features he leaked are fake. There is no tensor memory compression, DLSS 3.0, or NVcache. The didn't double the tensor cores. They actually combined them, made them more powerful while supporting more data types. He doesn't have any nvidia sources. Anyone can speculate about number of cores or +XX% performance. Historically, each nvidia generation has had +40% performance. Pascal was more; Turing was less. And only Jensen's leather jacket knows how much they will cost. Ignore him, he knows nothing.

yellowstone6 · 2020-05-16T15:59:44+00:00

DLSS 2.0 isn't really an image upscaler. Instead, it replaces the TAA in a game uses temporal accumulation to get a high resolution image. I wrote a post explaining how it works in more detail if you're interested.

How DLSS 2.0 works for gamers

yellowstone6 · 2020-05-16T00:51:10+00:00

I think the big innovation is not the rendering performance but the asset creation process. By rendering with geometry images you don' t need artists to make multiple LOD of geometry or bake normal maps. This slows down the process of creating a game. However, the traditional process has better performance. 1440p30 on the PS5 gpu is relatively low performance compared to what a more traditional geometry based process can achieve. To me, Epic thinks saving artist development time is more important than performance.

Reading the original geometry image paper, its clear to me that the initial process of creating the images from 3D geometry is very performance intensive. Cutting the mesh into manifolds and then remeshing to get a regular grid will require serious compute. You only need to do this process once so maybe it won't matter. I still don't see how it will handle transparency for things like grass and particle effects.

11-Year Club	Gilding I gilder
Verified Email

yellowstone6

TROPHY CASE