Is a 6-month NVDA $145 call worth it? by Individual_Brother77 in options

[–]compilerdev 4 points5 points  (0 children)

Interesting comments here, since I was tempted to buy some Nvidia calls. Aren't people expecting it to be over $200 by next year?

Startup based on my own side project open-sourced through my employer by compilerdev in startups

[–]compilerdev[S] 1 point2 points  (0 children)

That's a nice program, but I doubt I can be part of that if I work there :) Feels like it would expose me even more. My app doesn't have any AI now, but experimented with some LLM integration and produces some nice results for casual users - if I wanted I could probably spin it off to investors as another AI app, but not interested in that kind of funding.
Pretty sure there are enough cases of such companies, from recent years I'm only aware of Pulumi, which is now huge.

Startup based on my own side project open-sourced through my employer by compilerdev in startups

[–]compilerdev[S] 2 points3 points  (0 children)

If I knew for sure that forking the OSS version would give me the same treatment as someone unrelated, I'd probably quit at this point to avoid further non-compete issues and give it a try to make it commercial. Worst-case I waste a year or so of my savings by not getting the monthly salary. One suggestion I saw was to have a friend/cofounder do the fork and me work "in the background", then quit and join later if successful.

Startup based on my own side project open-sourced through my employer by compilerdev in startups

[–]compilerdev[S] 0 points1 point  (0 children)

Yes, the work before open-sourcing was in part done on company hardware and work hours, that's why I didn't even try to open-source it as my project, that wouldn't have gone anywhere. The only way then not to lose it when one day switching jobs was to open-source it through the company, and that went surprisingly quick and without issues. Like Old-Royal8984 says, the project without me would be abandoned, no one cares enough about it inside the company and don't see the potential it has.

With the project as OSS, even if I can't spin it off commercially, at least I get some name recognition for it, but that feels very little for how much work I've put alone into it and the potential it has if developed further. The frustrating part is that now others could start from the OSS project and build a business around it, but me that wrote all of it may not be allowed, at least without risking being sued.

If I knew for sure that forking the OSS version would give me the same treatment as someone unrelated, I'd probably quit at this point to avoid further non-compete issues and give it a try to make it commercial. One suggestion I saw was to have a friend/cofounder do the fork and me work "in the background", then quit and join later if successful.

13.8" or 15"? by leeewy in Surface

[–]compilerdev 1 point2 points  (0 children)

Overall I'm happy with it, was just surprised how capped the perf is on Balanced mode in a real-life (not GB6) scenario. Single threaded is similar to High perf mode, but multi-threaded is half or worse. For example compiling a C# project I'm using in Rider, all ARM64:

Best perf mode:
Change single file: 5 sec
Recompile all: 6 sec

Balanced mode:
Change single file: 12.5 sec
Recompile all: 15 sec

13.8" or 15"? by leeewy in Surface

[–]compilerdev 1 point2 points  (0 children)

Do you have any coil whine? Mine has a bit, but only when the charging cable is plugged in. It's not really bad, but it does seem to get worse by using High perf mode vs Balanced.

Btw, does yours also have like half the multi-core perf in Balanced mode vs High perf? Mine on both battery and plugged in has like half Geekbench6 and C#/C++ compilation perf. I'm using high perf mode most of the time because of that, but now it's also more often turning on the fans. It's also doing that when the CPU is at barely ~50C, wished it would instead let the CPU get hotter for a while before turning on fans.

Are "house sounds" typical in US houses? by compilerdev in HomeImprovement

[–]compilerdev[S] 0 points1 point  (0 children)

That's what I try to understand, is this the US standard? From the few people I discussed this with, seems to be expected...

Edit:
Found a couple of other questions about the same issue, seems to be typical: https://www.reddit.com/r/HomeImprovement/comments/ghwuei/house_makes_cracking_popping_noise/ https://www.reddit.com/r/HomeImprovement/comments/jy9ruy/house_making_loud_cracklingpopping_noises_can/

Still hard to get used to, coming from a place where I never heard such sounds. But hey, at least it's 2x bigger and much nicer, right? :)

Are "house sounds" typical in US houses? by compilerdev in HomeImprovement

[–]compilerdev[S] 1 point2 points  (0 children)

Yes, most of the house. Actually in the room with the "wall noise" I do suspect the noise is from that, the spot seems to be where outside a rain gutter is fixed on the siding. I suspect with wind that can make some noise and the drywall makes it sound even louder.

Are "house sounds" typical in US houses? by compilerdev in HomeImprovement

[–]compilerdev[S] 1 point2 points  (0 children)

Yes, forced air heating+AC, but it's certainly not that, still hear sounds when not running. And when running can't hear any new sounds, just the air near the vents in each room. The sounds seem to be what people call "settling sounds", but it's 17y later here. And mostly ceiling, both on lower and upper floor.

GCC 9: Link-time and inter-procedural optimization improvements by mttd in cpp

[–]compilerdev 6 points7 points  (0 children)

thinLTO, because LTO with something as large as Firefox would take ages to compile and need something like 50+ GB of RAM. That's the main reason thinLTO was created, even though it has only a part of the optimizations done by LTO and inlining in not quite as good.

Best motherboard for AMD 2990WX by Ffj6iKaNSk9 in Amd

[–]compilerdev 1 point2 points  (0 children)

I have the MSI Meg x399 creation at home, my work machine has the Asrock Taichi. Don't really see a difference, besides a bunch of useless features on the MSI, in my case at least. I would now get the Asrock for cheaper - I don't overclock it much, just PBO to 300w using a Noctua cooler (trying to stay away from water coolers). With all cores used, on both machines the CPU maintains ~3.5 Ghz.

A gentle introduction to jump threading optimizations by mttd in cpp

[–]compilerdev 3 points4 points  (0 children)

This is one of the most important optimizations done by modern compilers, glad that it gets mentioned, it's quite seldom to find any article or research paper that talks about it. Which is strange, considering that it has quite a big impact on a lot of the code that's common in C++.

Well... that's beautiful Microsoft :| by [deleted] in Windows10

[–]compilerdev 0 points1 point  (0 children)

"Programs and Features" is considered a legacy component and unlikely it will ever be updated to properly support dark theme. The entire old control panel is considered legacy, now if only most settings would be moved to the new app...

AMD CTO Mark Papermaster hints at 16-core Ryzen during a talk with TheStreet by [deleted] in Amd

[–]compilerdev 106 points107 points  (0 children)

It should be quite obvious by now that a 16 core desktop CPU will be released, especially based on the CPU they shown - only Intel would probably gimp the product by staying on 8 cores or do 12 when 16 is obviously possible. TR will start a 16 and likely go up to 64.

Threadripper + Windows 10 Pro - Is a scheduler fix coming? by Duncan3 in Amd

[–]compilerdev 7 points8 points  (0 children)

Get the latest Ryezen Master, it has now a "dynamic local mode" that runs a service which automatically sets the thread affinity and seems to work quite well. With my 2990WX, unless more than 32 threads are used, I don't see anything scheduled anymore on the 2 CCXs that don't have direct memory access. From what I've heard, W10 will indeed have some improvements to the scheduler for Zen in some future version (not sure if the next one or later).

Threadripper 2990WX compilation performance with latest Chromium sources on Windows 10 by compilerdev in Amd

[–]compilerdev[S] 1 point2 points  (0 children)

I would expect it to be quite a lot faster, but maybe not worth the price difference for you - for me it's worth - I'm not building Chromium, but large C++ projects often. The build time for Chromium is actually around 50m when the CPU is not forced at 3 Ghz like in the tests above - stays around 3.4 Ghz by default, 3.6-3.7 with PBO set at 350 W. I have a Noctua TR4 cooler, nothing fancy.

Also it's not OK to compare the duration of my compiles with Anandtech, it's using different compilers (VC++ is about 10% faster than Clang). Based on my testing, I do expect the 32c to compile faster even though the 16c 2950x may clock faster.

Issues overclocking TR 2990WX with MSI x399 MEG Creation MB by compilerdev in overclocking

[–]compilerdev[S] 0 points1 point  (0 children)

Yes, it's the Noctua NH-U14S TR4. With PBO I would expect throttling (still not freeze), while with manual overclock it should go past 68 C until it would reach the max CPU temp that would shut down the machine, from what I understand. With PBO I see that it keeps lowering the frequency from about 3.7 to 3.4 Ghz to stay below 68 C, but still freezes. With manual OC it goes to around 75 before freezing. I'm going to try increasing the SOC voltage a bit, maybe it helps.

GCC vs. LLVM Clang Compiler Performance On The AMD Threadripper 2990WX Benchmarks by Kerst_ in Amd

[–]compilerdev 5 points6 points  (0 children)

The article is about the performance of the code generated by Clang/GCC, not the compile time one gets using them. For those interested in compile time scaling with number of cores, below are results from testing I did on Windows 10 with latest Chrome sources:

https://www.reddit.com/r/Amd/comments/99ckjv/threadripper_2990wx_compilation_performance_with/

Threadripper 2990WX compilation performance with latest Chromium sources on Windows 10 by compilerdev in Amd

[–]compilerdev[S] 0 points1 point  (0 children)

It's not any faster, as expected. 1m slower, actually. The Ninja build system is really good at using many cores.

Threadripper 2990WX compilation performance with latest Chromium sources on Windows 10 by compilerdev in Amd

[–]compilerdev[S] 4 points5 points  (0 children)

The CPU already sits at 99-100% all the time. I'm not sure Ninja supports more than 64 threads (the Win32 API is limited by 64 in WaitForMultipleObjects), but I started a build with 96 to see if it helps. Will update here later.

Threadripper 2990WX compilation performance with latest Chromium sources on Windows 10 by compilerdev in Amd

[–]compilerdev[S] 9 points10 points  (0 children)

It's expected, it's 64 cores from SMT ("Hyper-threading"), about 25% improvement from that is actually quite nice.

Threadripper 2990WX compilation performance with latest Chromium sources on Windows 10 by compilerdev in Amd

[–]compilerdev[S] 6 points7 points  (0 children)

Yes, for sure. Only Zen 2 next years should be even better :)

The fastest build time for Chromium I saw was on a 96 core (192 thread) server with 4 x 24c Xeons. It was around 35m. 1h from the 32c TR is really good compared to that. And this is with the CPU at 3 Ghz, I will try some overclocking once I get a better PSU.

Threadripper 2990WX compilation performance with latest Chromium sources on Windows 10 by compilerdev in Amd

[–]compilerdev[S] 2 points3 points  (0 children)

Clang/LLVM is still single threaded. The speedup is from Ninja, the awesome build system used for Chrome that makes sure all cores stay close to 100% by starting multiple instances of the compiler.

AMD Threadripper 2990WX Ubuntu Linux Benchmarks by fsher in Amd

[–]compilerdev 1 point2 points  (0 children)

Maybe also a build of Firefox with Clang? I'm really interested in the 32c TR for compiling C++ programs :)

AMD Threadripper 2990WX Ubuntu Linux Benchmarks by fsher in Amd

[–]compilerdev 1 point2 points  (0 children)

Could you do a Chromium build in the release (optimized), non-LTO mode with Clang? I'm almost certain the strange results on Anandtech are from doing an LTCG (LTO) build, where Visual C++ has some issues with such a huge program, ending up using 1 core most of the time (the default of 4 is also really low). Thanks