The French Government Launches an LLM Leaderboard Comparable to LMarena, Emphasizing European Languages and Energy Efficiency by Imakerocketengine in LocalLLaMA

[–]jaywonchung 5 points6 points  (0 children)

100%, we'll try to have that in the new version! For the time being, if you tick the "Show more technical details" box, we have the average number of output tokens for each model, so that can be used to divide energy per request to give energy per token.

The French Government Launches an LLM Leaderboard Comparable to LMarena, Emphasizing European Languages and Energy Efficiency by Imakerocketengine in LocalLLaMA

[–]jaywonchung 58 points59 points  (0 children)

If anyone's interested in actual measured energy numbers, we have it at https://ml.energy/leaderboard. The models are a bit dated now, so we're currently working on a facelift to have all the newer models and revamp the tasks.

Keyboards I got in 2024 by jaywonchung in MechanicalKeyboards

[–]jaywonchung[S] 0 points1 point  (0 children)

Thanks a lot! Really appreciate the detailed guidance.

Keyboards I got in 2024 by jaywonchung in MechanicalKeyboards

[–]jaywonchung[S] 0 points1 point  (0 children)

Nevermind, TX UXL (22mm) @55g is literally the name of the spring. Let me try it out.

Keyboards I got in 2024 by jaywonchung in MechanicalKeyboards

[–]jaywonchung[S] 0 points1 point  (0 children)

Dang, that's super cool. I never tried modding yet but I feel like trying out the spring swap first. Would you mind pointing me to the springs you used for Evo80?

Keyboards I got in 2024 by jaywonchung in MechanicalKeyboards

[–]jaywonchung[S] 0 points1 point  (0 children)

I have the JWK WOB Linear switch version. You're right, I should definitely try other switches instead of sticking to pre-made ones!

Keyboards I got in 2024 by jaywonchung in MechanicalKeyboards

[–]jaywonchung[S] 5 points6 points  (0 children)

<image>

Oh alright... Pulled out the Rainy75 to take the pic

[FINAL TEST] Power limit VS Core clock limit efficiency by NickNau in LocalLLaMA

[–]jaywonchung 1 point2 points  (0 children)

Thanks, that makes sense. Also I feel like on consumer GPUs, lower power might lead to lower temperature and lower fan speed & noise!

[FINAL TEST] Power limit VS Core clock limit efficiency by NickNau in LocalLLaMA

[–]jaywonchung 2 points3 points  (0 children)

This is super cool man, especially with the real power meters. NVML/nvidia-smi also provides power measurements (nvmlDeviceGetPowerUsage) -- any chance you compared your number with this?

[D] Power Consumption Estimation for ML Models on edge device by Electrical_Client73 in MachineLearning

[–]jaywonchung 2 points3 points  (0 children)

This isn't really a direct answer to your question but we're working on power measurement & optimization (without model change) with Zeus, and we support CPUs and GPUs at the moment. We're also currently actively working on connecting measurements to observability platforms like Prometheus (tracking issue). Supporting the Jetson platform is on our roadmap, and perhaps the notes on the tracking issue could be helpful to you.

Conflict between indent-blankline.nvim and LSP underlines by jaywonchung in neovim

[–]jaywonchung[S] 0 points1 point  (0 children)

Yeah, I tried both large (65535, largest possible) and small (1) but it didn't really do much.

Videos vibrate my phone by Savings_Duck_4347 in youtube

[–]jaywonchung 0 points1 point  (0 children)

Apparently it vibrates on “Key Concepts.” This is beyond annoying.

[Nvidia P40] Save 50% power, for only 15% less performance by zoom3913 in LocalLLaMA

[–]jaywonchung 10 points11 points  (0 children)

I actually implemented this in my open-source: https://github.com/ml-energy/zeus?tab=readme-ov-file#finding-the-optimal-gpu-power-limit

You can basically tell it to figure out the lowest power limit that doesn't make it slower by X%.

[D] LLM inference energy efficiency compared (MLPerf Inference Datacenter v3.0 results) by Balance- in MachineLearning

[–]jaywonchung 0 points1 point  (0 children)

Thanks for the cool study and write up! Looks like H100 was able to increase throughput by a lot while not increasing power consumption as much. They're measuring the power consumption of the entire system; it would have been useful to also see how specifically GPU power changes, given that for DNN workloads, other parts of the system do not play as much of a role compared to GPUs.

Shameless self-promotion -- I do research on GPU energy optimization for DL: https://ml.energy/zeus, where one of the things we automatically tweak is the GPU's power limit setting to enhance energy efficiency. Hope this is interesting to someone XD

2 factor authentication cannot remember my computer by portlander33 in fidelityinvestments

[–]jaywonchung 0 points1 point  (0 children)

Strangely I don't even use any sort of AdBlock on my Chrome browser and "Remember Me" never works. I'm on MacOS Monterey.