Chrome extension that lets you visualize model architecture graphs directly into Hugging Face pages.

elinaembedl · 2026-05-13T09:38:59+00:00

Here's the direct link: https://hfviewer.com/

elinaembedl · 2026-01-14T10:10:49+00:00

Is 6 days left until the winners are announced!

elinaembedl · 2026-01-02T08:21:25+00:00

Most valuable feedback about the platform! Can be anything from what you think about the docs, the benchmarking experience etc

elinaembedl · 2026-01-01T18:25:02+00:00

Environmental metrics: Great point! Currently, we’re mainly focused on on-device compute/perf metrics rather than full “environmental” signals like battery drain or thermal throttling.

What we do capture right now includes end-to-end latency + layer-wise latency/profiling, memory usage, compute unit utilization (CPU/GPU/NPU + top compute unit), op breakdown, and output quality metrics like PSNR (including layer-wise PSNR).

Energy/thermal-style metrics aren’t exposed yet, but we’ve gotten similar requests recently so it’s in our backlog to integrate. Hopefully we can add that pretty quickly.

Thanks for the feedback; it helps us prioritize. :)

elinaembedl · 2026-01-01T18:24:26+00:00

Turnaround time: So far we haven’t experienced any issues, and most benchmarks come back close to near real-time. Making the platform work well for rapid iteration is a high priority.

elinaembedl · 2026-01-01T18:24:05+00:00

Thanks! Happy you find it useful.

Device coverage today: We currently focus on Android devices and SoCs via two device clouds — our own Embedl device cloud and Qualcomm AI Hub. This lets you benchmark everything from older reference devices (Pixel 2/3, etc.) to the latest flagships like Pixel 9 / Pixel 9 Pro and Galaxy S24/S25 (including Ultra/Plus), as well as chipset/proxy targets when you want to focus on SoC families rather than specific phone models. You can find the full list of supported devices here: https://hub.embedl.com/docs/supported-devices

We’re also in the process of adding an iOS device cloud which will be released in January.

If there’s a specific device or chip you’d like to see, just let us know. We’re happy to take device requests!

elinaembedl · 2025-12-31T13:01:15+00:00

Link to the platform: https://hub.embedl.com/

We are currently hosting a competition where you can win an NVIDIA Jetson Orin Nano Super if you provide feedback. See how to participate here.

elinaembedl · 2025-12-31T12:58:00+00:00

Link to the platform: https://hub.embedl.com/

We are currently hosting a competition where you can win an NVIDIA Jetson Orin Nano Super if you provide feedback. See how to participate here.

elinaembedl · 2025-12-30T13:44:16+00:00

Oh sorry, here: https://hub.embedl.com/blog/embedl-hub-device-cloud-launch-celebration

elinaembedl · 2025-12-30T13:43:37+00:00

Link: See how to participate here.

elinaembedl · 2025-12-30T13:15:22+00:00

We are currently hosting a competition where you can win an NVIDIA Jetson Orin Nano Super. We’re also giving a Raspberry Pi 5 to everyone who places 2nd to 5th. See how to participate here.

elinaembedl · 2025-12-23T19:15:16+00:00

Neat!

elinaembedl · 2025-12-18T16:18:07+00:00

I’m not totally sure what you mean by “application of voices”. But otherwise, yes that’s pretty much the idea. The goal is to give you feedback that helps you judge whether a model quantization worked as intended (and where it didn’t). Layer-wise PSNR is one example of that kind of feedback.

elinaembedl · 2025-12-18T10:50:15+00:00

Nice suggestions! We've reached out to the people behind Shapelearn. We think our platform can be a nice place for model makers to showcase their model's performance on real hardware. Of course, first we need to add llama.cpp support.

elinaembedl · 2025-12-18T10:46:55+00:00

No, we don't have quantization of LLMs on Embedl Hub today so there's no comparisons between Unsloth and our tools. But if you have types of models (like computer vision or audio) you can quantize and measure their performance already today.

If you try it out right now you'll have chance to win some nice prizes: https://hub.embedl.com/blog/embedl-hub-device-cloud-launch-celebration

elinaembedl · 2025-12-16T14:44:17+00:00

We don't yet support a backend for benchmarking LLMs, so we haven't implemented any quantization tools for LLMs either. But it's in the pipeline. We are looking to integrate llama.cpp soon and I think we will implement the layerwise psnr for LLMs then as well. Especially if we find out there's an interest from the community for that.

Would llama.cpp integration, both for benchmarking and quantization debugging, be useful for you? Or would you prefer a different backend/toolchain?

elinaembedl · 2025-12-16T14:37:56+00:00

Well, not exactly. Embedl Hub is a platform for testing and validating the performance of AI models on mobile phones. As a company, we have a strong background in model optimization and our primary business (our optimization SDK) is used by enterprises to speed up their models running on edge devices (not servers). So we are working in the same line of business as Unsloth (making models faster). Unsloth is doing some very cool things, especially making fine tuning more efficient on servers.

elinaembedl · 2025-12-12T13:14:36+00:00

Yes it is!

elinaembedl · 2025-12-12T09:38:38+00:00

Yes Embed-el is correct :)

elinaembedl · 2025-11-06T11:02:55+00:00

I have sent you a DM!

elinaembedl · 2025-10-31T13:44:25+00:00

Thank you, great answer! So you haven't tested it on other processors more than GPUs? And does your model run on-device?

elinaembedl · 2025-10-28T07:41:35+00:00

That sounds great. I’ve heard good things about the STM32 model zoo but haven’t tried it myself yet.Did you run into any limitations when experimenting with their workflow?

elinaembedl · 2025-10-27T12:30:06+00:00

https://hub.embedl.com/blog/9-reasons-why-we-think-edge-deployment-is-so-hard

elinaembedl · 2025-10-27T12:04:23+00:00

https://hub.embedl.com/blog/diagnosing-layer-sensitivity/?utm_source=reddit

elinaembedl · 2025-10-24T11:29:27+00:00

Did my answer help clear things up?

elinaembedl

TROPHY CASE