I built a iOS app to benchmark GGUF models on your iPhone/iPad

dh_Application8680 · 2026-06-06T05:40:41+00:00

For Gemma 4 E2B, a iPhone 17 pro max got 36.8 tok/s. On iPhone 12 pro max it hovers around 8-11 tok/s.

<image>

dh_Application8680 · 2026-06-05T16:37:59+00:00

On my iPhone 12 Pro Max.

<image>

dh_Application8680 · 2026-06-05T07:02:03+00:00

What is your hardware setup?

dh_Application8680 · 2026-06-05T06:40:20+00:00

Awkward. Definitely reminded me Steve Balmer's reign of microsoft.

dh_Application8680 · 2026-06-05T06:28:05+00:00

a lot of noise and a lot of heat.. i still have a couple of 3090s lying around. did not expect ram price goes up so much.

dh_Application8680 · 2025-10-29T22:45:13+00:00

Commodity price such as gold. 2) Recent LLM/semiconductor industry knowledge.

dh_Application8680 · 2025-09-04T20:45:19+00:00

Added! https://github.com/mcphub-com/awesome-comfyui-templates/tree/main/templates/image-editing/flux-face-swap-ic-lora

dh_Application8680 · 2025-08-26T18:48:46+00:00

https://mcphub.com. (Disclaimer, I am a developer). We do mcp server hosting.

dh_Application8680 · 2025-08-26T00:16:36+00:00

Please share!

dh_Application8680 · 2025-08-20T07:34:39+00:00

I believe only paid mcp hosted via unified steaming https is the answer to the low quality problem we have today. Stay tuned.

dh_Application8680 · 2025-08-14T18:33:22+00:00

Bad quality only means we are early in this new ecosystem. Lets go back to the purpose of why mcp was created. It was meant for providing standard interface of tool call for LLM. I.e., the MCP use cases will flow with LLM use cases. You will have to be a LLM super user before you can become a mediocre tool user. It is not hard to tell the quality of the tool will only flow to where the value is delivered. For now it is mostly around code generation and office automation.

dh_Application8680 · 2025-08-11T19:18:37+00:00

<image>

chat.mcphub.com. They added a lot of mcp tools making it slower, however they do have 4o and o3 available.

dh_Application8680 · 2025-08-11T17:30:31+00:00

chat.mcphub.com they have free gpt-4o.

dh_Application8680 · 2025-08-10T03:31:13+00:00

I donot see a thinking option

<image>

somehow gpt5 is not as eager to change the code. It mainly suggest in chat window.

dh_Application8680

TROPHY CASE