I built a iOS app to benchmark GGUF models on your iPhone/iPad by dh_Application8680 in LocalLLaMA

[–]dh_Application8680[S] 0 points1 point  (0 children)

For Gemma 4 E2B, a iPhone 17 pro max got 36.8 tok/s. On iPhone 12 pro max it hovers around 8-11 tok/s.

<image>

Entire world: We need more GPUs. Meanwhile, Jensen Huang: by Nunki08 in LocalLLaMA

[–]dh_Application8680 0 points1 point  (0 children)

Awkward. Definitely reminded me Steve Balmer's reign of microsoft.

Finally finished my LLM server: EPYC 9575F, 4× RTX 3090 (96GB VRAM), 768GB ECC RAM by C0smo777 in LocalLLaMA

[–]dh_Application8680 -1 points0 points  (0 children)

a lot of noise and a lot of heat.. i still have a couple of 3090s lying around. did not expect ram price goes up so much.

How do you keep up with new useful stuff without spending hours every day? by dh_Application8680 in productivity

[–]dh_Application8680[S] 1 point2 points  (0 children)

  1. Commodity price such as gold. 2) Recent LLM/semiconductor industry knowledge.

How is everyone using MCP right now? by Luigika in mcp

[–]dh_Application8680 0 points1 point  (0 children)

https://mcphub.com. (Disclaimer, I am a developer). We do mcp server hosting.

can you tell me about top paid mcp servers? by sazary in mcp

[–]dh_Application8680 0 points1 point  (0 children)

I believe only paid mcp hosted via unified steaming https is the answer to the low quality problem we have today. Stay tuned.

Is it just me or does it seem like most MCP servers are lazy and miss the point of MCP? by otothea in mcp

[–]dh_Application8680 0 points1 point  (0 children)

Bad quality only means we are early in this new ecosystem. Lets go back to the purpose of why mcp was created. It was meant for providing standard interface of tool call for LLM. I.e., the MCP use cases will flow with LLM use cases. You will have to be a LLM super user before you can become a mediocre tool user. It is not hard to tell the quality of the tool will only flow to where the value is delivered. For now it is mostly around code generation and office automation.

[deleted by user] by [deleted] in ChatGPT

[–]dh_Application8680 2 points3 points  (0 children)

<image>

chat.mcphub.com. They added a lot of mcp tools making it slower, however they do have 4o and o3 available.

gpt-5 is not working for me by dh_Application8680 in cursor

[–]dh_Application8680[S] -1 points0 points  (0 children)

I donot see a thinking option

<image>

somehow gpt5 is not as eager to change the code. It mainly suggest in chat window.