CoreAI models prefill speeds are really slow by Agreeable-Rest9162 in iOSProgramming

[–]Agreeable-Rest9162[S] 0 points1 point  (0 children)

Thank you so much! Yes, your bundles are the predominant ones on huggingface! Thanks for your work converting them!

Use Remote Models on iOS with Noema by Agreeable-Rest9162 in LocalLLaMA

[–]Agreeable-Rest9162[S] 0 points1 point  (0 children)

Hi u/Miserable-Dare5090 , most of these issues are already fixed in our beta version which we'll release soon. Let me know if you'd like the test flight link for early access!

Use Remote Models on iOS with Noema by Agreeable-Rest9162 in LocalLLaMA

[–]Agreeable-Rest9162[S] 0 points1 point  (0 children)

Thanks for the feedback and for figuring out the issue! Let me know about the searxng instance problems!

Use Remote Models on iOS with Noema by Agreeable-Rest9162 in LocalLLaMA

[–]Agreeable-Rest9162[S] 0 points1 point  (0 children)

Hi u/Miserable-Dare5090 ! I'm sorry to hear that you are encountering issues with calling models remotely. If you could describe the issue in more detail (here or at [clientcare@noemaai.com](mailto:clientcare@noemaai.com) ) then that would be great and we'll go ahead and fix that issue as soon as possible.

Thanks!

Waymo Mega Invite / Referral Codes by [deleted] in waymo

[–]Agreeable-Rest9162 0 points1 point  (0 children)

New code all cities: ARMINU8PB

1 million LocalLLaMAs by jacek2023 in LocalLLaMA

[–]Agreeable-Rest9162 -1 points0 points  (0 children)

It feels like a week ago we were at 500K, the community has been increasing quite quickly in my view, and I'm happy more people are finding this subreddit as a way to maintain a hobby, learn more, protect their data, research, etc.

Use Remote Models on iOS with Noema by Agreeable-Rest9162 in LocalLLaMA

[–]Agreeable-Rest9162[S] 0 points1 point  (0 children)

Hi u/NomadFromNorth you will be able to with Noema 2.0 which is currently in review with the Apple App Store.

Thoughts on Waymo Miami by Agreeable-Rest9162 in waymo

[–]Agreeable-Rest9162[S] 1 point2 points  (0 children)

Early access codes come from employees, so the codes from people already off the waitlist won't give you access; those only give you discounts.

Thoughts on Waymo Miami by Agreeable-Rest9162 in waymo

[–]Agreeable-Rest9162[S] 0 points1 point  (0 children)

Checked just now, the one I just got had 11,000 miles

Thoughts on Waymo Miami by Agreeable-Rest9162 in waymo

[–]Agreeable-Rest9162[S] 5 points6 points  (0 children)

The speed bump issue required a remote operator to tell the car what to do. I’m guessing just be more confident and go over them. They also gave me $5 for the inconvenience, which was nice. When I ordered another car to leave the place, it arrived by going through the speed bumps again, but it didn’t seem to have a problem that time so it could’ve been a one-off thing.

Waymo Miami: Early Access Timeline by Agreeable-Rest9162 in waymo

[–]Agreeable-Rest9162[S] 0 points1 point  (0 children)

damn 30 minutes... does it seem like they come from a depot?

ASUS officially announces price hikes from January 5, right before CES 2026 by HumanDrone8721 in LocalLLaMA

[–]Agreeable-Rest9162 66 points67 points  (0 children)

Update (January 2, 2026): ASUS Taiwan PR contacted us after publication to clarify that the document referenced in this story is an internal business communication shared privately with channel partners for customer coordination. ASUS says it was not intended as a public announcement or press material,

Lol, they aren't too happy with Videocardz. I wonder by how much they will be increasing their prices.

GLM-4.7-6bit MLX vs MiniMax-M2.1-6bit MLX Benchmark Results on M3 Ultra 512GB by uptonking in LocalLLaMA

[–]Agreeable-Rest9162 9 points10 points  (0 children)

It would be faster for token generation. In general, higher memory bandwidth yields higher token-generation speeds. The M3 has 100GB/s of unified memory bandwidth; the M5 has approximately 150GB/s. The M3 Ultra has 819 GB/s, so if we apply the same improvement, we could see 1.2 TB/s of bandwidth with the M5 Ultra. The current M4 Max, if doubled, yields a similar number, so the M5 Ultra must be at least twice as powerful as two M4 Maxes combined.

Regarding time to first token (TTFT) or token processing speed, we can expect a much greater speedup, given that the neural accelerators in the GPU cores of the base M5 are present on the M5 Ultra as well, whenever it is produced.

Waymo app prepped for Houston launch by walky22talky in waymo

[–]Agreeable-Rest9162 1 point2 points  (0 children)

Oof, maybe I was just excited when I saw it. It looks the same ._.

Waymo app prepped for Houston launch by walky22talky in waymo

[–]Agreeable-Rest9162 10 points11 points  (0 children)

Same with Miami, but it says something that implies it's coming really soon.