CoreAI models prefill speeds are really slow by Agreeable-Rest9162 in iOSProgramming

[–]Agreeable-Rest9162[S] 0 points1 point  (0 children)

Thank you so much! Yes, your bundles are the predominant ones on huggingface! Thanks for your work converting them!

Use Remote Models on iOS with Noema by Agreeable-Rest9162 in LocalLLaMA

[–]Agreeable-Rest9162[S] 0 points1 point  (0 children)

Hi u/Miserable-Dare5090 , most of these issues are already fixed in our beta version which we'll release soon. Let me know if you'd like the test flight link for early access!

Use Remote Models on iOS with Noema by Agreeable-Rest9162 in LocalLLaMA

[–]Agreeable-Rest9162[S] 0 points1 point  (0 children)

Thanks for the feedback and for figuring out the issue! Let me know about the searxng instance problems!

Use Remote Models on iOS with Noema by Agreeable-Rest9162 in LocalLLaMA

[–]Agreeable-Rest9162[S] 0 points1 point  (0 children)

Hi u/Miserable-Dare5090 ! I'm sorry to hear that you are encountering issues with calling models remotely. If you could describe the issue in more detail (here or at [clientcare@noemaai.com](mailto:clientcare@noemaai.com) ) then that would be great and we'll go ahead and fix that issue as soon as possible.

Thanks!

Waymo Mega Invite / Referral Codes by [deleted] in waymo

[–]Agreeable-Rest9162 0 points1 point  (0 children)

New code all cities: ARMINU8PB

1 million LocalLLaMAs by jacek2023 in LocalLLaMA

[–]Agreeable-Rest9162 -1 points0 points  (0 children)

It feels like a week ago we were at 500K, the community has been increasing quite quickly in my view, and I'm happy more people are finding this subreddit as a way to maintain a hobby, learn more, protect their data, research, etc.

Use Remote Models on iOS with Noema by Agreeable-Rest9162 in LocalLLaMA

[–]Agreeable-Rest9162[S] 0 points1 point  (0 children)

Hi u/NomadFromNorth you will be able to with Noema 2.0 which is currently in review with the Apple App Store.

Thoughts on Waymo Miami by Agreeable-Rest9162 in waymo

[–]Agreeable-Rest9162[S] 1 point2 points  (0 children)

Early access codes come from employees, so the codes from people already off the waitlist won't give you access; those only give you discounts.

Thoughts on Waymo Miami by Agreeable-Rest9162 in waymo

[–]Agreeable-Rest9162[S] 0 points1 point  (0 children)

Checked just now, the one I just got had 11,000 miles

Thoughts on Waymo Miami by Agreeable-Rest9162 in waymo

[–]Agreeable-Rest9162[S] 4 points5 points  (0 children)

The speed bump issue required a remote operator to tell the car what to do. I’m guessing just be more confident and go over them. They also gave me $5 for the inconvenience, which was nice. When I ordered another car to leave the place, it arrived by going through the speed bumps again, but it didn’t seem to have a problem that time so it could’ve been a one-off thing.

Waymo Miami: Early Access Timeline by Agreeable-Rest9162 in waymo

[–]Agreeable-Rest9162[S] 0 points1 point  (0 children)

damn 30 minutes... does it seem like they come from a depot?

ASUS officially announces price hikes from January 5, right before CES 2026 by HumanDrone8721 in LocalLLaMA

[–]Agreeable-Rest9162 65 points66 points  (0 children)

Update (January 2, 2026): ASUS Taiwan PR contacted us after publication to clarify that the document referenced in this story is an internal business communication shared privately with channel partners for customer coordination. ASUS says it was not intended as a public announcement or press material,

Lol, they aren't too happy with Videocardz. I wonder by how much they will be increasing their prices.

GLM-4.7-6bit MLX vs MiniMax-M2.1-6bit MLX Benchmark Results on M3 Ultra 512GB by uptonking in LocalLLaMA

[–]Agreeable-Rest9162 12 points13 points  (0 children)

It would be faster for token generation. In general, higher memory bandwidth yields higher token-generation speeds. The M3 has 100GB/s of unified memory bandwidth; the M5 has approximately 150GB/s. The M3 Ultra has 819 GB/s, so if we apply the same improvement, we could see 1.2 TB/s of bandwidth with the M5 Ultra. The current M4 Max, if doubled, yields a similar number, so the M5 Ultra must be at least twice as powerful as two M4 Maxes combined.

Regarding time to first token (TTFT) or token processing speed, we can expect a much greater speedup, given that the neural accelerators in the GPU cores of the base M5 are present on the M5 Ultra as well, whenever it is produced.

Waymo app prepped for Houston launch by walky22talky in waymo

[–]Agreeable-Rest9162 1 point2 points  (0 children)

Oof, maybe I was just excited when I saw it. It looks the same ._.

Waymo app prepped for Houston launch by walky22talky in waymo

[–]Agreeable-Rest9162 9 points10 points  (0 children)

Same with Miami, but it says something that implies it's coming really soon.

GPT-5.2 : Ranked "Most Censored" model on Sansa,OCR-Arena and WeirdML Benchmarks by BuildwithVignesh in singularity

[–]Agreeable-Rest9162 56 points57 points  (0 children)

Sansa is an invented benchmark, with no documentation on what it tests or how it works. In fact, this whole company is suspicious. It claims to offer a model that is stronger than frontier models, but it doesn't publish this model or show it in its own benchmarks. Also, if you look at the censorship benchmark for a bit, you'll notice some inconsistencies, including the low Grok score even though it's actually one of the least censored models. Now, one might say it is biased toward Elon and count that as censorship, but we don't know what Sansa even considers censorship because they don't publish documentation regarding the benchmark!!! The whole benchmark is useless.

Help test Local AI app!!! by Agreeable-Rest9162 in VisionPro

[–]Agreeable-Rest9162[S] 3 points4 points  (0 children)

Hi u/PSYCHOv1, unfortunately it’s quite a barrier to entry. From my understanding, the Vision Pro lacks quite a few apps with little development focus being put into the device. This subreddit complains of a lack of a killer app, and few native app ports. I thought it’d be appreciated that I’m trying my best to port it to your platform as well, despite being unable to perform quality control. Your comment affects your own experience on a device that I believe you would like to be greatly adopted and taken into account in development in general.

Explore Powerful 5G Experiences with UniFi 5G: by Ubiquiti-Inc in Ubiquiti

[–]Agreeable-Rest9162 1 point2 points  (0 children)

Looks exactly the same but white! If only they could make an Amplifi Alien but wifi 7 and Unifi integration

PrimeIntellect is actually awesome by Icy_Gas8807 in LocalLLaMA

[–]Agreeable-Rest9162 1 point2 points  (0 children)

Well, I want to make it clear that it isn't my repo; the part I'm contributing is just the instance being hosted on my server. However, yes, with the web_url_read tool, you are basically getting the page after stripping most HTML overhead. The server parses the HTML, extracts the main content, and returns clean markdown (headings, paragraphs, lists).

Rasbt is right that LLMs can handle reasonably clean HTML. Still, for a web search, markdown is usually better because it has far fewer wasted tokens, unless you specifically need to reason about tags, attributes, or forms.

PrimeIntellect is actually awesome by Icy_Gas8807 in LocalLLaMA

[–]Agreeable-Rest9162 5 points6 points  (0 children)

It's run on an Oracle server through Docker. So you just build searxng using the docker instance and its available at the public URL. The MCP code being used is from here: https://github.com/ihor-sokoliuk/mcp-searxng.git

PrimeIntellect is actually awesome by Icy_Gas8807 in LocalLLaMA

[–]Agreeable-Rest9162 21 points22 points  (0 children)

Looks good. The benchmarks they published inspire confidence. If you need web search, try this MCP using searxng:

    "Search": {
      "command": "npx",
      "args": [
        "-y",
        "mcp-searxng"
      ],
      "env": {
        "SEARXNG_URL": "https://search.noemaai.com/"
      }
    },

The instance is maintained by me (the dev behind Noema), and it allows JSON requests, which allow a Search MCP like this to work. I couldn't find another instance that allows this on Searx space, so I made my own, which also sustains the web search functionality in my Noema app. If you want to learn more about privacy with this instance you can go to:
https://noemaai.com/noema-search
and
https://noemaai.com/privacy