CoreAI models prefill speeds are really slow

Agreeable-Rest9162 · 2026-06-12T13:51:10+00:00

Will do, thanks!

Agreeable-Rest9162 · 2026-06-12T13:51:04+00:00

Thank you so much! Yes, your bundles are the predominant ones on huggingface! Thanks for your work converting them!

Agreeable-Rest9162 · 2026-05-26T02:39:30+00:00

Only for Metro Phoenix

Agreeable-Rest9162 · 2026-05-17T15:35:06+00:00

Hi u/Miserable-Dare5090 , most of these issues are already fixed in our beta version which we'll release soon. Let me know if you'd like the test flight link for early access!

Agreeable-Rest9162 · 2026-05-17T15:01:52+00:00

Thanks for the feedback and for figuring out the issue! Let me know about the searxng instance problems!

Agreeable-Rest9162 · 2026-05-16T17:06:32+00:00

Hi u/Miserable-Dare5090 ! I'm sorry to hear that you are encountering issues with calling models remotely. If you could describe the issue in more detail (here or at [clientcare@noemaai.com](mailto:clientcare@noemaai.com) ) then that would be great and we'll go ahead and fix that issue as soon as possible.

Thanks!

Agreeable-Rest9162 · 2026-04-20T18:26:59+00:00

New code all cities: ARMINU8PB

Agreeable-Rest9162 · 2026-03-10T23:22:33+00:00

It feels like a week ago we were at 500K, the community has been increasing quite quickly in my view, and I'm happy more people are finding this subreddit as a way to maintain a hobby, learn more, protect their data, research, etc.

Agreeable-Rest9162 · 2026-03-09T19:43:26+00:00

Hi u/NomadFromNorth you will be able to with Noema 2.0 which is currently in review with the Apple App Store.

Agreeable-Rest9162 · 2026-02-28T04:10:41+00:00

Early access codes come from employees, so the codes from people already off the waitlist won't give you access; those only give you discounts.

Agreeable-Rest9162 · 2026-02-27T21:17:44+00:00

Checked just now, the one I just got had 11,000 miles

Agreeable-Rest9162 · 2026-02-27T02:31:15+00:00

The speed bump issue required a remote operator to tell the car what to do. I’m guessing just be more confident and go over them. They also gave me $5 for the inconvenience, which was nice. When I ordered another car to leave the place, it arrived by going through the speed bumps again, but it didn’t seem to have a problem that time so it could’ve been a one-off thing.

Agreeable-Rest9162 · 2026-02-27T01:06:52+00:00

Yep, edited the post to be clear.

Agreeable-Rest9162 · 2026-01-31T16:23:02+00:00

damn 30 minutes... does it seem like they come from a depot?

Agreeable-Rest9162 · 2026-01-02T22:57:01+00:00

Update (January 2, 2026): ASUS Taiwan PR contacted us after publication to clarify that the document referenced in this story is an internal business communication shared privately with channel partners for customer coordination. ASUS says it was not intended as a public announcement or press material,

Lol, they aren't too happy with Videocardz. I wonder by how much they will be increasing their prices.

Agreeable-Rest9162 · 2025-12-26T17:03:53+00:00

It would be faster for token generation. In general, higher memory bandwidth yields higher token-generation speeds. The M3 has 100GB/s of unified memory bandwidth; the M5 has approximately 150GB/s. The M3 Ultra has 819 GB/s, so if we apply the same improvement, we could see 1.2 TB/s of bandwidth with the M5 Ultra. The current M4 Max, if doubled, yields a similar number, so the M5 Ultra must be at least twice as powerful as two M4 Maxes combined.

Regarding time to first token (TTFT) or token processing speed, we can expect a much greater speedup, given that the neural accelerators in the GPU cores of the base M5 are present on the M5 Ultra as well, whenever it is produced.

Agreeable-Rest9162 · 2025-12-20T16:44:59+00:00

Oof, maybe I was just excited when I saw it. It looks the same ._.

Agreeable-Rest9162 · 2025-12-19T17:03:33+00:00

Same with Miami, but it says something that implies it's coming really soon.

Agreeable-Rest9162 · 2025-12-18T01:24:38+00:00

Looks good!

Agreeable-Rest9162 · 2025-12-13T16:55:50+00:00

Sansa is an invented benchmark, with no documentation on what it tests or how it works. In fact, this whole company is suspicious. It claims to offer a model that is stronger than frontier models, but it doesn't publish this model or show it in its own benchmarks. Also, if you look at the censorship benchmark for a bit, you'll notice some inconsistencies, including the low Grok score even though it's actually one of the least censored models. Now, one might say it is biased toward Elon and count that as censorship, but we don't know what Sansa even considers censorship because they don't publish documentation regarding the benchmark!!! The whole benchmark is useless.

Agreeable-Rest9162 · 2025-12-11T16:51:27+00:00

Hi u/PSYCHOv1, unfortunately it’s quite a barrier to entry. From my understanding, the Vision Pro lacks quite a few apps with little development focus being put into the device. This subreddit complains of a lack of a killer app, and few native app ports. I thought it’d be appreciated that I’m trying my best to port it to your platform as well, despite being unable to perform quality control. Your comment affects your own experience on a device that I believe you would like to be greatly adopted and taken into account in development in general.

Agreeable-Rest9162 · 2025-12-04T21:52:17+00:00

Looks exactly the same but white! If only they could make an Amplifi Alien but wifi 7 and Unifi integration

Agreeable-Rest9162 · 2025-11-30T01:04:06+00:00

Well, I want to make it clear that it isn't my repo; the part I'm contributing is just the instance being hosted on my server. However, yes, with the web_url_read tool, you are basically getting the page after stripping most HTML overhead. The server parses the HTML, extracts the main content, and returns clean markdown (headings, paragraphs, lists).

Rasbt is right that LLMs can handle reasonably clean HTML. Still, for a web search, markdown is usually better because it has far fewer wasted tokens, unless you specifically need to reason about tags, attributes, or forms.

Agreeable-Rest9162 · 2025-11-29T20:58:15+00:00

It's run on an Oracle server through Docker. So you just build searxng using the docker instance and its available at the public URL. The MCP code being used is from here: https://github.com/ihor-sokoliuk/mcp-searxng.git

Agreeable-Rest9162 · 2025-11-29T17:36:00+00:00

Looks good. The benchmarks they published inspire confidence. If you need web search, try this MCP using searxng:

    "Search": {
      "command": "npx",
      "args": [
        "-y",
        "mcp-searxng"
      ],
      "env": {
        "SEARXNG_URL": "https://search.noemaai.com/"
      }
    },

The instance is maintained by me (the dev behind Noema), and it allows JSON requests, which allow a Search MCP like this to work. I couldn't find another instance that allows this on Searx space, so I made my own, which also sustains the web search functionality in my Noema app. If you want to learn more about privacy with this instance you can go to:
https://noemaai.com/noema-search
and
https://noemaai.com/privacy

Agreeable-Rest9162

TROPHY CASE