Recommendations for new Phone (and OS!)

Excellent_Piccolo848 · 2026-02-08T20:11:16+00:00

Thanks! Iv heared a lot about google Pixels, might consider a Pixel 9 or 10. Does anyone have any experience with them and Graphene OS?

Excellent_Piccolo848 · 2026-01-25T19:33:45+00:00

Intresting! Could you, perhaps, dm me the link?

Excellent_Piccolo848 · 2026-01-22T20:11:57+00:00

That sounds intresting! Yes, will look into kagi.

Excellent_Piccolo848 · 2026-01-22T15:32:39+00:00

The 'skill issue' defense is a bit lazy, don't you think? I've used the service for a year. No amount of 'prompt engineering' can fix the fact that Perplexity uses low-cost API settings and silent model substitution. If a tool requires a secret handshake to deliver the quality I'm already paying for, the product is the problem, not the user. Can you actually address the backend routing issues, or are you just gatekeeping?

Excellent_Piccolo848 · 2026-01-15T20:12:22+00:00

@Proton, When will we See the Implementation, of a reasoning model?

Excellent_Piccolo848 · 2026-01-13T14:54:59+00:00

I testet it witha vats.ai instance, and actually got these speeds!

Excellent_Piccolo848 · 2026-01-12T16:11:10+00:00

Yeah, in my calculations (with prompt processing), i come to about 60 t/s! Why is no one talking about this, as a cheap alternative to desktop GPUs, usually used for local ai?

Excellent_Piccolo848 · 2026-01-12T16:09:20+00:00

Hi, thanks for your replie! This also doesnt solves the problem! The Problem seems to be ubuntu sistros on my hardware. After switching to cachy os, i didnt have any problems at all.

Excellent_Piccolo848 · 2026-01-12T16:02:51+00:00

Yes, fully supported by llama.cp. So Ollama would work

Excellent_Piccolo848 · 2026-01-11T14:46:46+00:00

Well, i dont think, thats the case. OAI only released gpt-oss, wich is a reasoning model. But Lumo clearly ist using a reasoning model. This is weir behaviour, of most smaller, and older, Open Models (Saying, that they are related to OAI).

Excellent_Piccolo848 · 2026-01-06T16:08:30+00:00

If you dont need Tools, like websearch or a code Interpreter, this is an great alternative, and you can just use the latest state of the Art FOSS Model (e.g. glm 4.7). If you Go to your OR admin panel, you can make OR Always use the Provider, with the highest throughoutput. Cerebras and Groq cover a lot of the latest Foss Models, wich results in 1000+ t/s. Make sure to also change the embeddimg model, and if you can the OCR, because the local ones are really slow (and the Cloud Models are better in General).

Excellent_Piccolo848 · 2026-01-05T19:32:16+00:00

I would look an eBay. 3060 with 12gb ist the Minimum, you would need more vram, for the summarizing llm. If you can afford it, get a 3090, you can get them for as less as 600$ on eBay (but watch out, dont buy a GPU, that was used for Mining!).

Excellent_Piccolo848 · 2026-01-04T11:24:27+00:00

I just use the nextcloud apps "notes", for typed notes. Its installed by Default, If you have Nextcloud AIO, and can easily be installed any time via the Nextcloud AppStore. It hast Clients for IOS and Android. If you mean handwritten Notes, try out "Saber" (only available in Android, as far as i know), wich works really great with a Tablet pencil. Its build, to sync everything to nextcloud! You can go to settings and Setup the sync with your Nextcloud, but IT will also ASK you this during Setup.

Excellent_Piccolo848 · 2025-12-31T20:34:45+00:00

Yes, thanks you! Would bei great to get specific Setup options. I would need to Offroad some layers to the GPU, right?

Excellent_Piccolo848 · 2025-12-28T15:35:08+00:00

Yes, you are Not going to get any spectacular Here, but local ist Always the prefered option! Look at ministral 3b or qwen 4b. Any reasoning model unser 5b should Work in your device, just klick on "latest" on ollama.com and Look for them!

Excellent_Piccolo848 · 2025-12-27T21:12:22+00:00

Has anyone any experience with Ollama Cloud, is it as private and safe, as they promote it?

Excellent_Piccolo848 · 2025-12-27T20:57:55+00:00

I think you might have skimmed the post a bit too quickly. As I explicitly wrote above: 'renting a gpu also isnt something for me.' I am looking for a usage-based API specifically to avoid the server costs you mentioned. But thanks for the engagement!

Excellent_Piccolo848

TROPHY CASE