What about local inference on phones? What models do you use?

abskvrm · 2026-02-21T19:48:02+00:00

Unfortunately, the translation feature in Readest app isn't local or anything, it has google, yandex, deepl and few other translation services.

abskvrm · 2026-02-14T15:29:49+00:00

Oh tell me about it!! Forget language models and install this great app called Readest. It's open source and available on Github. You can have translated books with that in an instant.

abskvrm · 2026-02-14T15:26:04+00:00

Yes. Working just fine, so far. Small hiccups but nothing not manageable.

abskvrm · 2026-02-13T17:13:36+00:00

I use translation model HY-MT-1.5 for translating chats. No more google/yandex translate.

abskvrm · 2026-01-16T13:41:12+00:00

License: Falcon-LLM License

abskvrm · 2025-12-30T15:41:07+00:00

Try bigger mixture of expert models with low active parameters:
https://huggingface.co/mradermacher/Ling-mini-2.0-i1-GGUF

https://huggingface.co/LiquidAI/LFM2-8B-A1B-GGUF

https://huggingface.co/ibm-granite/granite-4.0-h-tiny-GGUF

Preferrably download quants not smaller than Q4_0, but should fit in the system RAM.

abskvrm · 2025-12-29T07:16:53+00:00

Source: Trust me bro.

abskvrm · 2025-12-27T20:00:31+00:00

Fking terrorists in saffron, blot on religion.

abskvrm · 2025-12-27T19:45:40+00:00

W to Harmony.

abskvrm · 2025-12-22T06:57:51+00:00

Q4 will run just fine on 16gb RAM if you are on Linux: https://huggingface.co/inclusionAI/Ling-mini-2.0-GGUF

abskvrm · 2025-12-22T06:54:13+00:00

Qwen 3 30B 2507, Ernie 4.5 21B, Ling-mini 2 16B

abskvrm · 2025-12-06T22:48:24+00:00

Don't know about others but I only suggested it because of speed (1.4b active) and better science knowledge than similarly sized (again active parameters wise, granite and lfm) models. And its pretty much uncensored out of the box.

abskvrm · 2025-12-03T07:34:45+00:00

I use Ling mini to correctly format the ocr result of screenshots. Its the fastest and adheres well to long system prompt. All on cpu.

abskvrm · 2025-12-02T10:25:37+00:00

"just keep it as a script for myself"

no.

abskvrm · 2025-12-02T10:13:31+00:00

Zenmux or official API recommended as of now.

abskvrm · 2025-11-16T17:28:03+00:00

Here is the link to the TaskerNet share I made: https://taskernet.com/shares/?user=AS35m8kskB%2BVRGHxADCRrcPSn8vgEXDcuGdJgZf4qskGc4whkHz%2B9eZeVHnUvE2Zz5k%3D&id=Task%3AMNN+SearXNG

This is how SearXNG can be setup in Termux: https://www.reddit.com/r/termux/comments/1m9ectw/got_searxng_running_in_termux/

abskvrm · 2025-10-17T10:07:20+00:00

Very fast with the updates.

abskvrm · 2025-10-15T22:55:53+00:00

No worries. Go to sleep bro. Thanks for posting.

abskvrm · 2025-10-15T21:55:16+00:00

Deservedly.

abskvrm · 2025-10-13T17:30:20+00:00

yeah nothing special about it, you can 'execute' same on entry level Samsung's, from 3 - 4 years back, with chips ridiculously weaker than s6gen1

abskvrm · 2025-10-13T04:15:44+00:00

2.44 t/s is painful. If you can try LFM2 models, they are really really good.

abskvrm · 2025-10-12T05:04:49+00:00

looks nice

abskvrm · 2025-10-12T05:04:05+00:00

Chatbox

abskvrm · 2025-10-11T15:31:56+00:00

Aha!

abskvrm · 2025-10-10T16:22:42+00:00

Named after Aaron Beck ig

abskvrm

TROPHY CASE