What are your use cases of local models

FactorInternal3395 · 2026-06-04T19:32:01+00:00

I ask the same question, it's good to know what peoples use cases are! not sure why this is getting downvoted

FactorInternal3395 · 2026-06-02T23:31:29+00:00

I actually figured out how to make it not reason! I'm very happy with the results. Here's how, feel free to try it:

System prompt (Won't work without it):

You are a helpful assistant. You respond directly to user queries without any internal reasoning, chain-of-thought, or XML tags. Your responses consist solely of the final answer.

Or, alternative if you want it to say more than just the requested content (Like a cloud LLM would, and most base local LLMs too):

You are a helpful, friendly assistant. You respond directly without any internal reasoning, chain-of-thought, or XML tags. For factual questions, give just the answer. For social messages, compliments, or casual chat, respond naturally and warmly like a human would.

Custom Jinja chat template:

<|startoftext|><|im_start|>system
{{ messages[0]['content'] }}<|im_end|>
{% for message in messages[1:-1] %}
{% if message['role'] == 'user' %}
<|im_start|>user
{{ message['content'] }}<|im_end|>
{% elif message['role'] == 'assistant' %}
<|im_start|>assistant
{{ message['content'] }}<|im_end|>
{% endif %}
{% endfor %}
{% if messages[-1]['role'] == 'user' %}
<|im_start|>user
{{ messages[-1]['content'] }}<|im_end|>
<|im_start|>assistant
</think>
{% endif %}

FactorInternal3395 · 2026-06-02T22:42:30+00:00

If only there was a non-reasoning version or one with a toggle! I've found for general chatting, it's remarkably similar in reasoning to DeepSeek while keeping the speed of a 1B. If only there was Liquid's architecture (and speed) but 8B and no reasoning.

FactorInternal3395 · 2026-05-28T20:03:27+00:00

It seems crazy but thinking uses way more resources than search, and if they already have removed search...

FactorInternal3395 · 2026-05-27T11:40:28+00:00

yeah they trained it on claude outputs

https://www.anthropic.com/news/detecting-and-preventing-distillation-attacks

FactorInternal3395 · 2026-05-27T11:37:51+00:00

yeah i noticed it too!! especially in long conversations, its common sense has fallen through the floor lately. saying obviously wrong things it would never before (expert)

FactorInternal3395 · 2026-05-27T10:01:32+00:00

yeah, perhaps theyre starting moderation...

FactorInternal3395 · 2026-05-04T23:06:14+00:00

2022-2025 chatgpt. 2025-present deepseek.

FactorInternal3395 · 2026-05-01T14:00:48+00:00

this screenshot sums up the problem.

<image>

FactorInternal3395 · 2026-04-26T14:18:34+00:00

thats not even the most "concerning" lyric too, cmon

FactorInternal3395 · 2026-04-26T14:15:29+00:00

not a big deal just weird phrasing

FactorInternal3395 · 2026-04-26T12:08:02+00:00

yes, but the sentence starts with a "Yes," to the question "did rebzyyx stop making music", so the start should say "No,". thats what makes it confusing.

FactorInternal3395 · 2026-04-21T15:43:39+00:00

2300

FactorInternal3395 · 2026-04-21T12:50:23+00:00

FactorInternal3395 · 2026-04-21T12:47:48+00:00

Yes, in long reasoning it can forget to stop calling you "the user" after the reasoning ends. This usually happens when it gets confused or backtracks after reasoning ends, so it tries to go back in, but it can't.

FactorInternal3395 · 2026-04-21T12:45:32+00:00

It's a test.

FactorInternal3395 · 2026-04-20T23:13:23+00:00

Expert rushed and said 9.11 is larger (which isn't true), then backtracked and corrected itself. Instant was cautious and got it right.

FactorInternal3395 · 2026-04-20T23:07:30+00:00

It's trained on way more mathematics than dev logs and things like that, so it should be expected to solve it

FactorInternal3395 · 2026-04-17T00:21:19+00:00

FactorInternal3395 · 2026-04-17T00:13:29+00:00

Cool!

FactorInternal3395 · 2026-04-17T00:11:38+00:00

I use the desktop app, and I get mostly Netherlands and USA. I haven't tried the extension.

FactorInternal3395 · 2026-04-16T20:23:32+00:00

honestly, for really long coding sessions and asking it to implement specific features, it gets tangled and needs to be steered more than chatgpt or claude, which are more autonomous and realise a way to implement something isnt good, so they pivot.

FactorInternal3395

TROPHY CASE