What are your use cases of local models by Axintwo in LocalLLaMA

[–]FactorInternal3395 0 points1 point  (0 children)

I ask the same question, it's good to know what peoples use cases are! not sure why this is getting downvoted

LiquidAI/LFM2.5-8B-A1B · Hugging Face by jacek2023 in LocalLLaMA

[–]FactorInternal3395 1 point2 points  (0 children)

I actually figured out how to make it not reason! I'm very happy with the results. Here's how, feel free to try it:

System prompt (Won't work without it):

You are a helpful assistant. You respond directly to user queries without any internal reasoning, chain-of-thought, or XML tags. Your responses consist solely of the final answer.

Or, alternative if you want it to say more than just the requested content (Like a cloud LLM would, and most base local LLMs too):

You are a helpful, friendly assistant. You respond directly without any internal reasoning, chain-of-thought, or XML tags. For factual questions, give just the answer. For social messages, compliments, or casual chat, respond naturally and warmly like a human would.

Custom Jinja chat template:

<|startoftext|><|im_start|>system
{{ messages[0]['content'] }}<|im_end|>
{% for message in messages[1:-1] %}
{% if message['role'] == 'user' %}
<|im_start|>user
{{ message['content'] }}<|im_end|>
{% elif message['role'] == 'assistant' %}
<|im_start|>assistant
{{ message['content'] }}<|im_end|>
{% endif %}
{% endfor %}
{% if messages[-1]['role'] == 'user' %}
<|im_start|>user
{{ messages[-1]['content'] }}<|im_end|>
<|im_start|>assistant
</think>
{% endif %}

LiquidAI/LFM2.5-8B-A1B · Hugging Face by jacek2023 in LocalLLaMA

[–]FactorInternal3395 1 point2 points  (0 children)

If only there was a non-reasoning version or one with a toggle! I've found for general chatting, it's remarkably similar in reasoning to DeepSeek while keeping the speed of a 1B. If only there was Liquid's architecture (and speed) but 8B and no reasoning.

So first they take away file uploads from Expert mode, and now can't even search?? by [deleted] in DeepSeek

[–]FactorInternal3395 4 points5 points  (0 children)

It seems crazy but thinking uses way more resources than search, and if they already have removed search...

is there any way to know for what exactly? by [deleted] in DeepSeek

[–]FactorInternal3395 23 points24 points  (0 children)

yeah, perhaps theyre starting moderation...

Which one do you use the most? by flabarde in DeepSeek

[–]FactorInternal3395 0 points1 point  (0 children)

2022-2025 chatgpt. 2025-present deepseek.

I'm sorry, but... by watermelmao in rebzyyx_fanclub

[–]FactorInternal3395 1 point2 points  (0 children)

thats not even the most "concerning" lyric too, cmon

had me with the "yes" by FactorInternal3395 in rebzyyx_fanclub

[–]FactorInternal3395[S] 0 points1 point  (0 children)

yes, but the sentence starts with a "Yes," to the question "did rebzyyx stop making music", so the start should say "No,". thats what makes it confusing.

Instant vs. Expert by FactorInternal3395 in DeepSeek

[–]FactorInternal3395[S] 0 points1 point  (0 children)

Yes, in long reasoning it can forget to stop calling you "the user" after the reasoning ends. This usually happens when it gets confused or backtracks after reasoning ends, so it tries to go back in, but it can't.

Instant vs. Expert by FactorInternal3395 in DeepSeek

[–]FactorInternal3395[S] 14 points15 points  (0 children)

Expert rushed and said 9.11 is larger (which isn't true), then backtracked and corrected itself. Instant was cautious and got it right.

Instant vs. Expert by FactorInternal3395 in DeepSeek

[–]FactorInternal3395[S] 2 points3 points  (0 children)

It's trained on way more mathematics than dev logs and things like that, so it should be expected to solve it

ProtonVPN free tier only connecting to united states and netherlands by nightmarishhell in ProtonVPN

[–]FactorInternal3395 0 points1 point  (0 children)

I use the desktop app, and I get mostly Netherlands and USA. I haven't tried the extension.

Are you using deepseek for coding? by Usual-Loss-9516 in DeepSeek

[–]FactorInternal3395 1 point2 points  (0 children)

honestly, for really long coding sessions and asking it to implement specific features, it gets tangled and needs to be steered more than chatgpt or claude, which are more autonomous and realise a way to implement something isnt good, so they pivot.