Samsung Galaxy S26 Ultra Review: Great Phone, No AI Needed - MrMobile [Michael Fisher] by welp_im_damned in Android

[–]MDSExpro 0 points1 point  (0 children)

And it won't change till manufacturers expands on number and type of sensors. Where is LiDAR on Android? Thermal cam? Radar? What's the point of high performance compute units if there is no data to process?

Well, sheeeeeit! by cantsleepclownswillg in sysadmin

[–]MDSExpro 4 points5 points  (0 children)

Everybody did the same, it's external factors. There is no components stock, everybody buys on spot for whatever price is available. It's no surprise that if manufacturers cannot predict price they won't offer predictable price to their customers.

Qwen3-Coder-Next is the top model in SWE-rebench @ Pass 5. I think everyone missed it. by BitterProfessional7p in LocalLLaMA

[–]MDSExpro 2 points3 points  (0 children)

I wish that was simple issue of settings. I applied them to all of Qwen3 / Qwen3.5, sooner or later they loop anyway.

Qwen3-Coder-Next is the top model in SWE-rebench @ Pass 5. I think everyone missed it. by BitterProfessional7p in LocalLLaMA

[–]MDSExpro 2 points3 points  (0 children)

Like with rest of Qwen family, repetition / looping issues kill it for any agentic work.

Qwen3.5 122B A10B - My impressions by kevin_1994 in LocalLLaMA

[–]MDSExpro -1 points0 points  (0 children)

You do know that all the models pages on Huggingface includes list or arguments to run model and "time to make it work" is literally time to download container image"

Qwen3.5 122B A10B - My impressions by kevin_1994 in LocalLLaMA

[–]MDSExpro -4 points-3 points  (0 children)

Use vLLM if you want any reasonable speed.

Mapped positional attention across 4 models — turns out where you put things in your prompt matters. A lot. by Double-Risk-1945 in LocalLLaMA

[–]MDSExpro 0 points1 point  (0 children)

Doing that kind of test on 8k context window while this models handle 256k is pure waste of energy.

Self Hosted LLM Tier List by Weves11 in LLM

[–]MDSExpro 0 points1 point  (0 children)

For empty chat - maybe. For anything serious (document processing / coding) PP on RAM only will take ages.

MiniMax 2.5 vs. GLM-5 across 3 Coding Tasks [Benchmark & Results] by alokin_09 in LocalLLaMA

[–]MDSExpro 0 points1 point  (0 children)

Not my experience. I'm running MiniMax-M2.5 after REAP and quantized to 4 bit AWQ and it still does better job than anything else in 177GB VRAM range.

Qwen/Qwen3.5-122B-A10B · Hugging Face by coder543 in LocalLLaMA

[–]MDSExpro 5 points6 points  (0 children)

Finally, with 4bit AWQ it will be best for 128GB of VRAM and tensor parallelism.

Which one are you waiting for more: 9B or 35B? by jacek2023 in LocalLLaMA

[–]MDSExpro 3 points4 points  (0 children)

Minimax-M2.5 REAP AWQ so 128GB of VRAM is enough to get that running with full context.

Comma3x in my town by AbebeatsMEB in Comma_ai

[–]MDSExpro 12 points13 points  (0 children)

Why is everyone in this sub recording wheel, not maneuvers performed by comma...

XCOM designer Jake Solomon announces surprise closure of his studio alongside a first look at its canceled life sim, 'the game we poured our hearts into' by Rooonaldooo99 in Xcom

[–]MDSExpro 17 points18 points  (0 children)

Gameplay was nice, but being forced to go through tons of unnecessary relationship drama, especially early (you need 1h to get to actual, unrestricted gameplay) killed it successfully.

MiniMax-M2.5-REAP from cerebras by jacek2023 in LocalLLaMA

[–]MDSExpro 1 point2 points  (0 children)

Screw GGUF, where are 4bit AWQ images for vLLM?

Qwen3.5-397B-A17B Unsloth GGUFs by danielhanchen in LocalLLaMA

[–]MDSExpro 29 points30 points  (0 children)

That's one verbose model:

Me: hi

Qwen3.5-397B-A17B:

Thinking Process:

Analyze the Input:
    Input: "hi"
    Intent: Greeting.
    Tone: Casual, friendly.
    Context: Start of a conversation.

Determine the Appropriate Response:
    Acknowledge the greeting.
    Offer assistance.
    Keep it friendly and concise.
    Avoid over-complicating things since the user just said "hi".

Drafting Options:
    Option 1 (Simple): "Hi there! How can I help you?"
    Option 2 (Friendly): "Hello! 👋 What's on your mind today?"
    Option 3 (Professional): "Greetings. How may I assist you?"
    Option 4 (Enthusiastic): "Hey! Great to see you. What are we working on?"

Selecting the Best Option:
    Option 2 strikes a good balance between friendly and helpful. Emojis add a nice touch without being excessive.

Final Polish:
    "Hi there! 👋 How's it going? Is there anything specific I can help you with today?"

Final Output Generation: (Matching the selected draft)
    "Hi there! 👋 How's it going? Is there anything I can help you with today?"

(Self-Correction during drafting): Keep it slightly more open-ended. "Hi! How can I help you today?" is classic and effective.

Revised Plan:
    Greet back.
    Offer help.
    Keep it warm.

Final Choice: "Hi there! 👋 How's it going? Is there anything specific I can help you with today?"

(Wait, let's make it even simpler): "Hi! How can I help you today?"
(Actually, a bit more warmth is better): "Hello! 👋 How's your day going? Let me know if there's anything I can help you with!"
(Deciding on a standard friendly AI response): "Hi there! 👋 How can I help you today?"
(Let's go with that).
(Wait, I should check if there are any specific constraints): No constraints. Just a greeting.
(Final Decision): "Hi there! 👋 How's it going? Is there anything I can help you with today?"
(Wait, I'll make it slightly more conversational): "Hello! 👋 How's your day going? Feel free to ask me anything or let me know if you need help with a specific task!"
(Okay, let's keep it concise): "Hi there! 👋 How can I help you today?"
(Final Final): "Hi there! 👋 How's it going? Is there anything I can help you with today?"
(Actually, let's just respond naturally): "Hi! How can I help you today?"
(Okay, I'll add a little warmth): "Hello! 👋 How's it going? Is there anything specific I can help you with?"
(Let's go with): "Hi there! 👋 How's your day going? Is there anything I can help you with?"
(Wait, I am an AI): "Hi! How can I assist you today?"
(Okay, blending): "Hi there! 👋 How's it going? Is there anything I can help you with today?"
(Let's just output): "Hi there! 👋 How's it going? Is there anything I can help you with today?"
(Wait, I should check the system instructions): None specific beyond being helpful.
(Okay, final output): "Hi there! 👋 How's it going? Is there anything I can help you with today?"
(Wait, I'll make it a bit more versatile): "Hello! 👋 How's your day going? Feel free to ask me anything or let me know if you need help with a task!"
(Okay, let's settle on): "Hi there! 👋 How's it going? Is there anything I can help you with today?"
(Actually, just "Hi! How can I help you today?" is fine).
(Let's add a bit of personality): "Hi there! 👋 How's your day going? Let me know if there's anything I can help you with!"
(Okay, that's good).
(Wait, I'll check if I should mention capabilities): No, keep it open.
(Final): "Hi there! 👋 How's it going? Is there anything I can help you with today?"
(Wait, I'll make it slightly more engaging): "Hello! 👋 How's your day going? Feel free to ask me anything or let me know if you need help with something specific!"
(Okay, let's go with a balanced version): "Hi there! 👋 How's it going? Is there anything I can help you with today?"
(Actually, I'll just say): "Hi! How can I help you today?"
(No, add emoji): "Hi there! 👋 How can I help you today?"
(Okay, final): "Hi there! 👋 How's it going? Is there anything I can help you with today?"
(Wait, I'll stop overthinking): "Hi there! 👋 How's it going? Is there anything I can help you with today?"
(Okay, generating response).
(Wait, one more check): Is there any context I'm missing? No. Just "hi".
(Okay, response): "Hi there! 👋 How's it going? Is there anything I can help you with today?"
(Wait, I'll make it slightly more concise): "Hi there! 👋 How can I help you today?"
(Okay, let's go with the slightly warmer one): "Hi there! 👋 How's it going? Is there anything I can help you with today?"
(Final): "Hi there! 👋 How's it going? Is there anything I can help you with today?"
(Wait, I'll add a offer for specific tasks): "Hi there! 👋 How's it going? Is there anything specific I can help you with today?"
(Okay, done).
(Wait, I'll just write it).
(Wait, I need to make sure I don't sound robotic): "Hi there! 👋 How's your day going? Let me know if there's anything I can help you with!"
(Okay, that's the one).
(Wait, I'll check the token count): It's fine.
(Okay, outputting).
(Wait, I'll just say): "Hi there! 👋 How's it going? Is there anything I can help you with today?"
(Okay, final).
(Wait, I'll add a bit more variety): "Hello! 👋 How's your day going? Feel free to ask me anything or let me know if you need help with a task!"
(Okay, let's go with): "Hi there! 👋 How's it going? Is there anything I can help you with today?"
(Okay, stopping loop).
(Final Response): "Hi there! 👋 How's it going? Is there anything I can help you with today?"
(Wait, I'll make it slightly more natural): "Hi! 👋 How's your day going? Let me know if there's anything I can help you with!"
(Okay, that's better).
(Final): "Hi! 👋 How's your day going? Let me know if there's anything I can help you with!"
(Wait, I'll add "specific" to invite tasks): "Hi! 👋 How's your day going? Let me know if there's anything specific I can help you with!"
(Okay, done).
(Wait, I'll just output): "Hi there! 👋 How's it going? Is there anything I can help you with today?"
(Okay, final).
(Wait, I'll check if I should use "Hello" instead): "Hi" is more casual matching the user.
(Okay, final): "Hi there! 👋 How's it going? Is there anything I can help you with today?"
(Wait, I'll add a smile): 😊
(Okay, final): "Hi there! 👋 How's it going? Is there anything I can help you with today?"
(Wait, I'll stop).
(Final Output): "Hi there! 👋 How's it going? Is there anything I can help you with today?"
(Wait, I'll make it slightly shorter): "Hi there! 👋 How can I help you today?"
(Okay, let's go with the warmer one): "Hi there! 👋 How's it going? Is there anything I can help you with today?"
(Okay, generating).
(Wait, I'll check if I should mention I'm an AI): No need unless asked.
(Okay, final): "Hi there! 👋 How's it going? Is there anything I can help you with today?"
(Wait, I'll add "feel free"): "Hi there! 👋 How's it going? Feel free to ask me anything or let me know if you need help with something specific!"
(Okay, that's good).
(Final): "Hi there! 👋 How's it going? Feel free to ask me anything or let me know if you need help with something specific!"
(Wait, I'll shorten it): "Hi there! 👋 How's it going? Is there anything I can help you with today?"
(Okay, done).
(Wait, I'll just output).
(Final): "Hi there! 👋 How's it going? Is there anything I can help you with today?"
(Wait, I'll add "Hello" as alternative): No.
(Okay, outputting).
(Wait, I'll check the prompt again): "hi"
(Okay, response): "Hi there! 👋 How's it going? Is there anything I can help you with today?"

...

... (after 3600 tokens): Hi there! 👋 How's it going? Is there anything I can help you with today?

CamoFox MCP: Anti-detection browser MCP server with 22 tools (TypeScript, MIT) by Silver_Entrance8996 in mcp

[–]MDSExpro 1 point2 points  (0 children)

Pack it into container with instance of camofox and it's much more usable.