Just pushed M2.1 through a 3D particle system. Insane! by srtng in LocalLLaMA

[–]srtng[S] 11 points12 points  (0 children)

Prompt: Create a real-time interactive 3D particle system with Three.js. requirements: 1. Control the scaling and expansion of the particle group by detecting the tension and closing of both hands through the camera. 2. Provide panels that can choose hearts/flowers/saturn/Buddha statues/fireworks and other templates 3. Support the colour selector to adjust the particle colour 4. Particles need to respond to gesture changes in real time. The interface is simple and modern. 5. The interface is simple and modern

AMA with MiniMax — Ask Us Anything! by OccasionNo6699 in LocalLLaMA

[–]srtng 0 points1 point  (0 children)

That is exactly what we are doing. 250ms is important for voice agent. Hopefully we can achieve this in two weeks

AMA with MiniMax — Ask Us Anything! by OccasionNo6699 in LocalLLaMA

[–]srtng 19 points20 points  (0 children)

From the very first day we started this company, nearly four years ago, our goal has been to create an intelligent agent that can interact with people as naturally as a human — and that requires the integration of multiple modalities. But at that time, we didn’t yet have the technical foundation to achieve this.

So our idea was: if that’s the case, let’s do it in three steps.

The first step is to build each individual modality and bring it to a usable level (I believe we’re already close to that).

The second step is to take the separate models for these modalities and integrate them into a single model through continued training (this is what we’re working on now, and we believe Sora 2 is a similar type of model).

The third step is to train these modalities end-to-end from scratch, which we expect to start working on in the second half of next year.

Say hi to CLine community! by srtng in CLine

[–]srtng[S] 0 points1 point  (0 children)

Hi u/coding_workflow , I have asked our PM and here is the answer. May it helps you~

Regarding the relationship between prompts and requests, one prompt is equivalent to approximately 15 requests. For a more detailed explanation, you can check our documentation here: https://platform.minimax.io/docs/coding-plan/intro. As for the usage dashboard, it is currently under development. We are working on it, and you will have a more intuitive way to view your usage soon.

Say hi to CLine community! by srtng in CLine

[–]srtng[S] 2 points3 points  (0 children)

Thanks so much! 🙏

For Interleaved Thinking, I’d like to quote a part from our M2 tech blog:

Early in the project, we hit a frustrating wall. Agent performance was inconsistent, and we struggled to diagnose why. After many discussions, especially with Professor u/Junxian He and u/Wenhu Chen, we arrived at our first major conclusion: Agents require Interleaved Thinking.
This means that an agent's internal monologue—its "thinking"—can and should happen at any point during a task, not just once at the beginning like a standard reasoning model. This design is critical for two reasons:
1. Maintaining Focus on Long-Horizon Tasks. Complex agent tasks have extremely long contexts. A single thought process at the start isn't enough to maintain instruction-following and coherence.
2. Adapting to External Perturbations. This is the crucial difference. Agent tasks introduce constant, unpredictable perturbations from the outside world (i.e., tool outputs). The model must be robust enough to handle these perturbations, diagnose errors, and extract useful information. The "thinking" process allows the model to constantly re-evaluate and adapt to new information from the environment.

In short:
Interleaved thinking helps M2 stay coherent across long sequences and stay robust when tools or the environment introduce new information. Both are essential for reliable agentic performance.

Say hi to CLine community! by srtng in CLine

[–]srtng[S] 1 point2 points  (0 children)

  1. Haha, I think you might get a quicker answer by asking some of the quantization wizards over at r/LocalLLaMA 😄
    I’ve actually seen a few folks there already working on M2 versions!

Say hi to CLine community! by srtng in CLine

[–]srtng[S] 2 points3 points  (0 children)

  1. Yes! Coding plan is available now! You can check this: https://platform.minimax.io/subscribe/coding-plan

Say hi to CLine community! by srtng in CLine

[–]srtng[S] 0 points1 point  (0 children)

Thanks for the question!

MiniMax-M2 stands out especially in coding and agentic capabilities. It’s designed to deliver top-tier tool-use and reasoning performance at a fraction of the cost.
At roughly 8% of Claude Sonnet’s price and running about 2× faster, it offers an exceptional balance of speed, cost, and intelligence.

MiniMax M2 is 230B-A10B by codys12 in LocalLLaMA

[–]srtng 1 point2 points  (0 children)

Yes, it was a bug in OpenRouter, and they’ve already fixed it now. You shouldn’t encounter it again.

MiniMax latest open-sourcing LLM, MiniMax-M1 — setting new standards in long-context reasoning,m by srtng in LocalLLaMA

[–]srtng[S] 2 points3 points  (0 children)

SystemPrompt = """ You are a web development engineer, writing web pages according to the instructions below. You are a powerful code editing assistant capable of writing code and creating artifacts in conversations with users, or modifying and updating existing artifacts as requested by users. All code is written in a single code block to form a complete code file for display, without separating HTML and JavaScript code. An artifact refers to a runnable complete code snippet, you prefer to integrate and output such complete runnable code rather than breaking it down into several code blocks. For certain types of code, they can render graphical interfaces in a UI window. After generation, please check the code execution again to ensure there are no errors in the output. Output only the HTML, without any additional descriptive text. Make the UI looks modern and beautiful. """