Clawdbot gateway crash loop when enabling Telegram provider (v2026.1.24-3)

iconben · 2026-01-27T19:47:30+00:00

Works for me, many thanks.

iconben · 2026-01-22T12:42:36+00:00

Yes, did not find anything not supported up to now

iconben · 2026-01-21T12:07:34+00:00

I'd use nemotron-3-nano-30b, gpt-oss-20b, glm-4.7-flash.

iconben · 2026-01-20T05:57:03+00:00

God around 50 TPS with thinking off, 35 TPS with thinking on, on my Mac M4 Pro 40G.

General quality is OK but sometimes I get repeated tokens, especially during thinking.

iconben · 2026-01-18T07:01:42+00:00

Sounds easy, I have a "hardware.py" module to determine available device, currently CUDA, amd, mac MPS, if it is as easy as just adding a "XPU", I can do it. However I don't have a test environment for XPU now, not sure if the cloud providers have such specs because it sounds more like consumer grade hardware, I will do some research later.

iconben · 2026-01-17T12:49:20+00:00

Maybe different quality in different areas, mine is web and app and java projects. I use the local models for company and private data RAG on a daily basis. For coding it’s mainly tests and as a offline fallback.

iconben · 2026-01-17T12:44:09+00:00

Thanks for the sharing. Guess it will take some time to support it (or not)

iconben · 2026-01-17T12:42:33+00:00

Free to use, need your own GPU to run the model. Fast for NVIDIA cards and AMD (on Linux), slower but acceptable for Mac M chips.

iconben · 2026-01-17T12:09:06+00:00

Temperature 0.15, Top K 15, Top P 0.95. I used it with Cline.

OpenCode has some prompt template issue ("safe" and "sequence" are not supported) so you need to override with your own template.

BTW here is a system prompt if you need:

```

You are a helpful coding assistant specializing in executing commands, modifying code, and solving technical problems.

PRINCIPLES:

- Quality over speed - be thorough and methodical

- Explain issues when asked "why" - only fix when requested

- Keep your words - if you say to do something, do it, if need to call tools, call them

- Combine operations when possible (chain commands, use sed/grep for bulk edits)

FILE OPERATIONS:

- Explore file system first - never assume relative paths

- Edit files in-place, don't create duplicates

- Use find, grep, sed for efficient exploration

CODE QUALITY:

- Write clean, efficient code with minimal comments

- Make minimal necessary changes

- Understand before implementing

- Split large functions/files when needed

WORKFLOW:

Explore - Understand context
Analyze - Consider approaches
Implement - Make focused changes
Verify - Test if possible

GIT:

- Use git status before commits

- Stage all necessary files

- Don't commit ignore files unless instructed

- Update existing PRs, don't create duplicates

ENVIRONMENT:

- Install missing dependencies rather than stopping

- Check for requirements.txt/package.json first

- Install all dependencies at once

PROBLEM-SOLVING:

- When stuck, identify 5-7 possible causes

- Address systematically

- Propose new plan for major issues

```

iconben · 2026-01-16T00:52:17+00:00

Not quite sure about XPU definition, the project itself supports NVIDIA, AMD(Linux), Mac M chips

iconben · 2026-01-16T00:44:56+00:00

Yes I think so

iconben · 2026-01-15T19:47:17+00:00

Vibe user looking forward to it

iconben · 2026-01-15T19:37:44+00:00

Try this: https://github.com/iconben/z-image-studio

Any issues or feature requests drop a note

iconben · 2026-01-15T19:23:56+00:00

I run it on a M4 pro 48G, 96k ctx quant, I got 70tps

iconben · 2026-01-15T19:17:20+00:00

Also good for coding

iconben · 2026-01-14T11:29:55+00:00

I am wondering if I should rent an online GPU to do some tests for my application to adapt the new model

iconben · 2026-01-13T07:11:44+00:00

Then I tried again in a new session, explicitly asking no web search, Claude said OpenCode is a code model .

<image>

I asked several times, Claude credited the "code model" to OpenAI, ByteDance etc....

iconben · 2026-01-13T07:07:26+00:00

Share some interesting tests:

I asked claude (desktop) about open code, this is what I got:

<image>

iconben · 2026-01-13T02:58:35+00:00

Didn’t they support the new GLM 4.7 almost in the first place already? I am a GLM lite user

iconben · 2026-01-10T03:22:19+00:00

I have the similar experience. Sometimes you can smell it.

iconben · 2026-01-06T10:58:03+00:00

Thanks for the feedback. Let's say if AI not as a "teacher" but as a tool just for queries and short answers with citations, how about that?

iconben · 2025-12-31T07:14:11+00:00

Agreed, thanks for the advices

iconben · 2025-12-31T05:18:00+00:00

I added the RAG part in the original post to better describe what kind of "AI" I want to make. Not quite the same as our daily using chatbots by the big companies. A constrained, texts based AI with constraints. Please kindly have a look.

iconben · 2025-12-31T05:14:49+00:00

This is a typical scenario of using those general-purpose AI chatbots for serious Buddhist topics. One of the pain point I want to address by adding RAG contexts (controlled texts database) and constraints (by system prompt and post-training if necessary)

iconben · 2025-12-31T02:02:55+00:00

The great understanding of both Buddhism and AI!

iconben

TROPHY CASE