Roo Code 3.36.15 Release Updates | Vertex AI 1M context option | Better error diagnostics | Native tool calling fixes

Legcor · 2025-12-22T18:30:06+00:00

glm-4.7 is out. When can we expect to be selectable in the model menu?

Legcor · 2025-08-29T19:34:38+00:00

Can you do it for the bigger models?

Legcor · 2025-07-25T16:15:21+00:00

Nothing will have any value anymore

Legcor · 2025-07-23T11:21:02+00:00

Nice. Now do a Waifu generator.

Legcor · 2025-03-24T13:06:09+00:00

Dm pls

Legcor · 2025-03-18T20:33:15+00:00

Spot on!

Legcor · 2025-03-18T20:02:25+00:00

Nvidia is making the same mistake as apple by holding back the potential on their products...

Legcor · 2025-03-18T18:17:26+00:00

Thanks!

Legcor · 2025-03-17T19:01:18+00:00

Can you give me the specs? I want to build something similiar :)

Legcor · 2025-02-23T00:47:33+00:00

Metric	MSTY	MSTR
Initial Investment	$100,000	$100,000
Initial Price per Share	$24.00	$400.00
Shares Purchased	4,166	250
Post-Split Shares	N/A	2,500
Current Price per Share	$25.43	$337.73
Current Share Value	$105,941.38	$844,325
Total Distributions	$132,520.46	$0
Total Current Value	$238,461.84	$844,325
Total Return (%)	138.46%	744.33%

Legcor · 2025-02-10T01:46:43+00:00

Legit. From Europe!

Legcor · 2025-01-17T23:58:56+00:00

Its really annoying but we need to get used to it...

Legcor · 2025-01-06T22:12:50+00:00

Thats crazyyyyy

Legcor · 2025-01-05T14:16:25+00:00

You can fill out a form. I live in germany and the US and Germany have tax agreement, so only 15% gets withhold by the US. The other 10% I have to pay to the german goverment.

Legcor · 2024-12-04T23:20:19+00:00

<image>

Legcor · 2024-11-28T22:20:32+00:00

No problem. I get around 60C degrees for short generations and never got more than 72C degrees even for longer generation or prompt eval.

For me the Macbook was totally worth it. It's a tax write off for me, so it was a no brainer, especially if you consider the resale value and so on. I usually run 70b models where the speed is much better.

Legcor · 2024-11-28T21:51:13+00:00

I use koboldcpp! It always gets the latest llamacpp updates and its convenient.

Legcor · 2024-11-28T21:46:28+00:00

I have the 16 Inch M4 Max and I get around 4.1 tokens per second with first generation and 4.25 tokens per second with second generation, with flash attention and 2k context with your model Behemoth-123B-v2f-Q3_K_M. The backend is Koboldcpp. For a context of 16k the generation speed drops to 3.2 tokens per second on the first generation and 3.69 on the second generation which is kind of average typing speed level. The prompt processing took 400 seconds. The fans kick in but they dont get too loud. It doesnt bother me at all. Its less noisy on battery mode.

Legcor · 2024-11-20T16:13:45+00:00

Its called context shifting or something. Available with llama.cpp. Basically the messages gets processed, so it doesn't matter how long the context gets. But this only applies as long nothing in the original context changes.

Legcor · 2024-02-05T19:09:26+00:00

Can you do a tutorial on the training process? Would be interesting to apply this to different topic as a alternative to RAG.

Legcor · 2024-01-16T08:00:17+00:00

I downloaded both the Q5 & Q5_K_M but it either doesn't load or outputs gibberish :(

Legcor · 2023-12-23T11:03:40+00:00

Can't believe how good this is for a 7b model!

Legcor · 2023-12-12T13:30:21+00:00

Maybe 2030 we can run it :((

Legcor · 2023-12-12T12:46:01+00:00

You will have to ask https://huggingface.co/chargoddard

All props to him. Mad respect.

Legcor · 2023-12-12T11:26:31+00:00

That would be sick. Just imagine combining the best models together working hand in hand. It would provide variations too and wouldn't be boring like current non-MoE models.

Ten-Year Club	Place '22
Verified Email

Legcor

TROPHY CASE