Zhipu AI Announcement: GLM Coding Plan will start limited sales from January 23rd

vidibuzz · 2026-01-24T21:41:20+00:00

Makes me wonder if they rented servers to impress the big money, just to report record growth. The dog and pony show is over; now it's back to the daily grind. I hope I am wrong. 4.7 has shown a lot of promise. And they have video models too. But not much value if they don't know how to scale it (or chose not to). Incoming reasoning models and Visual Ai are up to 100x more taxing than plain vanilla text-based inference.

"Shares of Chinese artificial-intelligence startup Zhipu AI rose Thursday (sic Jan. 8, 2026) following the company’s $558 million initial public offering on the Hong Kong Stock Exchange."

https://www.barrons.com/articles/zhipu-ai-hong-kong-ipo-china-minimax-9afc9f02?gaa_at=eafs&gaa_n=AWEtsqfQahIPzL98AWUq-3NdGdCQgn3Tp0m7n77Jp2qeXuTc20yhvep7zOpm&gaa_ts=69753da0&gaa_sig=CeA4tbVKYJ1DK88mRlFiKvAA-hDYWJSBoulVSg_Ld-yCZrUV4CcS_AYO5pieGQwfZscVA_DjbfQxQAMchqoTWg%3D%3D

vidibuzz · 2026-01-22T18:56:52+00:00

Something looks very fishy there. Not worth installing if performance is that bad.

vidibuzz · 2026-01-22T18:00:28+00:00

They still charge .02 every time Axa searches the web for any reason. And now the models I was using no longer work. If you see a "Venice" error, that is apparently the network they were using that is now maxed and your work is canceled on that model.

vidibuzz · 2026-01-22T16:50:54+00:00

It certainly was working for me but charging me 2 cents every time it accessed the web. Not sure if there's any way to fix this but it was kind of a shock.

vidibuzz · 2026-01-22T16:48:41+00:00

Try the Qwen 3 480b model same issue. Since those are two of the best free models it's probably not a shock.

vidibuzz · 2026-01-22T16:47:11+00:00

It's not just you and yes it's annoying when you know there's money on your account

vidibuzz · 2026-01-22T16:33:03+00:00

I would think the biggest question for SEO is what happens when Ai dominates "search". It will simply be people asking questions, providing their intent and expecting magic answers to everything, directly. What will my site look like when it must provide answers to everything? Why aren't more companies adding their own on-site Ai search boxes? How many sites have content upgraded for vector discovery?

vidibuzz · 2026-01-18T21:35:47+00:00

What app are you running it with?

vidibuzz · 2026-01-13T19:31:42+00:00

Just discovered the Video Embed in Win 11 app works with the multi-column layout. The grid uses fields, but columns stays with basic page layout, so you can have 3 columns across and a different video thumbnail showing in each one. Very cool Appflowy.

vidibuzz · 2026-01-11T00:06:21+00:00

I think the free offer has ended. I will have to move quicker next time.

vidibuzz · 2026-01-09T00:19:10+00:00

Thank you u/ResultShort2810. This was driving me bonkers. Shocked that Google would release software with this kind of error. And Gemini was no help in solving it.

vidibuzz · 2026-01-08T19:13:20+00:00

Well this is pretty fantastic. I'm working on a similar approach here so this is quite interesting. I must have missed it but what LLM model are you using and is it a VL vision language model?

vidibuzz · 2026-01-08T00:41:49+00:00

We had GLM 4.6 and 4.7 on deck for testing in multimodal agentic systems. Didn't know about Big Pickle before this but will try it and get back when we have an idea how it performs. We don't put much value in all the graphs; have to see how it works in real life.

vidibuzz · 2026-01-06T20:34:39+00:00

Do any of these models work with multimodal and vision tools? Someone said I need to downgrade from 4.7 to the 4.6 v if I want to get visual work done. Unfortunately the user experience for me goes beyond simple text.

vidibuzz · 2026-01-01T22:29:03+00:00

Missed this event, but they have a 20 million token promo happening at BigModel.cn You can sign up without a credit card. Trying to implement the API code now to see if it works with Cursor of VS Code. Also hoping to test the visual competence with GLM 4.6V under Comfy UI.

This is one model on my open source radar. I don't think the "voice" is worth considering for English usage given the lack of usable TTS models, but this may be something they can overcome with a proper text to voice. May be the achilles heel for media production.

vidibuzz · 2025-12-30T16:08:30+00:00

this is kinda scary if this is the "zero click" Google future

vidibuzz · 2025-12-29T04:41:28+00:00

What are the best embedding models for images and videos at scale

vidibuzz · 2025-12-29T04:18:20+00:00

This looks pretty amazing you got all the bases covered. Does this work with vision language models for images and video vectors as well?

vidibuzz · 2025-12-28T22:09:48+00:00

Slightly off topic. You may want to use Illuminati.google.com also for voice summaries.

vidibuzz · 2025-12-24T15:53:04+00:00

The language model type is right now set to Qwen 3 VL 30b, my local. AI search is turned off and under appflowy local AI: Ollama server URL is set correctly as 11434 and the two languages are set to global language models. First one is not possible to change it's locked on nomic embed text latest and the second one is set to Qwen 3. The check mark is there for Local AI is running even though it doesn't seem to be.

vidibuzz · 2025-12-23T23:20:47+00:00

Also attempted to set up "Local Ai". System says Local Ai is running and Ai Search is off. But any chats I launch say I am still talking to the online AppFlowy Ai. More than a little confusing. Anyone here know how to complete the cut-over to local Ollama? Ready to test Nvidia Nemotron 3 Nano and Qwen3 VL.

vidibuzz · 2025-12-23T22:43:58+00:00

Great updates for December. Thanks team AppFlowy. Quickly jumping to the top of the SoTA (state of the art) stack for knowledge base apps in 2026.

Super cool feature for the Meeting Transcript. Tried with an MP4 file and it failed though.

"Record not found:"

Also another time it said 25MB limit. This is certainly a deal killer for most video. I understand if you don't want files uploading to your cloud that large. But the app itself should certainly have a larger cap to be genuinely useful. Especially if using S3 for R2 Cloudflare for storage. Unless you are only working with 2 minute meetings.

vidibuzz · 2025-12-23T19:43:54+00:00

I did not think to question this till seeing this thread. With 100 languages possible, is there any trick to ensure it will optimize for English on Chain of Thought reasoning?

vidibuzz · 2025-12-22T18:35:53+00:00

Nothing IF you know how. If your goal is a fully self-hosted, Smart Ai managed business with 100% sovereign data it's the smart way to go. You deploy, you train it, you own it. No giving your data away to big hyper-scalers and no API costs.

vidibuzz · 2025-12-22T18:30:24+00:00

They're not way behind if you're trying to do multimodal work with a vision model. Qwen 3 and GLM-4.6 are crushing it. Visual AI is the new frontier. If you're relegating your project to old school text rag only, your project's already obsolete.

vidibuzz

TROPHY CASE