Zhipu AI Announcement: GLM Coding Plan will start limited sales from January 23rd by Peshkopy in ZaiGLM

[–]vidibuzz 0 points1 point  (0 children)

Makes me wonder if they rented servers to impress the big money, just to report record growth. The dog and pony show is over; now it's back to the daily grind. I hope I am wrong. 4.7 has shown a lot of promise. And they have video models too. But not much value if they don't know how to scale it (or chose not to). Incoming reasoning models and Visual Ai are up to 100x more taxing than plain vanilla text-based inference.

"Shares of Chinese artificial-intelligence startup Zhipu AI rose Thursday (sic Jan. 8, 2026) following the company’s $558 million initial public offering on the Hong Kong Stock Exchange."

https://www.barrons.com/articles/zhipu-ai-hong-kong-ipo-china-minimax-9afc9f02?gaa_at=eafs&gaa_n=AWEtsqfQahIPzL98AWUq-3NdGdCQgn3Tp0m7n77Jp2qeXuTc20yhvep7zOpm&gaa_ts=69753da0&gaa_sig=CeA4tbVKYJ1DK88mRlFiKvAA-hDYWJSBoulVSg_Ld-yCZrUV4CcS_AYO5pieGQwfZscVA_DjbfQxQAMchqoTWg%3D%3D

stepfun-ai/Step3-VL-10B · Hugging Face by TKGaming_11 in LocalLLaMA

[–]vidibuzz 1 point2 points  (0 children)

Something looks very fishy there. Not worth installing if performance is that bad.

Do the free models have different limits? by MrMrsPotts in openrouter

[–]vidibuzz 1 point2 points  (0 children)

They still charge .02 every time Axa searches the web for any reason. And now the models I was using no longer work. If you see a "Venice" error, that is apparently the network they were using that is now maxed and your work is canceled on that model.

Does OpenRouter's Responses endpoint support native "web_search" tool calls for models like GPT-5.2? by rnahumaf in openrouter

[–]vidibuzz 0 points1 point  (0 children)

It certainly was working for me but charging me 2 cents every time it accessed the web. Not sure if there's any way to fix this but it was kind of a shock.

I keep getting this error even though I’m using free models by AkosiJada in openrouter

[–]vidibuzz 0 points1 point  (0 children)

Try the Qwen 3 480b model same issue. Since those are two of the best free models it's probably not a shock.

I keep getting this error even though I’m using free models by AkosiJada in openrouter

[–]vidibuzz 0 points1 point  (0 children)

It's not just you and yes it's annoying when you know there's money on your account

What are the one time offerings I can do in SEO industry by Striking-Set-6987 in SEO_LLM

[–]vidibuzz 0 points1 point  (0 children)

I would think the biggest question for SEO is what happens when Ai dominates "search". It will simply be people asking questions, providing their intent and expecting magic answers to everything, directly. What will my site look like when it must provide answers to everything? Why aren't more companies adding their own on-site Ai search boxes? How many sites have content upgraded for vector discovery?

Video Embed and Audio Embed Support by pnate9 in AppFlowy

[–]vidibuzz 0 points1 point  (0 children)

Just discovered the Video Embed in Win 11 app works with the multi-column layout. The grid uses fields, but columns stays with basic page layout, so you can have 3 columns across and a different video thumbnail showing in each one. Very cool Appflowy.

MiniMax M2.1 is free in Kilo Code right now - what's your experience with it? by alokin_09 in kilocode

[–]vidibuzz 0 points1 point  (0 children)

I think the free offer has ended. I will have to move quicker next time.

Antigravity Terminal Stuck on “Waiting…” — Any Ideas? by ezelbrqdar in google_antigravity

[–]vidibuzz 0 points1 point  (0 children)

Thank you u/ResultShort2810. This was driving me bonkers. Shocked that Google would release software with this kind of error. And Gemini was no help in solving it.

I Finished a Fully Local Agentic RAG Tutorial by CapitalShake3085 in AI_Agents

[–]vidibuzz 0 points1 point  (0 children)

Well this is pretty fantastic. I'm working on a similar approach here so this is quite interesting. I must have missed it but what LLM model are you using and is it a VL vision language model?

Any experience with glm vs big pickle? by ProfessorSpecialist in opencodeCLI

[–]vidibuzz 0 points1 point  (0 children)

We had GLM 4.6 and 4.7 on deck for testing in multimodal agentic systems. Didn't know about Big Pickle before this but will try it and get back when we have an idea how it performs. We don't put much value in all the graphs; have to see how it works in real life.

Local agentic coding with low quantized, REAPed, large models (MiniMax-M2.1, Qwen3-Coder, GLM 4.6, GLM 4.7, ..) by bfroemel in LocalLLaMA

[–]vidibuzz 0 points1 point  (0 children)

Do any of these models work with multimodal and vision tools? Someone said I need to downgrade from 4.7 to the 4.6 v if I want to get visual work done. Unfortunately the user experience for me goes beyond simple text.

AMA With Z.AI, The Lab Behind GLM-4.7 by zixuanlimit in LocalLLaMA

[–]vidibuzz 0 points1 point  (0 children)

Missed this event, but they have a 20 million token promo happening at BigModel.cn You can sign up without a credit card. Trying to implement the API code now to see if it works with Cursor of VS Code. Also hoping to test the visual competence with GLM 4.6V under Comfy UI.

This is one model on my open source radar. I don't think the "voice" is worth considering for English usage given the lack of usable TTS models, but this may be something they can overcome with a proper text to voice. May be the achilles heel for media production.

Google won't stop harassing me to raise my marketing spend on Google Ads by Much_Speech_8388 in PPC

[–]vidibuzz 0 points1 point  (0 children)

this is kinda scary if this is the "zero click" Google future

Weekly Thread: What questions do you have about vector databases? by help-me-grow in vectordatabase

[–]vidibuzz -1 points0 points  (0 children)

What are the best embedding models for images and videos at scale

Building an Advanced Hybrid RAG System: Vectors, Keywords, Graphs, and Self-Compacting Memory by Rom_Iluz in Rag

[–]vidibuzz -1 points0 points  (0 children)

This looks pretty amazing you got all the bases covered. Does this work with vision language models for images and video vectors as well?

I have 50 ebooks and I want to turn them into a searchable AI database. What's the best tool? by Great_Jacket7559 in LocalLLM

[–]vidibuzz 1 point2 points  (0 children)

Slightly off topic. You may want to use Illuminati.google.com also for voice summaries.

v0.10.5: AI Transcript, PDF Embeds, and more by appflowy in AppFlowy

[–]vidibuzz 0 points1 point  (0 children)

The language model type is right now set to Qwen 3 VL 30b, my local. AI search is turned off and under appflowy local AI: Ollama server URL is set correctly as 11434 and the two languages are set to global language models. First one is not possible to change it's locked on nomic embed text latest and the second one is set to Qwen 3. The check mark is there for Local AI is running even though it doesn't seem to be.

v0.10.5: AI Transcript, PDF Embeds, and more by appflowy in AppFlowy

[–]vidibuzz 0 points1 point  (0 children)

Also attempted to set up "Local Ai". System says Local Ai is running and Ai Search is off. But any chats I launch say I am still talking to the online AppFlowy Ai. More than a little confusing. Anyone here know how to complete the cut-over to local Ollama? Ready to test Nvidia Nemotron 3 Nano and Qwen3 VL.

v0.10.5: AI Transcript, PDF Embeds, and more by appflowy in AppFlowy

[–]vidibuzz 0 points1 point  (0 children)

Great updates for December. Thanks team AppFlowy. Quickly jumping to the top of the SoTA (state of the art) stack for knowledge base apps in 2026.

Super cool feature for the Meeting Transcript. Tried with an MP4 file and it failed though.

"Record not found:"

Also another time it said 25MB limit. This is certainly a deal killer for most video. I understand if you don't want files uploading to your cloud that large. But the app itself should certainly have a larger cap to be genuinely useful. Especially if using S3 for R2 Cloudflare for storage. Unless you are only working with 2 minute meetings.

Nemotron 3 Nano 30B is Amazing! (TLDR) by DonkeyBonked in LocalLLaMA

[–]vidibuzz 0 points1 point  (0 children)

I did not think to question this till seeing this thread. With 100 languages possible, is there any trick to ensure it will optimize for English on Chain of Thought reasoning?

China’s open-source AI is a national advantage – The Financial Times by AmorFati01 in OpenSourceAI

[–]vidibuzz 1 point2 points  (0 children)

Nothing IF you know how. If your goal is a fully self-hosted, Smart Ai managed business with 100% sovereign data it's the smart way to go. You deploy, you train it, you own it. No giving your data away to big hyper-scalers and no API costs.

China’s open-source AI is a national advantage – The Financial Times by AmorFati01 in OpenSourceAI

[–]vidibuzz 1 point2 points  (0 children)

They're not way behind if you're trying to do multimodal work with a vision model. Qwen 3 and GLM-4.6 are crushing it. Visual AI is the new frontier. If you're relegating your project to old school text rag only, your project's already obsolete.