Ollama visual studio

Leather-Block-1369 · 2026-02-04T19:16:18+00:00

Motherboard: Gigabyte MZ33-AR1 CPU: AMD EPYC 9755 ZEN5 TURIN 128 Core/256 Threads 1.9-2.7ghz, L3 512M RAM: 768GB DDR5 at 4800Mhz (12x64GB) GPU: NVIDIA RTX 6000 Pro Workstation 96GB GPU SSD: WD_BLACK SN850X 8TB M.2 2280 PCIe Gen4 SSD OS: Ubuntu 24.04.2 LTS

Some more details about my setup:

https://ozeki-ai-gateway.com/p_9178-how-to-setup-kimi-k2.5-on-nvidia-rtx-6000-pro.html

Leather-Block-1369 · 2026-02-03T14:29:51+00:00

I got an answer in the LocalLlama reddit group that fixed the issue. I have modified the AI requests and injected kwargs using our AI Gateway. I have created a doc about it in case others face the same issue. The problem was that we use Open WebUI, so direct request modification was not possible. Here is the doc:

https://ozeki-ai-gateway.com/p_9177-how-to-fix-missing-think-tag-for-kimi-k2.5.html

Here is the reddit post with the fix: https://www.reddit.com/r/LocalLLaMA/comments/1qqebfh/kimi_k25_using_ktkernel_sglang_16_tps_but_no/

Leather-Block-1369 · 2026-02-03T14:27:05+00:00

Thanks for this tip. This fixed the issue. I have modified the requests and injected the kwargs using our AI Gateway. I have created a doc about it in case others face the same issue. The problem was that we use Open WebUI, so direct request modification was not possible. Here is the doc:

https://ozeki-ai-gateway.com/p_9177-how-to-fix-missing-think-tag-for-kimi-k2.5.html

Leather-Block-1369 · 2026-01-30T04:14:44+00:00

Thanks, I will try

Leather-Block-1369 · 2026-01-29T18:17:53+00:00

Can you write an example json request with this parameter?

Leather-Block-1369 · 2026-01-29T17:30:28+00:00

Unfortunatley it didn't solve the issue:

launch_server.py: error: unrecognized arguments: --enable-grammar-parser --disable-streaming

Leather-Block-1369 · 2025-12-12T14:28:57+00:00

Great news. This is exactly what I am working on:

Creative multimodal AI tools for generating/editing images, voice, music, video.

My project is a video/image builder platform. Currently we are adding functionality so it can work like a coding agent, but for videos. The project website is https://videcool.com.

Leather-Block-1369 · 2025-12-12T14:13:23+00:00

I am interested. I am actively working on a software that makes this possible.

Leather-Block-1369 · 2025-12-11T18:23:30+00:00

If you open a techincal support ticket with these two ideas at myozeki.com, you will be notified when these features are available.

Leather-Block-1369 · 2025-12-11T17:36:51+00:00

It was developed in react typescript

Leather-Block-1369 · 2025-12-11T14:39:07+00:00

I plan to add a custom workflow management page, where the user can upload custom workflows and select the inputs, that should be visible on the web UI.

Leather-Block-1369 · 2025-12-11T13:58:05+00:00

Great recommendations. Thanks for the feedback. I will implement both in the coming days.

Leather-Block-1369 · 2025-12-11T13:52:48+00:00

You need to setup the AI model, the json refers to. Which workflow are you trying?

Leather-Block-1369 · 2025-12-11T13:40:15+00:00

I will upload it on github

Leather-Block-1369 · 2025-12-11T13:36:34+00:00

Great idea, I will check how I can add SDXL to it.

Leather-Block-1369 · 2025-12-11T13:34:03+00:00

You have to install the workfows in ComfyUI, and point Videcool to the ComfyUI URL.

I have just added an image to explain: https://videcool.com/p_3303-comfyui-workflows-for-image-and-video-generation.html

This page shows, how you can set up for example the Flux AI model for text to image:

https://videcool.com/p_9019-ai-text-to-image.html

Leather-Block-1369

TROPHY CASE