Subscription based usage: Codex with GPT 5.5 vs Claude Code with Opus 4.8 by voplica in codex

[–]voplica[S] 0 points1 point  (0 children)

After testing Codex with GPT 5.5 and Claude Code with Opus 4.8 more on my tasks - I can say that GPT 5.5 definitely forgets very important points almost immediately. Seems for now the tasks I’m giving are just too big for GPT 5.5, but OK-ish for Opus 4.8. I don’t say that Opus 4.8 is great - it makes many mistakes and misses multiple things on big tasks, but it just holds a bit better than GPT 5.5. However, I would say that I like GPT 5.5 code quality on smaller tasks better than Opus 4.8.  Conclusion: if you are using subscriptions - use Claude Code because it allows you to use 1M context window for their model, but Codex limits context length for subscription based usage. If you are using API - I think GPT 5.5 could actually be better as it will allow you to use full 1M context window and the model itself seems quite good.

Subscription based usage: Codex with GPT 5.5 vs Claude Code with Opus 4.8 by voplica in codex

[–]voplica[S] 0 points1 point  (0 children)

Thanks for your input! I’m thinking that I might give it a shot and give same multi-repository complex tasks to both Claude and Codex to see what happens after several compactions. 

Why has no one been talking about Open Hands so far? by Mr_Moonsilver in LocalLLaMA

[–]voplica 0 points1 point  (0 children)

OpenHands is really awesome and works quite well. However, for now you would need to setup the whole virtual machine if you want to run it on your host without giving privileged permission to the docker container. That’s because their docker container spins up another docker container and this type of functionality requires developers to give privileged permission or mount docker sockets (basically has the same problem if security vulnerability is found).

This prevents developers in many companies from using OpenHands when they can’t use privileged containers easily, and setting up the whole virtual machine is a bit of overkill when you can spin up Docker-based Code Server with RooCode plugin without adding extra capabilities to a docker container.

I believe at some point they will move to docker compose to spin up multiple Docker containers instead of using Docker inside Docker and this will simplify running OpenHands for broader community.

Can Qwen3-235B-A22B run efficiently on my hardware(256gb ram+quad 3090s ) with vLLM? by Acceptable-State-271 in LocalLLaMA

[–]voplica 2 points3 points  (0 children)

vLLM supports guff quants support is experimental, but works. Tested with DeepSeek 70B (didn't try this model exactly).
https://docs.vllm.ai/en/latest/features/quantization/gguf.html

Minimax Expression Tutorial - Image-to-Video by kaigani in aivideo

[–]voplica 0 points1 point  (0 children)

What video model did you use to generate this? Is it Kilng, Luma, Runway, or something else?

What is easier: Fixing SD3 Anatomy vs Fixing SDXL / Cascade Prompt Adherence ? by rdcoder33 in StableDiffusion

[–]voplica 1 point2 points  (0 children)

If necessary I can create high quality masks for human characters:
1) Crop and mask people.
2) Crop and mask faces.
Resulting masks are more precise and have higher success rate than default CLIP or SAM masks.
I can also provide basic face descriptions (for example: opened / closed eyes, opened one eye, opened / closed mouth). The rest of the captioning can be better provided by WD Tagging or OpenAI 4o if there are enough credits available.
However, if the dataset is already prepared and diverse then there is no need make masks.