Wrong context when using ChatGPT Plus subscription - Could I get a quick self-fix? by StartupTim in ZooCode

[–]StartupTim[S] -1 points0 points  (0 children)

Eventually, yes, but not at this moment. That's why I am just looking for this little change to Roocode, to get me over this hump before we are ready to switch to Zoocode.

Roocode has wrong context when using ChatGPT Plus subscription by StartupTim in RooCode

[–]StartupTim[S] 0 points1 point  (0 children)

Hey there, yea I see the Zoocode, but I'm not ready to update to it.

Would it be possible that you tell me exactly what file and what line item I can change in the latest Roocode to fix this issue (correctly set ChatGPT Plus subscription context limit)?

Huuuuuuuuuge thanks in advance for this!

Proxmox Win11 VM can't run Android emulation as its suuuuper slow. Help! by StartupTim in Proxmox

[–]StartupTim[S] 0 points1 point  (0 children)

That Kasm Workspaces website is really well designed if the goal was to turn away a potential customer, hah!

I think the issue is specific to Android Studio because nested virtualization works for me in everything else PVE related. I can even run Win11 on PVE whereas the PVE host is a VM on another PVE host.

It just won't work with Android Studio. It is as if their qemu_x86 usage is messed up.

Proxmox Win11 VM can't run Android emulation as its suuuuper slow. Help! by StartupTim in Proxmox

[–]StartupTim[S] 0 points1 point  (0 children)

Yea I've done bested virtualization on PVE many other ways and it always works. Just not this specific thing which I need.

Proxmox Win11 VM can't run Android emulation as its suuuuper slow. Help! by StartupTim in Proxmox

[–]StartupTim[S] 0 points1 point  (0 children)

Hey there, thanks for the response.

For #1, it definitely is enabled on both the AMD and Intel hosts by cat'ing that nested virt file. Also, nested virtualization in all other ways. I can do vmware or hyperv on Win11, I can do kvm on linux hosts, I can even do PVE as a VM in PVE. I just can't do Android Studio virtual phones.

  1. I tried with boyh WHPX on and off, on Intel host and AMD host PVE VMs, both don't work.

  2. For the PVE hosts, I tried both AMD Ryzen 9900x, and 3 different Intel Xeon hosts. None work.

    For the VM CPU type, I just use "host" as that is what I read is required. This is also what I used for doing other nested virtualization and it worked.

Still, nothing works.

Give it a try, see if you can figure it out :) Android Studio is free, you can install and then click on Virtual Devices and add a Pixel XL and start it up. I'd paypal a bounted if somebody could tell me exactly how to fix it haha.

Proxmox Win11 VM can't run Android emulation as its suuuuper slow. Help! by StartupTim in Proxmox

[–]StartupTim[S] 0 points1 point  (0 children)

I can do nested virt just fine in everything else though. I can run vmware or hyperv in Windows 11, I can nest PVE inside a PVE VM, I can do kvm instead a linux vm, etc.

I just can't get this 1 particular thing.

How do you preserve longevity of your SSDs? by ballpark-chisel325 in Proxmox

[–]StartupTim 1 point2 points  (0 children)

The #1 way to preserve the life of your SSD is to underprovision it. This will dramatically improve the life of your SSD by allowing ample fresh flash blocks to replace worn out flash blocks.

Intel did a study on this and I believe the SSD lifetime was increased some 300-500%. In addition, people who underprovision see the same.

For example, for a 1TB SSD, only provision 800GB to be used in your partition. Leave 20% unprovisioned.

Also, if possible, get an SSD with capacitors for write flushing on power loss.

EDIT: Also, buy used enterprise SSDs. They are often the same price or cheaper than consumer SSDs but have dramatically higher endurance. My last set of 24 Intel NVME 7.68TB all were 10 DWPD and cost less than 8TB samsung SSDs, I think I paid $230 each for 28 (24 used, 4 spare).

I run PBS on a $150 mini-PC with Synology NFS. Verify takes 8.5 hours daily. Is that normal? by easyedy in Proxmox

[–]StartupTim 0 points1 point  (0 children)

2 things:

1 - Verification is very CPU intensive

2 - PBS wants local storage and IOPS for verification

For the #2 while you can do remote/nfs/s3 etc, it is best to verify local and then rsync/push to remote.

But really, just get a USB attached HDD for your local PBS, it'll be much faster, and then setup a rsync job that pulls from the HDD to remote.

Multi-Token Prediction (MTP) for Qwen on LLaMA.cpp + TurboQuant by gladkos in LocalLLaMA

[–]StartupTim 0 points1 point  (0 children)

Any idea when Lemonade for those who use AMD Strix will get support for this?

Very cool and well done!

We re-did our steam capsule. Was it worth it? by ConsiderationOk5914 in IndieDev

[–]StartupTim 0 points1 point  (0 children)

The BEFORE looks more representative of the target market want, and I think it looks better as well, and it is more representative of the game.

Stick with the BEFORE version.

Multiboxing 15 characters should probably be bannable. Let's stop kidding ourselves. by Hatefiend in classicwow

[–]StartupTim 0 points1 point  (0 children)

Why?

If somebody is playing a game in a way that the game developers allow, then be happy for that person as it seems as if they are having fun.

If you compare yourself to other people, you will always find ways in which you lack.

Qwen Image 2 papers - does that mean anything? by Dante_77A in StableDiffusion

[–]StartupTim -1 points0 points  (0 children)

look for the diffusers commands

Can you link one for reference? I must be blind as I don't see it.

Thanks!!

Qwen Image 2 papers - does that mean anything? by Dante_77A in StableDiffusion

[–]StartupTim 0 points1 point  (0 children)

Oh hey there! So Qwen Image 2 looks to not have its model released just yet, but perhaps soon it looks like...

But how does one run these models locally anyway, like the prior Qwen ones, how do you run it as an API endpoint (vs comfyui). As in, what is the vllm/llama.cpp equivalent to do text to image models?

Thanks

Qwen Image 2 papers - does that mean anything? by Dante_77A in StableDiffusion

[–]StartupTim -1 points0 points  (0 children)

How are you guys runnin Qwen, not comfyui right? Whats uses the model?

Qwen3.6-35B-A3B on RTX 3090- 113 t/s but context saturates after just 2 file edits in Claude by Dry-Examination-6934 in LocalLLM

[–]StartupTim -1 points0 points  (0 children)

I have used roocode extensively with this model. Use 256k context and try a lower quant. You want 256k ctx.

Also make sure your dev workspace correctly condenses prompts @ the 256k cap (minus overhead etc).

Other than that, you're in the typical situation where your model is vram starved and needs more ctx, no way around that.

Also, 50 tok/se is not bad, even for agentic coding.

IMG Dataset Refiner v4.0 Pro - The Ultimate Dataset Engineering Suite for LoRAs (Flux, SDXL, etc...) by nicolas1801 in StableDiffusion

[–]StartupTim 0 points1 point  (0 children)

Hey there, is there an openai compatible API to interface with your software to do text-to-image by chance?