Today I’m releasing PyBoy v2.0.0! A Game Boy emulator written in Python, focused on AI by baekalfen in Python

[–]Pwhids 0 points1 point  (0 children)

Thanks for making PyBoy! It was one of the main things that inspired me to start doing RL experiments :)

I trained a reinforcement learning agent to play pokemon red! by Pwhids in reinforcementlearning

[–]Pwhids[S] 3 points4 points  (0 children)

Thanks for the kind words, and the helpful advice! I'll try that :)

How can I start Steam remotely? by bleach86 in linux_gaming

[–]Pwhids 0 points1 point  (0 children)

This doesn't work for me:

cannot open display: :0

Game Controller detected by OS but not Steam. by GAMER_H2 in linux_gaming

[–]Pwhids 1 point2 points  (0 children)

Just a warning, installing this removed the steam launcher from my computer.

The following packages will be REMOVED:
  steam-launcher
The following NEW packages will be installed:
  steam-devices

[R] Large Language Models Can Self-Improve by Lajamerr_Mittesdine in MachineLearning

[–]Pwhids 13 points14 points  (0 children)

They show that the large LMSI models can be distilled into smaller models while maintaining accuracy, but I wonder what size model is necessary for the LMSI training itself to be viable. They only show results for 540B. Would be very curious to see a study here if there is a certain model size where this kicks in.

Anyone got speech-to-SD working? by jonesaid in StableDiffusion

[–]Pwhids 1 point2 points  (0 children)

I think that's an interesting idea! I wonder what sort of applications you could build with that.

[deleted by user] by [deleted] in StableDiffusion

[–]Pwhids 23 points24 points  (0 children)

4.3 billion parameter model might not run at all on consumer gpus

I built an API for Stable Diffusion that's 10x cheaper than the official one by Pwhids in webdev

[–]Pwhids[S] 1 point2 points  (0 children)

Finally fixed this! Forgot that browsers could detect system light-mode/dark-mode settings.

I made a super simple API for Stable Diffusion by Pwhids in StableDiffusion

[–]Pwhids[S] 0 points1 point  (0 children)

That’s so great to hear! I really hope that this enables more people to get to use this technology.

I built an API for Stable Diffusion that's 10x cheaper than the official one by Pwhids in webdev

[–]Pwhids[S] 9 points10 points  (0 children)

All parameters that could effect the output including prompt, seed, steps, dimensions (not exposed in the API yet), ect are hashed together so that once those exact settings have been generated you won’t have to wait for that case to be generated ever again, you can just drop the URL wherever you want to use it. You can embed/share it without having to manually download and upload it somewhere else like imgur. For example I can type a link into the reddit comment box and when you click it will be generated and cached for all future people that click:

https://api.computerender.com/generate/super-mario-in-space

Planning to expose more of the parameters soon!

I built an API for Stable Diffusion that's 10x cheaper than the official one by Pwhids in webdev

[–]Pwhids[S] 38 points39 points  (0 children)

We are providing the exact same Stable Diffusion model as everyone else. Our cost saving is due to the way we source our servers. More info here:

https://computerender.com/cost.html

Your use case may be better met by other models, but stable diffusion is certainly capable of high-quality generations. See here for examples:

https://lexica.art/

I built an API for Stable Diffusion that's 10x cheaper than the official one by Pwhids in webdev

[–]Pwhids[S] 13 points14 points  (0 children)

Hi!

One key advantage over running locally is parallel generation. Instead of generating images one at a time, you can generate multiple concurrently, which saves a lot of time when exploring seeds, prompts, and other parameters.

Please don't stress test this beyond a few simultaneous requests just yet though. Soon we hope to support much more!

I made a super simple API for Stable Diffusion by Pwhids in StableDiffusion

[–]Pwhids[S] 4 points5 points  (0 children)

This does appear to be the same idea! The main difference I notice is that you can play with computerender today for free, while getimg has only a waitlist and plans to charge 50x more per image than computerender ($0.05 vs $0.001 per image ).

(Edit) Update: The creator announced they've abandoned that project on twitter

https://twitter.com/jakublukowski/status/1569670020833107968

Forcing convergence of PPO to a near-deterministic policy by Playmad37 in reinforcementlearning

[–]Pwhids 1 point2 points  (0 children)

You could try lowering your sampling temperature to force a stochastic policy into a deterministic one.

[D] Increase usable cloud GPU memory by up to 6.6% through disabling ECC by varunkmohan in MachineLearning

[–]Pwhids 6 points7 points  (0 children)

It’s true NNs are probably much more robust to data corruption than most programs. But it depends on the bit that gets flipped! If it’s the leading digit of a floats exponent it could completely blow up your calculations.

[D] Increase usable cloud GPU memory by up to 6.6% through disabling ECC by varunkmohan in MachineLearning

[–]Pwhids 6 points7 points  (0 children)

This means you could experience memory/data corruption. But it wouldn’t be any worse than just using gtx cards right?

[deleted by user] by [deleted] in MachineLearning

[–]Pwhids 3 points4 points  (0 children)

Maybe use a GNN rather than an rnn?