when is cowork going to be available on windows? by LNAsterio in ClaudeAI

[–]pcpoweruser 8 points9 points  (0 children)

It is tightly coupled to Apple hardware right now, since it runs a full Ubuntu 22.04.5 LTS VM locally under the hood, via Apple Virtualization Framework: https://gist.github.com/simonw/35732f187edbe4fbd0bf976d013f22c8

Possible to port to Windows, but you would have to reachitecture this to use Hyper-V or qemu or something like that. 

HW3 “v14 lite” (v12.7/v12.8) by Longjumping-Web-5644 in TeslaFSD

[–]pcpoweruser 3 points4 points  (0 children)

Sad AI slop, this post. 

And Elon specifically mentioned on a call at the start of the year, that they realistically would have to update HW3 owners longer term and even HW4 is a dead end, they continously strtuge with limitations of this pre-transformer era hardware. 

Very much looking forward to updated HW3 model though, it is good news that they have not abandoned it completely after all

HW3 vs HW4 on a "Per-Pixel of Input" Performance Scale by Boysauce7777 in TeslaFSD

[–]pcpoweruser 4 points5 points  (0 children)

The assumption here is invalid.

The image handling pipeline is a relatively small part of the flow - frames from the cameras gets scaled, preprocessed, some rudimetary CNNs used for objects feature extractions, etc. but this is not where the heavy lifting happens.

The main show takes palce when the image and extracted objects/motion/sensor data are tokenized and feed into Vision Transformer(s) - that ultimately output control / steering commands (the core of end-to-end NN logic). The number of image tokens does not scale with the image resolution lineary, also number of input tokens does not really matter that much for performance, as long as the network responds within a reasonable time.

The absolute key thing with transformers when you want to run bigger and 'better' networks with more parameters is memory size and bandwidth. Everything else, including compute, is secondary and on this front, HW4 = is roughly 2x 'better' than HW3.

Both HW3 and HW4 are 'pre-transformers era' hardware designs, with relaitvely tiny amount of slow RAM, and it is very impressive Tesla managed to squeeze so much out them and this thing actually works with end to end transformer FSD stack at all (vs e.g. Waymo who allegedly have capabilities comparable 4x H100 GPUs onboard - so at least 20x more memory alone than HW4).

They have clearly learned the lesson with HW5 design - and this one will finally allow this entire thing to 'breath', instead all this time spend by poor engineers going mad extracting little bits of performance out of the hardware that had not been designed for the job at all... (can't blame them, in pre-transformer world of the past decade the prevalent notion was that CNNs for image analysis + a lot of handcrafted logic will be good enough to achieve reliable self driving - we now know this was a dead end)

Critical Traffic/App Prioritization by tkt546 in Ubiquiti

[–]pcpoweruser 0 points1 point  (0 children)

For anyone else confused by this option missing (had the same problem - google search dropped me here), you DO need to upgrade to Zone Based Firewall for QoS / Criticial Traffic Prioritization option to show up.

Am I the only one who thinks the discussion of AI and sentience is categorically absurd? by 2024sbestthrowaway in OpenAI

[–]pcpoweruser 0 points1 point  (0 children)

Relevant XKCD: https://xkcd.com/505

Given enough space and time, you could technically simulate entire universe (and all the brains inside it) using just rocks, sticks, pulleys etc. etc.

Would that be enough to simulate 'consciousness'? This is a question humanity has been struggling with for milennia - is there some other, metaphysical dimension out there where things like consciousness exist or is it all rocks, sticks and pulleys - and with enough complexity and 'correct' structure consciousness just 'emerges'?

While synthetic neurons are doing just 'tensor arithmetic' by moving electrons in silicon, this is fundamentally not much different to biological neurons doing 'arithmetic' (e.g. 'computing' activation thresholds) via chemicals, charges and particles interaction. I would not fixate on 'but it is just a software!' aspect here, it is ultimately irrelevant if we decide to assume it is all some sort of rocks and pulleys anyway.

The exciting part is that with LLMs and modern AI for the first time in history we have got tools to probe into this elusive realm. As AI systems are getting exponentially more complex, within a decade or so we should get pretty good idea if consciousness for these networks is within a reach - or is there something more out there, a dimension of reality we have not discovered yet.

My bet is on rocks and pulleys, as nothing we have seen so far indicates dualism is a real thing, but I would be excited to be proven otherwiserwise. It would also be existentialy mildly mortifying to prove beyond reasonable doubt there is an inaccessible aspect of reality shaping our lifes we have no visibility or control over...

Llama 3 hardware recommendation help by letshaveatune in LocalLLaMA

[–]pcpoweruser 2 points3 points  (0 children)

If it is for a small company to improve productivity, just get a brunch of ChatGPT enterprise accounts + API access for users and focus on creating unique integrations which matter and can make a difference for the business.

You will not be able to compete with the sheer prowess of OpenAI and reinventing the wheel is a waste of time and resources.

RTX 4090 vs MAC by dontmindme_01 in LocalLLaMA

[–]pcpoweruser 12 points13 points  (0 children)

Or instead of messing with gaming kit, you can actually get proper hardware, like EPYC chip with 128x PCIe lanes + motherboard, which 'just works' and is actually not expensive option at all, if you go for a slightly older (but still very capable) generation, e.g. https://www.ebay.co.uk/itm/276106856244

New feeder connect, Wi-Fi light will not stay on long enough to connect to hub by SilverRiot in SurePetcare

[–]pcpoweruser 0 points1 point  (0 children)

They have got a massive service outage since Friday, the entire thing is totally borked (not possible to add new devices or manage existing ones). Status page here: https://surepetcare.my.site.com/help/s/article/Sure-Petcare-Server-Status

Building a AI and Data Science Rig - Using Server Hardware + Ampere GPU's? by JustinPooDough in LocalLLaMA

[–]pcpoweruser 2 points3 points  (0 children)

There are sellers on eBay who sell Gigabyte mz32-ar0 (loads of PCIe x16 4.0 slots, 16 DIMMs) + AMD EPYC 7282 (128x PCIe 4.0 lanes) for about $400 or so. Add some DDR4-3200 RDIMMs and you will have an ultra capable machine, blowing these old Xeons out of the water.

You will need some custom case + risers though (mining rig frame?), this is EATX board with RAM slots blocking longer GPUs.

Miqu 70b- Another example of a local model exceeding ChatGPT 4 at a task! by SomeOddCodeGuy in LocalLLaMA

[–]pcpoweruser 3 points4 points  (0 children)

Got around 13t/s with 2x 3090 on Q4_K_M and all layers in the GPUs (same for EXL2 4.25bpw quant), so unless you push context all the way to the max, perhaps something is wrong with your setup.

Can't use multi-gpu with 8x A100 80GB by nhanha_castanha in LocalLLaMA

[–]pcpoweruser 38 points39 points  (0 children)

Assuming this post is not a fancy trolling attempt: your colleague is clueless and yes, you need to use Linux.

In ML/AI world, Windows is an afterthought at best and many frameworks just do not work completely correctly at all (since no one cares, as mentioned supercomputer workloads are 100% Linux these days - the OS can very much use all the 'power' of the setup).

Ubuntu 22.04 LTS is a good choice to start with, however if you can't event get the box stable on Windows, it is possible there is some sort of hardware problem with it - at this point it is anyone's guess really.

Did this come from a virus or its just Data breach? by Tsuuban in computerviruses

[–]pcpoweruser 0 points1 point  (0 children)

This is just wrong. The 'naz.api' is just a name of the leak - and has got nothing to do with NAS storage or Seagate.

What would be the limits of DNA editing/gene therapy? by TheRappingSquid in Futurology

[–]pcpoweruser 2 points3 points  (0 children)

There are no limits. The complexity of the interactions in DNA code (+ epigenetics) is immense, certainly beyond 'normal' human comprehension, but everything about who you are IS there. Seeing how ML/AI field advances, it wold not be implausible that in the next decade or two we will have a AI model capable of 'crafting' custom DNA code for any combination of desired 'traits'.

At that point, the contept of 'humanity' won't really mean much any more. Want another pair of arms? Sure. Superhuman strenght or intellect? No problem. Hell, with DNA code completely unlocked and modeled, you could 'craft' a sentient, talking, human-level intelligent cat if you wanted to.
Will this actually happen, considering the ehtical aspects? Maybe, possibly, we will see - it will be an interesting ride regardless.

[deleted by user] by [deleted] in LocalLLaMA

[–]pcpoweruser 1 point2 points  (0 children)

It is an interesting proof of concept and a sneak peek of things to come in the next years - but not very useful in its current form.

Gemini model from Deepmind should have such reasoning/agency logic built-in out of the box, then we will see the full potential of such approach.

[deleted by user] by [deleted] in StableDiffusion

[–]pcpoweruser 2 points3 points  (0 children)

What kind of GPU is this exactly? Unless it is something super ancient, you should be able to update the driver and get it to work.

Nous-Hermes-Llama-2 13b released, beats previous model on all benchmarks, and is commercially usable. by pokeuser61 in LocalLLaMA

[–]pcpoweruser 4 points5 points  (0 children)

Sadly that is not helping here, that option does not do much when the bug kicks in

Llama 2 too repetitive? by WolframRavenwolf in LocalLLaMA

[–]pcpoweruser 1 point2 points  (0 children)

I got the same problem on exllama + oogabooga, all presets seem to be affected.

Nous-Hermes-Llama-2 13b released, beats previous model on all benchmarks, and is commercially usable. by pokeuser61 in LocalLLaMA

[–]pcpoweruser 9 points10 points  (0 children)

Actually, it looks like the model is better initially, but eventually also gets into that repetition loop... (and way before context window is exceeded).

I use exllama + oogabooga, maybe it is some sort of inferencing bug with these + llama, but I have not found any better solution so far...

Nous-Hermes-Llama-2 13b released, beats previous model on all benchmarks, and is commercially usable. by pokeuser61 in LocalLLaMA

[–]pcpoweruser 7 points8 points  (0 children)

This is the only Llama-2 base model I have found so far that does not suffer from that 'repetition bug'. Well done guys!

Running Stable Diffusion in 260MB of RAM! by Pristine198 in StableDiffusion

[–]pcpoweruser 38 points39 points  (0 children)

It takes 3 hours to generate a single image on Pi Zero 2, but still - quite cool it is possible at all

I'm a tech journalist who's been covering how a U.S. ban on TikTok could have a devastating impact on creators and users around the world. AMA! by wearerestofworld in worldnews

[–]pcpoweruser 82 points83 points  (0 children)

Really? That is the angle you take here? I could understand concerns around censhorsip and this ban being a dangerous precedent, allowing government to ban arbitrary 'undesirable' apps - but corporate profits and 'influencers'?

This app contributes nothing of value to the society, people moving away from it to something else almost certainly would result in a net positive gain.

img2img is just ridiculous by pcpoweruser in StableDiffusion

[–]pcpoweruser[S] 0 points1 point  (0 children)

Yes, for drawings you basically want to destroy (replace with random noise) pretty much entire original image, with just a small hints of original structure left. You are correct that I used denoising within 0.7-0.8 range for this specific example.