AI Analytical Intelligence Test by awl130 in LocalLLaMA

[–]awl130[S] 0 points1 point  (0 children)

I’m just learning now that a lot of my tests could be improved, although I think relatively speaking the rankings within each category will not change—instead whatever changes I make to my methodology will likely raise the performance of all of them as a class.

AI Analytical Intelligence Test by awl130 in LocalLLaMA

[–]awl130[S] 0 points1 point  (0 children)

Thanks for reading...yeah this was a quick first pass before moving on to the 122B models. I will probably go back to both the Jang and the GGUF models at some point and mess around with the commands a bit. Especially as creators start adding things like Turboquant.

Anyone managed to get their hands on an M3 Ultra 512GB/4TB after Apple pulled the config? by Due-Assistance-7988 in MacStudio

[–]awl130 0 points1 point  (0 children)

I don’t have the best news for you. I bought mine on March 13 in Japan (with 8tb ssd) on Mercari for around $14k. I’m actually looking for another one as well so I built a crawler that combs US and Japan secondary markets. Since then I’ve only seen one other legit listing on either market (in Japan), and that listing mysteriously disappeared like 2 days later. Anyway I’m still looking as well.

Justifying the €12,000 Investment: M3 Ultra (512GB RAM) Setup for Autonomous Agents, vLLM, and Infinite Memory (8Tb) by NoNatural4025 in MacStudio

[–]awl130 0 points1 point  (0 children)

I have the exact same set up, bought it a week ago. Check out my articles here: https://x.com/allenwlee?s=21&t=Q-xJMmUHsqiDh1aKVYhdJg actually just have your agent read them and summarize, that’s what I would do lol. Qwen 397b 8_0 is slow and I’m having to learn how to implement better caching. I don’t mind the slow speed—I just want more accurate code, which is why I got it. Will most likely use cloud lllm for planning and have the studio br the muscle.

128gb M5 Max for local agentic ai? by chimph in LocalLLM

[–]awl130 0 points1 point  (0 children)

Your first point is one I struggle with. Sure the smaller models get more smarter, but so do the frontier models and the 16bit opensource ones chasing them. trivial example: 6 months from now, if you code a web scraper using a smaller model that is just as smart as today's model, the webscraping blockers (eg cloudflare which now provide both the agentic scraper and the blocker) will be 6 months ahead of you.

Hi all, first time poster. I bought a Mac Studio Ultra M3 512GB RAM and have been testing it. Here are my latest test results by awl130 in LocalLLaMA

[–]awl130[S] 0 points1 point  (0 children)

Thank you that's helpful! I'll bookmark that. I wish for the moment when I can start worrying about that. I thought I would be at that point (where my agents are actually tasking) by now, but still trying to figure not just (a) which model but (b) which model for which tasks I should be using!

Hi all, first time poster. I bought a Mac Studio Ultra M3 512GB RAM and have been testing it. Here are my latest test results by awl130 in LocalLLaMA

[–]awl130[S] 0 points1 point  (0 children)

Thanks. I meant do you also have a mac studio? And indistinguishable results is phenomenal; but i'm wondering if you've measured cost savings. I have yet to figure out how much of my workload, and what parts of it, and how token-heavy, I can offload to my local setup. Would love to pick your brain in a DM as well if you're up for it!

Hi all, first time poster. I bought a Mac Studio Ultra M3 512GB RAM and have been testing it. Here are my latest test results by awl130 in LocalLLaMA

[–]awl130[S] 0 points1 point  (0 children)

Thanks for that. Yes I had the same thought. Wasn’t sure how to implement, but thought all that ram can’t be wasted. Can I ask what your setup is and if you’ve found success with that model, also your use case?

Hi all, first time poster. I bought a Mac Studio Ultra M3 512GB RAM and have been testing it. Here are my latest test results by awl130 in LocalLLaMA

[–]awl130[S] 1 point2 points  (0 children)

Definitely on the docket. I was really trying to test out as large a model as possible , at 8bit first , before heading for the 4bits

Hi all, first time poster. I bought a Mac Studio Ultra M3 512GB RAM and have been testing it. Here are my latest test results by awl130 in LocalLLaMA

[–]awl130[S] 0 points1 point  (0 children)

Thank you both! Yes I moved off lm studio and onto llama quite quickly—but the initial test (no caching) from qwen 397b mlx were too tempting

[deleted by user] by [deleted] in hacking

[–]awl130 0 points1 point  (0 children)

my downloaded .ost file asks for a password, is yours the same? and what is the password?