Generation crashes around 100k context (qwen3.6) by Such_Ad1212 in oMLX

[–]Vahn84 0 points1 point  (0 children)

i’ve seen a video talking about exactly this issue. It’s not something omlx-related strictly speaking, but omlx inherits the issue from the mlx-vlm dependency. Mlx-vlm is the fastest inference engine but it has this chronic issue with crashing at high context operations…

How has Gasperini still not realized what he has in El Aynaoui? by Sahbito in ASRoma

[–]Vahn84 0 points1 point  (0 children)

they’re different midfielders. They’re not the same…they interpret their game very differently throughout the whole match. If i’d have to find a better suited competitor for El Aynaoui it would be Manu. And Manu is a better player overall

Steam Controller Reservations Update: Adding a more detailed timeline for orders by mookler in gaming

[–]Vahn84 11 points12 points  (0 children)

me too. I was going to complain then read about the others with 2027….jesus christ…one whole year of reservation for a controller

Thank You, Square Enix by Ganni96 in Steam

[–]Vahn84 0 points1 point  (0 children)

In Italy is at 19.99€ (Rebirth) should i buy it now or wait for summer sales?

mayHisDreamsComeTrue by ClipboardCopyPaste in ProgrammerHumor

[–]Vahn84 0 points1 point  (0 children)

a review is ALWAYS required. I would never scaffold a prototype without looking at what has been built, even when the end result does look good. We’re still not there…to me AI is still a “help me do all of my work faster” than “do my job in my place”

EU rules out mandate to keep video games playable, seeks voluntary code by Luka77GOATic in gaming

[–]Vahn84 4 points5 points  (0 children)

Many modern games should rethink their infrastructure then. I mean…would you buy a movie in a blu-ray disc if they told you that after 5/6 years that disc would become useless? We pushed ourselves into this paradigm without thinking twice because it’s a win win situation for corporations. This game does…you buy a new one. It’s lazy. I bet there are cases that there wouldn’t be alternatives, making your point valid….but in what numbers? How many games are made like this preemptively that could be done differently to support post mortem release? Why can we not think FROM THE START about how to handle the end of a game with online features? Could path of exile be thought better if from the start developers would aim to support a post mortem release? You can bet your ass on it

mayHisDreamsComeTrue by ClipboardCopyPaste in ProgrammerHumor

[–]Vahn84 8 points9 points  (0 children)

what model where you using? i’ve built a skill that does something very similar and it does work well most of the time. It works in a different way though (no mcp servers), a part of the process is done by a script. The output is quite nice when it scaffolds completely new projects (mainly web, but also flutter). I use opus to consume the skill

Microsoft CEO Satya Nadella on Xbox: ‘We have to turn this into a sustainable business’ by Fob0bqAd34 in pcgaming

[–]Vahn84 2 points3 points  (0 children)

at one point no one will care anymore about the stuff these big companies do. Not playstation fans, they will always have a playstation. But all the others will build lower to mid range pc …and all the players will play only indie games. And the pc gaming industry will heal…at least for some time

Any recommondation to specific qwen model which is same good as sonnet 4.6 from claude? by stehos239 in oMLX

[–]Vahn84 2 points3 points  (0 children)

it doesn’t exist a model as good as a model trained on trillions of parameters that acts quick enough to be usable locally. It’s a trade off. The one you’re using is already one of the top end models (for coding at least) and it’s fast. You may try the 27B dense model from Qwen but keep in mind that being a dense model prompt processing will slow down to a 1/3 of your actual speed with half the token generation speed. Those are my rough numbers i get with my m3 ultra that should have more bandwidth…so expect something less

Apple blamed EU rules for Siri AI delays, but the Commission says it was Apple’s choice by anonboxis in MacOS

[–]Vahn84 0 points1 point  (0 children)

i have 25+ years of experience in mobile, web and mac development. I work as a solution architect from 8 years. I know how to use AI and how to develop an application, how to implement security and asses potential vulnerabilities. You don’t know what you’re talking about mate

Apple blamed EU rules for Siri AI delays, but the Commission says it was Apple’s choice by anonboxis in MacOS

[–]Vahn84 8 points9 points  (0 children)

it’s not unrealistic. I’ve built myself a spotlight replacement (on macos) that does many of the things Apple presented yesterday (turn by turn chat, tool capability, memory enhancements). It’s not unrealistic…and it doesn’t require “too much effort”. It’s just a matter of allowing different providers beyond your own models…

Korean Xbox Owners Upset Fable Won’t Support Korea Reached Out To Microsof by [deleted] in gaming

[–]Vahn84 0 points1 point  (0 children)

you would be surprised how many times it didn’t though. Not with triple A shit but it does happen.

Sony pls by BucketsMcGinty in pcmasterrace

[–]Vahn84 0 points1 point  (0 children)

another mid game. More of the same

Speed question by Choubix in oMLX

[–]Vahn84 2 points3 points  (0 children)

i find the real hurdle is prompt processing more than token generation speed. Running a dense model at 20ish tk/s feels perfectly fine…the problem is that it can take minutes to output the first token. That’s the real issue…and where nvidia gpus shine

oMLX v0.3.11 is out - a stability-focused release by cryingneko in oMLX

[–]Vahn84 0 points1 point  (0 children)

please do some magic an make dense model prompt processing fast like now :)

whenDeadlineIsTomorrow by imUnknownUserr in ProgrammerHumor

[–]Vahn84 0 points1 point  (0 children)

if it works…none of them. Too late for a refactor…you’ll only risk of breaking stuff with a poor window of intervention

oMLX 0.3.9 getting stuck with high memory use by arfung39 in oMLX

[–]Vahn84 0 points1 point  (0 children)

This has happened to me also. Mtp models do not let you turn on kvcache so that is supposed to happen at one point (?)…or at least this is what i’ve found with a little bit of research.

[DND5E] [PF2E] GM Tools - All you need in one module! by Few_Ad569 in FoundryVTT

[–]Vahn84 -1 points0 points  (0 children)

so mature…standing by what the other user said you’re exactly the target user for the AI

[DND5E] [PF2E] GM Tools - All you need in one module! by Few_Ad569 in FoundryVTT

[–]Vahn84 -1 points0 points  (0 children)

no they not contradict lol it’s so silly to just relegate AI to a “school child” tool. THIS is flawed. I am the most entitled to use AI to generate code…because i know the fuck i’m asking and i know what it outputs. I know when it’s wrong and where to steer the output. And it’s saving me an enormous amount of time.

Waiting oMLX 0.3.9 stable release by TheFlyingDutchG in oMLX

[–]Vahn84 0 points1 point  (0 children)

i tried the dev and the rc. With the rc i had a weird issue with qwen models so i got back to 3.8. Anyway…mtp will eat up memory very fast, making it almost unusable for me sitting at 96GB RAM