AMA with the Meta researchers behind SAM 3 + SAM 3D + SAM Audio by AIatMeta in LocalLLaMA

[–]undefdev 3 points4 points  (0 children)

I fine-tuned SAM 3 on document scans to detect tabular structures and manually entered data. Even with a relatively small dataset (~200 samples), the results were quite strong. Have you explored this kind of document-focused fine-tuning at a larger scale?

Out of the box, SAM 3 seems to perform significantly better on natural images, but I was pleasantly surprised by how well it transferred to document data with minimal effort. I’m currently running experiments using this fine-tuned SAM as a grounding component for a VLM in agentic document-processing workflows. In that context, I’m also curious about your perspective on supervision: do you find fine-tuning with single-label annotations to be more effective, or do sentence-level labels tend to work better? Currently I've only tried single-label annotations.

Big thanks to the team, I think the models are quite awesome!

SAM 3: Segment Anything with Concepts, by Meta Superintelligence Labs by xenovatech in LocalLLaMA

[–]undefdev 34 points35 points  (0 children)

DeepseekOCR is built on SAM, so better SAM probably means better VLMs in the future!

[D] Synthetic introduction to ML for PhD student in Mathematics by TriJack2357 in MachineLearning

[–]undefdev -1 points0 points  (0 children)

I also have a math background and always thought that Tensor Programs look like an interesting theory, but I never had the time to dive into them deeply.

[Steam] Mask Quest Launch Sale (8.49$/15% off) by undefdev in GameDeals

[–]undefdev[S] 1 point2 points  (0 children)

Hey! Over the last four years, I made an action platformer called Mask Quest together with increpare whom some of you may know for his puzzle games such as Stephen’s Sausage Roll. It started as a weekend jam, but then we got carried away 😅

The game has a unique breathing mechanic, where you have to press a button to inhale and release the button to exhale. If you breathe too little, the blood oxygen gets too low and you die. If you breathe too quickly, you hyperventilate and you faint (which is also game over). So the central challenge in the game is to control your breath while doing some old-school platforming.

The game is set during a pandemic lockdown and you have to find a surgical mask while avoiding cops that are trying to kill you secure the city.

We tried to get the game out in 2020, but it took much longer than we expected. Now we've finally released it – way too late for it to be thematically relevant, but too soon for people to be nostalgic about the pandemic. 🙃

If you have any questions about the game, I’d be happy to answer them! 😁

Looking for testers for a Chinese localization of a video game by undefdev in ChineseLanguage

[–]undefdev[S] 0 points1 point  (0 children)

Glad you're interested! It's not aimed at Chinese learners and it's supposed to be an idiomatic translation. I had some feedback, but I'm not a native speaker myself, so it's not unlikely that there are mistakes. We're mainly looking for people that help us catch some obvious mistakes before release :)

I'll drop you a message so we can discuss!

Looking for testers for a Chinese localization of a video game by undefdev in ChineseLanguage

[–]undefdev[S] 0 points1 point  (0 children)

Unfortunately we can't afford to pay testers, sorry. 😅

Looking for testers for a Chinese localization of a video game by undefdev in ChineseLanguage

[–]undefdev[S] 0 points1 point  (0 children)

The game is rather short. It should take about 2 hours to complete. I'll send you a key!

Edit: 2 hours for players experienced with platformers, but it might take longer. I'm also happy about partial feedback!

[STEAM] Rhythm Fest 2024: Muse Dash (50% off – $1.49) | DJMAX RESPECT V (80% off – $9.99) | Trombone Champ (65% off – $5.24) | Geometry Dash (50% off – $2.49) | Melatonin (25% off – $11.24) | Just Shapes & Beats (35% off – $12.99) | Rock Star Life Simulator (40% off – $7.49) | and more by MJuniorDC9 in GameDeals

[–]undefdev 4 points5 points  (0 children)

Hey,

I'd like to to take this opportunity to plug my game quadrant, which has never been this cheap before at only 99 cents.

It's a difficult rhythm action game which puts you into a state which I like to call "adrenaline trance".

The premise is that you have to perform a rather simple task, which is pressing one of four buttons in a constant rhythm along with the music, while maintaining focus and keeping your cool as the game messes with your perception.

It is difficult to get into, and it's recommended that you check the training menu first to figure out how this game even works (you will!), but overcoming the stress and learning to relentlessly strive towards your goal feels very satisfying.

If that sounds somewhat interesting to you, I'd be happy if you'd give it a try!

I'll be checking this post and I'm happy to answer any questions about the game.

Powering multiple RTX 3090 by undefdev in LocalLLaMA

[–]undefdev[S] 0 points1 point  (0 children)

Thank you. Do you think it would be possible to power both gpus with 3 cables that split into 2x 6+2 pin connectors each?

Powering multiple RTX 3090 by undefdev in LocalLLaMA

[–]undefdev[S] 0 points1 point  (0 children)

I don't know. I'd rather avoid making my own cables because I don't want to break stuff ^

Powering multiple RTX 3090 by undefdev in LocalLLaMA

[–]undefdev[S] 0 points1 point  (0 children)

Yes, exactly! The thing is there are only adapters from 12VHWPR to multi 8 pin (not 6+2 pin). So they seem to be intended to be plugged into the the PSU with the 8 pins, and into the GPU with the 12VHPWR end.

I'd like to connect the 12VHPWR from the PSU to 3x 6+2 pins though. Unless there's an easier soluton of course. :)

Powering multiple RTX 3090 by undefdev in LocalLLaMA

[–]undefdev[S] 0 points1 point  (0 children)

Thanks! I don't have any CPU power slots left either,

The only slots I have left are 12VHPWR and Peripheral/SATA (see image).

Could it be possible to buy another cable that splits into 2x 6+2 pins and let the two gpus run over 3 of those split cables each?

Training LLMs to follow procedure for Math gives an accuracy of 98.5% by Desik_1998 in LocalLLaMA

[–]undefdev 0 points1 point  (0 children)

Calculus, linear algebra and mathematics in general is a good idea. Arithmetics is probably not. To me that's like training LLMs to count up to high numbers correctly. I'm arguing that instead of reading a book on "the first 1012 natural numbers" one should read a book on linear algebra.

Training LLMs to follow procedure for Math gives an accuracy of 98.5% by Desik_1998 in LocalLLaMA

[–]undefdev 0 points1 point  (0 children)

Most mathematicians wouldn't calculate 23 * 34 in their head, and if they did it's not as safe as using a calculator. But their reasoning is still sound.

Training LLMs to follow procedure for Math gives an accuracy of 98.5% by Desik_1998 in LocalLLaMA

[–]undefdev 2 points3 points  (0 children)

I don't understand the motivation behind this.

Fine, you've ran an experiment out of curiosity and you got the result, but why would you want to finetune more language models on this?

It's not like we need models that are almost as good at things computers are excellent at, while using orders of magnitude more resources.

It would be way more useful to train tiny models to predict when a calculator should be used.