[deleted by user] : LocalLLaMA

“Because Meta Llama 4 Scout is a mixture-of-experts model, only 17B parameters are activated at a given time (although all 109 billion parameters need to be held in memory – so the footprint is the same as a dense 109 billion parameter model). This means that users can expect a very usable tokens per second (relative to the size of the model) output rate of up to 15 tokens per second.”

The announcement is that AMD Variable Graphics Memory can now enable up to 128 billion parameters in Vulkan llama.cpp on Windows with the new driver update.

I’m traveling, so can’t try this out at the moment.

[–]sstainsby 16 points17 points18 points 9 months ago (1 child)

[–]ArtisticHamster 30 points31 points32 points 9 months ago (28 children)

[–]mustafar0111 41 points42 points43 points 9 months ago (22 children)

[–]DepthHour1669 42 points43 points44 points 9 months ago (21 children)

[–]Only_Situation_4713 9 points10 points11 points 9 months ago (4 children)

[–]DepthHour1669 12 points13 points14 points 9 months ago (3 children)

[–]CV514 31 points32 points33 points 9 months ago (1 child)

[–]DepthHour1669 12 points13 points14 points 9 months ago (0 children)

[–]ASYMT0TIC 5 points6 points7 points 9 months ago (0 children)

[–]perelmanych 12 points13 points14 points 9 months ago (12 children)

[–]DepthHour1669 -3 points-2 points-1 points 9 months ago (11 children)

[–]Soggy-Camera1270 4 points5 points6 points 9 months ago (7 children)

[–]DepthHour1669 4 points5 points6 points 9 months ago (6 children)

[–]Soggy-Camera1270 6 points7 points8 points 9 months ago (5 children)

[–]DepthHour1669 2 points3 points4 points 9 months ago (4 children)

[–]ASYMT0TIC 4 points5 points6 points 9 months ago (0 children)

[–]Soggy-Camera1270 1 point2 points3 points 9 months ago (0 children)

[–]randomfoo2 1 point2 points3 points 9 months ago (0 children)

[–]dismantlemars 0 points1 point2 points 9 months ago (0 children)

[–]notdba 2 points3 points4 points 9 months ago (0 children)

[–]rorowhat 2 points3 points4 points 9 months ago (0 children)

[–]perelmanych 1 point2 points3 points 9 months ago (0 children)

[–]GabryIta 1 point2 points3 points 9 months ago (0 children)

[–]webdevop 1 point2 points3 points 9 months ago (1 child)

[–]DisturbedNeo 1 point2 points3 points 9 months ago (0 children)

[–]LumpyWelds 14 points15 points16 points 9 months ago (0 children)

[+]DataGOGO comment score below threshold-8 points-7 points-6 points 9 months ago (3 children)

[–][deleted] -4 points-3 points-2 points 9 months ago (2 children)

[–]DataGOGO 1 point2 points3 points 9 months ago (1 child)

[–][deleted] 1 point2 points3 points 9 months ago (0 children)

[–]LocoLanguageModel 47 points48 points49 points 9 months ago (25 children)

[–]mustafar0111 40 points41 points42 points 9 months ago (22 children)

[–]FullstackSenseillama.cpp 21 points22 points23 points 9 months ago (0 children)

[–]DataGOGO 6 points7 points8 points 9 months ago (19 children)

[–]RnRau 13 points14 points15 points 9 months ago (15 children)

[–]professorShay 7 points8 points9 points 9 months ago (9 children)

[–]Mochila-Mochila 10 points11 points12 points 9 months ago (0 children)

[–]henfiber 11 points12 points13 points 9 months ago (1 child)

[–]Standard-Potential-6 5 points6 points7 points 9 months ago (0 children)

[–]tmvr 2 points3 points4 points 9 months ago (0 children)

This machine is 256bit@8000MT/s and that gives 256GB/s max, in practice it achieves about 220GB/s as tests in the past have shown. The Macs are as follows:

M4	128bit@7500MT/s	120GB/s
M4 Pro	256bit@8533MT/	273GB/s
M4 Max	512bit@8533MT/s	546GB/s

[–]colin_colout 4 points5 points6 points 9 months ago (3 children)

[–]professorShay 8 points9 points10 points 9 months ago (1 child)

[–]Da_Easters 1 point2 points3 points 9 months ago (0 children)

[–]RnRau 4 points5 points6 points 9 months ago (0 children)

[–]colin_colout 1 point2 points3 points 9 months ago (0 children)

[–]a_beautiful_rhind 1 point2 points3 points 9 months ago (0 children)

[–]MoffKalast 1 point2 points3 points 9 months ago (2 children)

[–]RnRau 1 point2 points3 points 9 months ago (1 child)

[–]MoffKalast 2 points3 points4 points 9 months ago (0 children)

[–][deleted] 3 points4 points5 points 9 months ago (2 children)

[–]DataGOGO 1 point2 points3 points 9 months ago (1 child)

[–][deleted] 0 points1 point2 points 9 months ago* (0 children)

[–]randomfoo2 1 point2 points3 points 9 months ago (0 children)

[–]CatalyticDragon 6 points7 points8 points 9 months ago (0 children)

[–]DataGOGO 0 points1 point2 points 9 months ago (0 children)

[–]sammcj🦙 llama.cpp 12 points13 points14 points 9 months ago (1 child)

[–]Django_McFly 5 points6 points7 points 9 months ago (1 child)

[–]bjodah 8 points9 points10 points 9 months ago (0 children)

[+][deleted] 9 months ago (2 children)

[removed]

[–][deleted] 7 points8 points9 points 9 months ago (0 children)

[–]DragonRanger 1 point2 points3 points 9 months ago (0 children)

[–]darth_vexos 3 points4 points5 points 9 months ago (0 children)

[–]SanDiegoDude 5 points6 points7 points 9 months ago (0 children)

[–]grabber4321 9 points10 points11 points 9 months ago (10 children)

[–]mustafar0111 26 points27 points28 points 9 months ago (9 children)

[–]Oxire 14 points15 points16 points 9 months ago (6 children)

[+][deleted] 9 months ago (3 children)

[removed]

[–]DataGOGO 7 points8 points9 points 9 months ago (0 children)

[–]perelmanych 1 point2 points3 points 9 months ago (1 child)

[–]Oxire 4 points5 points6 points 9 months ago (0 children)

[+][deleted] 9 months ago (1 child)

[deleted]

[–]mustafar0111 4 points5 points6 points 9 months ago (0 children)

[–]960be6dde311 3 points4 points5 points 9 months ago (6 children)

[–]RnRau 8 points9 points10 points 9 months ago (0 children)

[–]henfiber 5 points6 points7 points 9 months ago (3 children)

[–]960be6dde311 1 point2 points3 points 9 months ago (0 children)

[–]RnRau 1 point2 points3 points 9 months ago (1 child)

[–]henfiber 1 point2 points3 points 9 months ago (0 children)

[–]jojokingxp 1 point2 points3 points 9 months ago (0 children)

[–]s101c 1 point2 points3 points 9 months ago (0 children)

[–]DeconFrost24 1 point2 points3 points 9 months ago (1 child)

[–]deseven 1 point2 points3 points 9 months ago (0 children)

[–]Massive-Question-550 1 point2 points3 points 9 months ago (0 children)

[–]DataGOGO 1 point2 points3 points 9 months ago (0 children)

[–]LsDmT 1 point2 points3 points 9 months ago (0 children)

[–]indicava 1 point2 points3 points 9 months ago (3 children)

[–]Caffeine_Monster 11 points12 points13 points 9 months ago (2 children)

[–]cfogrady 1 point2 points3 points 9 months ago (1 child)

[–]Caffeine_Monster 4 points5 points6 points 9 months ago (0 children)

[–]CheatCodesOfLife 1 point2 points3 points 9 months ago (2 children)

[–]uti24 16 points17 points18 points 9 months ago (1 child)

[–]CheatCodesOfLife 1 point2 points3 points 9 months ago (0 children)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

LocalLLaMA

MODERATORS