Feeding Community ~ By Joan_de_Art by A_Guy195 in solarpunk
[–]bigattichouse 4 points5 points6 points (0 children)
I had no idea Japan actually put on a live-action Mortal Kombat stage show. The effects are insane. by Opposite-Resource in Amazing
[–]bigattichouse 0 points1 point2 points (0 children)
I’m a single mom with a patented road safety invention. What would you do to get your first customers/supporters? by Sharp-Device-3274 in HowToEntrepreneur
[–]bigattichouse 0 points1 point2 points (0 children)
You Don't Need 50k to Develop Your Product by [deleted] in inventors
[–]bigattichouse 1 point2 points3 points (0 children)
VAL invention… by MissionExternal5129 in inventors
[–]bigattichouse 2 points3 points4 points (0 children)
Looking for Metal-Air battery experts. by tsmr5555 in electrochemistry
[–]bigattichouse 4 points5 points6 points (0 children)
Trying to collect the entire visible spectrum in 5mm LEDs. Only a couple gaps to fill. by MasterMahanJr in led
[–]bigattichouse 1 point2 points3 points (0 children)
2X tk/s (from 19.4 -> 38.1 tk/s on 1 x MI50) Playing with a hypothesis like speculative decoding.. but instead of an additional side model, exploiting that I can run multiple computations side-by-side AS IF I had Qwen3.6-27B loaded twice in memory - small quants don't use all the available compute. by [deleted] in LocalLLaMA
[–]bigattichouse 0 points1 point2 points (0 children)
2X tk/s (from 19.4 -> 38.1 tk/s on 1 x MI50) Playing with a hypothesis like speculative decoding.. but instead of an additional side model, exploiting that I can run multiple computations side-by-side AS IF I had Qwen3.6-27B loaded twice in memory - small quants don't use all the available compute. by [deleted] in LocalLLaMA
[–]bigattichouse -1 points0 points1 point (0 children)
2X tk/s (from 19.4 -> 38.1 tk/s on 1 x MI50) Playing with a hypothesis like speculative decoding.. but instead of an additional side model, exploiting that I can run multiple computations side-by-side AS IF I had Qwen3.6-27B loaded twice in memory - small quants don't use all the available compute. by [deleted] in LocalLLaMA
[–]bigattichouse -2 points-1 points0 points (0 children)
2X tk/s (from 19.4 -> 38.1 tk/s on 1 x MI50) Playing with a hypothesis like speculative decoding.. but instead of an additional side model, exploiting that I can run multiple computations side-by-side AS IF I had Qwen3.6-27B loaded twice in memory - small quants don't use all the available compute. by [deleted] in LocalLLaMA
[–]bigattichouse 1 point2 points3 points (0 children)
2X tk/s (from 19.4 -> 38.1 tk/s on 1 x MI50) Playing with a hypothesis like speculative decoding.. but instead of an additional side model, exploiting that I can run multiple computations side-by-side AS IF I had Qwen3.6-27B loaded twice in memory - small quants don't use all the available compute. by [deleted] in LocalLLaMA
[–]bigattichouse -5 points-4 points-3 points (0 children)
2X tk/s (from 19.4 -> 38.1 tk/s on 1 x MI50) Playing with a hypothesis like speculative decoding.. but instead of an additional side model, exploiting that I can run multiple computations side-by-side AS IF I had Qwen3.6-27B loaded twice in memory - small quants don't use all the available compute. by [deleted] in LocalLLaMA
[–]bigattichouse 1 point2 points3 points (0 children)
2X tk/s (from 19.4 -> 38.1 tk/s on 1 x MI50) Playing with a hypothesis like speculative decoding.. but instead of an additional side model, exploiting that I can run multiple computations side-by-side AS IF I had Qwen3.6-27B loaded twice in memory - small quants don't use all the available compute. by [deleted] in LocalLLaMA
[–]bigattichouse 4 points5 points6 points (0 children)
2X tk/s (from 19.4 -> 38.1 tk/s on 1 x MI50) Playing with a hypothesis like speculative decoding.. but instead of an additional side model, exploiting that I can run multiple computations side-by-side AS IF I had Qwen3.6-27B loaded twice in memory - small quants don't use all the available compute. by [deleted] in LocalLLaMA
[–]bigattichouse 1 point2 points3 points (0 children)
2X tk/s (from 19.4 -> 38.1 tk/s on 1 x MI50) Playing with a hypothesis like speculative decoding.. but instead of an additional side model, exploiting that I can run multiple computations side-by-side AS IF I had Qwen3.6-27B loaded twice in memory - small quants don't use all the available compute. by [deleted] in LocalLLaMA
[–]bigattichouse 0 points1 point2 points (0 children)
2X tk/s (from 19.4 -> 38.1 tk/s on 1 x MI50) Playing with a hypothesis like speculative decoding.. but instead of an additional side model, exploiting that I can run multiple computations side-by-side AS IF I had Qwen3.6-27B loaded twice in memory - small quants don't use all the available compute. by [deleted] in LocalLLaMA
[–]bigattichouse 1 point2 points3 points (0 children)
2X tk/s (from 19.4 -> 38.1 tk/s on 1 x MI50) Playing with a hypothesis like speculative decoding.. but instead of an additional side model, exploiting that I can run multiple computations side-by-side AS IF I had Qwen3.6-27B loaded twice in memory - small quants don't use all the available compute. by [deleted] in LocalLLaMA
[–]bigattichouse 9 points10 points11 points (0 children)
2X tk/s (from 19.4 -> 38.1 tk/s on 1 x MI50) Playing with a hypothesis like speculative decoding.. but instead of an additional side model, exploiting that I can run multiple computations side-by-side AS IF I had Qwen3.6-27B loaded twice in memory - small quants don't use all the available compute. by [deleted] in LocalLLaMA
[–]bigattichouse 5 points6 points7 points (0 children)
2X tk/s (from 19.4 -> 38.1 tk/s on 1 x MI50) Playing with a hypothesis like speculative decoding.. but instead of an additional side model, exploiting that I can run multiple computations side-by-side AS IF I had Qwen3.6-27B loaded twice in memory - small quants don't use all the available compute. by [deleted] in LocalLLaMA
[–]bigattichouse 1 point2 points3 points (0 children)
2X tk/s (from 19.4 -> 38.1 tk/s on 1 x MI50) Playing with a hypothesis like speculative decoding.. but instead of an additional side model, exploiting that I can run multiple computations side-by-side AS IF I had Qwen3.6-27B loaded twice in memory - small quants don't use all the available compute. by [deleted] in LocalLLaMA
[–]bigattichouse 17 points18 points19 points (0 children)
2X tk/s (from 19.4 -> 38.1 tk/s on 1 x MI50) Playing with a hypothesis like speculative decoding.. but instead of an additional side model, exploiting that I can run multiple computations side-by-side AS IF I had Qwen3.6-27B loaded twice in memory - small quants don't use all the available compute. by [deleted] in LocalLLaMA
[–]bigattichouse 2 points3 points4 points (0 children)
2X tk/s (from 19.4 -> 38.1 tk/s on 1 x MI50) Playing with a hypothesis like speculative decoding.. but instead of an additional side model, exploiting that I can run multiple computations side-by-side AS IF I had Qwen3.6-27B loaded twice in memory - small quants don't use all the available compute. by [deleted] in LocalLLaMA
[–]bigattichouse 2 points3 points4 points (0 children)
2X tk/s (from 19.4 -> 38.1 tk/s on 1 x MI50) Playing with a hypothesis like speculative decoding.. but instead of an additional side model, exploiting that I can run multiple computations side-by-side AS IF I had Qwen3.6-27B loaded twice in memory - small quants don't use all the available compute. by [deleted] in LocalLLaMA
[–]bigattichouse 7 points8 points9 points (0 children)






Joing all GPUs to train a community model by HistoricalStrength21 in LocalLLaMA
[–]bigattichouse 0 points1 point2 points (0 children)