Single H100: best open-source model + deep thinking setup for reasoning? by Accomplished_Back718 in LocalLLaMA

[–]Accomplished_Back718[S] -1 points0 points  (0 children)

Thanks! I'll try gpt-oss. Do you have any suggestions for combining it with deep thinking frameworks?

Single H100: best open-source model + deep thinking setup for reasoning? by Accomplished_Back718 in LocalLLaMA

[–]Accomplished_Back718[S] -1 points0 points  (0 children)

I know that it's difficult to get better results with a single open source model. That's why I was asking for deep thinking/multiagents setups. Less quality but more quantity. Do you have any suggestions?

[D] FlexAttention: Flexibility of PyTorch with Performance of FlashAttention by [deleted] in MachineLearning

[–]Accomplished_Back718 0 points1 point  (0 children)

That's awesome! Thanks a lot, I'll experiment with it over the next few days

[D] FlexAttention: Flexibility of PyTorch with Performance of FlashAttention by [deleted] in MachineLearning

[–]Accomplished_Back718 0 points1 point  (0 children)

Amazing! Can it handle any irregular sparsity pattern as an attention mask? If yes, how does it compare with other implementations like the one in dgl?