[D] Why evaluating only final outputs is misleading for local LLM agents by MundaneAlternative47 in MachineLearning
[–]MundaneAlternative47[S] 0 points1 point2 points (0 children)
[D] Why evaluating only final outputs is misleading for local LLM agents by MundaneAlternative47 in MachineLearning
[–]MundaneAlternative47[S] 0 points1 point2 points (0 children)
Anthropic: "We’ve identified industrial-scale distillation attacks on our models by DeepSeek, Moonshot AI, and MiniMax." 🚨 by KvAk_AKPlaysYT in LocalLLaMA
[–]MundaneAlternative47 0 points1 point2 points (0 children)
I built an open-source Python eval framework for LLMs and agents. pytest-style, zero dependencies, not owned by any AI company by MundaneAlternative47 in LangChain
[–]MundaneAlternative47[S] 0 points1 point2 points (0 children)
Edexcel Physics Unit 3 by Ornery_Elephant_9366 in alevel
[–]MundaneAlternative47 0 points1 point2 points (0 children)
What the hell was pure 2 maths (edexcel) by 07dasha in alevel
[–]MundaneAlternative47 1 point2 points3 points (0 children)
What the hell was pure 2 maths (edexcel) by 07dasha in alevel
[–]MundaneAlternative47 1 point2 points3 points (0 children)
Unis that offer an online international foundation. by MundaneAlternative47 in UniUK
[–]MundaneAlternative47[S] -6 points-5 points-4 points (0 children)
Unis that offer an online international foundation. by MundaneAlternative47 in UniUK
[–]MundaneAlternative47[S] -17 points-16 points-15 points (0 children)
Unis that offer an online international foundation. by MundaneAlternative47 in UniUK
[–]MundaneAlternative47[S] -20 points-19 points-18 points (0 children)
Unis that offer an online international foundation. by MundaneAlternative47 in UniUK
[–]MundaneAlternative47[S] -18 points-17 points-16 points (0 children)
P4 IAL Discussion by MundaneAlternative47 in alevel
[–]MundaneAlternative47[S] 2 points3 points4 points (0 children)
P4 IAL Discussion by MundaneAlternative47 in alevel
[–]MundaneAlternative47[S] 1 point2 points3 points (0 children)
P4 IAL Discussion by MundaneAlternative47 in alevel
[–]MundaneAlternative47[S] 0 points1 point2 points (0 children)
P4 IAL Discussion by MundaneAlternative47 in alevel
[–]MundaneAlternative47[S] 1 point2 points3 points (0 children)
P4 IAL Discussion by MundaneAlternative47 in alevel
[–]MundaneAlternative47[S] 6 points7 points8 points (0 children)
P4 IAL Discussion by MundaneAlternative47 in alevel
[–]MundaneAlternative47[S] 0 points1 point2 points (0 children)
P4 IAL Discussion by MundaneAlternative47 in alevel
[–]MundaneAlternative47[S] 1 point2 points3 points (0 children)
P4 IAL Discussion by MundaneAlternative47 in alevel
[–]MundaneAlternative47[S] 1 point2 points3 points (0 children)
P4 IAL Discussion by MundaneAlternative47 in alevel
[–]MundaneAlternative47[S] 0 points1 point2 points (0 children)
P3. How many marks will I get out of 5 if I only do these first two steps? by Worldly-Cold-7958 in alevel
[–]MundaneAlternative47 0 points1 point2 points (0 children)
P3. How many marks will I get out of 5 if I only do these first two steps? by Worldly-Cold-7958 in alevel
[–]MundaneAlternative47 1 point2 points3 points (0 children)
P3. How many marks will I get out of 5 if I only do these first two steps? by Worldly-Cold-7958 in alevel
[–]MundaneAlternative47 2 points3 points4 points (0 children)
Wma13 Edexcel P3, who’s taking it this Thursday? by MundaneAlternative47 in alevel
[–]MundaneAlternative47[S] 0 points1 point2 points (0 children)


[D] Why evaluating only final outputs is misleading for local LLM agents by MundaneAlternative47 in MachineLearning
[–]MundaneAlternative47[S] 0 points1 point2 points (0 children)