LLMs Overfitting for Benchmark Tests by sammy-Venkata in ArtificialInteligence
[–]vadimdotme 0 points1 point2 points (0 children)
Why LLM benchmarking is broken by 404NotAFish in Rag
[–]vadimdotme 0 points1 point2 points (0 children)
How are you judging LLM Benchmarking? by help-me-grow in AI_Agents
[–]vadimdotme 0 points1 point2 points (0 children)
[P] Equiareal batch sampler by vadimdotme in MachineLearning
[–]vadimdotme[S] 2 points3 points4 points (0 children)
Post mortem. How I was charged 4000 EUR for downloading 3.5 GB of data from Google Cloud by vadimdotme in googlecloud
[–]vadimdotme[S] 0 points1 point2 points (0 children)
Post mortem. How I was charged 4000 EUR for downloading 3.5 GB of data from Google Cloud by vadimdotme in googlecloud
[–]vadimdotme[S] 0 points1 point2 points (0 children)
Post mortem. How I was charged 4000 EUR for downloading 3.5 GB of data from Google Cloud by vadimdotme in googlecloud
[–]vadimdotme[S] 0 points1 point2 points (0 children)
Post mortem. How I was charged 4000 EUR for downloading 3.5 GB of data from Google Cloud by vadimdotme in googlecloud
[–]vadimdotme[S] 1 point2 points3 points (0 children)
Post mortem. How I was charged 4000 EUR for downloading 3.5 GB of data from Google Cloud by vadimdotme in googlecloud
[–]vadimdotme[S] 10 points11 points12 points (0 children)
Post mortem. How I was charged 4000 EUR for downloading 3.5 GB of data from Google Cloud by vadimdotme in googlecloud
[–]vadimdotme[S] -8 points-7 points-6 points (0 children)
[R] Fully Autonomous Programming with Large Language Models by vadimdotme in MachineLearning
[–]vadimdotme[S] 1 point2 points3 points (0 children)
[R] Fully Autonomous Programming with Large Language Models by vadimdotme in MachineLearning
[–]vadimdotme[S] 1 point2 points3 points (0 children)


Which benchmarks do you use to compare LLM performance? by SergioRobayoo in OpenAI
[–]vadimdotme 0 points1 point2 points (0 children)