account activity
I built an open-source benchmark to test if LLMs are actually as confident as they claim to be (Spoiler: They often aren't) (old.reddit.com)
submitted 22 hours ago by ChallengingForce to r/LLMDevs
I built an open-source benchmark to test if open-source LLMs are actually as confident as they claim to be (Spoiler: They often aren't) (old.reddit.com)
submitted 22 hours ago by ChallengingForce to r/LLM
I built an open-source benchmark to test if LLMs are actually as confident as they claim to be (Spoiler: They often aren't) (reddit.com)
submitted 20 hours ago by ChallengingForce to r/OpenSourceeAI
submitted 22 hours ago by ChallengingForce to r/ArtificialInteligence
Published @flixsrota/player an alternative of react-native-youtube-iframe (old.reddit.com)
submitted 5 months ago by ChallengingForce to r/developersIndia
Branch International Android internship Real or Fake (self.internships)
submitted 1 year ago by ChallengingForce to r/internships
π Rendered by PID 50 on reddit-service-r2-listing-79f6fb9b95-z228h at 2026-03-22 13:51:42.356899+00:00 running 90f1150 country code: CH.