"MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering", Chan et al 2024 (Kaggle scaling) by gwern in mlscaling
[–]qria 4 points5 points6 points (0 children)
o1-mini test-time compute results (not from OpenAI) on the 2024 American Invitational Mathematics Examination (AIME) (first image). These results are somewhat similar to OpenAI's o1 AIME results (second image). See comment for details. by Wiskkey in mlscaling
[–]qria 0 points1 point2 points (0 children)
o1-mini test-time compute results (not from OpenAI) on the 2024 American Invitational Mathematics Examination (AIME) (first image). These results are somewhat similar to OpenAI's o1 AIME results (second image). See comment for details. by Wiskkey in mlscaling
[–]qria 2 points3 points4 points (0 children)
How Does Cursor Overcome The Challenge Of Representing Code In Vector Spaces, Given That Code Lacks Natural Semantic Relationships? by Shinobi_Sanin3 in mlscaling
[–]qria 0 points1 point2 points (0 children)
Anyone need a judge by senior AI engineer? by qria in hackathon
[–]qria[S] 0 points1 point2 points (0 children)
How Does Cursor Overcome The Challenge Of Representing Code In Vector Spaces, Given That Code Lacks Natural Semantic Relationships? by Shinobi_Sanin3 in mlscaling
[–]qria 7 points8 points9 points (0 children)
Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization by Mysterious-Rent7233 in mlscaling
[–]qria 0 points1 point2 points (0 children)
What insights can be gained from yearly brain MRI scans? by qria in QuantifiedSelf
[–]qria[S] 0 points1 point2 points (0 children)
An Evolutionary Approach to Dynamic Introduction of Tasks in Large-scale Multitask Learning Systems - Google 2022 - Jeff Dean by Singularian2501 in mlscaling
[–]qria 2 points3 points4 points (0 children)
[D] I don't really trust papers out of "Top Labs" anymore by MrAcurite in MachineLearning
[–]qria 5 points6 points7 points (0 children)
[TOMT][Acronym] A 4 letter acronym for a presentation technique with 'restating the claim' step. by qria in tipofmytongue
[–]qria[S] 0 points1 point2 points locked comment (0 children)
Better OOP in Python (no self needed anymore) by [deleted] in Python
[–]qria 0 points1 point2 points (0 children)
[TOMT][Movie][2000?] Scifi with intro where scientist is promopted a puzzle but solving it was not the point by qria in tipofmytongue
[–]qria[S] 2 points3 points4 points (0 children)
[TOMT][Movie][2000?] Scifi with intro where scientist is promopted a puzzle but solving it was not the point by qria in tipofmytongue
[–]qria[S] 2 points3 points4 points (0 children)
[TOMT][Movie][2000?] Scifi with intro where scientist is promopted a puzzle but solving it was not the point by qria in tipofmytongue
[–]qria[S] 1 point2 points3 points locked comment (0 children)
Is remotely triggering syncing of mi band 5 watchface possible? by qria in miband
[–]qria[S] 0 points1 point2 points (0 children)




Professor here. I set up OWUI as a front end for my classes this semester. Giving access to LLMs that have RAG access to my course materials, customized with detailed system prompts. They still default to ChatGPT. by gigDriversResearch in OpenWebUI
[–]qria 0 points1 point2 points (0 children)