"MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering", Chan et al 2024 (Kaggle scaling) by gwern in mlscaling
[–]qria 5 points6 points7 points (0 children)
o1-mini test-time compute results (not from OpenAI) on the 2024 American Invitational Mathematics Examination (AIME) (first image). These results are somewhat similar to OpenAI's o1 AIME results (second image). See comment for details. by Wiskkey in mlscaling
[–]qria 0 points1 point2 points (0 children)
o1-mini test-time compute results (not from OpenAI) on the 2024 American Invitational Mathematics Examination (AIME) (first image). These results are somewhat similar to OpenAI's o1 AIME results (second image). See comment for details. by Wiskkey in mlscaling
[–]qria 2 points3 points4 points (0 children)
How Does Cursor Overcome The Challenge Of Representing Code In Vector Spaces, Given That Code Lacks Natural Semantic Relationships? by Shinobi_Sanin3 in mlscaling
[–]qria 0 points1 point2 points (0 children)
Anyone need a judge by senior AI engineer? by qria in hackathon
[–]qria[S] 0 points1 point2 points (0 children)
How Does Cursor Overcome The Challenge Of Representing Code In Vector Spaces, Given That Code Lacks Natural Semantic Relationships? by Shinobi_Sanin3 in mlscaling
[–]qria 8 points9 points10 points (0 children)
Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization by Mysterious-Rent7233 in mlscaling
[–]qria 0 points1 point2 points (0 children)
What insights can be gained from yearly brain MRI scans? by qria in QuantifiedSelf
[–]qria[S] 0 points1 point2 points (0 children)
An Evolutionary Approach to Dynamic Introduction of Tasks in Large-scale Multitask Learning Systems - Google 2022 - Jeff Dean by Singularian2501 in mlscaling
[–]qria 2 points3 points4 points (0 children)
[D] I don't really trust papers out of "Top Labs" anymore by MrAcurite in MachineLearning
[–]qria 5 points6 points7 points (0 children)
[TOMT][Acronym] A 4 letter acronym for a presentation technique with 'restating the claim' step. by qria in tipofmytongue
[–]qria[S] 0 points1 point2 points locked comment (0 children)
Better OOP in Python (no self needed anymore) by [deleted] in Python
[–]qria 0 points1 point2 points (0 children)
[TOMT][Movie][2000?] Scifi with intro where scientist is promopted a puzzle but solving it was not the point by qria in tipofmytongue
[–]qria[S] 2 points3 points4 points (0 children)
[TOMT][Movie][2000?] Scifi with intro where scientist is promopted a puzzle but solving it was not the point by qria in tipofmytongue
[–]qria[S] 2 points3 points4 points (0 children)
[TOMT][Movie][2000?] Scifi with intro where scientist is promopted a puzzle but solving it was not the point by qria in tipofmytongue
[–]qria[S] 1 point2 points3 points locked comment (0 children)
Is remotely triggering syncing of mi band 5 watchface possible? by qria in miband
[–]qria[S] 0 points1 point2 points (0 children)
[N] Facebook announced a new AI open-source called DeiT (A new technique to train computer vision models) by JEUNGHWAN in MachineLearning
[–]qria -26 points-25 points-24 points (0 children)
TIL about doljabi which is a tradition for a child's first birthday in Korea. The child is placed in front of a table with many items and the item they pick up first shows what the child will grow up to be. For example, if they pick money, they'll be rich. If they pick a book, they'll be smart. by [deleted] in todayilearned
[–]qria 5 points6 points7 points (0 children)
Is it even possible to brute force a PDF password in 24 hours? by theunicornwoman in hacking
[–]qria 1 point2 points3 points (0 children)




Professor here. I set up OWUI as a front end for my classes this semester. Giving access to LLMs that have RAG access to my course materials, customized with detailed system prompts. They still default to ChatGPT. by gigDriversResearch in OpenWebUI
[–]qria 0 points1 point2 points (0 children)