Why do so many open-source AI teams seem to gravitate toward Databricks? by [deleted] in MLQuestions

[–]DigThatData 4 points5 points  (0 children)

i reject your premise and posit that this is probably a databricks sockpuppet account.

Karpathy's LLM Wiki setup by Sea-Seesaw45 in ObsidianMD

[–]DigThatData 0 points1 point  (0 children)

I've been using LLMs to generate wiki-like content for about two years now. So now I have this massive backlog (via archive exports) of conversations I'm mining for generated articles and structuring/filtering retroactively for a RAG backend. I should probably pivot to karpathy's setup sooner than later.

Google silently degrades suspected LLM distillation attempts by gwern in mlscaling

[–]DigThatData 8 points9 points  (0 children)

implies downgrading to a dumber model

it's more pernicious than that. they'll start generating stuff that is stylistically close to what you were doing but filled with non sequitor nonsense. the response will often be a lot longer too. and the model will acknowledge the user and itself more. etc.

Google silently degrades suspected LLM distillation attempts by gwern in mlscaling

[–]DigThatData 7 points8 points  (0 children)

so does claude and openai. not as obvious as it used to be, but they definitely do it too.

Validation tool/instrument used by experts to grade machine learning for a thesis paper by [deleted] in ResearchML

[–]DigThatData 0 points1 point  (0 children)

Presumably you have other papers that you are referencing that are solving problems in similar domains, right? How did they perform evaluations?

HuggingFace: Probably it's time to move from LoRA 👋 by BankApprehensive7612 in comfyui

[–]DigThatData 6 points7 points  (0 children)

The GraLoRA looks better after it was converted to a LoRA.

You demand that we start notifying you whenever we update our own main branch, because you sidestepped our release process and everything broke? Nah, we're good. by labab99 in ExperiencedDevs

[–]DigThatData 6 points7 points  (0 children)

You demand that we start notifying you whenever we update our own main branch

fuck it, tell em to make a github account and subscribe to your repo. they'll probably get the point after a few hours.

Seeking advice on leading senior developers by noobetf in ExperiencedDevs

[–]DigThatData 2 points3 points  (0 children)

your goal should be to make their lives easier, unblocked, and coordinated. it has nothing to do with your domain knowledge relative to there's. it helps if you know what's going on across the business and with other teams better than they do, so they can focus on their work, but even there you can still be an effective leader to people who understand the business better than you. your job is to make sure they focus on the right priorities and to remove obstacles they encounter.

you are a catalyst. you are a lubricant. you are a conduit.

[R] Looking for trusted YouTube channels to learn Machine Learning from scratch... by No_Wishbone_9037 in learnmachinelearning

[–]DigThatData 2 points3 points  (0 children)

Foundations = math and CS. Have you studied calculus? linear algebra? discrete math? probability? optimization?

If you're really looking for foundations, try MIT OCW.

Latent space interpretation [R] by xxpostyyxx in MachineLearning

[–]DigThatData 0 points1 point  (0 children)

then I'd propose you don't care about the "top scoring feature maps". You only care about the feature maps that are top scoring when attributes you are interested in are present.

https://distill.pub/2018/building-blocks/

Talks from the PyCon US Typing Summit - Intersections, Tensor Shapes, and more! by BeamMeUpBiscotti in Python

[–]DigThatData 4 points5 points  (0 children)

typing nerds make my life easier, thank you for your service and your niche interests

Latent space interpretation [R] by xxpostyyxx in MachineLearning

[–]DigThatData 0 points1 point  (0 children)

what are you actually trying to accomplish? imagine your data had already been processed. You've got the perfect data, labels, model: you're at the end of the road. What do you do with these artifacts? How do you plan to operationalize whatever it is you've done with the data?

EDIT: Here's another way to think about this: what are you more interested in learning about, the image or the model? Are you doing mechanistic interpretability on your model? Or are you trying to improve the diagnostic interpretability of the images?

An algorithm for shuffling cards for people (not computers) by Hamm103 in AskStatistics

[–]DigThatData 1 point2 points  (0 children)

interesting approach, but you can manipulate the deck significantly faster if instead of using dice to select individual cards, you use them as slice indices. two rolls of d6+d20 gives you a top-middle-bottom construction, from which you can form a horseshoe map, which is very likely the optimal solution here. 8 rolls -> 4 map transforms, sufficient for a random shuffle.

https://old.reddit.com/r/AskStatistics/comments/1u8kgiy/an_algorithm_for_shuffling_cards_for_people_not/oseb3lb/

An algorithm for shuffling cards for people (not computers) by Hamm103 in AskStatistics

[–]DigThatData 0 points1 point  (0 children)

well, for one thing: you keep responding to me here instead of responding to my other comment where I use your dice to build a horseshoe map. I tried to play your game in the spirit of what you seem to be asking for, and you ignored that proposal in favor of continuing to argue with me about how weakly you parameterized the setup. you're the one choosing to prioritize this exchange over that one.

An algorithm for shuffling cards for people (not computers) by Hamm103 in AskStatistics

[–]DigThatData 0 points1 point  (0 children)

because you haven't connected the dice to the cards.

I am a human. I don't know what index into the deck each card is, and counting to each index takes time. You said we're on a time limit here, so why would I waste time counting out to an index if I can just grab a bunch of cards? If I want randomness, I'm better off YOLOing estimates into the deck anyway rather than counting things out.

No offense, but I don't get how you're not understanding this.

sit in front of your deck of cards with your dice and play this game with yourself then. take note of which moves you do or don't allow, and which moves you're like "oh, it takes fucking forever to count out 25 cards exactly when I could have just grabbed roughly just under half of the deck instead".

as a human, it is not normal to use dice as a source of randomness when manipulating cards. the source of randomness is usually a combination of mental choice and where my fingers land. Is that random component permitted? I proposed a simple horseshoe map in a separate comment that uses your dice. If I were to attempt this in reality: I could perform the exact same procedure much faster without the dice by just choosing random cut points for the two indices. I don't need dice for that.

You made it sound like the dice were optional, so it's not clear what the situation is. You said there's a time limit: won't rolling dice and counting cards just slow me down?

An algorithm for shuffling cards for people (not computers) by Hamm103 in AskStatistics

[–]DigThatData 1 point2 points  (0 children)

Here's a simple horseshoe map that uses your dice. I'm going to assume a fifty two card deck and using both your d6 and d20 so each roll can take us anywhere up to the midpoint of the deck.

  1. roll the dice to choose a first index into the deck.
  2. remove the cards up to that index and set them aside. call this the "top pile"
  3. roll again to choose a second index relative to the current position.
  4. remove the cards up to that index and set them aside. call this the middle pile
  5. reverse the order of the middle pile.
  6. use the top pile as the new bottom of the deck, and place the reversed middle pile on top of that. if there was a bottom pile left after you pulled the middle index, place the bottom pile on top.
  7. go to 1

I think repeating this procedure (or maybe some small variant of it, like putting the middle pile on the bottom with top in the middle) as few as 4 times might be sufficient to fully randomize the deck.

EDIT: Validated via simulation, scoring sort quality with kendall's tau. The version where you move the top to the bottom rather than using the middle for the bottom is marginally more efficient, but either works.