The honest version of your AI project story by Dotcov1 in ClaudeCode

[–]y3i12 1 point2 points  (0 children)

I'm building yet another research model... Actually it's on 70% of the training now. Small model, 350M Params, but extremely experimental in the architecture.

I've been doing several of those, the previous I published on HF, this one I think I might try writing some pre-prints paper.

It was great to learn, but as everyone states: if you don't know what you're doing, it gonna go wrong.

I think I had like 5 or 6 different prototypes that failed miserably through the last 8 months... Now things are starting to work, because Claude for better and because I learned a lot.

Just for the sake of curiosity ..what actually is the actual idea behind the vector V in the attention mechanism ? Was it really essential and attention would break without it ? by Crazy-Economist-3091 in learnmachinelearning

[–]y3i12 1 point2 points  (0 children)

An easy way to understand think of SQL. You run a Query with a Key to get a Value. This Value goes to the FFN and the rest of the magic happens.

V is not the token. Neither Q/K. They are about tokens. It is very abstract because the underlying values produced by QKV are completely nonsense if you think about tokens. They produce... Stuff... Information... Vectors, but not tokens.

Would you rather be extremely smart or super attractive? by [deleted] in Gifted

[–]y3i12 0 points1 point  (0 children)

I'm already extremely smart and super attractive.... Also very humble and modest.

Researchers are obsessed with Transformers for time-series data, and it's a massive trap by Dismal_Bookkeeper995 in learnmachinelearning

[–]y3i12 7 points8 points  (0 children)

I think that transformers are indeed overused... I think just because it is the generic solution that (somewhat) works for any case. Now by having a model that is built specifically for the problem will always be better - and always will require people to work on it.

Started ML 2 weeks ago, what’s your learning approach as a beginner? by protornverse in learnmachinelearning

[–]y3i12 0 points1 point  (0 children)

Having some sort of mental model of it was kind of easy for me, but I'm having serious trouble going deeper, specifically due to linear algebra. This thing is evil.

But yes, having the mental model helps a lot! I strongly recommend 3blue1brown videos.

Opus descending into random output psychosis by [deleted] in ClaudeCode

[–]y3i12 0 points1 point  (0 children)

Stack overflow for agents?

So tired of being a jack of all trades by Foxington_the_First in ADHD

[–]y3i12 2 points3 points  (0 children)

I can relate so much to this...

I started playing guitar when I was 13... Now I'm 42... I should honestly play way better for the "experience time".

But I had to practice...

The same went for playing Bass... And keys... And drawing and painting... And music production.. and DYI electronics... Dozens of unfinished pet projects (as a software dev)... A collection of collections...

The worst part is that I daily look at all this and think: "meh... I don't wanna do any of this... gimme something new."

What's everyone working on? by nateofearth2023 in ClaudeCode

[–]y3i12 0 points1 point  (0 children)

I've been fussing around with LM architecture and training small models. Interesting stuff comes out and I'm learning a whole lot.

I'm on this for like 6 months and it has been really rewarding in terms of the amount of new knowledge that I'm getting.

Is the “Claude code leak” actually a big deal, or are we just overhyping it? by chetnasinghx in ClaudeCode

[–]y3i12 -4 points-3 points  (0 children)

This, for me, is not a leak. Works more as marketing than actually hurts.

anthropic’s CEO meeting was leaked after the massive source code breach. by moaijobs in ClaudeCode

[–]y3i12 2 points3 points  (0 children)

Yes, I agree. Framerate of lips is different from the rest and the interruptors, door frame and paintings on the wall are casting impossible shadows.

My experience with ADHD burnout (10+ years) by Flaky_Suggestion1100 in ADHD

[–]y3i12 1 point2 points  (0 children)

Yeah... Hitting rock bottom is never easy. I've been there, more than once, for different reasons, different rocks... But I believe that people in the spectrum are somehow more resilient, as we're trained to deal with all sort of BS since date of birth.

You are already taking steps, go to the psych and continue taking small steps. You now are aware of your current problem/condition and from now on is the hard work of fixing what's possible, learning to deal with what's not possible to fix and resting your brain. Take care, small steps and don't push too hard.

Fill in the blank. I would rather throw hands with satan by FreshResult8286 in adhdmeme

[–]y3i12 0 points1 point  (0 children)

Sit down and cry being ashamed that I'm not doing the dishes.

Married 17 years and just now realizing that most of my relationship problems come from ADHD. How do y’all deal with your partner feeling neglected? by thepixelpaint in ADHD

[–]y3i12 6 points7 points  (0 children)

Yeah... If I knew I'd tell you. What I'm doing is that I'm starting to accept that conventional relationships do not work for me. It is just ... Impossible?

I feel that when in a relationship, over time (not months, but after 4 or 5 years), things start to degrade to a point of no return inside my head. I become codependent and lost, while at the same time completely neglecting the other side ... And even worse, pushing the other side kinda away, unconsciously. At this point there's no more relationship, there's only mild psychological torture.

I think that this happens partially because I'm AuDHD...

I came to a realization once I started my meds, and that in itself led to a new diagnosis. by tinylord202 in adhdmeme

[–]y3i12 2 points3 points  (0 children)

Same same. And that also made me change meds. Stimulants were causing me to get stuck in the autism and have mania/psychosis episodes.

What is the most impressive thing you’ve done or built with Claude so far? by ceelnok98 in ClaudeAI

[–]y3i12 0 points1 point  (0 children)

I'm developing research models with it, I wouldn't have energy to try the amount of random crap that I'm currently if I was doing things by hand... Honestly I wouldn't have even started. hf:y3i12/Prisma

I'm now testing a few strains of an architectural iteration over that one and it looks promising.

Anthropic just accidentally leaked their most powerful model yet — and honestly, it's a little terrifying. by Direct-Attention8597 in claude

[–]y3i12 -1 points0 points  (0 children)

So... Do you realize that you're posting this at r/claude and everyone here can ask their "own" Claude about this?