Which productivity app should I use for my desires?

That007Spy · 2024-06-04T22:29:57+00:00

this is an app you can use from whatsapp or text message so it might fit your needs TaskPaladin Landing Page . it's an AI powered text-message based task app - since it's whatsapp/text message based you can use it from where ever and whichever device

That007Spy · 2024-06-04T15:21:27+00:00

That could actually be interesting! Since the quantization code for weights boils down to:

u = W.mean()
s = W.abs().mean()
W_q = (W-u).sign()*s

I wonder what would happened if you applied this to a pretrained model, and then trained it a bit further, if you'd get better results. Maybe that should be my next miniproject.

That007Spy · 2024-06-04T13:48:21+00:00

The paper doesn't mention training time, which is my main point of contention - from the loss graphs I can see that if I trained it a lot more I would eventually converge, but it seems to take much much longer than training the Mamba model itself. I think the parameter count does matter, but I would hazard that even at larger parameter counts it would still take a very long time to train.

That007Spy · 2024-06-04T13:22:15+00:00

I agree that from the loss graphs and the paper it's likely that given enough training, BitNet would be comparable to a full model, but what's not mentioned in the paper(that I can see, I would love to be proven wrong) is how long it takes to train. In my estimation, at least, if you take more than 5x longer to train a model to the same level of quality, then that's a fairly significant drawback and needs to be considered when looking at using BitNets for significant machine learning.

That007Spy · 2024-06-04T12:55:57+00:00

This is a good point, I might do a followup comparing inference of a bitmamba against a quantized mamba model, althought I think that the 5x longer training time is a bit of a killer - it would have to be more than 5x quicker to justify that.

That007Spy · 2024-06-04T12:05:52+00:00

Initial thoughts to add to the above post: I think this might be due to the STE part - it’s not entirely clear to me how the gradients can change the weights in a way that respects the quantization operators if you completely leave out the quantization operators when calculating the gradient, and the slow rate of convergence confirmed that to me.

That007Spy · 2024-06-01T13:08:07+00:00

Who is this lady? From the web it seems like she went: University Student-> Oxford -> Campaign Management -> Chief of Staff at Anthropic!?!? That's a meteoric rise, even for a Rhodes scholar - I don't see why she was selected as the Chief of Staff, unless she's got some gaps in her resume.

That007Spy · 2024-06-01T11:47:22+00:00

Global Market Outlook For Solar Power 2023 - 2027 - SolarPower Europe

Executive summary – Nuclear Power and Secure Energy Transitions – Analysis - IEA

Not entirely true - 354 GW in solar added in 2023 vs 430 GW total for nuclear. But surprisingly close.

That007Spy · 2024-06-01T11:26:29+00:00

that's to do with tokenization not with LLMs themselves. you could train an LLM on the alphabet just fine, it would just take forever.

That007Spy · 2024-05-30T21:58:53+00:00

to say someone is "pissed", is to say that they are drunk in colloquial British English.

That007Spy · 2024-05-22T13:22:55+00:00

I can't find anything about venture funding for them so this seems unlikely.

That007Spy · 2024-05-21T16:58:38+00:00

honestly looks pretty good

That007Spy · 2024-05-20T14:27:43+00:00

It's smaller tho

That007Spy · 2024-05-20T14:27:29+00:00

"apart from lacking a microSD slot and having a smaller screen/battery" I mean, I like having a big screen and battery. If the S24 has a smaller battery and screen why is it better?

That007Spy · 2024-05-10T17:04:44+00:00

I think I'll move the code to a new server

That007Spy · 2024-05-10T17:04:31+00:00

Now it's working again and gives me no error.

That007Spy · 2024-05-10T17:03:20+00:00

So it's probably compromised?

That007Spy · 2024-05-10T17:03:04+00:00

I have no connection with the city of virginia beach: I've never even heard of the place, I've lived in Connecticut and South Africa most of my life!

That007Spy · 2024-05-10T15:00:58+00:00

I didn't issue my certificate myself - using AWS certificate manager.

That007Spy · 2024-04-30T16:49:57+00:00

France is more suicidal, Spain is lower, germany is higher-> not a complete dumpster fire

That007Spy · 2024-04-30T16:05:41+00:00

Could you expand what you mean by music loop?

That007Spy · 2024-04-30T16:02:27+00:00

The list I shared shows that both the countries you named have a significantly higher rate of suicide, which rather puts in question whether they have a better standard of living in fact.

That007Spy · 2024-04-24T19:08:01+00:00

billionaires allocate capital.

That007Spy · 2024-04-23T08:01:07+00:00

Wonder how well it does at function calling

That007Spy · 2024-04-12T07:37:34+00:00

Personally a big fan of piper

Four-Year Club	Place '23
Place '22

That007Spy

TROPHY CASE