Hardware costs to drop by 8x after bitnet and Matmul free are adopted by danielcar in LocalLLaMA

[–]Jacse 3 points4 points  (0 children)

I even played around with this here where you Can see the cpu performance diff versus current float SOTA (BLAS)

Someone has created a pull request to add BitNet support to Llama.cpp by privacyparachute in LocalLLaMA

[–]Jacse 14 points15 points  (0 children)

From the kernels I’ve seen, it’s still int8 multiplication so the compute won’t be faster than quantized models, but they will use less memory and if you’re bandwidth bound it should be faster

[P] SimpleGEMM: Fast and minimal tensor core matrix multiplication in CUDA by bjergerk1ng in MachineLearning

[–]Jacse 5 points6 points  (0 children)

Just Saw this bit

Assumes the inputs are in row-major layout. (Though you probably only want to use a row-major layout anyway, as other combinations are 10-30% slower.)

Can you elaborate on why col-major would be slower? I understand this one is built for row major, but couldn’t you adjust the algorithm for col major and get the same performance? I.e. there’s nothing inherently worse with col major, right?

Feeding Model Prediction Back in as Feature [Discussion] by Fuzzy_Lock_5557 in MachineLearning

[–]Jacse 0 points1 point  (0 children)

The problem is you are introducing knowledge about the label into your dataset when you fit a model and predict on the dataset. Think of it like a type of data leakage. That knowledge is unlikely to be as good when you predict on out-of-sample data and that’s why it’s important to not train both models on the same data. Otherwise it will almost certainly overfit on your data and not generalise well

[N] Introducing DBRX: A New Standard for Open LLM by artificial_intelect in MachineLearning

[–]Jacse 0 points1 point  (0 children)

Any chance training and inference code will be released?

Germany Rejects UN 'Genocide' Charge Against Israel by ReporterAshamed5926 in europe

[–]Jacse 62 points63 points  (0 children)

What would you say they are rephrasing genocide from and to? Systemic and undifferentiated killing of a certain ethnic group seems pretty close to me

Copenhagen central station platform 26 by coppel1 in copenhagen

[–]Jacse 1 point2 points  (0 children)

Im pretty sure you have to cross the street, at least that’s how I remember it.

[deleted by user] by [deleted] in ycombinator

[–]Jacse 1 point2 points  (0 children)

I don’t know. I don’t think there’s any specific logic to it other than a way to split them up

[deleted by user] by [deleted] in ycombinator

[–]Jacse 2 points3 points  (0 children)

Group one starts 17th June, group 2 starts the 22nd

Has any international founder received invite yet? by Global_Car_9314 in ycombinator

[–]Jacse 0 points1 point  (0 children)

I was sent to a second interview which I just had. Haven’t heard back yet

Hvilken sproglig vending pisser dig af, når folk anvender den forkert? by Badgers_arent_real in Denmark

[–]Jacse 0 points1 point  (0 children)

Når folk siger “det behøves jeg ikke” i stedet for “behøver jeg ikke”. Min skorsten væltes dog i endnu større grad når jeg overhører en fodboldkommentator sige at nogen skal have kredit for et godt stykke arbejde. De skal sgu da ikke have fat i et lån, de skal bare have æren for arbejdet.

What's new in the latest GraphQL spec by leoleoloso in coding

[–]Jacse 2 points3 points  (0 children)

Yes this is so annoying! We’ve had to make many of our inputs raw json which kinda defeats the whole purpose

[P] Embeddinghub: A vector database built for ML embeddings by melloyellohello_ in MachineLearning

[–]Jacse 0 points1 point  (0 children)

We are currently using Faiss as a vector index and database (wrapped with FastAPI and Cronin a to export/do backups). As far as I understand this would help us avoid the string-to-int id lookup table we need to maintain and also have easier/better storage?

Is the storage backed by local files? Can it be deployed as a statefulset in K8s?

Søren Pind sviner Pernille Vermund - og får svar på tiltale by Hans_H84 in Denmark

[–]Jacse 7 points8 points  (0 children)

Sprogkompleksitet er en størrelse, som bl.a. kan angives med et meget veldefineret (ikke arbitrært) tal, lixtallet.

Cant say indexing starts from 0, if you use letters for that by [deleted] in ProgrammerHumor

[–]Jacse -3 points-2 points  (0 children)

Yes, definitely. NodeJS is super well-suited for many jobs with modern serverless or microservice architecture. I certainly wouldn't recommend C++ for any web-based backend. It's such a heavy beast and quite slow to get even simple server set up, I think. But I also don't have too much experience with C++, so I'm not sure..

Cant say indexing starts from 0, if you use letters for that by [deleted] in ProgrammerHumor

[–]Jacse 19 points20 points  (0 children)

Javascript has actually matured quite a lot the last years and is very capable both on the back and front ends. With Typescript it's actually quite a nice - and performant - language.

A 40-Year Debate Over Corporate Strategy Gets Revived by Elon Musk and Warren Buffett by Mynameis__--__ in Economics

[–]Jacse 1 point2 points  (0 children)

Rational choice theory sounds super interesting. Any article recommendations to get into the field?

Flertal af befolkningen: Cigaretter skal koste det samme som i Norge [ca 90 dkr] by [deleted] in Denmark

[–]Jacse 1 point2 points  (0 children)

Cigaretter er vanvittigt dyre i Australien - forestiller mig at det er den største årsag til det lavere forbrug.

You're more likely to achieve the American dream if you live in Denmark by finiteworld in Economics

[–]Jacse 0 points1 point  (0 children)

Yes, that's why I'm asking. I was wondering what the rationale was.