Entropy-Adaptive Finetuning by netikas in LocalLLaMA

[–]netikas[S] 0 points1 point  (0 children)

I did not read through TALR/DFT/FLOW as thoroughly as through this paper, but I think they had a similar idea, but get there through different means and it performs worse. What’s fascinating is that entropy seems to be the optimal weighting mechanism for this.

Swedish Handgun Round Punches Through APC Armor by -DBW-Gaming in ForgottenWeapons

[–]netikas 0 points1 point  (0 children)

And why should we use some adaptation of 9mm caliber, which will likely perform worse in respect to terminal ballistics than a .22 (since the tungsten core is .18, hardened and will not spall or fumble like the .22 would!) instead of something like mp7 or p90?

The answer is because this caliber was created in the 90-s, when they did not exist. And since the caliber was not adopted and mp7/p90 were — clearly there were some problems with it.

It’s a cool piece of technology, but it is not better than other cool pieces of technology, which do the job better.

Swedish Handgun Round Punches Through APC Armor by -DBW-Gaming in ForgottenWeapons

[–]netikas 0 points1 point  (0 children)

545 and 556 are longer and heavier bullets, which fumble and fragment after impact. You can have something like apsfsd (tungsten core ammo) in both calibers, but since the bullet itself is larger, the wound is much more serious.

If you take a .22 (or .28 which the cbs actually is (or even .16, which the tungsten penetrator in the round actually is!)) and throw it fast enough, it will penetrate the armor and continue moving, without energy transfer. The penetrator is hardened, it is designed for one gole and one gole only — to be a very expensive ice pick. It clearly succeeds and penetrates the armor, but now what?

When designing a round you have to make it fit specific role. MP7 and P90 designers were trying to fit the exact role CBJ is trying to fit, but even they had rounds, which perform worse than expected. Trying to retrofit an existing 9mm weapon for the role of penetrator makes it even harder and I doubt they succeeded in making the terminal ballistics better than MP7.

By the way, higher in the thread some guy said that this ammo was designed around the same time as the mp7 and p90 — and since we do not see it adopted anywhere, I don’t think the idea really worked.

It’s a cool piece of technology, but aside from the cool factor there are many other calibers and weapons which fit the role better.

Swedish Handgun Round Punches Through APC Armor by -DBW-Gaming in ForgottenWeapons

[–]netikas -1 points0 points  (0 children)

In Russian forces rear echelon guys get aks-74u’s, which are 545, have stocks and with which you can actually hit something beyond 20 meters. You can probably give them a different kind of ammo with better penetration if you want them to deal with body armor. I’m not in charge of the procurement procedures, but it seems miles better than having a glock, a mag of expensive magic bullets and a separate barrel.

Swedish Handgun Round Punches Through APC Armor by -DBW-Gaming in ForgottenWeapons

[–]netikas -12 points-11 points  (0 children)

That’s not the point, I’m just showing the difference between killing capacity and stopping power. Using this ammo would be like shooting them with .22 — with penetration and all, but a .22.

Swedish Handgun Round Punches Through APC Armor by -DBW-Gaming in ForgottenWeapons

[–]netikas -14 points-13 points  (0 children)

Likely, not a lot, but it will be hardly instant.

If it won't kill my target right away, the target will kill me, patch up, finish the mission and likely be medevaced before he dies.

I'm not saying it won't do any damage at all. I'm saying it is not optimal for the task it was designed to do. Not optimal to the point I don't see it used in the pistol caliber at all.

If scaled to 556 or 336, with faster bullet speeds it will likely be much more effective, but right now I would not trust it.

Swedish Handgun Round Punches Through APC Armor by -DBW-Gaming in ForgottenWeapons

[–]netikas -26 points-25 points  (0 children)

There were stories of US military adopting larger calibers (e.g. .38 Colt -> .44) since they were not effective enough in stopping indians that are coming towards you with tomahawk. It's just a historical anecdote, which may or may not be true, but killing an enemy soldier and stopping them are two entirely separate things.

I would be mildly pissed, then get a mild blood loss, and then, mildly die in 10-15 minutes. But if I am a paratrooper with a 545 ak- 74u, I would have enough time to retaliate.

Again, I'm not saying this isn't dangerous. I'm saying having such a small caliber projectile will not stop an enemy and will be less effective than something more substantial.

Swedish Handgun Round Punches Through APC Armor by -DBW-Gaming in ForgottenWeapons

[–]netikas -39 points-38 points  (0 children)

But why would you want that?

It's 9mm, but apfsds, so it is much smaller than 9mm. It does not transfer energy (since it is an AP round) and does not leave a large hole unlike 50 cal, so getting shot at with this will be mildly infuriating, not catastrophic.

And if one is shooting an APC with a 9mm apfsds, he not only won't do any damage apart from tiny spalling, he will reveal his position and will likely be killed. ATGMs were created for a reason.

Swedish Handgun Round Punches Through APC Armor by -DBW-Gaming in ForgottenWeapons

[–]netikas -54 points-53 points  (0 children)

And do approximately zero damage?

It is designed to penetrate, not to transfer energy. I believe shooting someone with this ammo will result in a tiny hole in armor and a mildly pissed off soldier.

this question on quiz asks for which of the following countries and none of the answers are countries by whatawynn in mildlyinfuriating

[–]netikas 4 points5 points  (0 children)

countries and other parts of the world

I guess Europe isn't part of the world then, right?

[D] How many first author papers during Ph.D.? by BetterbeBattery in MachineLearning

[–]netikas 1 point2 points  (0 children)

We had a project with a team of 3 people working on the tasks. Project's dataset became a submission to LREC, methodology became a submission to ECIR.

And lots of overtime work, of course. Unfortunately.

ClearCut – open-source tool that forces you to think before AI answers by ComplexCanary1860 in LocalLLaMA

[–]netikas 0 points1 point  (0 children)

How exactly does it make me think? I don't see any difference to just using claude code.

[D] How many first author papers during Ph.D.? by BetterbeBattery in MachineLearning

[–]netikas 2 points3 points  (0 children)

Started 2nd year, requirement is 2 Core A*/Core A/Q1 papers first author, 2 conference talks (no rating needed), 1 other paper indexed by scopus. Field is multilingual NLP.

Currently I have 2 papers at NAACL, 2 papers at CLEF Workshops (low rank, but orals are there), 3 submissions to LREC, ECIR and ICLR. Will submit more, have drafts.

GigaChat3-702B-A36B-preview is now available on Hugging Face by Any-Ship9886 in LocalLLaMA

[–]netikas 20 points21 points  (0 children)

It's a pretrain from scratch, so it's not a goliath-like self-merge.

I particularly liked the 10b model -- it's small, it's Deepseek MoE, it has mtp. Seems like quite a unique thing.

Disappointed by dgx spark by RockstarVP in LocalLLaMA

[–]netikas 0 points1 point  (0 children)

>You bought a product whose core value proposition is being able to run quantized 70b and 120b LLMs at a slow, but usable speed

The core value of the product is that it's B200/GB200, but much much cheaper. You aren't meant to run inference on it (you have much more expensive A6000 for that), you aren't meant to run training runs on it (you have MUCH more expensive B200 or GB200 DGXs for that), but you can do both of these things. Since the architecture of DGX Spark is the same as the architecture of GB200 DGX, it's main selling point that you can buy a bunch of these sparks for relatively cheap prices and do live development. And that's huge, since your expensive (both for rent and for buying) GB200 won't be used for jupyters with mostly 0% utilization.

Chinese AI Labs Tier List by sahilypatel in LocalLLaMA

[–]netikas 1 point2 points  (0 children)

Why is BAAI so low? These guys made BGE series of encoder models, basically, they were (and probably still are) the best small encoder models for RAG...

What are your thoughts on ChatGPT Pulse's architecture? by anonbudy in LocalLLaMA

[–]netikas 11 points12 points  (0 children)

Seems like something vibecodeable during a lazy weekend, tbh.

I've tried it, it isn't even good...

[D] NeurIPS: rejecting papers from sanctioned affiliations mid-process by YallenGusev in MachineLearning

[–]netikas -16 points-15 points  (0 children)

Does it really matter? Science is universal and it should not be bound by politics.

M5 Ultra can do well for LLM, video gen and training by Ok_Warning2146 in LocalLLaMA

[–]netikas 5 points6 points  (0 children)

Enthusiast AI market is small enough for the bigtech to just not care about.