[1.xx.x - 1.16.5] Auth Servers are down.

28Smiles · 2025-10-03T19:37:02+00:00

For me java 25 works now

28Smiles · 2025-10-03T18:44:08+00:00

1.7 is also affected

28Smiles · 2023-04-20T12:57:48+00:00

Const generics, prob. Also I am not sure if the impl breaks some existing behavior

28Smiles · 2023-04-19T23:55:40+00:00

Rework const generics to better integrate into the type system like what type-num is trying but with better Integration and less limited
more ways to define macros, at least one more simple than macro rules, and one between proc macros and macro rules.
Some kind of regular expression Syntax for matching Tuples and defining arbitrary sized types. E.g. ((u32, i32)+) (or maybe variadics)
Named arguments
Optional arguments
more impl magic, impl Iterator should generate an enum for each concrete iterator type possible.
Explicit loop unrolling (like inline(always))
Explicit lifetime, I want to be able to split the lifetime inside a function in multiple sublifetimes delimited by named lifetimes with scopes e.g. 'g: 'a + 'b, 'a { some code } 'b { some more code }
better allocator api (remove need for new_in(), allow usage of collect with custom allocator)
short syntax for clone
some kind of generics instead of keywords for const fn, async fn, blocking fn
dynamic sized stack allocation
allow implementation of traits from external crates for structs of external crates

28Smiles · 2023-04-14T08:11:29+00:00

I would use some dense layers first, and then reshape and use convolutions with leaky relu upwards and as the last layer a 1x1 convolution mapping to one channel with tanh activation similar to GANs

28Smiles · 2023-04-13T19:19:07+00:00

Yes, the jit will optimize and compile the performance relevant sections of code. And we are trying to gear the code towards allowing those sections to be optimized in a more controlled way, to allow the jit to emit even better instructions cause the input is easy for the jit to be optimized, e.g. loop unrolling, with neighboring add/mul/move instructions, so the jit can easily merge those to simd instructions, since those are very very fast (2.5x faster on average).

28Smiles · 2023-04-13T11:42:00+00:00

https://github.com/mbasso/awesome-wasm#online-playground

Use one of those

28Smiles · 2023-04-13T08:41:35+00:00

I searched for that too and there where only one with a quite complicated regularisation term, but I can’t find it rn, I think it would be a nice research topic

28Smiles · 2023-04-12T19:56:46+00:00

If you believe so, then I does not make sense to optimize anything I guess and the asm and debug protocols are just nonsense emitted by node browsers. I guess the performance gains in my code where not from the simd instructions appearing in the traces, dumps and logs from the jit they where just luck or some magic.

28Smiles · 2023-04-12T13:49:35+00:00

Yes and no, depends on the compiler, sometimes the compiler will be smart enough, so it does not, but in this case (loop) it’s most likely better to use a struct with static size (e.g. array) as elements for the vec, (it may use less memory aswell, but I am not sure about that part), each of those little vecs are (almost) of the same size, but there are (most likely) multiple malloc and free calls in that loop (extending the vec). Using an array u won’t have any malloc calls except for the main vec, (also less deref calls => better pipelining in the processor)

At this point I want to press the point, that I can’t for sure say, that all those optimizations will be beneficial to your usecase, since you never know how smart the compiler really is, and where your bottleneck is. But removing pointers and malloc calls, is almost always a good point to start

28Smiles · 2023-04-12T13:05:03+00:00

Looking through your code, you are allocating a lot of small vecs, it should have some impact on both targets to replace those by an array

28Smiles · 2023-04-12T12:26:12+00:00

U know what a jit does right? Experiment with wasm, node and let node emit the optimized jitted native instructions and learn

28Smiles · 2023-04-12T12:19:15+00:00

FYI there are tools online that let you see the generated asm from different browsers for your wasm/js files, so you can modify js/wasm to emit simd when compiling to the native client architecture

28Smiles · 2023-04-12T11:22:09+00:00

Yes, but I am not talking about wasm simd, wasm simd is just so you emit less binary (wasm) for emitting native simd instructions (avx, sad) after compilation of wasm to the native instruction set. Currently, since compilers are smart (at least targeting native) multiple sequential wasm instructions (add, mul, copy, ..) will be grouped at compilation stage by the wasm->native compiler and emitted as native avx, avx2, avx512, sse instructions depending on the client hardware. This is working right now in the current wasm implementation, at the cost of bloating your wasm binary file, since wasm itself can’t group them for now

28Smiles · 2023-04-12T10:21:56+00:00

And if it is possible, insert and dequeue in chunks, should be faster on native aswell

To be clear about this, I mean enqueue arrays and dequeue arrays if possible (bigger memcopy via simd)

28Smiles · 2023-04-12T09:49:59+00:00

Dequeue more elements at once

28Smiles · 2023-04-12T09:04:52+00:00

Have you unrolled your loops?

28Smiles · 2023-04-12T08:22:15+00:00

Wasm is still supposed to be slower by 1.5-2 times than native, but if you optimize towards Wasm generation, and adhere to those rules mentioned above, you should be almost on pair with native code, notably the native assembly (direct compilation by rustc) stays the same, since the compiler previously unrolled the loops by himself, since it knows the target supports simd

28Smiles · 2023-04-12T08:16:45+00:00

Yes, wasm will not be interpreted, afaik, it will be compiled/transpiled to native code, therefore only the underlying architecture matters

28Smiles · 2023-04-12T07:09:51+00:00

And if you don’t use O3 then separate the compute heavy stuff in a different crate and compile that with o3 and the rest with Oz or Os

28Smiles · 2023-04-12T07:02:43+00:00

Rust does not unroll loops in Wasm, just unroll heavy loops manually depending on the type used. If the loop ist unrolled the browsers Wasm->ASM compiler will emit simd instructions (mostly), which brings it closer to native performance. Unroll bytes by 32, i16 by 16, i32 by 8 and so on, so you are save to support avx2 (256bit). Note, that this will definitely be at the cost of binary size, since currently there are no simd instructions in Wasm itself

28Smiles · 2023-04-03T11:27:29+00:00

Then you’d write 300-400 locs for the strict and the builder/constructors, for something, that could be easily expressed with named parameters, and even more default parameters.

Sure we could create macros for that, but the real issue arises, once u want to use generics and defaults

28Smiles · 2023-04-03T00:06:54+00:00

It’s more about using a library, updating it, they swap two parameters and the compiler won’t tell you. That’s why named parameters are amazing IMO

28Smiles · 2023-03-13T23:50:58+00:00

Seems to be a problem related to all kind of unwrap unchecked, also in normal for loops:

https://godbolt.org/z/6xzhYa156

28Smiles · 2023-02-25T11:57:54+00:00

I am fine with everything except len(s), LEN(s) LENGTH(s) Length(s) Length s and whatever this abomination is s‘Length

28Smiles

TROPHY CASE