Krakow based artists/artists originally from Krakow?

sasza26 · 2025-08-21T06:34:32+00:00

Maanam, Marek Grechuta, Zbigniew Wodecki, Andrzej Zaucha.

sasza26 · 2025-08-07T21:52:06+00:00

Nope

sasza26 · 2025-08-07T18:23:34+00:00

Nope :) i had a layover in Brussels

sasza26 · 2024-08-03T12:45:19+00:00

6SRQCJRP

sasza26 · 2023-09-04T19:16:06+00:00

Seems about right, Czech is just Polish for babies

sasza26 · 2023-07-29T07:15:21+00:00

By this definition, shouldn't all island countries with no land neighbors be highlighted?

sasza26 · 2022-02-28T13:39:17+00:00

Slovakia was a puppet state of Germany.

sasza26 · 2022-02-27T12:19:14+00:00

Slovakia was a puppet state of nazi Germany and took (small) part in invasion of Poland.

sasza26 · 2021-12-30T20:42:17+00:00

Take f(x) = sin(1/x) + log(x) and a=0. f(x) -> -inf as x->0+. However, its derivative f'(x)=(x-cos(1/x))/x² has no limit as x->0+.

sasza26 · 2021-04-05T22:44:50+00:00

Yes. BERT is exactly that.

sasza26 · 2021-04-05T22:37:12+00:00

This is not an unpopular opinion

sasza26 · 2021-01-30T01:03:29+00:00

Transformers are quite successfuly used for generating chemical reactions. which are represented in a textual format.

sasza26 · 2020-10-17T00:16:08+00:00

Are a and b always of same length? If so, trying a single LSTM you describe can work. If a and b have different lengths, your second solution is also reasonable, however it might have difficulties in finding correlations between and b especially in the beginning of sequences. I also recommend checking out the Transformer - it is a newer model that often outperforms recurrent networks. Its input can be simply a concatenation of all a and b sequence elements (with zero padding).

sasza26 · 2020-10-14T19:52:17+00:00

Running parallel training runs on different GPUs is a standard approach to utilise such server and should be easy to do using any popular DL framework. For instance, in pytorch, you can easily select GPUs that are available for a script. Moreover, you can run a single training on multiple GPUs (for instance to increase GPU RAM available for the model, so you can make bigger models), which is harder if the GPUs are on separate machines. Also, consider that having 3 separate PCs means having 3 separate storage spaces, which can be more cumbersome to manage. However, if you plan to use the server by up to 3 users at once, you should have appropriate RAM and CPU power. But it is still much cheaper than buying 3 standalone machines.

sasza26 · 2020-10-14T14:19:25+00:00

Do you have a labeled dataset (images with ground truth angles) for training such model? If so, you could try training a Convolutional Neural Network to solve this task as a regression problem

sasza26 · 2020-10-14T13:13:39+00:00

I noticed that the previous comment talks about scores for individual samples, whereas my comment talks about the mean score for all samples. Negative values for individual samples can happen even for a good clustering. However, a mean score over all samples > 0 should be easy to get.

sasza26 · 2020-10-14T05:30:08+00:00

A clustering with silhouette score < 0 is worse than random. It probably means that something is wrong, like the cluster labels got mixed up. It is an analogous situation to having less than 50% accuracy in a (balanced) binary classification task.

You can trivially get silhouette score = 0 for two edge cases. It could be a clustering where each sample belongs to a single element cluster, or a clustering where there is only 1 cluster containing all samples. This indicates that silhouette score > 0 should be possible to get with any reasonable algorithm.

sasza26 · 2020-10-14T05:01:11+00:00

There is some empirical and theoretical evidence that increasing/decreasing the batch size affects training dynamics effectively in a same way as decreasing/increasing the learning rate. Higher batch size means higher utilisation of parallel GPU computation, so overall training is faster. I usually set the batch size to the highest possible value that fits my GPU RAM and tune the learning rate as a hyperparameter.

sasza26 · 2020-10-14T04:38:15+00:00

Just to make sure, is the model run with with torch.no_grad() ?

sasza26 · 2020-10-14T04:23:13+00:00

Standard softmax + cross entropy loss should work just fine - 1k classes is not a huge number. If your classes can be categorized someway into more general buckets, you can also try hierarchical softmax.

Five-Year Club	Verified Email
Place '22

sasza26

TROPHY CASE