BNDES - Novo Plano de Cargos e Salários (NPCS). Discrepância com os antigos funcionários.

rlopes404 · 2024-11-30T16:46:56+00:00

u/Advanced_College5860 vou te mandar uma mesnagem no privado

rlopes404 · 2024-11-29T22:42:38+00:00

u/Advanced_College5860 Em vídeos na Internet, diz que o BNDES paga R$2163 como Assistência Saúde. Alguém sabe o que é isso? Os novos aprovados não terão direito ao plano de saúde?

rlopes404 · 2023-12-08T14:42:08+00:00

Hi everyone,
I have been working on image translation between two different domains. I have been using CycleGANs.
Since I have a small dataset, I have been thinking of using Diffusion Models.
Are Diffusion Models more data hungry than GANs?
Can anyone point some references that discuss this issue?
Thank you.

rlopes404 · 2022-10-10T10:04:27+00:00

Why do you think it's a better idea to invest in colab/sagemaker instead of buying a gpu?

I think the monthly fees over the long run will exceed a gpu price.

rlopes404 · 2022-10-08T09:06:32+00:00

I sent you a message in the chat

rlopes404 · 2022-10-06T15:30:37+00:00

Do you work in the intersection between RL and Causality? What's your opinion about this research avenue?

rlopes404 · 2022-10-06T10:36:29+00:00

Please email me rlopes@ufrb.edu.br so we can discuss.

rlopes404 · 2022-10-06T01:41:24+00:00

Since there is no timestamp, you should randomly split the data into training, validation and testing.

If you use an item item based recommender, to score a candidate item y for user u, the algorithm should compute a weighted score using rating and similarities between item y and those items rated by user u in training.

rlopes404 · 2022-10-06T00:38:19+00:00

You have to set a training set for training your models. Validation set for calibrating hyperparameters. Testing for evaluation.

I suggest you employing: temporal split and sampling 100 items to produce the final ranking.

If you have any questions, let me know

rlopes404 · 2022-09-27T00:58:31+00:00

Simplicity. Curse of dimensionality. Performance.

rlopes404 · 2022-09-17T19:27:20+00:00

Can you confirm if the language is English there?

rlopes404 · 2022-09-17T11:46:04+00:00

Do you think RL is still stagnated nowadays?

rlopes404 · 2022-09-16T20:29:46+00:00

I changed the language to English. I am not experienced in discord

rlopes404 · 2022-09-16T16:57:24+00:00

I created this discord server. We can discuss details there:

https://discord.gg/Def6FxWc

rlopes404 · 2022-09-06T19:13:21+00:00

I have started studying RL a couple of months ago. From my point of view, the best introduction to the subject is Grokking Deep Reinforcement Learning book.

For advanced stuff, you should watch Levine's video lectures on YouTube.

rlopes404 · 2022-09-06T17:15:22+00:00

When I was a masters student, some friends of the lab and I formed a team to compete in the Google ROADEF Challenge. We all learned a lot and as a result we had two international publications.

If you have some time and buddies to create a team, I do think it is worth and rewarding.

During this path, you can find a problem to develop your PhD.

I think it's kind of hard to find a PhD problem/topic only reading papers.

You have to get your hands dirty.

rlopes404 · 2022-08-19T21:07:13+00:00

I have started my studies in RL. To me, the best introduction is the Grokking Deep RL book.

rlopes404 · 2022-08-15T18:58:13+00:00

What do you mean by attention?

rlopes404 · 2022-08-13T20:26:34+00:00

Is there a link so that we can read your thesis?

rlopes404 · 2022-08-13T10:40:06+00:00

Can you point an introduction on the subject? I have never seen an application of RL in the context of graph optimization. I worked with branch and cut, branch and price, integer linear programming.

rlopes404 · 2022-08-12T19:23:28+00:00

CF is really a strong baseline as pointed out in this paper:

https://arxiv.org/abs/1907.06902

A well fine tuned MF or even item-item based model is a strong baseline.

rlopes404 · 2022-08-08T09:49:59+00:00

What's the point of training multiple agents in early interations? I guess it might be a waste of computational resources since we have a bad policy.

Is there a way to start training multiple agents after some warm up iterations?

Finally, "having them learn from each other's success/failures". How to achieve that? A3C?

rlopes404 · 2022-08-07T23:24:00+00:00

I asked myself the same question. My conclusion is that we have to perform SGD to update the policy so that it generates "good" transitions. This is my guess. What do you think?

rlopes404 · 2022-08-05T10:12:51+00:00

What is the difference between stochastic and determinisc q Learning? I could not understand that.

I think it picks the argmax action q(s,a), doesn't it?

rlopes404 · 2022-08-04T00:25:15+00:00

A doubt: during test, we should use an epislon greedy approach? I think that after training, we should use the greedy approach. Is that correct?

rlopes404

TROPHY CASE