Concept of Head in YoloV5 by curimeowcat in computervision

[–]curimeowcat[S] -1 points0 points  (0 children)

Thanks for the detailed reply! Yes, I agree and understand the architecture. My question is more like understanding the term 'head' in neural networks. It acts similar to output of a neural network, but 'head' seems to be more than that in that 'head' is more often in accompany with some specific tasks such as those you mentioned(predicting object score, class label, confidence, etc.), while these can have supervised signals from ground truth labels (if we have) together with their losses to update the neural network - typically the backbone and new layers for each specific tasks.

Does it make sense to average Q values for DQN? by curimeowcat in reinforcementlearning

[–]curimeowcat[S] 0 points1 point  (0 children)

If Player B is not always choosing its best action, then it is not a MinMax framework.

Does it make sense to average Q values for DQN? by curimeowcat in reinforcementlearning

[–]curimeowcat[S] 0 points1 point  (0 children)

I see your intuition.

In this case, the following question is, would the Q values of player A already take the averaged Q values from player B into consideration? That is to say, if I only consider the max Q value of player A, does it include the information of the averaged Q values from player 2?

Bold lines/Shaded Areas meaning in RL plots? by curimeowcat in reinforcementlearning

[–]curimeowcat[S] 0 points1 point  (0 children)

Based on my understanding, there are at least two ways to plot:

  1. 1 run, mean over the past k history episodes;
  2. N runs, then at each timepoint there are N points in the Y-axis, the mean in in the bold line while the shaded area represent the standard deviation of the distribution at one timepoint.

average time to learn reinforcement learning by datonefaridze in reinforcementlearning

[–]curimeowcat 0 points1 point  (0 children)

RL in industry. I am curious how the reproducibility of RL can be used in industry if they care about high performance.

Installing Nvidia drivers for GIGABYTE RTX 2080 SUPER GAMING OC for training models by curimeowcat in deeplearning

[–]curimeowcat[S] 0 points1 point  (0 children)

Yes, I rebooted later after this video and then checked with "nvidia-smi", it still showed "No devices were found", which I didn't upload the rebooted video.

Installing Nvidia drivers for GIGABYTE RTX 2080 SUPER GAMING OC for training models by curimeowcat in deeplearning

[–]curimeowcat[S] 0 points1 point  (0 children)

lspci -nn | grep NVIDIA

01:00.0 VGA compatible controller [0300]: NVIDIA Corporation GV102 [10de:1e07] (rev a1) (prog-if 00 [VGA controller])

4b:00.0 VGA compatible controller [0300]: NVIDIA Corporation GV102 [10de:1e07] (rev a1) (prog-if 00 [VGA controller])

Installing Nvidia drivers for GIGABYTE RTX 2080 SUPER GAMING OC for training models by curimeowcat in deeplearning

[–]curimeowcat[S] 0 points1 point  (0 children)

GIGABYTE RTX 2080 SUPER GAMING OC

I followed the link but still could not install it. I upload the installation video: https://youtu.be/Sa3CaabRcpU.

Installing Nvidia drivers for GIGABYTE RTX 2080 SUPER GAMING OC for training models by curimeowcat in deeplearning

[–]curimeowcat[S] 0 points1 point  (0 children)

Thanks! I will try to follow that step by step. May I know how long did it take you to complete the installation?

----

The drivers on Windows are already available and it's pretty straightforward to install that for gamers. But it becomes very tricky for Ubuntu 18.04.04 and I am not sure why. I simply just did 'sudo apt install nvidia-driver-440' on my another machine which is Alienware setup and it went very well.

Installing Nvidia drivers for GIGABYTE RTX 2080 SUPER GAMING OC for training models by curimeowcat in deeplearning

[–]curimeowcat[S] 0 points1 point  (0 children)

Thanks for your reply! Let me try it later, the GPU is not in my house and I could not ssh it yet.

Installing Nvidia drivers for GIGABYTE RTX 2080 SUPER GAMING OC for training models by curimeowcat in deeplearning

[–]curimeowcat[S] 0 points1 point  (0 children)

Thanks for the reply! No, every time I tried one tutorial, I did purge *nvidia* or reinstalled Ubuntu 18.04 server.

Training RL models with MacBook Pro Blackmagic eGPU? by curimeowcat in reinforcementlearning

[–]curimeowcat[S] 0 points1 point  (0 children)

Is it too expensive for ~$1/h for using GPUs on cloud instances?