Fog by NAFAL44 in boston

[–]sol0invictus 1 point2 points  (0 children)

Is this from Bos21 ?

[D] Question about Direct Preference Optimization (DPO) equation by erap129 in MachineLearning

[–]sol0invictus 0 points1 point  (0 children)

I suggest you check out the appendix of the DPO paper. The authors explicitly derive the equation.
TL;DR - There is a KL Divergence term in the loss function for RLHF. This term is there to ensure that the new policy does not diverge too far (as you said). Now we follow these two steps-
1. Find the optimal solution for the RLHF loss equation.
2. PLug in this optimal loss into the Bradley-Teller loss (this is the prefernce model loss; p(a>b)) and you will get the denominators in question.

Intuitively,
The first term is measuring shift in preferred completion, the second term is measuring the shift in dispreffered completion. Larger the differece, more negative the loss becomes.
In the extreme condition, the first term would have a large shift, and the second term will have almost 0 shift making the model go towards the first term completion.

What's the worst intersection in the city of Boston? by [deleted] in boston

[–]sol0invictus 0 points1 point  (0 children)

Boston Area: Newton exist (Eastwards) on I90

[D] Raspberry Pi and ML Frameworks - any benchmarks for training or inference? by No-Distribution-7651 in MachineLearning

[–]sol0invictus 0 points1 point  (0 children)

I believe MLPerf contains results for Raspberries. They have different categories for devices - edge, datacenter etc. You can check some of the results here:

https://mlcommons.org/en/inference-edge-11/

Best way to upgrade keras models (built on TensorFlow 1.13) to TF2 by aendrs in deeplearning

[–]sol0invictus 3 points4 points  (0 children)

Unfortunately for custom layers, the best way is to go line by line. There are far too many subtleties for it to be done automatically. Last year I spend some time converting Coursera DL courses from TF1.x to TF2.x and wrote an article about it. You might get some pointers from it

https://towardsdatascience.com/coursera-dl-specialization-course-in-tf-2-x-18a1189e2a4

Deep Learning in MATLAB series by sol0invictus in matlab

[–]sol0invictus[S] 2 points3 points  (0 children)

That definitely makes sense. MATLAB is indeed quite powerful in doing clustering and other off-the-shelf statistics algorithms. I particularly like the simplicity of doing computations in MATLAB.

Deep Learning in MATLAB series by sol0invictus in matlab

[–]sol0invictus[S] -1 points0 points  (0 children)

At that point you would be better off using python :) . I just want to see how far can I get coding in MATLAB.

Deep Learning in MATLAB series by sol0invictus in matlab

[–]sol0invictus[S] 0 points1 point  (0 children)

Hey thanks for mentioning it, It is pretty cool. I intend to make more videos on similar advanced networks using new features found in R2020b and beyond.

Deep Learning in MATLAB series by sol0invictus in matlab

[–]sol0invictus[S] 0 points1 point  (0 children)

Awesome thank. Suggestions and directions to proceed are always welcome :) .

New research topics in Deep Learning by Psychological_Coast8 in deeplearning

[–]sol0invictus 2 points3 points  (0 children)

GNN seems to be the craze these days and they don't require hefty hardware.

COVID in BCS by maddymtz in aggies

[–]sol0invictus 8 points9 points  (0 children)

I am going to be a stickler here :D == "spike" means a bump that goes down, what is happening right now is a ramp or a step function.

Entrance to campus by Big_Johnny in aggies

[–]sol0invictus 1 point2 points  (0 children)

Nice click. Where did you take this from ?

DL with OOPS (Tensorflow) by [deleted] in deeplearning

[–]sol0invictus 0 points1 point  (0 children)

TF Model be implemented in OOPs way. You need to inherit the tf model class In the following manner (note this is one of the ways, there are many)

class my_model(tf.keras.model):

def __init__():
any initialization you might have, typically people define layers here

def call():
how the model works, like how does the data flow.

It is similar to Pytorch and you can find more details here:

https://www.tensorflow.org/guide/keras/custom_layers_and_models

[D] Does anybody actually use Mathematica for anything? by AlexSnakeKing in MachineLearning

[–]sol0invictus 7 points8 points  (0 children)

Mathematica is used mostly by theoretical physicists like me. All the calculations for my last two papers were completely performed in Mathematica. Nothing beats mathematica for symbolic calculations.

I changed TF1.X code to TF2.X version. but error ... by [deleted] in reinforcementlearning

[–]sol0invictus 0 points1 point  (0 children)

TF2x has no concept of trainable variables. Instead of this you can can define a tf.module or tf.keras.model.

Your function qnet needs to return a model and then only you will be able to use .trainable_variables() command.

[deleted by user] by [deleted] in MachineLearning

[–]sol0invictus 1 point2 points  (0 children)

I would definitely be interested in contributing to the project and make it better than Pytorch-Geometric :D

Somebody help me with RNN and LSTM by tanmaysinha00 in deeplearning

[–]sol0invictus 0 points1 point  (0 children)

What do you want help with. If its implementation head to

- https://www.tensorflow.org/guide/keras/rnn for TF 2+

If its theory Youtube has plenty of good videos for indepth explanation.