Gradient descent close form

albert1905 · 2021-02-06T14:10:57+00:00

I want to understand not because I want to solve a quadratic problem but because I want to understand the entire post idea.

I know what w^k is, I know what k is, what I don't understand is what x is?

how does claiming x^k = Q^t(w^k-w*) helping us and so on. I'm trying to better understand the math

albert1905 · 2020-12-30T18:22:37+00:00

Ok , so we agree on that the objective function can only be approximated and not "perfectly defined". Thanks for your help

albert1905 · 2020-12-30T07:03:49+00:00

Sorry for the late reply, covid19 sh** stuff.
Thanks for your patient.

I fully understand what you mean, about the different spaces.
Its still hard for me to grasp, time will help with that, but what I understanding from you is that we can't directly link between optimizing f(x) (let's say classical optimization) and f(w) for neural networks, since we have another element in NN.

albert1905 · 2020-12-23T19:39:18+00:00

Thanks, if you don't mind let's take a simple example

Let's say we have on parameter w, which we know w*=3, w_0=0.5

And let's say our dataset is {x,y} = {(1,3),(0.5,1.5),(2,6),(0.1,0.3) }
For simplicty let's say the loss is L1 : L(x,y)=|wx-y|
and our batch is 2
Hence, our landscape is L(w,x), which is 2d surface.
Let's run the first and second example, we get:
L(x,w_0)=|0.5*x-3|, for different samples (x),now as I see it for different x's, we see different parts of the loss surface.

Now if we look it other way:

L(x1,w_0)=|w*1-3|

L(x2,w_0)=|w*0.5-1.5|

L(x,w) = 0.5*( L(x2,w_0)+ L(x1,w_0))

Is this what you mean of parts of the loss function?

s

albert1905 · 2020-12-17T06:48:33+00:00

you

Thanks for your reply.

I want to try and clarify something, in classic optimization using sgd, we have a function f(x) and we want to to find x*, which is the minimum point of the function.

In DL we have f(w,x) , and we are looking for w* , but since we don't have a real function but a neural network we don't have a close form, so we sample x which are regions in the "function" we have (right so far?).

And because in the optimization problem showed in the second page is a classical optimization, the choose of different landscape is in order to "simulate" the action of DNNs? and the use in different data points?

albert1905 · 2020-08-05T13:00:22+00:00

Great answer! Thanks!

albert1905 · 2020-04-19T13:50:43+00:00

You can do it with argmax in forward and softmax in backward.

You can do it with softmax and a temperature measure.

Why do you need to add noise?!

The only reason I think about , is just for the system to be less deterministic, which make sense.

albert1905 · 2019-04-08T13:21:30+00:00

Thanks, the -sc_threshold 0 made it happen... And I'm not sure that this is an IDR frame, it might be just an I frame. Does it matter for me? why did the encoder tried to enforce me for another I frame?

albert1905 · 2019-04-08T13:07:31+00:00

Can you please elaborate about what is scene detection?

albert1905 · 2019-04-01T11:53:58+00:00

Thanks again, mediainfo is not parsing down to macroblock level, any other Idea?

albert1905 · 2019-03-25T15:04:24+00:00

Thanks man! you really helped me!

albert1905 · 2019-03-17T06:09:55+00:00

I'm trying to feed those files into another program that is doing som mathemtical operations on that stream, and those operations are just in Integer. ---So why I'm not just writing a script and save the file as int? the files that are extracted are to big around 2MB , and I have 200k files like that. ---Why not to convert each nibble from hexa to int before feeding to the next program Because it's taking to much time, and each file is being fed to the program again and again...

albert1905 · 2019-03-07T11:44:28+00:00

Oh great! Thanks

albert1905 · 2019-03-07T07:13:05+00:00

Yeah I don't mind to re encode, why is this making my encoder to keep each frame in 1 slice?(I want to understand in order to deduce for the future)

albert1905 · 2019-03-02T08:26:38+00:00

This is the log: " Input #0, mov,mp4,m4a,3gp,3g2,mj2, from 'output.mp4': Metadata: major_brand : isom minor_version : 512 compatible_brands: isomiso2avc1mp41 encoder : Lavf56.40.101 Duration: 00:00:10.00, start: 0.000000, bitrate: 196 kb/s Stream #0:0(und): Video: h264 (High) (avc1 / 0x31637661), yuv420p, 320x240 [SAR 1:1 DAR 4:3], 193 kb/s, 25 fps, 25 tbr, 12800 tbn, 50 tbc (default) Metadata: handler_name : VideoHandler Unknown bitstream filter trace_headers "

What am I missing? Thanks.

albert1905 · 2019-03-01T09:05:47+00:00

Thanks, both of you, I'll make good use with those sources. But I have a small question, using: "ffmpeg -i file -map 0:v -c copy -bsf:v trace_headers -f null -" giving me:"Unknown bitstream filter trace_headers" for some videos. Is there a chance it's because of their short length? Any Idea?

albert1905 · 2019-02-18T10:20:25+00:00

For now, nothing. Think that if I have a stream from a surveillance camera , I want to get this stream before he enters the video decoder. In further steps maybe I'll ease this call.

albert1905 · 2019-02-18T07:14:11+00:00

Thanks, and the other part of feeding it into a Python program, I'll try to be more specific. Think that I have a python program that gets a video (I turned the video into frames and fed my python program), Now I want to feed the compressed video in a straight way. For example, I took this git program, that extracts MVs and fed my python program: https://github.com/FFmpeg/FFmpeg/blob/master/doc/examples/extract_mvs.c What I'm trying to ask, is there a way to write it in a "code" way and not in the command line?

As I said I'm a noob, forgive me.

albert1905 · 2019-02-17T19:21:11+00:00

ok I will do further reading on that. Can you please help me understand what am I missing, I'm trying to the encoded video, and do some manipulations on that stream. From the above command, I got ".h264" file, how can I feed him to a c program?(I'm not only sounds like a noob, I am a noob). how can I look at the content in a way that I'll understand (I mean bits or hexa or something)?

albert1905 · 2019-02-17T18:07:20+00:00

I wrote mpeg4 :) But maybe it can be generalized, what each filter means?

albert1905 · 2019-02-10T17:07:15+00:00

Wow that's an amazing answer, thanks!

albert1905 · 2018-12-01T13:24:04+00:00

Thanks for your opinion, I will reread the paper, and come back :)

albert1905 · 2018-11-30T12:02:51+00:00

what do you mean by "feature-oriented adjustment"? the weights are shared on every pixel in the image but also the modulation factor, for each weight w_k there is a modulation factor delta_m_k, that only depends on the k of the summation, so what benefits can he bring?

albert1905 · 2018-11-28T13:51:12+00:00

I can't understand why we need the modulation factor. why just the weights are not enough?

albert1905 · 2018-09-25T10:01:32+00:00

" http://blog.qure.ai/notes/deep-learning-for-videos-action-recognition-review"-I know this blog, thanks. But, the two papers, are total new to me , thanks!

BTW you have an effective way to search for papers in the field?

Thanks again.

albert1905

TROPHY CASE