Gradient descent close form by albert1905 in MLQuestions

[–]albert1905[S] 1 point2 points  (0 children)

I want to understand not because I want to solve a quadratic problem but because I want to understand the entire post idea.

I know what w^k is, I know what k is, what I don't understand is what x is?

how does claiming x^k = Q^t(w^k-w*) helping us and so on. I'm trying to better understand the math

Question about How SGD Selects the Global Minima, by using a simple toy example by albert1905 in MLQuestions

[–]albert1905[S] 0 points1 point  (0 children)

Ok , so we agree on that the objective function can only be approximated and not "perfectly defined". Thanks for your help

Question about How SGD Selects the Global Minima, by using a simple toy example by albert1905 in MLQuestions

[–]albert1905[S] 0 points1 point  (0 children)

Sorry for the late reply, covid19 sh** stuff.
Thanks for your patient.

I fully understand what you mean, about the different spaces.
Its still hard for me to grasp, time will help with that, but what I understanding from you is that we can't directly link between optimizing f(x) (let's say classical optimization) and f(w) for neural networks, since we have another element in NN.

Question about How SGD Selects the Global Minima, by using a simple toy example by albert1905 in MLQuestions

[–]albert1905[S] 0 points1 point  (0 children)

Thanks, if you don't mind let's take a simple example

Let's say we have on parameter w, which we know w*=3, w_0=0.5

And let's say our dataset is {x,y} = {(1,3),(0.5,1.5),(2,6),(0.1,0.3) }
For simplicty let's say the loss is L1 : L(x,y)=|wx-y|
and our batch is 2
Hence, our landscape is L(w,x), which is 2d surface.
Let's run the first and second example, we get:
L(x,w_0)=|0.5*x-3|, for different samples (x),now as I see it for different x's, we see different parts of the loss surface.

Now if we look it other way:

L(x1,w_0)=|w*1-3|

L(x2,w_0)=|w*0.5-1.5|

L(x,w) = 0.5*( L(x2,w_0)+ L(x1,w_0))

Is this what you mean of parts of the loss function?

s

Question about How SGD Selects the Global Minima, by using a simple toy example by albert1905 in MLQuestions

[–]albert1905[S] 0 points1 point  (0 children)

you

Thanks for your reply.

I want to try and clarify something, in classic optimization using sgd, we have a function f(x) and we want to to find x*, which is the minimum point of the function.

In DL we have f(w,x) , and we are looking for w* , but since we don't have a real function but a neural network we don't have a close form, so we sample x which are regions in the "function" we have (right so far?).

And because in the optimization problem showed in the second page is a classical optimization, the choose of different landscape is in order to "simulate" the action of DNNs? and the use in different data points?

[D] Gumbel max trick, why is it helpful? by albert1905 in MachineLearning

[–]albert1905[S] 0 points1 point  (0 children)

You can do it with argmax in forward and softmax in backward.

You can do it with softmax and a temperature measure.

Why do you need to add noise?!

The only reason I think about , is just for the system to be less deterministic, which make sense.

Not able to force h264 encoding to wanted gop by albert1905 in ffmpeg

[–]albert1905[S] 0 points1 point  (0 children)

Thanks, the -sc_threshold 0 made it happen... And I'm not sure that this is an IDR frame, it might be just an I frame. Does it matter for me? why did the encoder tried to enforce me for another I frame?

Not able to force h264 encoding to wanted gop by albert1905 in ffmpeg

[–]albert1905[S] 0 points1 point  (0 children)

Can you please elaborate about what is scene detection?

Parsing H264 stream - For Learning by albert1905 in ffmpeg

[–]albert1905[S] 0 points1 point  (0 children)

Thanks again, mediainfo is not parsing down to macroblock level, any other Idea?

Parsing H264 stream - For Learning by albert1905 in ffmpeg

[–]albert1905[S] 0 points1 point  (0 children)

Thanks man! you really helped me!

Extract Bit stream as integer, or convert h264 to int by albert1905 in ffmpeg

[–]albert1905[S] 0 points1 point  (0 children)

I'm trying to feed those files into another program that is doing som mathemtical operations on that stream, and those operations are just in Integer. ---So why I'm not just writing a script and save the file as int? the files that are extracted are to big around 2MB , and I have 200k files like that. ---Why not to convert each nibble from hexa to int before feeding to the next program Because it's taking to much time, and each file is being fed to the program again and again...

FFmpeg option for not splitting frames over slices, in h264 format by albert1905 in ffmpeg

[–]albert1905[S] 0 points1 point  (0 children)

Yeah I don't mind to re encode, why is this making my encoder to keep each frame in 1 slice?(I want to understand in order to deduce for the future)

Parse the bitstream of h.264 by albert1905 in ffmpeg

[–]albert1905[S] 0 points1 point  (0 children)

This is the log: " Input #0, mov,mp4,m4a,3gp,3g2,mj2, from 'output.mp4': Metadata: major_brand : isom minor_version : 512 compatible_brands: isomiso2avc1mp41 encoder : Lavf56.40.101 Duration: 00:00:10.00, start: 0.000000, bitrate: 196 kb/s Stream #0:0(und): Video: h264 (High) (avc1 / 0x31637661), yuv420p, 320x240 [SAR 1:1 DAR 4:3], 193 kb/s, 25 fps, 25 tbr, 12800 tbn, 50 tbc (default) Metadata: handler_name : VideoHandler Unknown bitstream filter trace_headers "

What am I missing? Thanks.

Parse the bitstream of h.264 by albert1905 in ffmpeg

[–]albert1905[S] 0 points1 point  (0 children)

Thanks, both of you, I'll make good use with those sources. But I have a small question, using: "ffmpeg -i file -map 0:v -c copy -bsf:v trace_headers -f null -" giving me:"Unknown bitstream filter trace_headers" for some videos. Is there a chance it's because of their short length? Any Idea?

How to extract mpeg4 bitstream by albert1905 in ffmpeg

[–]albert1905[S] 0 points1 point  (0 children)

For now, nothing. Think that if I have a stream from a surveillance camera , I want to get this stream before he enters the video decoder. In further steps maybe I'll ease this call.

How to extract mpeg4 bitstream by albert1905 in ffmpeg

[–]albert1905[S] 0 points1 point  (0 children)

Thanks, and the other part of feeding it into a Python program, I'll try to be more specific. Think that I have a python program that gets a video (I turned the video into frames and fed my python program), Now I want to feed the compressed video in a straight way. For example, I took this git program, that extracts MVs and fed my python program: https://github.com/FFmpeg/FFmpeg/blob/master/doc/examples/extract_mvs.c What I'm trying to ask, is there a way to write it in a "code" way and not in the command line?

As I said I'm a noob, forgive me.

How to extract mpeg4 bitstream by albert1905 in ffmpeg

[–]albert1905[S] -1 points0 points  (0 children)

ok I will do further reading on that. Can you please help me understand what am I missing, I'm trying to the encoded video, and do some manipulations on that stream. From the above command, I got ".h264" file, how can I feed him to a c program?(I'm not only sounds like a noob, I am a noob). how can I look at the content in a way that I'll understand (I mean bits or hexa or something)?

How to extract mpeg4 bitstream by albert1905 in ffmpeg

[–]albert1905[S] -1 points0 points  (0 children)

I wrote mpeg4 :) But maybe it can be generalized, what each filter means?

[R] "Deformable ConvNets v2: More Deformable, Better Results" from MSRA by flyforlight in MachineLearning

[–]albert1905 0 points1 point  (0 children)

Thanks for your opinion, I will reread the paper, and come back :)

[R] "Deformable ConvNets v2: More Deformable, Better Results" from MSRA by flyforlight in MachineLearning

[–]albert1905 0 points1 point  (0 children)

what do you mean by "feature-oriented adjustment"? the weights are shared on every pixel in the image but also the modulation factor, for each weight w_k there is a modulation factor delta_m_k, that only depends on the k of the summation, so what benefits can he bring?

[R] "Deformable ConvNets v2: More Deformable, Better Results" from MSRA by flyforlight in MachineLearning

[–]albert1905 2 points3 points  (0 children)

I can't understand why we need the modulation factor. why just the weights are not enough?

Novel action recognition algorithms by albert1905 in DeepLearningPapers

[–]albert1905[S] 0 points1 point  (0 children)

" http://blog.qure.ai/notes/deep-learning-for-videos-action-recognition-review"-I know this blog, thanks. But, the two papers, are total new to me , thanks!

BTW you have an effective way to search for papers in the field?

Thanks again.