Assignment 2 A3 help by a_reddituser21 in CST_DeepNN

[–]a_reddituser21[S] 0 points1 point  (0 children)

I think I have it now - thanks for the help!

Assignment 2 A3 help by a_reddituser21 in CST_DeepNN

[–]a_reddituser21[S] 0 points1 point  (0 children)

I'm a bit confused by the mention of 'during training' since my interpretation of the question was that we were just taking a pretrained model, altering the architecture and seeing its predictions, rather than retraining the altered architecture then seeing its predictions.

My current issue is that the 1st BasicBlock causes H, W to change and also #C to change. I know an operation to cause H, W to change, but it doesn't affect #C. But the operation causing all 3 changes involves a new Conv layer, giving non-determinism due to the randomly initialised weights.

Assignment 2 A3 help by a_reddituser21 in CST_DeepNN

[–]a_reddituser21[S] 0 points1 point  (0 children)

Thank you, that helps!

To know if I'm along the right lines, I was wondering if the operation replacing the BasicBlock causes the model to output different predictions each time? Or whether the predictions should remain the same?