Deep Learning: Conv Nets
Assessment
•
Josiah Wang
•
Mathematics
•
University
•
36 plays
•
Hard
Improve your activity
Higher order questions
Match
•
Reorder
•
Categorization
actions
Add similar questions
Add answer explanations
Translate quiz
Tag questions with standards
More options
10 questions
Show answers
1.
Multiple Choice
How is shift Invariance achieved in ConvNets?
Through convolutional equivariance
Through convolutional equivariance and approximate translation invariance with pooling
Through convolutional equivariance and exact pooling invariance
They exist in a higher dimensional invariant space
Answer explanation
The convolutional layers are shift equivariant. If an input image is shifted a little bit, the convolutional filters will produce the same response at the shifted location. The pooling layers are approximately shift invariant. For example, if an input image is shifted a little bit under a max pooling layer, the maximum value will still be the same. Overall, given an input image x, a shift S, a shift equivariant convolutional layer f, and a shift invariant pooling layer g, the ConvNet g(f(x)) is shift invariant because g(f(Sx)) = g(Sf(x)) = g(f(x)). (see Note02)
2.
Multiple Select
Why do we include dropout in the network architecture ?
Offers regularization and helps build deeper networks
Can help with uncertainty estimation through Monte-Carlo use
Increases the capacity of the model
Prevents vanishing gradients
None of these
Answer explanation
Dropout randomly removes connections between neurons in neural networks. It offers regularization through preventing the model from relying on all features, often termed co-adaption, helping reduce overfitting. In addition, Monte Carlo dropout can be used as a Bayesian approximation to estimate uncertainties in neural networks (see https://arxiv.org/abs/1506.02142).
3.
Multiple Choice
Model Ensembling is:
Having multiple instances of the network(s) and average together their responses
Having a single instance of the network and pass the input multiple times but altered in a small way
The perfect string quartet
None of the above
4.
Multiple Select
Which of the following activation functions helps with the vanishing gradients problem?
Sigmoid
Tanh
ReLU
SELU
Softmax
Answer explanation
Hyperbolic activation functions have gradients in the range of 0-1 with significant proportions of the activation function space yielding very low gradients. As backpropagtion involves the chain role, repeatedly calculating the product of partial derivatives, the gradients passed back become vanishingly small. This does not occur in ReLUs for example as the gradient is either 0 or 1.
5.
Multiple Choice
True or False. Two 3x3 convolutional layers have the same receptive field as one 5x5 convolutional layer, results in more non linearities and requires less weights.
True
False
Answer explanation
6.
Multiple Choice
What causes vanishing gradients?
The Wizard Merlin
Large changes in X cause small changes in Y
Large changes in Y cause small changes in X
ReLU activations 'dying'
Answer explanation
Vanishing gradients occur when the derivative of a function becomes very close to zero, meaning large changes in input (X) cause only small changes in output (Y). This is a problem as backpropagation is done by calculating the derivatives of the error with respect to the weights, so if the derivatives are very small, the parameters will barely change, and the error will remain.
Explore this activity with a free account
Find a similar activity
Create activity tailored to your needs using
Ratio and Proportion
•
KG - 4th Grade
Adding and Subtracting Decimals
•
5th - 7th Grade
Processor Components
•
11th - 12th Grade
Multiplication
•
6th Grade
Expanding Single Brackets
•
4th Grade
Scratch Programming
•
KG
Adding Fractions
•
5th Grade
Equivalent Fractions
•
5th Grade