Convolutional Layer

Convolutional Layer

Convolutional layer is the core building block of a convolutional neural network. The layer’s parameters consist of a set of learnable filters (or kernels), which have a small receptive field, but extend through the full depth of the input volume. During the forward pass, each filter is convolved across the width and height of the input volume: where is the -th feature map in the -th layer, is the -th feature map in the -th layer, is the kernel, and is the kernel size. We call the stride of the convolution the number of pixels by which we move the filter at each step. |300

Receptive Field

The receptive field of a neuron is the area in the input space that can affect the neuron’s output. In a convolutional neural network, the receptive field of a neuron in layer is the area in the input space that can affect the neuron’s output in layer .

Pooling Layer

Pooling layer is a special convolutional layer used to reduce the spatial dimensions of the input volume.

Max Pooling

The most common pooling operation is max pooling, which partitions the input volume into a set of non-overlapping rectangles and, for each such sub-region, outputs the maximum value. |350

Proposition

If convolutional filters have size and stride , and pooling layer has pools of size , then each unit in the pooling layer has a receptive field of size: .

Convolutional Neural Network

A convolutional neural network (CNN) is a neural network whose hidden layers contain at least one convolutional layer, and some pooling layers. CNNs are particularly effective for image recognition tasks.