Perceptrons, backpropagation and deep architectures
A perceptron computes a weighted sum of inputs passed through an activation function. Stacking several layers of neurons forms a multilayer perceptron able to approximate complex non-linear functions.
A network learns by measuring a loss, propagating its gradient backward through the layers using the chain rule, and updating the weights with gradient descent over many epochs.
Convolutional neural networks use convolution and pooling to excel at images, while recurrent networks and transformers process sequences such as text and time series. Dropout and regularisation curb overfitting.
This unit introduced neural networks from the perceptron to multilayer and deep architectures, the backpropagation training procedure, and the convolutional and recurrent models used in modern deep learning.