Online Learning Dynamics of ReLU Neural Networks

Name: Online Learning Dynamics of ReLU Neural Networks
Start: 2019-03-29T14:30:00+01:00
Location: University of Groningen

Abstract

The rectifier activation function (Rectified Linear Unit: ReLU) has become popular in deep learning applications, mostly because the activation function often yields better performance than sigmoidal activation functions. Although there are known advantages of using ReLU, there is still a lack of mathematical arguments that explain why ReLU networks have the ability to learn faster and show better performance. In this project, the Statistical Physics of Learning framework is used to derive an exact mathematical description of the learning dynamics of the ReLU perceptron and ReLU Soft Committee Machines. The mathematical description consists of a system of ordinary differential equations that describe the evolution of so-called order parameters, which summarize the state of the network relative to the target rule. The correctness of the theoretical results is verified with simulations and several learning scenarios will be discussed.

Date

Mar 29, 2019 2:30 PM

Event

Second International Workshop on Intelligent Systems and Computational Intelligence - WISCI 2019

Location

University of Groningen

Michiel Straat

Postdoctoral Research Group Leader “Lifelong Machine Learning for Physical Systems”

My research interests include Machine Learning for Physical Systems and the theory of Neural Networks.