Daten exportieren

 

Forschungsprojekt ::
Stability and Solvability in Deep Learning

Projektbeschreibung

One of the challenges of the current digital era is automated decision making, which is often based on classification. For example, in image classification the task of the computer is to identify which objects are shown in a given image. This is too complicated of a task to solve by explicitly writing a program. Therefore, one uses artificial intelligence approaches that learn to solve the given task based on a set of training samples. In the last decade, the cutting edge technique for this in fields like imagine classification and game intelligence has been Deep Learning, the heuristic adaptation of weights of large neural networks based on training samples. These weights determine the strength of the connection between the different neurons of the network. The standard procedure for training the network weights is to initialize the weights randomly and then iteratively adapt them by minimizing (via the method of steepest descent, more formally called gradient descent) the error made by the network on the given training samples.Empirically, this approach outperforms all classical classification methods. Formally though, the mathematical understanding of Deep Learning is far from complete. An example of a not fully understood phenomenon with significant practical impact is the instability of neural networks with respect to very small changes in the inputs. For instance, a network might correctly classify an image of a cat as depicting a cat but when a small perturbation (imperceptible to a human) is applied to the picture, the same network will classify it as depicting a dog.The goal of this project is to investigate and formalize this phenomenon, focusing on the following three points:1. Mathematically prove that the current state of the art training methods necessarily induce this instability as a side effect and mathematically study the most widely used algorithms for generating perturbations that lead to misclassification.2. Examine partially successful existing ideas to mitigate this instability. Propose a new training dogma with stability guarantees that do not affect the classification accuracy.3. Analyze the computability of this and other training methods. In full generality, the problem of minimizing the error on the training sample is not computable, however empirical evidence suggests that neural networks can be trained successfully using computers. This warrants a more precise investigation regarding what conditions guarantee computability.

Angaben zum Forschungsprojekt

Beginn des Projekts:2021
Ende des Projekts:30. September 2025
Projektstatus:laufend
Projektleitung:Voigtlaender, Prof. Dr. Felix
Beteiligte Personen:Geuchen, Paul
Lehrstuhl/Institution:
Finanzierung des Projekts:Begutachtete Drittmittel
Geldgeber:Deutsche Forschungsgemeinschaft (DFG)
Projektpartner:
  • Prof. Dr. Felix Krahmer, Technische Universität München, München
  • Prof. Dr. Anders Hansen, University of Cambridge, Cambridge (England)
  • Prof. Dr. Philipp Petersen, Universität Wien, Wien (Österreich)
  • Prof. Dr. Sjoerd Dirksen, Utrecht University, Utrecht (Niederlande)
Schlagwörter:Mathrmatics of deep learning, Adversarial examples, computability
Projekttyp:Grundlagenforschung
Link zu Gepris:https://gepris.dfg.de/gepris/projekt/448518204?lan...
Fördernummer:448518204
Projekt-ID:3299
Eingestellt am: 13. Feb 2023 15:25
Letzte Änderung: 20. Jul 2023 03:35
URL zu dieser Anzeige: https://fordoc.ku.de/id/eprint/3299/
Analytics