Description
According to the predictive coding theory in neuroscience, the brain works like a hierarchical generative model, continuously predicting incoming stimuli. Learning and inference are achieved by minimizing the difference between predicted and actual stimuli [1].
Computational models of predictive coding, known as predictive coding networks (PCNs), encapsulate this concept within a mathematical structure. In the context of image classification, these models approach prediction as a task of reconstruction, aiming to infer an input image's class label by attempting its reconstruction from that label. Nevertheless, PCNs typically excel in either reconstruction or classification tasks, but not both [2].
Recent work in our group suggests that classification- and reconstruction-driven information must be traded off when integrated into shared representation in deep learning architectures [3]. This trade-off effect suggests that the observed specialization in PCNs might stem from the inherent trade-off between these two types of information, potentially elucidating their limitations in performing both tasks proficiently.
This project aims to explore the potential occurrence of the trade-off effect in PCNs. The candidate will implement a PCN in the line of [4] and analyze the training dynamics from the perspective of the classification-reconstruction trade-off. Additionally, the candidate will implement recently suggested alterations of PCNs and, again, evaluate them from the classification-reconstruction trade-off perspective [2, 5, 6, 7].
Requirements:
- Interest in machine learning and computational neuroscience
- Programming experience with Python and preferably a deep learning framework, e.g., PyTorch
Supervision: Jan Rathjens and Prof. Dr. Laurenz Wiskott
Contact: jan.rathjens@ini.rub.de
[1] https://arxiv.org/abs/2107.12979
[2]https://pubmed.ncbi.nlm.nih.gov/32795234/
[3]https://arxiv.org/abs/2401.09237
[4]https://osf.io/preprints/psyarxiv/4hb58
[5]https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1011280