LLCs and Unlearning

Investigating how unlearning procedures like LEACE affect the Local Learning Coefficient.

Type: Applied

Difficulty: Medium

Status: Unstarted

This project aims to investigate how unlearning procedures, particularly LEACE (Linear Concept Erasure), affect the Local Learning Coefficient (LLC) of neural networks. Understanding the impact of unlearning on the LLC could provide insights into how these procedures affect model complexity and internal structure.

Key research questions:

How do unlearning procedures like LEACE change the LLC of a model?
Can changes in the LLC during unlearning reveal information about the concepts being erased?
How does the impact of unlearning on the LLC vary across different model architectures and sizes?
Can LLC analysis be used to improve or validate unlearning procedures?

Methodology:

Implement LEACE and other unlearning procedures for a variety of model architectures.
Estimate LLCs before, during, and after the unlearning process.
Analyze how different types of concept erasure affect the LLC.
Investigate whether changes in the LLC correlate with successful concept erasure.
Compare the impact of unlearning on the LLC across different model architectures and sizes.

Expected outcomes:

Empirical data on how unlearning procedures affect the LLC.
Insights into the relationship between concept erasure and model complexity as measured by the LLC.
Potential development of LLC-based metrics for evaluating the effectiveness of unlearning procedures.
Improved understanding of how unlearning affects the internal structure of neural networks.

This research could provide valuable insights into the nature of unlearning and its effects on model complexity, potentially leading to improved techniques for concept erasure and model editing.

Where to begin:

If you have decided to start working on this, please let us know in the Discord. We'll update this listing so that other people who are interested in this project can find you.