Automated Mechanistic Interpretability of Generative AI Models Trained on Biological Data
Develop automated approaches for mechanistic interpretability of virtual
neural network models trained on biological state data. This would enable the
extraction of mechanistic insights from predictive models, potentially
informing both basic science and therapeutic design.
Resources (1)
Advances in interpretability and our understanding of neural computation
Whitepapers and Essays