Schedule
-
EventDateDescriptionCourse Material
-
Lecture07/30/2024
Tuesday(xml-1) Introduction[slides]Suggested Readings:
- Ch-3.1 from Interpretable Machine Learning book by Christoph Molnar
- Towards A Rigorous Science of Interpretable Machine Learning, Doshi-Velez and Kim, 2017
- Interpretable machine learning: definitions, methods, and applications, Murdoch et al. 2019
- Explanation in artificial intelligence: Insights from the social sciences, Tim Miller, 2019
- Examples are not Enough, Learn to Criticize! Criticism for Interpretability, Kim et al. 2016
-
Lecture08/06/2024
Tuesday(xml-2) Taxonomy, Scope, and Evaluation of Explainability[slides]Suggested Readings:
-
Lecture08/09/2024
Friday(xml-3) Inherently Interpretable Models[slides] -
Lecture08/13/2024
Tuesday(xml-4) Local Model-Agnostic Methods - LIME.[slides]Suggested Readings:
-
Lecture08/16/2024
Friday(xml-5) Local Model-Agnostic Methods - Counterfactual Explanations.[slides] -
Lecture08/16/2024
Friday(xml-6) Interpreting Neural Networks (Feature Visualization)[slides]Suggested Readings:
-
Lecture08/30/2024
Friday(xml-7) Interpreting Neural Networks (Pixel Attribution)[slides]Suggested Readings:
- Visualising image classification models and saliency maps (ICLRW-2012)
- Visualizing and understanding CNNs (ECCV-2013)
- SmoothGrad (2017)
- Interpreting DNNs is fragile (AAAI-2019)
- Sanity Checks for Saliency Maps (NeurIPS-2018)
- The (un) reliability of saliency methods (Explainable AI-2019)
- Sanity Checks for Saliency Metrics (AAAI-2020)
-
Lecture09/03/2024
Tuesday(xml-8) Interpreting Neural Networks (Concept-based Explanations)[slides] -
Lecture09/06/2024
Friday(xml-9) Attention and Explanations[slides]Suggested Readings: