Schedule
-
EventDateDescriptionCourse Material
-
Lecture08/07/2023
Monday(xml-0) Introduction[slides]Suggested Readings:
- Ch-3.1 from Interpretable Machine Learning book by Christoph Molnar
- Towards A Rigorous Science of Interpretable Machine Learning, Doshi-Velez and Kim, 2017
- Interpretable machine learning: definitions, methods, and applications, Murdoch et al. 2019
- Explanation in artificial intelligence: Insights from the social sciences, Tim Miller, 2019
- Examples are not Enough, Learn to Criticize! Criticism for Interpretability, Kim et al. 2016
-
Lecture08/09/2023
Wednesday(xml-1) Taxonomy, Scope, and Evaluation of Explainability[slides]Suggested Readings:
-
Lecture08/10/2023
Thursday(xml-2) Inherently Interpretable Models (Linear and Logistic Regression)[slides] -
Lecture08/14/2023
Monday(xml-3) Inherently Interpretable Models (GLM, GLA, DT)[slides]Suggested Readings:
-
Lecture08/21/2023
Monday(xml-4) Model-Agnostic Methods - PDP and ALE[slides] -
Lecture08/23/2023
Wednesday(xml-5) Model-Agnostic Methods - feature interactions, surrogate, etc.[slides]Suggested Readings:
-
Lecture08/24/2023
Thursday(xml-6) Local Model-Agnostic Methods - LIME.[slides]Suggested Readings:
-
Lecture08/31/2023
Thursday(xml-7) Local Model-Agnostic Methods - Counterfactual Explanations.[slides] -
Lecture09/07/2023
Thursday(xml-8) Interpreting Neural Networks (Feature Visualization)[slides]Suggested Readings:
-
Lecture09/14/2023
Thursday(xml-9) Interpreting Neural Networks (Pixel Attribution)[slides] -
Lecture09/21/2023
Thursday(xml-10) Interpreting Neural Networks (Concept-based Explanations)[slides] -
Lecture11/06/2023
Monday(xml-11) Causal Inference and Explainability[slides] -
Lecture11/08/2023
Wednesday(xml-12) Attention and Explanations[slides]Suggested Readings:
-
Lecture11/09/2023
Thursday(xml-13) Generating Robust Counterfactuals (Algorithmic Recourse)[slides]Suggested Readings:
-
Lecture11/16/2023
Thursday(xml-14) Causal Antehoc Explanations[slides]Suggested Readings:
-
Lecture11/20/2023
Monday(xml-15) Mechanistic Interpretability[slides]Suggested Readings: