Skip to main content

Showing 1–1 of 1 results for author: Blackwell, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.02933  [pdf, other

    cs.LG cs.CY cs.HC

    InterpretCC: Intrinsic User-Centric Interpretability through Global Mixture of Experts

    Authors: Vinitra Swamy, Syrielle Montariol, Julian Blackwell, Jibril Frej, Martin Jaggi, Tanja Käser

    Abstract: Interpretability for neural networks is a trade-off between three key requirements: 1) faithfulness of the explanation (i.e., how perfectly it explains the prediction), 2) understandability of the explanation by humans, and 3) model performance. Most existing methods compromise one or more of these requirements; e.g., post-hoc approaches provide limited faithfulness, automatically identified featu… ▽ More

    Submitted 29 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.