Bayesian networks for interpretable machine learning and optimization
[摘要] As artificial intelligence is being increasingly used for high-stakes applications, it is becoming more and more important that the models used be interpretable. Bayesian networks offer a paradigm for inter-pretable artificial intelligence that is based on probability theory. They provide a semantics that enables a compact, declarative representation of a joint probability distribution over the variables of a domain by leveraging the conditional independencies among them. The representation consists of a directed acyclic graph that encodes the conditional independencies among the variables and a set of parameters that encodes conditional distributions. This representation has provided a basis for the development of algo-rithms for probabilistic reasoning (inference) and for learning probability distributions from data. Bayesian networks are used for a wide range of tasks in machine learning, including clustering, super -vised classification, multi-dimensional supervised classification, anomaly detection, and temporal mod-eling. They also provide a basis for estimation of distribution algorithms, a class of evolutionary algorithms for heuristic optimization. We illustrate the use of Bayesian networks for interpretable machine learning and optimization by presenting applications in neuroscience, the industry, and bioin-formatics, covering a wide range of machine learning and optimization tasks. (c) 2021 Published by Elsevier B.V.
[发布日期] 2021-10-07 [发布机构]
[效力级别] [学科分类]
[关键词] Interpretability;Explainable machine learning;Probabilistic graphical models [时效性]