Awad, Edmond, Levine, Sydney, Anderson, Michael, Anderson, Susan Leigh, Conitzer, Vincent, Crockett, M.J., Everett, Jim A.C., Evgeniou, Theodoros, Gopnik, Alison, Jamison, Julian C., Kim, Tae Wan, Liao, S. Matthew, Meyer, Michelle N., Mikhail, John, Opoku-Agyemang, Kweku, Borg, Jana Schaich, Schroeder, Juliana, Sinnott-Armstrong, Walter, Slavkovik, Marija, and Tenenbaum, Josh B.
Technological advances are enabling roles for machines that present novel ethical challenges. The study of 'AI ethics' has emerged to confront these challenges, and connects perspectives from philosophy, computer science, law, and economics. Less represented in these interdisciplinary efforts is the perspective of cognitive science. We propose a framework – computational ethics – that specifies how the ethical challenges of AI can be partially addressed by incorporating the study of human moral decision-making. The driver of this framework is a computational version of reflective equilibrium (RE), an approach that seeks coherence between considered judgments and governing principles. The framework has two goals: (i) to inform the engineering of ethical AI systems, and (ii) to characterize human moral judgment and decision-making in computational terms. Working jointly towards these two goals will create the opportunity to integrate diverse research questions, bring together multiple academic communities, uncover new interdisciplinary research topics, and shed light on centuries-old philosophical questions. The past 15 years have seen an increased interest in developing ethical machines; manifested in various interdisciplinary research communities (under the umbrella term 'AI ethics'). Less represented in these interdisciplinary efforts is the perspective of cognitive science. We propose a framework – computational ethics – that specifies how the ethical challenges of AI can be addressed better by incorporating the study of how humans make moral decisions. As the driver of this framework, we propose a computational version of reflective equilibrium. The goal of this framework is twofold: (i) to inform the engineering of ethical AI systems, and (ii) to characterize human moral judgment and decision-making in computational terms. Working jointly towards these two goals may prove to be beneficial in making progress on both fronts. [ABSTRACT FROM AUTHOR]