Author: "Kolt, Noam" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Kolt, Noam"' showing total 18 results

Start Over Author "Kolt, Noam"

18 results on '"Kolt, Noam"'

1. IDs for AI Systems

Author: Chan, Alan, Kolt, Noam, Wills, Peter, Anwar, Usman, de Witt, Christian Schroeder, Rajkumar, Nitarshan, Hammond, Lewis, Krueger, David, Heim, Lennart, and Anderljung, Markus
Subjects: Computer Science - Artificial Intelligence
Abstract: AI systems are increasingly pervasive, yet information needed to decide whether and how to engage with them may not exist or be accessible. A user may not be able to verify whether a system has certain safety certifications. An investigator may not know whom to investigate when a system causes an incident. It may not be clear whom to contact to shut down a malfunctioning system. Across a number of domains, IDs address analogous problems by identifying particular entities (e.g., a particular Boeing 747) and providing information about other entities of the same class (e.g., some or all Boeing 747s). We propose a framework in which IDs are ascribed to instances of AI systems (e.g., a particular chat session with Claude 3), and associated information is accessible to parties seeking to interact with that system. We characterize IDs for AI systems, provide concrete examples where IDs could be useful, argue that there could be significant demand for IDs from key actors, analyze how those actors could incentivize ID adoption, explore a potential implementation of our framework for deployers of AI systems, and highlight limitations and risks. IDs seem most warranted in settings where AI systems could have a large impact upon the world, such as in making financial transactions or contacting real humans. With further study, IDs could help to manage a world where AI systems pervade society., Comment: Under review; accepted to RegML workshop at NeurIPS 2024
Published: 2024

2. Responsible Reporting for Frontier AI Development

Author: Kolt, Noam, Anderljung, Markus, Barnhart, Joslyn, Brass, Asher, Esvelt, Kevin, Hadfield, Gillian K., Heim, Lennart, Rodriguez, Mikel, Sandbrink, Jonas B., and Woodside, Thomas
Subjects: Computer Science - Computers and Society, Computer Science - Artificial Intelligence
Abstract: Mitigating the risks from frontier AI systems requires up-to-date and reliable information about those systems. Organizations that develop and deploy frontier systems have significant access to such information. By reporting safety-critical information to actors in government, industry, and civil society, these organizations could improve visibility into new and emerging risks posed by frontier systems. Equipped with this information, developers could make better informed decisions on risk management, while policymakers could design more targeted and robust regulatory infrastructure. We outline the key features of responsible reporting and propose mechanisms for implementing them in practice.
Published: 2024

3. Black-Box Access is Insufficient for Rigorous AI Audits

Author: Casper, Stephen, Ezell, Carson, Siegmann, Charlotte, Kolt, Noam, Curtis, Taylor Lynn, Bucknall, Benjamin, Haupt, Andreas, Wei, Kevin, Scheurer, Jérémy, Hobbhahn, Marius, Sharkey, Lee, Krishna, Satyapriya, Von Hagen, Marvin, Alberti, Silas, Chan, Alan, Sun, Qinyi, Gerovitch, Michael, Bau, David, Tegmark, Max, Krueger, David, and Hadfield-Menell, Dylan
Subjects: Computer Science - Computers and Society, Computer Science - Artificial Intelligence, Computer Science - Cryptography and Security
Abstract: External audits of AI systems are increasingly recognized as a key mechanism for AI governance. The effectiveness of an audit, however, depends on the degree of access granted to auditors. Recent audits of state-of-the-art AI systems have primarily relied on black-box access, in which auditors can only query the system and observe its outputs. However, white-box access to the system's inner workings (e.g., weights, activations, gradients) allows an auditor to perform stronger attacks, more thoroughly interpret models, and conduct fine-tuning. Meanwhile, outside-the-box access to training and deployment information (e.g., methodology, code, documentation, data, deployment details, findings from internal evaluations) allows auditors to scrutinize the development process and design more targeted evaluations. In this paper, we examine the limitations of black-box audits and the advantages of white- and outside-the-box audits. We also discuss technical, physical, and legal safeguards for performing these audits with minimal security risks. Given that different forms of access can lead to very different levels of evaluation, we conclude that (1) transparency regarding the access and methods used by auditors is necessary to properly interpret audit results, and (2) white- and outside-the-box access allow for substantially more scrutiny than black-box access alone., Comment: FAccT 2024
Published: 2024
Full Text: View/download PDF

4. Visibility into AI Agents

Author: Chan, Alan, Ezell, Carson, Kaufmann, Max, Wei, Kevin, Hammond, Lewis, Bradley, Herbie, Bluemke, Emma, Rajkumar, Nitarshan, Krueger, David, Kolt, Noam, Heim, Lennart, and Anderljung, Markus
Subjects: Computer Science - Computers and Society, Computer Science - Artificial Intelligence
Abstract: Increased delegation of commercial, scientific, governmental, and personal activities to AI agents -- systems capable of pursuing complex goals with limited supervision -- may exacerbate existing societal risks and introduce new risks. Understanding and mitigating these risks involves critically evaluating existing governance structures, revising and adapting these structures where needed, and ensuring accountability of key stakeholders. Information about where, why, how, and by whom certain AI agents are used, which we refer to as visibility, is critical to these objectives. In this paper, we assess three categories of measures to increase visibility into AI agents: agent identifiers, real-time monitoring, and activity logging. For each, we outline potential implementations that vary in intrusiveness and informativeness. We analyze how the measures apply across a spectrum of centralized through decentralized deployment contexts, accounting for various actors in the supply chain including hardware and software service providers. Finally, we discuss the implications of our measures for privacy and concentration of power. Further work into understanding the measures and mitigating their negative impacts can help to build a foundation for the governance of AI agents., Comment: Accepted to ACM Conference on Fairness, Accountability, and Transparency (ACM FAccT 2024)
Published: 2024

5. LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models

Author: Guha, Neel, Nyarko, Julian, Ho, Daniel E., Ré, Christopher, Chilton, Adam, Narayana, Aditya, Chohlas-Wood, Alex, Peters, Austin, Waldon, Brandon, Rockmore, Daniel N., Zambrano, Diego, Talisman, Dmitry, Hoque, Enam, Surani, Faiz, Fagan, Frank, Sarfaty, Galit, Dickinson, Gregory M., Porat, Haggai, Hegland, Jason, Wu, Jessica, Nudell, Joe, Niklaus, Joel, Nay, John, Choi, Jonathan H., Tobia, Kevin, Hagan, Margaret, Ma, Megan, Livermore, Michael, Rasumov-Rahe, Nikon, Holzenberger, Nils, Kolt, Noam, Henderson, Peter, Rehaag, Sean, Goel, Sharad, Gao, Shang, Williams, Spencer, Gandhi, Sunny, Zur, Tom, Iyer, Varun, and Li, Zehua
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Computers and Society
Abstract: The advent of large language models (LLMs) and their adoption by the legal community has given rise to the question: what types of legal reasoning can LLMs perform? To enable greater study of this question, we present LegalBench: a collaboratively constructed legal reasoning benchmark consisting of 162 tasks covering six different types of legal reasoning. LegalBench was built through an interdisciplinary process, in which we collected tasks designed and hand-crafted by legal professionals. Because these subject matter experts took a leading role in construction, tasks either measure legal reasoning capabilities that are practically useful, or measure reasoning skills that lawyers find interesting. To enable cross-disciplinary conversations about LLMs in the law, we additionally show how popular legal frameworks for describing legal reasoning -- which distinguish between its many forms -- correspond to LegalBench tasks, thus giving lawyers and LLM developers a common vocabulary. This paper describes LegalBench, presents an empirical evaluation of 20 open-source and commercial LLMs, and illustrates the types of research explorations LegalBench enables., Comment: 143 pages, 79 tables, 4 figures
Published: 2023

6. Frontier AI Regulation: Managing Emerging Risks to Public Safety

Author: Anderljung, Markus, Barnhart, Joslyn, Korinek, Anton, Leung, Jade, O'Keefe, Cullen, Whittlestone, Jess, Avin, Shahar, Brundage, Miles, Bullock, Justin, Cass-Beggs, Duncan, Chang, Ben, Collins, Tantum, Fist, Tim, Hadfield, Gillian, Hayes, Alan, Ho, Lewis, Hooker, Sara, Horvitz, Eric, Kolt, Noam, Schuett, Jonas, Shavit, Yonadav, Siddarth, Divya, Trager, Robert, and Wolf, Kevin
Subjects: Computer Science - Computers and Society, Computer Science - Artificial Intelligence
Abstract: Advanced AI models hold the promise of tremendous benefits for humanity, but society needs to proactively manage the accompanying risks. In this paper, we focus on what we term "frontier AI" models: highly capable foundation models that could possess dangerous capabilities sufficient to pose severe risks to public safety. Frontier AI models pose a distinct regulatory challenge: dangerous capabilities can arise unexpectedly; it is difficult to robustly prevent a deployed model from being misused; and, it is difficult to stop a model's capabilities from proliferating broadly. To address these challenges, at least three building blocks for the regulation of frontier models are needed: (1) standard-setting processes to identify appropriate requirements for frontier AI developers, (2) registration and reporting requirements to provide regulators with visibility into frontier AI development processes, and (3) mechanisms to ensure compliance with safety standards for the development and deployment of frontier AI models. Industry self-regulation is an important first step. However, wider societal discussions and government intervention will be needed to create standards and to ensure compliance with them. We consider several options to this end, including granting enforcement powers to supervisory authorities and licensure regimes for frontier AI models. Finally, we propose an initial set of safety standards. These include conducting pre-deployment risk assessments; external scrutiny of model behavior; using risk assessments to inform deployment decisions; and monitoring and responding to new information about model capabilities and uses post-deployment. We hope this discussion contributes to the broader conversation on how to balance public safety risks and innovation benefits from advances at the frontier of AI development., Comment: Update July 11th: - Added missing footnote back in. - Adjusted author order (mistakenly non-alphabetical among the first 6 authors) and adjusted affiliations (Jess Whittlestone's affiliation was mistagged and Gillian Hadfield had SRI added to her affiliations) Updated September 4th: Various typos
Published: 2023

7. Model evaluation for extreme risks

Author: Shevlane, Toby, Farquhar, Sebastian, Garfinkel, Ben, Phuong, Mary, Whittlestone, Jess, Leung, Jade, Kokotajlo, Daniel, Marchal, Nahema, Anderljung, Markus, Kolt, Noam, Ho, Lewis, Siddarth, Divya, Avin, Shahar, Hawkins, Will, Kim, Been, Gabriel, Iason, Bolina, Vijay, Clark, Jack, Bengio, Yoshua, Christiano, Paul, and Dafoe, Allan
Subjects: Computer Science - Artificial Intelligence, K.4.1
Abstract: Current approaches to building general-purpose AI systems tend to produce systems with both beneficial and harmful capabilities. Further progress in AI development could lead to capabilities that pose extreme risks, such as offensive cyber capabilities or strong manipulation skills. We explain why model evaluation is critical for addressing extreme risks. Developers must be able to identify dangerous capabilities (through "dangerous capability evaluations") and the propensity of models to apply their capabilities for harm (through "alignment evaluations"). These evaluations will become critical for keeping policymakers and other stakeholders informed, and for making responsible decisions about model training, deployment, and security., Comment: Fixed typos; added citation
Published: 2023

8. Black-Box Access is Insufficient for Rigorous AI Audits

Author: Casper, Stephen, primary, Ezell, Carson, additional, Siegmann, Charlotte, additional, Kolt, Noam, additional, Curtis, Taylor Lynn, additional, Bucknall, Benjamin, additional, Haupt, Andreas, additional, Wei, Kevin, additional, Scheurer, Jérémy, additional, Hobbhahn, Marius, additional, Sharkey, Lee, additional, Krishna, Satyapriya, additional, Von Hagen, Marvin, additional, Alberti, Silas, additional, Chan, Alan, additional, Sun, Qinyi, additional, Gerovitch, Michael, additional, Bau, David, additional, Tegmark, Max, additional, Krueger, David, additional, and Hadfield-Menell, Dylan, additional
Published: 2024
Full Text: View/download PDF

9. Visibility into AI Agents

Author: Chan, Alan, primary, Ezell, Carson, additional, Kaufmann, Max, additional, Wei, Kevin, additional, Hammond, Lewis, additional, Bradley, Herbie, additional, Bluemke, Emma, additional, Rajkumar, Nitarshan, additional, Krueger, David, additional, Kolt, Noam, additional, Heim, Lennart, additional, and Anderljung, Markus, additional
Published: 2024
Full Text: View/download PDF

10. Governing AI Agents

Author: Kolt, Noam, primary
Published: 2024
Full Text: View/download PDF

11. Return on Data: Personalizing Consumer Guidance in Data Exchanges

Author: Kolt, Noam
Published: 2019
Full Text: View/download PDF

12. ALGORITHMIC BLACK SWANS.

Author: KOLT, NOAM
Subjects: HATE speech, BLACK swan theory, PRIVACY, SOCIAL values, ARTIFICIAL intelligence
Abstract: From biased lending algorithms to chatbots that spew violent hate speech, AI systems already pose many risks to society. While policymakers have a responsibility to tackle pressing issues of algorithmic fairness, privacy, and accountability, they also have a responsibility to consider broader, longer-term risks from AI technologies. In public health, climate science, and financial markets, anticipating and addressing societal-scale risks is crucial. As the COVID-19 pandemic demonstrates, overlooking catastrophic tail events--or "black swans"--is costly. The prospect of automated systems manipulating our information environment, distorting societal values, and destabilizing political institutions is increasingly palpable. At present, it appears unlikely that market forces will address this class of risks. Organizations building AI systems do not bear the costs of diffuse societal harms and have limited incentive to install adequate safeguards. Meanwhile, current regulatory proposals such as the EU AI Act primarily target the immediate risks from AI, rather than broader, longerterm risks. To fill this governance gap, this Article offers a roadmap for "algorithmic preparedness"--a set of five forward-looking principles to guide the development of regulations that confront the prospect of algorithmic black swans and mitigate the harms they pose to society. [ABSTRACT FROM AUTHOR]
Published: 2024

13. Cosmopolitan originalism: Revisiting the role of international law in constitutional interpretation

Author: Kolt, Noam
Published: 2017

14. Regulating advanced artificial agents.

Author: Cohen, Michael K., Kolt, Noam, Bengio, Yoshua, Hadfield, Gillian K., and Russell, Stuart
Subjects: *ARTIFICIAL intelligence, *INTELLIGENT agents, *ALGORITHMIC bias, *TURING test, *LAW reviews
Abstract: The article focuses on the growing concern about the potential dangers posed by advanced artificial intelligence (AI) systems, particularly long-term planning agents (LTPAs), which could evade human control and present existential risks. Topics discussed include the incentive for AI systems to deceive humans, the necessity of regulatory frameworks to address these risks, and the proposal for mandatory reporting and production controls for LTPAs to prevent their unlawful development.
Published: 2024
Full Text: View/download PDF

15. Legalbench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models

Author: Guha, Neel, primary, Nyarko, Julian, additional, Ho, Daniel E., additional, Ré, Christopher, additional, Chilton, Adam, additional, Narayana, Aditya, additional, Chohlas-Wood, Alex, additional, Peters, Austin, additional, Waldon, Brandon, additional, Rockmore, Daniel, additional, Zambrano, Diego, additional, Talisman, Dmitry, additional, Hoque, Enam, additional, Surani, Faiz, additional, Fagan, Frank, additional, Sarfaty, Galit, additional, Dickinson, Gregory M., additional, Porat, Haggai, additional, Hegland, Jason, additional, Wu, Jessica, additional, Nudell, Joe, additional, Niklaus, Joel, additional, Nay, John, additional, Choi, Jonathan H., additional, Tobia, Kevin, additional, Hagan, Margaret, additional, Ma, Megan, additional, Livermore, Michael A., additional, Rasumov-Rahe, Nikon, additional, Holzenberger, Nils, additional, Kolt, Noam, additional, Henderson, Peter, additional, Rehaag, Sean, additional, Goel, Sharad, additional, Gao, Shang, additional, Williams, Spencer, additional, Gandhi, Sunny, additional, Zur, Tom, additional, Iyer, Varun, additional, and Li, Zehua, additional
Published: 2023
Full Text: View/download PDF

16. Predicting Consumer Contracts

Author: Kolt, Noam
Published: 2022
Full Text: View/download PDF

17. Populist rhetoric, false mirroring, and the courts

Author: Harel, Alon, primary and Kolt, Noam, additional
Published: 2020
Full Text: View/download PDF

18. Populist Rhetoric, False Mirroring, and the Courts

Author: Harel, Alon, primary and Kolt, Noam, additional
Published: 2019
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

18 results on '"Kolt, Noam"'

1. IDs for AI Systems

2. Responsible Reporting for Frontier AI Development

3. Black-Box Access is Insufficient for Rigorous AI Audits

4. Visibility into AI Agents

5. LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models

6. Frontier AI Regulation: Managing Emerging Risks to Public Safety

7. Model evaluation for extreme risks

8. Black-Box Access is Insufficient for Rigorous AI Audits

9. Visibility into AI Agents

10. Governing AI Agents

11. Return on Data: Personalizing Consumer Guidance in Data Exchanges

12. ALGORITHMIC BLACK SWANS.

13. Cosmopolitan originalism: Revisiting the role of international law in constitutional interpretation

14. Regulating advanced artificial agents.

15. Legalbench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models

16. Predicting Consumer Contracts

17. Populist rhetoric, false mirroring, and the courts

18. Populist Rhetoric, False Mirroring, and the Courts

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

18 results on '"Kolt, Noam"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources