Multiparty Dynamics and Failure Modes for Machine Learning and Artificial Intelligence

Resumen

An important challenge for safety in machine learning and artificial intelligence systems is a set of related failures involving specification gaming, reward hacking, fragility to distributional shifts, and Goodhart’s or Campbell’s law. This paper presents additional failure modes for interactions within multi-agent systems that are closely related. These multi-agent failure modes are more complex, more problematic, and less well understood than the single-agent case, and are also already occurring, largely unnoticed. After motivating the discussion with examples from poker-playing artificial intelligence (AI), the paper explains why these failure modes are in some senses unavoidable. Following this, the paper categorizes failure modes, provides definitions, and cites examples for each of the modes: accidental steering, coordination failures, adversarial misalignment, input spoofing and filtering, and goal co-option or direct hacking. The paper then discusses how extant literature on multi-agent AI fails to address these failure modes, and identifies work which may be useful for the mitigation of these failure modes.

Palabras claves

multi-agent systems - specification gaming - artificial intelligence safety - Goodhart?s Law

Acceso

P�GINAS

N�MERO

Volumen: 3 N�mero: 2 Parte: June (2019)

MATERIAS

INFRAESTRUCTURA

REVISTAS SIMILARES

Big Data and Cognitive Computing
ISPRS International Journal of Geo-Information
Buildings

Art�culos similares

Systematic Calculation of Yield and Failure Curvatures of Reinforced Concrete Cross-Sections

Acceso

John Bellos and Apostolos Konstantinidis

This paper examines and provides a robust solution to the problem of yield and failure curvatures of reinforced concrete (RC) cross-sections, taking into account cracking. At the same time, it calculates the corresponding necessary reinforcement or the m... ver m�s

Revista: Buildings

An Improved BLG Tree for Trajectory Compression with Constraints of Road Networks

Acceso

Minshi Liu, Ling Zhang, Yi Long, Yong Sun and Mingwei Zhao

With the rising popularity of portable mobile positioning equipment, the volume of mobile trajectory data is increasing. Therefore, trajectory data compression has become an important basis for trajectory data processing, analysis, and mining. According ... ver m�s

Revista: ISPRS International Journal of Geo-Information

Micro-FL: A Fault-Tolerant Scalable Microservice-Based Platform for Federated Learning

Acceso

Mikael Sabuhi, Petr Musilek and Cor-Paul Bezemer

As the number of machine learning applications increases, growing concerns about data privacy expose the limitations of traditional cloud-based machine learning methods that rely on centralized data collection and processing. Federated learning emerges a... ver m�s

Revista: Future Internet

Integrating Digital Twins with BIM for Enhanced Building Control Strategies: A Systematic Literature Review Focusing on Daylight and Artificial Lighting Systems

Acceso

Martin Hauer, Sascha Hammes, Philipp Zech, David Geisler-Moroder, Daniel Pl�rer, Josef Miller, Vincent van Karsbergen and Rainer Pfluger

In the architecture, engineering, and construction industries, the integration of Building Information Modeling (BIM) has become instrumental in shaping the design and commissioning of smart buildings. At the center of this development is the pursuit of ... ver m�s

Revista: Buildings

Study on Axial Compression Performance of Corroded Reinforced Concrete Columns Strengthened by Concrete Canvas and Carbon Fiber Reinforced Plastic under Secondary Corrosion

Acceso

Fengge Li, Chen Chen and Zehui Xiang

To investigate the effects of concrete canvas (CC) and carbon fiber reinforced plastic (CFRP) reinforcement on the mechanical properties of corroded reinforced concrete columns (compressive strength, flexure strength, strength of extension, and so on), 4... ver m�s

Revista: Buildings

Revistas destacadas

Acceso directo a los n�meros publicados en la revista Infrastructures

Infrastructures

Acceso directo a los n�meros publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los n�meros publicados en la revista BiT

Acceso directo a los n�meros publicados en la revista Revista de la Construcci�n

Revista de la Construcci�n

Ver todas las revistas disponibles