ARTÍCULO
TITULO

Multiparty Dynamics and Failure Modes for Machine Learning and Artificial Intelligence

Resumen

An important challenge for safety in machine learning and artificial intelligence systems is a set of related failures involving specification gaming, reward hacking, fragility to distributional shifts, and Goodhart’s or Campbell’s law. This paper presents additional failure modes for interactions within multi-agent systems that are closely related. These multi-agent failure modes are more complex, more problematic, and less well understood than the single-agent case, and are also already occurring, largely unnoticed. After motivating the discussion with examples from poker-playing artificial intelligence (AI), the paper explains why these failure modes are in some senses unavoidable. Following this, the paper categorizes failure modes, provides definitions, and cites examples for each of the modes: accidental steering, coordination failures, adversarial misalignment, input spoofing and filtering, and goal co-option or direct hacking. The paper then discusses how extant literature on multi-agent AI fails to address these failure modes, and identifies work which may be useful for the mitigation of these failure modes.

 Artículos similares

       
 
John Bellos and Apostolos Konstantinidis    
This paper examines and provides a robust solution to the problem of yield and failure curvatures of reinforced concrete (RC) cross-sections, taking into account cracking. At the same time, it calculates the corresponding necessary reinforcement or the m... ver más
Revista: Buildings

 
Minshi Liu, Ling Zhang, Yi Long, Yong Sun and Mingwei Zhao    
With the rising popularity of portable mobile positioning equipment, the volume of mobile trajectory data is increasing. Therefore, trajectory data compression has become an important basis for trajectory data processing, analysis, and mining. According ... ver más

 
Mikael Sabuhi, Petr Musilek and Cor-Paul Bezemer    
As the number of machine learning applications increases, growing concerns about data privacy expose the limitations of traditional cloud-based machine learning methods that rely on centralized data collection and processing. Federated learning emerges a... ver más
Revista: Future Internet

 
Martin Hauer, Sascha Hammes, Philipp Zech, David Geisler-Moroder, Daniel Plörer, Josef Miller, Vincent van Karsbergen and Rainer Pfluger    
In the architecture, engineering, and construction industries, the integration of Building Information Modeling (BIM) has become instrumental in shaping the design and commissioning of smart buildings. At the center of this development is the pursuit of ... ver más
Revista: Buildings

 
Fengge Li, Chen Chen and Zehui Xiang    
To investigate the effects of concrete canvas (CC) and carbon fiber reinforced plastic (CFRP) reinforcement on the mechanical properties of corroded reinforced concrete columns (compressive strength, flexure strength, strength of extension, and so on), 4... ver más
Revista: Buildings