Inicio  /  Aerospace  /  Vol: 10 Par: 10 (2023)  /  Artículo
ARTÍCULO
TITULO

Lessons Learned in Transcribing 5000 h of Air Traffic Control Communications for Robust Automatic Speech Understanding

Juan Zuluaga-Gomez    
Iuliia Nigmatulina    
Amrutha Prasad    
Petr Motlicek    
Driss Khalil    
Srikanth Madikeri    
Allan Tart    
Igor Szoke    
Vincent Lenders    
Mickael Rigault and Khalid Choukri    

Resumen

Voice communication between air traffic controllers (ATCos) and pilots is critical for ensuring safe and efficient air traffic control (ATC). The handling of these voice communications requires high levels of awareness from ATCos and can be tedious and error-prone. Recent attempts aim at integrating artificial intelligence (AI) into ATC communications in order to lessen ATCos?s workload. However, the development of data-driven AI systems for understanding of spoken ATC communications demands large-scale annotated datasets, which are currently lacking in the field. This paper explores the lessons learned from the ATCO2 project, which aimed to develop an unique platform to collect, preprocess, and transcribe large amounts of ATC audio data from airspace in real time. This paper reviews (i) robust automatic speech recognition (ASR), (ii) natural language processing, (iii) English language identification, and (iv) contextual ASR biasing with surveillance data. The pipeline developed during the ATCO2 project, along with the open-sourcing of its data, encourages research in the ATC field, while the full corpus can be purchased through ELDA. ATCO2 corpora is suitable for developing ASR systems when little or near to no ATC audio transcribed data are available. For instance, the proposed ASR system trained with ATCO2 reaches as low as 17.9% WER on public ATC datasets which is 6.6% absolute WER better than with ?out-of-domain? but gold transcriptions. Finally, the release of 5000 h of ASR transcribed speech?covering more than 10 airports worldwide?is a step forward towards more robust automatic speech understanding systems for ATC communications.

 Artículos similares

       
 
Luciana Debs    
The importance of knowledge management (KM) in the Architecture, Engineering, and Construction (AEC) industry has risen with the improvement of information and communication technologies. However, the construction industry still struggles to capture and ... ver más

 
Irina Rychkova,Marwa Ghriba     Pág. 67 - 91
In modern society, where digital security is a major preoccupation, the perception of trust is undergoing fundamental transformations. Blockchain community created a substantial body of knowledge on design and development of trustworthy information syste... ver más

 
Elizabeth Ford, Richard Tyler, Natalie Johnston, Vicki Spencer-Hughes, Graham Evans, Jon Elsom, Anotida Madzvamuse, Jacqueline Clay, Kate Gilchrist and Melanie Rees-Roberts    
Background: In the United Kingdom National Health Service (NHS), digital transformation programmes have resulted in the creation of pseudonymised linked datasets of patient-level medical records across all NHS and social care services. In the Southeast E... ver más
Revista: Information

 
Young-Joo Song, Jonghee Bae, SeungBum Hong, Jun Bang, Kara M. Pohlkamp and Shane Fuller    
This paper outlines the collaborative efforts between the Korea Aerospace Research Institute (KARI) and the National Aeronautics and Space Administration (NASA) Johnson Space Center (JSC) for the Flight Dynamics (FD) operation of the Korea Pathfinder Lun... ver más
Revista: Aerospace

 
DingXin Cheng and Lerose Lane    
Single chip seals are used by many agencies to maintain or preserve their roadways. While the construction and performance of single chip seals can be easily found from literature, the construction of double chip seals with and without paving fabric or p... ver más
Revista: Infrastructures