Inicio  /  Future Internet  /  Vol: 15 Par: 5 (2023)  /  Artículo
ARTÍCULO
TITULO

Analysis of Digital Information in Storage Devices Using Supervised and Unsupervised Natural Language Processing Techniques

Luis Alberto Martínez Hernández    
Ana Lucila Sandoval Orozco and Luis Javier García Villalba    

Resumen

Due to the advancement of technology, cybercrime has increased considerably, making digital forensics essential for any organisation. One of the most critical challenges is to analyse and classify the information on devices, identifying the relevant and valuable data for a specific purpose. This phase of the forensic process is one of the most complex and time-consuming, and requires expert analysts to avoid overlooking data relevant to the investigation. Although tools exist today that can automate this process, they will depend on how tightly their parameters are tuned to the case study, and many lack support for complex scenarios where language barriers play an important role. Recent advances in machine learning allow the creation of new architectures to significantly increase the performance of information analysis and perform the intelligent search process automatically, reducing analysis time and identifying relationships between files based on initial parameters. In this paper, we present a bibliographic review of artificial intelligence algorithms that allow an exhaustive analysis of multimedia information contained in removable devices in a forensic process, using natural language processing and natural language understanding techniques for the automatic classification of documents in seized devices. Finally, some of the open challenges technology developers face when generating tools that use artificial intelligence techniques to analyse the information contained in documents on seized devices are reviewed.

 Artículos similares

       
 
Dominik Warch, Patrick Stellbauer and Pascal Neis    
In the digital transformation era, video media libraries? untapped potential is immense, restricted primarily by their non-machine-readable nature and basic search functionalities limited to standard metadata. This study presents a novel multimodal metho... ver más
Revista: Future Internet

 
Andreas F. Gkontzis, Sotiris Kotsiantis, Georgios Feretzakis and Vassilios S. Verykios    
Smart cities, leveraging advanced data analytics, predictive models, and digital twin techniques, offer a transformative model for sustainable urban development. Predictive analytics is critical to proactive planning, enabling cities to adapt to evolving... ver más
Revista: Future Internet

 
Mingwei Zhao, Xiaoxiao Ju, Ni Wang, Chun Wang, Weibo Zeng and Yan Xu    
Extracting a channel network based on the Digital Elevation Model (DEM) is one of the key research topics in digital terrain analysis. However, when the channel area is wide and flat, it is easy to form parallel channels, which seriously affect the accur... ver más

 
Ge Yan, Guoan Tang, Dingyang Lu, Junfei Ma, Xin Yang and Fayuan Li    
The intervalley plain is an important type of landform for mapping, and it has good connectivity for urban construction and development on the Loess Plateau. During the global landform mapping of the Deep-time Digital Earth (DDE) Big Science Program, it ... ver más

 
Thoralf Reis, Lukas Dumberger, Sebastian Bruchhaus, Thomas Krause, Verena Schreyer, Marco X. Bornschlegl and Matthias L. Hemmje    
Manual labeling and categorization are extremely time-consuming and, thus, costly. AI and ML-supported information systems can bridge this gap and support labor-intensive digital activities. Since it requires categorization, coding-based analysis, such a... ver más