REVISTA
Information

TODAS

Redirigiendo al acceso original de articulo en 19 segundos...

Inicio / Information / Vol: 15 Par: 2 (2024) / Art�culo

ART�CULO

TITULO

Leveraging Semantic Text Analysis to Improve the Performance of Transformer-Based Relation Extraction

Marie-Therese Charlotte Evans

Majid Latifi

Mominul Ahsan and Julfikar Haider

Resumen

Keyword extraction from Knowledge Bases underpins the definition of relevancy in Digital Library search systems. However, it is the pertinent task of Joint Relation Extraction, which populates the Knowledge Bases from which results are retrieved. Recent work focuses on fine-tuned, Pre-trained Transformers. Yet, F1 scores for scientific literature achieve just 53.2, versus 69 in the general domain. The research demonstrates the failure of existing work to evidence the rationale for optimisations to finetuned classifiers. In contrast, emerging research subjectively adopts the common belief that Natural Language Processing techniques fail to derive context and shared knowledge. In fact, global context and shared knowledge account for just 10.4% and 11.2% of total relation misclassifications, respectively. In this work, the novel employment of semantic text analysis presents objective challenges for the Transformer-based classification of Joint Relation Extraction. This is the first known work to quantify that pipelined error propagation accounts for 45.3% of total relation misclassifications, the most poignant challenge in this domain. More specifically, Part-of-Speech tagging highlights the misclassification of complex noun phrases, accounting for 25.47% of relation misclassifications. Furthermore, this study identifies two limitations in the purported bidirectionality of the Bidirectional Encoder Representations from Transformers (BERT) Pre-trained Language Model. Firstly, there is a notable imbalance in the misclassification of right-to-left relations, which occurs at a rate double that of left-to-right relations. Additionally, a failure to recognise local context through determiners and prepositions contributes to 16.04% of misclassifications. Furthermore, it is highlighted that the annotation scheme of the singular dataset utilised in existing research, Scientific Entities, Relations and Coreferences (SciERC), is marred by ambiguity. Notably, two asymmetric relations within this dataset achieve recall rates of only 10% and 29%.

Palabras claves

Joint Relation Extraction (JRE) - digital libraries - Named Entity Recognition (NER) - Relation Extraction (RE) - Pre-trained Language Model - transformer - SCIBERT - Scientific Entity Relation and Coreferences (SciERC) - PL-Marker - semantic text analysis - global context

Acceso

P�GINAS

pp. 0 - 0

N�MERO

Volumen: 15 Parte: 2 (2024)

MATERIAS

INGENIER�A Y CONSTRUCCI�N CIVIL
TECNOLOG�A

REVISTAS SIMILARES

Information
Algorithms
Journal of Marine Science and Engineering

DOI

https://doi.org/10.3390/info15020091

Art�culos similares

Nemesis: Neural Mean Teacher Learning-Based Emotion-Centric Speaker

Acceso

Aryan Yousefi and Kalpdrum Passi

Image captioning is the multi-modal task of automatically describing a digital image based on its contents and their semantic relationship. This research area has gained increasing popularity over the past few years; however, most of the previous studies... ver m�s

Revista: Algorithms

Automatic Construction of Educational Knowledge Graphs: A Word Embedding-Based Approach

Acceso

Qurat Ul Ain, Mohamed Amine Chatti, Komlan Gluck Charles Bakar, Shoeb Joarder and Rawaa Alatrash

Knowledge graphs (KGs) are widely used in the education domain to offer learners a semantic representation of domain concepts from educational content and their relations, termed as educational knowledge graphs (EduKGs). Previous studies on EduKGs have i... ver m�s

Revista: Information

Linked Data Interfaces: A Survey

Acceso

Eleonora Bernasconi, Miguel Ceriani, Davide Di Pierro, Stefano Ferilli and Domenico Redavid

In the era of big data, linked data interfaces play a critical role in enabling access to and management of large-scale, heterogeneous datasets. This survey investigates forty-seven interfaces developed by the semantic web community in the context of the... ver m�s

Revista: Information

Bi-LS-AttM: A Bidirectional LSTM and Attention Mechanism Model for Improving Image Captioning

Acceso

Tian Xie, Weiping Ding, Jinbao Zhang, Xusen Wan and Jiehua Wang

The discipline of automatic image captioning represents an integration of two pivotal branches of artificial intelligence, namely computer vision (CV) and natural language processing (NLP). The principal functionality of this technology lies in transmuti... ver m�s

Revista: Applied Sciences

Re-Engineered Word Embeddings for Improved Document-Level Sentiment Analysis

Acceso

Su Yang and Farzin Deravi

In this paper, a novel re-engineering mechanism for the generation of word embeddings is proposed for document-level sentiment analysis. Current approaches to sentiment analysis often integrate feature engineering with classification, without optimizing ... ver m�s

Revista: Applied Sciences

Revistas destacadas

Acceso directo a los n�meros publicados en la revista Infrastructures

Infrastructures

Acceso directo a los n�meros publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los n�meros publicados en la revista BiT

Acceso directo a los n�meros publicados en la revista Revista de la Construcci�n

Revista de la Construcci�n

Ver todas las revistas disponibles