REVISTA
Information

TODAS

Inicio / Information / Vol: 15 Par: 2 (2024) / Art�culo

ART�CULO

TITULO

Do Large Language Models Show Human-like Biases? Exploring Confidence?Competence Gap in AI

Aniket Kumar Singh

Bishal Lamichhane

Suman Devkota

Uttam Dhakal and Chandra Dhakal

Resumen

This study investigates self-assessment tendencies in Large Language Models (LLMs), examining if patterns resemble human cognitive biases like the Dunning?Kruger effect. LLMs, including GPT, BARD, Claude, and LLaMA, are evaluated using confidence scores on reasoning tasks. The models provide self-assessed confidence levels before and after responding to different questions. The results show cases where high confidence does not correlate with correctness, suggesting overconfidence. Conversely, low confidence despite accurate responses indicates potential underestimation. The confidence scores vary across problem categories and difficulties, reducing confidence for complex queries. GPT-4 displays consistent confidence, while LLaMA and Claude demonstrate more variations. Some of these patterns resemble the Dunning?Kruger effect, where incompetence leads to inflated self-evaluations. While not conclusively evident, these observations parallel this phenomenon and provide a foundation to further explore the alignment of competence and confidence in LLMs. As LLMs continue to expand their societal roles, further research into their self-assessment mechanisms is warranted to fully understand their capabilities and limitations.

Palabras claves

Large Language Models - Dunning?Kruger effects - chat-GPT - BARD - Claude - LLaMA - cognitive biases - artificial intelligence - AI ethics - Natural Language Processing - confidence assessment

Acceso

P�GINAS

pp. 0 - 0

N�MERO

Volumen: 15 Parte: 2 (2024)

MATERIAS

INGENIER�A Y CONSTRUCCI�N CIVIL
TECNOLOG�A

REVISTAS SIMILARES

Applied Sciences
Information
Algorithms

DOI

https://doi.org/10.3390/info15020092

Art�culos similares

The Impact of Data Preparation and Model Complexity on the Natural Language Classification of Chinese News Headlines

Acceso

Torrey Wagner, Dennis Guhl and Brent Langhals

Given the emergence of China as a political and economic power in the 21st century, there is increased interest in analyzing Chinese news articles to better understand developing trends in China. Because of the volume of the material, automating the cate... ver m�s

Revista: Algorithms

SRBerta?A Transformer Language Model for Serbian Cyrillic Legal Texts

Acceso

Milo? Bogdanovic, Jelena Kocic and Leonid Stoimenov

Language is a unique ability of human beings. Although relatively simple for humans, the ability to understand human language is a highly complex task for machines. For a machine to learn a particular language, it must understand not only the words and r... ver m�s

Revista: Information

Unveiling Insights: A Bibliometric Analysis of Artificial Intelligence in Teaching

Acceso

Malinka Ivanova, Gabriela Grosseck and Carmen Holotescu

The penetration of intelligent applications in education is rapidly increasing, posing a number of questions of a different nature to the educational community. This paper is coming to analyze and outline the influence of artificial intelligence (AI) on ... ver m�s

Revista: Informatics

Customization of the ASR System for ATC Speech with Improved Fusion

Acceso

Jiahao Fan and Weijun Pan

In recent years, automatic speech recognition (ASR) technology has improved significantly. However, the training process for an ASR model is complex, involving large amounts of data and a large number of algorithms. The task of training a new model for a... ver m�s

Revista: Aerospace

PDEC: A Framework for Improving Knowledge Graph Reasoning Performance through Predicate Decomposition

Acceso

Xin Tian and Yuan Meng

The judicious configuration of predicates is a crucial but often overlooked aspect in the field of knowledge graphs. While previous research has primarily focused on the precision of triples in assessing knowledge graph quality, the rationality of predic... ver m�s

Revista: Algorithms

Revistas destacadas

Acceso directo a los n�meros publicados en la revista Infrastructures

Infrastructures

Acceso directo a los n�meros publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los n�meros publicados en la revista BiT

Acceso directo a los n�meros publicados en la revista Revista de la Construcci�n

Revista de la Construcci�n

Ver todas las revistas disponibles