|
|
|
Jiajia Peng and Tianbing Tang
Image captioning, also recognized as the challenge of transforming visual data into coherent natural language descriptions, has persisted as a complex problem. Traditional approaches often suffer from semantic gaps, wherein the generated textual descript...
ver más
|
|
|
|
|
|
|
Jin-Woo Kong, Byoung-Doo Oh, Chulho Kim and Yu-Seop Kim
Intracerebral hemorrhage (ICH) is a severe cerebrovascular disorder that poses a life-threatening risk, necessitating swift diagnosis and treatment. While CT scans are the most effective diagnostic tool for detecting cerebral hemorrhage, their interpreta...
ver más
|
|
|
|
|
|
|
Oscar Ondeng, Heywood Ouma and Peter Akuon
Visual understanding is a research area that bridges the gap between computer vision and natural language processing. Image captioning is a visual understanding task in which natural language descriptions of images are automatically generated using visio...
ver más
|
|
|
|
|
|
|
Tian Xie, Weiping Ding, Jinbao Zhang, Xusen Wan and Jiehua Wang
The discipline of automatic image captioning represents an integration of two pivotal branches of artificial intelligence, namely computer vision (CV) and natural language processing (NLP). The principal functionality of this technology lies in transmuti...
ver más
|
|
|
|
|
|
|
Sen Du, Hong Zhu, Guangfeng Lin, Dong Wang and Jing Shi
Automatically describing the content of an image is a challenging task that is on the edge between natural language and computer vision. The current image caption models can describe the objects that are frequently seen in the training set very well, but...
ver más
|
|
|
|
|
|
|
Aryan Yousefi and Kalpdrum Passi
Image captioning is the multi-modal task of automatically describing a digital image based on its contents and their semantic relationship. This research area has gained increasing popularity over the past few years; however, most of the previous studies...
ver más
|
|
|
|
|
|
|
Jianying Li and Xiangjun Shao
Image captioning is a challenging task, which generates a sentence for a given image. The earlier captioning methods mainly decode the visual features to generate caption sentences for the image. However, the visual features lack the context semantic inf...
ver más
|
|
|
|
|
|
|
Fariba Lotfi, Amin Beheshti, Helia Farhood, Matineh Pooshideh, Mansour Jamzad and Hamid Beigy
In our digital age, data are generated constantly from public and private sources, social media platforms, and the Internet of Things. A significant portion of this information comes in the form of unstructured images and videos, such as the 95 million d...
ver más
|
|
|
|
|
|
|
Swarnendu Ghosh, Teresa Gonçalves and Nibaran Das
Conceptual representations of images involving descriptions of entities and their relations are often represented using scene graphs. Such scene graphs can express relational concepts by using sets of triplets ⟨subject—predicate&...
ver más
|
|
|
|
|
|
|
Viktar Atliha and Dmitrij ?e?ok
Image captioning is a very important task, which is on the edge between natural language processing (NLP) and computer vision (CV). The current quality of the captioning models allows them to be used for practical tasks, but they require both large compu...
ver más
|
|
|
|