已入深夜,您辛苦了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!祝你早点完成任务,早点休息,好梦!

Automated Image Annotation With Novel Features Based on Deep ResNet50-SLT

图像自动标注 计算机科学 注释 图像检索 语义鸿沟 情报检索 语义学(计算机科学) 搜索引擎索引 人工智能 领域(数学) 数字图像 图像(数学) 自然语言处理 图像处理 数学 程序设计语言 纯数学
作者
Myasar Mundher Adnan,Mohd Shafry Mohd Rahim,Azmat Ullah Khan,Ahmed Alkhayyat,Faten S. Alamri,Tanzila Saba,Saeed Ali Bahaj
出处
期刊:IEEE Access [Institute of Electrical and Electronics Engineers]
卷期号:11: 40258-40277 被引量:3
标识
DOI:10.1109/access.2023.3266296
摘要

Due to their vast size, the growing number of digital images found in personal archives and on websites has become unmanageable, making it challenging to accurately retrieve images from these large databases. While these collections are popular due to their convenience, they are often not equipped with proper indexing information, making it difficult for users to find what they need. One of the most significant challenges in the field of computer vision and multimedia is image annotation, which involves labeling images with descriptive keywords. However, computers do not possess the capability to understand the essence of images in the same way that humans do, and people can only identify images based on their visual attributes, not their deeper semantic meaning. Therefore, image annotation requires the use of keywords to effectively communicate the contents of an image to a computer system. However, raw pixels in an image do not provide enough information to generate semantic concepts, making image annotation a complex task. Unlike text annotation, where the dictionary linking words to semantics is well established, image annotation lacks a clear definition of "words" or "sentences" that can be associated with the meaning of the image, known as the semantic gap. To address this challenge, this study aimed to characterize image content meaningfully to make information retrieval easier. An improved automatic image annotation (AIA) system was proposed to bridge the semantic gap between low-level computer features and human interpretation of images by assigning one or multiple labels to images. The proposed AIA system can convert raw image pixels into semantic-level concepts, providing a clearer representation of the image content. The study combined the ResNet50 and slantlet transform with word2vec and principal component analysis with t-distributed stochastic neighbor embedding to balance precision and recall. This allowed the researchers to determine the optimal model for the proposed ResNet50-SLT AIA framework. A Word2vec model with ResNet50-SLT was used with principal component analysis and t-distributed stochastic neighbor embedding to improve IA prediction accuracy. The distributed representation approach involved encoding and storing information about image features. The proposed AIA system utilized seq2seq to generate sentences depending on feature vectors. The system was implemented on the most popular datasets (Flickr8k, Corel-5k, ESP-Game). The results showed that the newly developed AIA scheme overcame the computational time complexity associated with most existing image annotation models during the training phase for large datasets. The performance evaluation of the AIA scheme showed its excellent flexibility of annotation, improved accuracy, and reduced computational costs, thus outperforming the existing state-of-the-art methods. In conclusion, this AIA framework can provide immense benefits in accurately selecting and extracting image features and easily retrieving images from large databases. The extracted features can effectively be used to represent the image, thus accelerating the annotation process and minimizing the computational complexity.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
till发布了新的文献求助10
刚刚
刚刚
刚刚
3秒前
3秒前
思源应助百宝采纳,获得20
3秒前
小爱应助zhancon采纳,获得10
4秒前
4秒前
5秒前
玄同发布了新的文献求助10
5秒前
6秒前
6秒前
葱葱完成签到,获得积分10
6秒前
9秒前
JazzWon发布了新的文献求助10
9秒前
搜集达人应助科研通管家采纳,获得10
10秒前
nanfang应助科研通管家采纳,获得10
10秒前
nanfang应助科研通管家采纳,获得10
10秒前
顾矜应助科研通管家采纳,获得30
10秒前
10秒前
guo发布了新的文献求助10
11秒前
科目三应助Cc采纳,获得10
13秒前
隐形曼青应助大大采纳,获得50
13秒前
小二郎应助无邪采纳,获得10
14秒前
nhzz2023完成签到 ,获得积分10
14秒前
JazzWon完成签到,获得积分10
16秒前
19秒前
21秒前
自行者发布了新的文献求助80
21秒前
23秒前
刘小源完成签到 ,获得积分10
23秒前
23秒前
tianyue发布了新的文献求助10
24秒前
棠臻完成签到 ,获得积分10
27秒前
于归发布了新的文献求助10
27秒前
ng发布了新的文献求助10
28秒前
28秒前
曲初雪发布了新的文献求助10
30秒前
外向的涛完成签到,获得积分10
30秒前
Michelle发布了新的文献求助10
31秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Kinesiophobia : a new view of chronic pain behavior 2000
Psychology and Work Today 1000
Research for Social Workers 1000
Mastering New Drug Applications: A Step-by-Step Guide (Mastering the FDA Approval Process Book 1) 800
Signals, Systems, and Signal Processing 510
Discrete-Time Signals and Systems 510
热门求助领域 (近24小时)
化学 材料科学 生物 医学 工程类 计算机科学 有机化学 物理 生物化学 纳米技术 复合材料 内科学 化学工程 人工智能 催化作用 遗传学 数学 基因 量子力学 物理化学
热门帖子
关注 科研通微信公众号,转发送积分 5900006
求助须知:如何正确求助?哪些是违规求助? 6734556
关于积分的说明 15745321
捐赠科研通 5022921
什么是DOI,文献DOI怎么找? 2704808
邀请新用户注册赠送积分活动 1652191
关于科研通互助平台的介绍 1599774