ICCK Transactions on Machine Intelligence | Volume 1, Issue 2: 52-68, 2025 | DOI: 10.62762/TMI.2025.886122
Abstract
The capacity of AI systems to generate captions for images autonomously represents a significant advancement in artificial intelligence and language understanding. This paper presents an advanced image captioning system that employs deep learning techniques, particularly convolutional neural networks (CNNs) and recurrent neural networks (RNNs), to produce contextually appropriate and meaningful descriptions of visual content. The proposed method extracts features using the DenseNet201 model, enabling a more comprehensive and hierarchical understanding of image components. These extracted features are then fed into a long short-term memory (LSTM) network, a specialized RNN variant designed to... More >
Graphical Abstract