Captioning Deep Learning Based Encoder-Decoder through Long Short-Term Memory (LSTM)

Grimsby Chelsea

Captioning Deep Learning Based Encoder-Decoder through Long Short-Term Memory (LSTM)

International Journal of Scientific Innovation (forthcoming) Copy BIBT_EX

Abstract

This work demonstrates the implementation and use of an encoder-decoder model to perform a many-to-many mapping of video data to text captions. The many-to-many mapping occurs via an input temporal sequence of video frames to an output sequence of words to form a caption sentence. Data preprocessing, model construction, and model training are discussed. Caption correctness is evaluated using 2-gram BLEU scores across the different splits of the dataset. Specific examples of output captions were shown to demonstrate model generality over the video temporal dimension. Predicted captions were shown to generalize over video action, even in instances where the video scene changed dramatically. Model architecture changes are discussed to improve sentence grammar and correctness

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Edit

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

Author's Profile

Chelsea Grimsby

ESPAM FORMATION UNIVERSITY

Keywords

BLEU captioning decoder encoder many-to-many mapping sequence

Reprint years

My notes

Analytics

Added to PP
2024-03-13

Downloads
118 (#42,592)

6 months
118 (#149,461)

Historical graph of downloads

How can I increase my downloads?

Author's Profile

Chelsea Grimsby

ESPAM FORMATION UNIVERSITY

Citations of this work

No citations found.

Add more citations

References found in this work

Performance Comparison and Implementation of Bayesian Variants for Network Intrusion Detection.Tosin Ige & Christopher Kiekintveld - 2023 - Proceedings of the IEEE 1:5.

Encoder-Decoder Based Long Short-Term Memory (LSTM) Model for Video Captioning.Adewale Sikiru, Tosin Ige & Bolanle Matti Hafiz - forthcoming - Proceedings of the IEEE:1-6.

Ambient Technology & Intelligence.Amos Okomayin & Tosin Ige - forthcoming - International Journal of Research and Innovation in Applied Science.

Add more references

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Captioning Deep Learning Based Encoder-Decoder through Long Short-Term Memory (LSTM)

Abstract

Author's Profile

Categories

Keywords

Reprint years

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Author's Profile

Citations of this work

References found in this work