Rodyti trumpą aprašą

dc.rights.licenseVisos teisės saugomos / All rights reserveden_US
dc.contributor.authorAtliha, Viktar
dc.contributor.authorŠešok, Dmitrij
dc.date.accessioned2025-12-15T11:12:51Z
dc.date.available2025-12-15T11:12:51Z
dc.date.issued2020
dc.identifier.isbn9781728197807en_US
dc.identifier.urihttps://etalpykla.vilniustech.lt/handle/123456789/159547
dc.description.abstractRecent models for image captioning are usually based on an encoder-decoder framework. Large pre-trained convolutional neural networks are often used as encoders. However, different authors use different encoder architectures for their image captioning models. This makes it more difficult to determine the effect that the encoder has on the overall model performance. In this paper we compare two popular convolution networks architectures – VGG and ResNet – as encoders for the same image captioning model in order to find out which method is the best at image representation used for caption generation. The results show that the ResNet outperforms VGG allowing image captioning model achieve higher BLEU-4 score. Furthermore, the results show that the ResNet allows model to achieve a score comparable with the VGG-based model with a less amount of training epochs. Based on this data we can state that encoder plays a big role and can significantly improve model without changing a decoder architecture.en_US
dc.format.extent4 p.en_US
dc.format.mediumTekstas / Texten_US
dc.language.isoenen_US
dc.relation.urihttps://etalpykla.vilniustech.lt/handle/123456789/159395en_US
dc.source.urihttps://ieeexplore.ieee.org/document/9108880en_US
dc.subjectimage captioningen_US
dc.subjectencoder-decoder frameworken_US
dc.subjectconvolutional neural networksen_US
dc.subjectVGGen_US
dc.subjectResNeten_US
dc.titleComparison of VGG and ResNet used as Encoders for Image Captioningen_US
dc.typeKonferencijos publikacija / Conference paperen_US
dcterms.accrualMethodRankinis pateikimas / Manual submissionen_US
dcterms.issued2020-06-05
dcterms.references25en_US
dc.description.versionTaip / Yesen_US
dc.contributor.institutionVilniaus Gedimino technikos universitetasen_US
dc.contributor.institutionVilnius Gediminas Technical Universityen_US
dc.contributor.departmentInformacinių technologijų katedra / Department of Information Technologiesen_US
dcterms.sourcetitle2020 IEEE Open Conference of Electrical, Electronic and Information Sciences (eStream), April 30, 2020, Vilnius, Lithuaniaen_US
dc.identifier.eisbn9781728197791en_US
dc.publisher.nameIEEEen_US
dc.publisher.countryUnited States of Americaen_US
dc.publisher.cityNew Yorken_US
dc.identifier.doihttps://doi.org/10.1109/eStream50540.2020.9108880en_US


Šio įrašo failai

FailaiDydisFormatasPeržiūra

Su šiuo įrašu susijusių failų nėra.

Šis įrašas yra šioje (-se) kolekcijoje (-ose)

Rodyti trumpą aprašą