Rodyti trumpą aprašą

dc.contributor.authorKolesau, Aliaksei
dc.contributor.authorŠešok, Dmitrij
dc.date.accessioned2023-09-18T20:29:21Z
dc.date.available2023-09-18T20:29:21Z
dc.date.issued2020
dc.identifier.urihttps://etalpykla.vilniustech.lt/handle/123456789/150331
dc.description.abstractIn this paper we examine the results of using different acoustic feature computation pipelines for classifying audio keywords with a convolutional neural network (CNN). We compare the use of Mel-frequency cepstral coefficients (MFCCs) and a simple filterbank averaging technique. Also we examined the influence of MFCCs computation parameters on the resulting quality. The results show that CNNs benifit from using prior knowledge in acoustic feature computation. In our experiments we got 30% drop in accuracy while switching from MFCC to filterbank averaging. Furthemore, the default values of MFCCs parameters that are used in many libraries might not be the best for voice activation problem: frame length of 55 ms showed better results than default length of 20 ms.eng
dc.formatPDF
dc.format.extentp. 1-4
dc.format.mediumtekstas / txt
dc.language.isoeng
dc.relation.isreferencedbyScopus
dc.relation.isreferencedbyIEEE Xplore
dc.source.urihttps://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9108867
dc.titleInvestigation of acoustic features for voice activation problem
dc.typeStraipsnis konferencijos darbų leidinyje Scopus DB / Paper in conference publication in Scopus DB
dcterms.references21
dc.type.pubtypeP1b - Straipsnis konferencijos darbų leidinyje Scopus DB / Article in conference proceedings Scopus DB
dc.contributor.institutionVilniaus Gedimino technikos universitetas
dc.contributor.facultyFundamentinių mokslų fakultetas / Faculty of Fundamental Sciences
dc.subject.researchfieldT 007 - Informatikos inžinerija / Informatics engineering
dc.subject.vgtuprioritizedfieldsIK0303 - Dirbtinio intelekto ir sprendimų priėmimo sistemos / Artificial intelligence and decision support systems
dc.subject.ltspecializationsL106 - Transportas, logistika ir informacinės ir ryšių technologijos (IRT) / Transport, logistic and information and communication technologies
dc.subject.envoice activation
dc.subject.enconvolutional neural network
dc.subject.enMFCC
dcterms.sourcetitle2020 IEEE Open Conference of Electrical, Electronic and Information Sciences (eStream), 30 April 2020, Vilnius, Lithuania: proceedings of the conference / organized by Vilnius Gediminas Technical University
dc.publisher.nameIEEE
dc.publisher.cityNew York
dc.identifier.doi10.1109/eStream50540.2020.9108867
dc.identifier.elaba62634235


Šio įrašo failai

FailaiDydisFormatasPeržiūra

Su šiuo įrašu susijusių failų nėra.

Šis įrašas yra šioje (-se) kolekcijoje (-ose)

Rodyti trumpą aprašą