Rodyti trumpą aprašą

dc.contributor.authorTamulionis, Mantas
dc.contributor.authorSledevič, Tomyslav
dc.contributor.authorSerackis, Artūras
dc.date.accessioned2023-09-18T16:40:53Z
dc.date.available2023-09-18T16:40:53Z
dc.date.issued2023
dc.identifier.other(WOS_ID)000987245700001
dc.identifier.urihttps://etalpykla.vilniustech.lt/handle/123456789/115926
dc.description.abstractThis paper discusses an algorithm that attempts to automatically calculate the effect of room reverberation by training a mathematical model based on a recurrent neural network on anechoic and reverberant sound samples. Modelling the room impulse response (RIR) recorded at a 44.1 kHz sampling rate using a system identification-based approach in the time domain, even with deep learning models, is prohibitively complex and it is almost impossible to automatically learn the parameters of the model for a reverberation time longer than 1 s. Therefore, this paper presents a method to model a reverberated audio signal in the frequency domain. To reduce complexity, the spectrum is analyzed on a logarithmic scale, based on the subjective characteristics of human hearing, by calculating 10 octaves in the range 20–20,000 Hz and dividing each octave by 1/3 or 1/12 of the bandwidth. This maintains equal resolution at high, mid, and low frequencies. The study examines three different recurrent network structures: LSTM, BiLSTM, and GRU, comparing the different sizes of the two hidden layers. The experimental study was carried out to compare the modelling when each octave of the spectrum is divided into a different number of bands, as well as to assess the feasibility of using a single model to predict the spectrum of a reverberated audio in adjacent frequency bands. The paper also presents and describes in detail a new RIR dataset that, although synthetic, is calibrated with recorded impulses.eng
dc.formatPDF
dc.format.extentp. 1-12
dc.format.mediumtekstas / txt
dc.language.isoeng
dc.relation.isreferencedbyScience Citation Index Expanded (Web of Science)
dc.relation.isreferencedbyScopus
dc.source.urihttps://www.mdpi.com/2076-3417/13/9/5604
dc.titleInvestigation of machine learning model flexibility for automatic application of reverberation effect on audio signal
dc.typeStraipsnis Web of Science DB / Article in Web of Science DB
dcterms.accessRightsThis article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https:// creativecommons.org/licenses/by/ 4.0/).
dcterms.licenseCreative Commons – Attribution – 4.0 International
dcterms.references28
dc.type.pubtypeS1 - Straipsnis Web of Science DB / Web of Science DB article
dc.contributor.institutionVilniaus Gedimino technikos universitetas
dc.contributor.facultyElektronikos fakultetas / Faculty of Electronics
dc.subject.researchfieldT 001 - Elektros ir elektronikos inžinerija / Electrical and electronic engineering
dc.subject.researchfieldT 007 - Informatikos inžinerija / Informatics engineering
dc.subject.vgtuprioritizedfieldsIK05 - Virtuali ir pridėtinė realybė / Virtual and augmented reality
dc.subject.ltspecializationsL106 - Transportas, logistika ir informacinės ir ryšių technologijos (IRT) / Transport, logistic and information and communication technologies
dc.subject.enroom reverberation
dc.subject.enroom impulse response
dc.subject.enrecurrent neural networks
dc.subject.enaudio signal spectrum
dc.subject.enfilter bank
dcterms.sourcetitleApplied sciences: New Advances in Audio Signal Processing
dc.description.issueiss. 9
dc.description.volumevol. 13
dc.publisher.nameMDPI
dc.publisher.cityBasel
dc.identifier.doi000987245700001
dc.identifier.doi2-s2.0-85159261976
dc.identifier.doi85159261976
dc.identifier.doi1
dc.identifier.doi147463146
dc.identifier.doi10.3390/app13095604
dc.identifier.elaba167553630


Šio įrašo failai

FailaiDydisFormatasPeržiūra

Su šiuo įrašu susijusių failų nėra.

Šis įrašas yra šioje (-se) kolekcijoje (-ose)

Rodyti trumpą aprašą