Show simple item record

dc.rights.licenseVisos teisės saugomos / All rights reserveden_US
dc.contributor.authorKrishna, N. Sri
dc.contributor.authorPavan Kumar, Y. V.
dc.contributor.authorPrakash, K. Purna
dc.contributor.authorPradeep Reddy, G.
dc.date.accessioned2026-01-05T10:34:53Z
dc.date.available2026-01-05T10:34:53Z
dc.date.issued2024
dc.identifier.isbn9798350352429en_US
dc.identifier.issn2831-5634en_US
dc.identifier.urihttps://etalpykla.vilniustech.lt/handle/123456789/159655
dc.description.abstractDue to the continuous increase of smart home culture worldwide, large volumes of energy consumption data gained the attention of data scientists. Smart meters capture the energy consumption readings at a predefined rate and store them as a database. The quality of these databases is highly desired to have accurate analysis and decision-making. But, these readings often have anomalies namely missingness, redundancy, and outliers due to the issues present in meter/data communication networks. Among these, outlier readings indicate an abnormality of the load behavior (e.g.: nonlinearity, unpredicted load switching, system faults, etc.). Hence, it is essential to detect and visualize such anomalies for the necessary treatment. With this motivation, this paper implements various key machine learning and statistical techniques namely autoregressive integrated moving average (ARIMA), autoencoder, density-based spatial clustering of applications with noise (DBSCAN), isolation forest, k-means, hierarchical density-based spatial clustering of applications with noise (HDBSCAN), one-class support vector machine (SVM), local outlier factor (LOF), long short-term memory (LSTM), winsorization, interquartile range (IQR), and Z-score. The results revealed that DBSCAN consistently demonstrated the most accurate performance in detecting outliers in energy data, while, Z-score, IQR, and winsorization provided reasonable outcomes but were limited in handling complex and non-linear data patterns. Autoencoder, Isolation forest, and One-class SVM showed moderate success, but their performance depended on the specific dataset characteristics. Kmeans exhibited mixed results. ARIMA, LOF, LSTM, and HDBSCAN had limited success in outlier detection in the timeseries data. Thus, this analysis finally recommends DBSCAN as the best technique as it consistently outperformed other machine learning and statistical techniques in accurately detecting outliers in smart home energy consumption data.en_US
dc.format.extent6 p.en_US
dc.format.mediumTekstas / Texten_US
dc.language.isoenen_US
dc.relation.urihttps://etalpykla.vilniustech.lt/handle/123456789/159404en_US
dc.source.urihttps://ieeexplore.ieee.org/document/10542609en_US
dc.subjectData anomalyen_US
dc.subjectEnergy consumption readingsen_US
dc.subjectMachine learningen_US
dc.subjectOutliersen_US
dc.subjectSmart homeen_US
dc.titleMachine Learning and Statistical Techniques for Outlier Detection in Smart Home Energy Consumptionen_US
dc.typeKonferencijos publikacija / Conference paperen_US
dcterms.accrualMethodRankinis pateikimas / Manual submissionen_US
dcterms.issued2024-06-05
dcterms.references18en_US
dc.description.versionTaip / Yesen_US
dc.contributor.institutionVIT-AP Universityen_US
dc.contributor.institutionKoneru Lakshmaiah Education Foundationen_US
dc.contributor.institutionKookmin Universityen_US
dcterms.sourcetitle2024 IEEE Open Conference of Electrical, Electronic and Information Sciences (eStream), April 25, 2024, Vilnius, Lithuaniaen_US
dc.identifier.eisbn9798350352412en_US
dc.identifier.eissn2690-8506en_US
dc.publisher.nameIEEEen_US
dc.publisher.countryUnited States of Americaen_US
dc.publisher.cityNew Yorken_US
dc.identifier.doihttps://doi.org/10.1109/eStream61684.2024.10542609en_US


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record