• Lietuvių
    • English
  • English 
    • Lietuvių
    • English
  • Login
View Item 
  •   DSpace Home
  • Universiteto produkcija / University's production
  • Universiteto leidyba / University's Publishing
  • Konferencijų medžiaga / Conference Materials
  • Tarptautinės konferencijos / International Conferences
  • International Conference "Electrical, Electronic and Information Sciences“ (eStream)
  • 2024 International Conference "Electrical, Electronic and Information Sciences“ (eStream)
  • View Item
  •   DSpace Home
  • Universiteto produkcija / University's production
  • Universiteto leidyba / University's Publishing
  • Konferencijų medžiaga / Conference Materials
  • Tarptautinės konferencijos / International Conferences
  • International Conference "Electrical, Electronic and Information Sciences“ (eStream)
  • 2024 International Conference "Electrical, Electronic and Information Sciences“ (eStream)
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

A Roadmap on Developing a Taxonomy for Text Data Mining

Thumbnail
Date
2024
Author
Pokusajev, Sergej
Stefanovič, Pavel
Metadata
Show full item record
Abstract
Over the past decade, unstructured text data have increased significantly. Text data are utilized in various scientific research, such as sentiment analysis, semantic analysis, context extraction, or named-entity recognition. Nowadays, widely used Large Language Models (LLMs) are also based on text data. Depending on the type of task, different algorithms can be used to analyze the text data, such as classification, clustering, or the latest transformer models. In this paper, a systematic literature review of text data mining has been performed. During the research, the analysis of scientific articles was performed based on two different scientific databases: Web of Science and Google Scholar. The main aim of the research was to summarize the results of scientific researches, tasks, and methods used in text data analysis. The types of datasets and the language of the texts used in the research were also analyzed. Furthermore, the results obtained from the systematic literature that was performed allowed us to build a taxonomy of text data mining that can be helpful to other researchers.
Issue date (year)
2024
Author
Pokusajev, Sergej
URI
https://etalpykla.vilniustech.lt/handle/123456789/159658
Collections
  • 2024 International Conference "Electrical, Electronic and Information Sciences“ (eStream) [41]

 

 

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjects / KeywordsInstitutionFacultyDepartment / InstituteTypeSourcePublisherType (PDB/ETD)Research fieldStudy directionVILNIUS TECH research priorities and topicsLithuanian intelligent specializationThis CollectionBy Issue DateAuthorsTitlesSubjects / KeywordsInstitutionFacultyDepartment / InstituteTypeSourcePublisherType (PDB/ETD)Research fieldStudy directionVILNIUS TECH research priorities and topicsLithuanian intelligent specialization

My Account

LoginRegister