Academic Text Clustering Using Natural Language Processing
No Thumbnail Available
Date
2022
Authors
Kaya, Ersin
Journal Title
Journal ISSN
Volume Title
Publisher
Open Access Color
Green Open Access
No
OpenAIRE Downloads
OpenAIRE Views
Publicly Funded
No
Abstract
Accessing data is very easy nowadays. However, to use these data in an efficient way, it is necessary to get the right information from them. Categorizing these data in order to reach the needed information in a short time provides great convenience. All the more, while doing research in the academic field, text-based data such as articles, papers, or thesis studies are generally used. Natural language processing and machine learning methods are used to get the right information we need from these text-based data. In this study, abstracts of academic papers are clustered. Text data from academic paper abstracts are preprocessed using natural language processing techniques. A vectorized word representation extracted from preprocessed data with Word2Vec and BERT word embeddings and representations are clustered with four clustering algorithms.
Description
Keywords
Natural Language Processing, Machine Learning, Text Representation
Turkish CoHE Thesis Center URL
Fields of Science
0202 electrical engineering, electronic engineering, information engineering, 02 engineering and technology
Citation
WoS Q
Q4
Scopus Q
N/A

OpenCitations Citation Count
N/A
Source
Konya mühendislik bilimleri dergisi (Online)
Volume
10
Issue
0
Start Page
41
End Page
51
PlumX Metrics
Captures
Mendeley Readers : 10
Web of Science™ Citations
1
checked on Feb 03, 2026
Google Scholar™


