Academic Text Clustering Using Natural Language Processing

No Thumbnail Available

Date

2022

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Open Access Color

Green Open Access

No

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Average
Influence
Average
Popularity
Average

Research Projects

Journal Issue

Abstract

Accessing data is very easy nowadays. However, to use these data in an efficient way, it is necessary to get the right information from them. Categorizing these data in order to reach the needed information in a short time provides great convenience. All the more, while doing research in the academic field, text-based data such as articles, papers, or thesis studies are generally used. Natural language processing and machine learning methods are used to get the right information we need from these text-based data. In this study, abstracts of academic papers are clustered. Text data from academic paper abstracts are preprocessed using natural language processing techniques. A vectorized word representation extracted from preprocessed data with Word2Vec and BERT word embeddings and representations are clustered with four clustering algorithms.

Description

Keywords

Natural Language Processing, Machine Learning, Text Representation

Turkish CoHE Thesis Center URL

Fields of Science

0202 electrical engineering, electronic engineering, information engineering, 02 engineering and technology

Citation

WoS Q

Q4

Scopus Q

N/A
OpenCitations Logo
OpenCitations Citation Count
N/A

Source

Konya mühendislik bilimleri dergisi (Online)

Volume

10

Issue

0

Start Page

41

End Page

51
PlumX Metrics
Captures

Mendeley Readers : 10

Web of Science™ Citations

1

checked on Feb 03, 2026

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
0.19579882

Sustainable Development Goals

SDG data is not available