Price Rank Prediction of a Company by Utilizing Data Mining Methods on Financial Disclosures

dc.contributor.author Kaçar, Mustafa Sami
dc.contributor.author Yumuşak, Semih
dc.contributor.author Kodaz, Halife
dc.date.accessioned 2023-11-11T09:03:38Z
dc.date.available 2023-11-11T09:03:38Z
dc.date.issued 2023
dc.description.abstract The use of reports in action has grown significantly in recent decades as data has become digitized. However, traditional statistical methods no longer work due to the uncontrollable expansion and complexity of raw data. Therefore, it is crucial to clean and analyze financial data using modern machine learning methods. In this study, the quarterly reports (i.e. 10Q filings) of publicly traded companies in the United States were analyzed by utilizing data mining methods. The study used 8905 quarterly reports of companies from 2019 to 2022. The proposed approach consists of two phases with a combination of three different machine learning methods. The first two methods were used to generate a dataset from the 10Q filings with extracting new features, and the last method was used for the classification problem. Doc2Vec method in Gensim framework was used to generate vectors from textual tags in 10Q filings. The generated vectors were clustered using the K-means algorithm to combine the tags according to their semantics. By this way, 94000 tags representing different financial items were reduced to 20000 clusters consisting of these tags, making the analysis more efficient and manageable. The dataset was created with the values corresponding to the tags in the clusters. In addition, PriceRank metric was added to the dataset as a class label indicating the price strength of the companies for the next financial quarter. Thus, it is aimed to determine the effect of a company's quarterly reports on the market price of the company for the next period. Finally, a Convolutional Neural Network model was utilized for the classification problem. To evaluate the results, all stages of the proposed hybrid method were compared with other machine learning techniques. This novel approach could assist investors in examining companies collectively and inferring new, significant insights. The proposed method was compared with different approaches for creating datasets by extracting new features and classification tasks, then eventually tested with different metrics. The proposed approach performed comparatively better than the other machine learning methods to predict future price strength based on past reports with an accuracy of 84% on the created 10Q filings dataset. en_US
dc.identifier.doi 10.1587/transinf.2022OFP0002
dc.identifier.issn 0916-8532
dc.identifier.issn 1745-1361
dc.identifier.scopus 2-s2.0-85173218950
dc.identifier.uri https://doi.org/10.1587/transinf.2022OFP0002
dc.identifier.uri https://hdl.handle.net/20.500.13091/4749
dc.language.iso en en_US
dc.publisher IEICE-INST ELECTRONICS INFORMATION COMMUNICATION ENGINEERS en_US
dc.relation.ispartof Ieice Transactions On Information and Systems en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject 10-Q Filings en_US
dc.subject PriceRank XBRL en_US
dc.subject Doc2Vec en_US
dc.subject K means en_US
dc.subject CNN en_US
dc.subject Convolutional Neural-Network en_US
dc.subject K-Means en_US
dc.subject Language en_US
dc.subject Filings en_US
dc.title Price Rank Prediction of a Company by Utilizing Data Mining Methods on Financial Disclosures en_US
dc.type Article en_US
dspace.entity.type Publication
gdc.author.institutional
gdc.author.scopusid 58632181400
gdc.author.scopusid 56814988500
gdc.author.scopusid 8945093700
gdc.bip.impulseclass C5
gdc.bip.influenceclass C5
gdc.bip.popularityclass C5
gdc.coar.access open access
gdc.coar.type text::journal::journal article
gdc.description.department KTÜN en_US
gdc.description.departmenttemp [Kacar, Mustafa Sami; Kodaz, Halife] Konya Tech Univ, Comp Engn Dept, Konya, Turkiye; [Yumusak, Semih] KTO Karatay Univ, Comp Engn Dept, Konya, Turkiye en_US
gdc.description.endpage 1471 en_US
gdc.description.issue 9 en_US
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality Q3
gdc.description.startpage 1461 en_US
gdc.description.volume E106D en_US
gdc.description.wosquality Q4
gdc.identifier.openalex W4386315359
gdc.identifier.wos WOS:001065284900015
gdc.index.type WoS
gdc.index.type Scopus
gdc.oaire.accesstype GOLD
gdc.oaire.diamondjournal false
gdc.oaire.impulse 2.0
gdc.oaire.influence 2.7083193E-9
gdc.oaire.isgreen false
gdc.oaire.popularity 3.7238346E-9
gdc.oaire.publicfunded false
gdc.openalex.collaboration National
gdc.openalex.fwci 0.64697809
gdc.openalex.normalizedpercentile 0.71
gdc.opencitations.count 2
gdc.plumx.crossrefcites 2
gdc.plumx.mendeley 5
gdc.plumx.scopuscites 2
gdc.scopus.citedcount 2
gdc.virtual.author Kodaz, Halife
gdc.wos.citedcount 0
relation.isAuthorOfPublication 8a1e6584-7869-499d-81a8-a73a0c428829
relation.isAuthorOfPublication.latestForDiscovery 8a1e6584-7869-499d-81a8-a73a0c428829

Files