Content area

Abstract

This paper explores articles hosted on the arXiv preprint server with the aim of uncovering valuable insights hidden in this vast collection of research. Employing text mining techniques and through the application of natural language processing methods, I xamine the contents of quantitative finance papers posted in arXiv from 1997 to 2022. I extract and analyze, without relying on ad hoc software or proprietary databases, crucial information from the entire documents, including the references, to understand the topic trends over time and to find out the most cited researchers and journals in this domain. Additionally, I compare numerous algorithms for performing topic modeling, including state-of-the-art approaches.

Details

1009240
Title
Text Mining arXiv: A Look Through Quantitative Finance Papers
Publication title
Volume
13
Issue
9
First page
1375
Publication year
2025
Publication date
2025
Publisher
MDPI AG
Place of publication
Basel
Country of publication
Switzerland
Publication subject
e-ISSN
22277390
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-04-23
Milestone dates
2025-02-24 (Received); 2025-04-18 (Accepted)
Publication history
 
 
   First posting date
23 Apr 2025
ProQuest document ID
3203209797
Document URL
https://www.proquest.com/scholarly-journals/text-mining-arxiv-look-through-quantitative/docview/3203209797/se-2?accountid=208611
Copyright
© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-05-13
Database
2 databases
  • Coronavirus Research Database
  • ProQuest One Academic