Predictive Modeling for Detecting Fraudulent Insurance Claims

Abstract

The objective of this thesis is to create a framework for detecting fraudulent vehicle insurance claims using binary regression models (logistic, probit, and complementary log-log), machine learning binary classifiers (decision tree, random forest, and naïve Bayes), and optimization-based machine learning techniques (k-nearest neighbor, gradient boosting, support vector machine, and artificial neural network). The study utilizes a dataset consisting of 16,100 observations, which includes prediction variables such as gender, marital status, age, whether a police report was filed, whether there were witnesses to the accident, and the number of cars involved in the accident. Approximately 30% of the response variable’s values (indicating whether fraud was detected) are affirmative cases. The dataset is divided into 80% for training and 20% for testing. The models are trained on the training data, and fraud probability is predicted for each row in the testing data. Model performances are evaluated using various criteria. The programming language R (version 4.3.2) is used throughout the thesis due to its capabilities to implement the aforementioned techniques.

Details

Subject

Statistics;
Computer science;
Mathematics

Classification

0463: Statistics
0984: Computer science
0405: Mathematics

Identifier / keyword

Insurance claims; Machine learning; Artificial neural network; Fraud

Title

Predictive Modeling for Detecting Fraudulent Insurance Claims

Author

Mora, Katya

Number of pages

136

Publication year

2024

Degree date

2024

School code

6080

Source

MAI 86/4(E), Masters Abstracts International

ISBN

9798384481232

Advisor

Korosteleva, Olga

Committee member

Suaray, Kagba; Le, Rebecca

University/institution

California State University, Long Beach

Department

Mathematics and Statistics

University location

United States -- California

Degree

M.S.

Source type

Dissertation or Thesis

Language

English

Document type

Dissertation/Thesis

Dissertation/thesis number

31336275

ProQuest document ID

3116494192

Document URL

https://www.proquest.com/dissertations-theses/predictive-modeling-detecting-fraudulent/docview/3116494192/se-2?accountid=208611

Database copyright ProQuest LLC; ProQuest does not claim copyright in the individual underlying works.

Database

ProQuest One Academic

Predictive Modeling for Detecting Fraudulent Insurance Claims

Content area

Abstract

Details