Skip to main content
eScholarship
Open Access Publications from the University of California

UCLA

UCLA Electronic Theses and Dissertations bannerUCLA

Fraud Detection in Vehicle Insurance Claims using Machine Learning

Abstract

Insurance fraud poses a significant financial burden on the industry, with fraudulent vehicle insurance claims being a major contributor. This study explores the application of machine learning techniques to accurately detect fraudulent vehicle insurance claims. Six different models - Logistic Regression, Random Forest, Gaussian Naive Bayes, Decision Tree, XGBoost, and Gradient Boosting classifiers - are evaluated on an imbalanced dataset. To address class imbalance, oversampling techniques like SMOTE, Borderline SMOTE, and ADASYN are employed. Performance is assessed using metrics such as F1 score, recall, and AUC. Results indicate that XGBoost and Gradient Boosting models demonstrate superior overall performance, effectively balancing precision and recall. The Gaussian Naive Bayes model exhibits exceptional recall, making it suitable for minimizing missed fraud cases.

Main Content
For improved accessibility of PDF content, download the file to your device.
Current View