Synthetic Data Generation for Fraud Detection
Skip to main content
eScholarship
Open Access Publications from the University of California

UCLA

UCLA Electronic Theses and Dissertations bannerUCLA

Synthetic Data Generation for Fraud Detection

Abstract

This paper applies various synthetic data generation techniques to create synthetic fraud datafor buy now, pay later (BNPL) financial institutions that mimic the statistical properties of real data. We utilize both statistical and deep learning methods to accomplish this task, contrasting each different framework’s respective qualities. We evaluate the efficacy of our approaches by using our generated data to enhance the training sets of a fraud detection model and analyze the effects on validation results. Our results show that including synthetic data in existing datasets can improve the accuracy of fraud detection systems.

Main Content
For improved accessibility of PDF content, download the file to your device.
Current View