Content area
Credit card fraud has a significant impact on the financial industry and is now a growing concern. Machine learning can minimize bias against legitimate transactions and enable accurate identification of fraud. This study explores machine learning techniques to address category imbalances in credit card fraud detection datasets to mitigate economic losses while improving model performance. The results show that logistic regression outperforms other classifiers, including support vector classifiers (SVC), K-nearest neighbor classifiers (KNN), and decision trees, achieving an optimal balance between fraud detection and minimizing false positives. By conducting data processing techniques such as feature scaling and dataset balancing, the model shows an effective identification of fraudulent transactions that rarely exist in a vast number of legitimate transactions. In addition, simple neural networks trained on oversampled data reveal higher recall rates but at the cost of higher false positives, highlighting the tradeoff between accuracy and fraud detection sensitivity. These findings underscore the importance of choosing models that can both effectively detect fraud and minimize disruption to legitimate transactions, which also provide valuable insights for financial institutions seeking to enhance their fraud detection systems.