Publication Date
Spring 2017
Degree Type
Master's Project
Degree Name
Master of Science (MS)
Department
Computer Science
First Advisor
Leonard Wesley
Second Advisor
Robert Chun
Third Advisor
Raghavendra Keshavamurthy
Keywords
Logistic Regression, SVD, Credit Scores, Machine Learning
Abstract
This report presents an approach to predict the credit scores of customers using the Logistic Regression machine learning algorithm. The research objective of this project is to perform a comparative study between feature selection and feature extraction, against the same dataset using the Logistic Regression machine learning algorithm. For feature selection, we have used Stepwise Logistic Regression. For feature extraction, we have used Singular Value Decomposition (SVD) and Weighted Singular Value Decomposition (SVD). In order to test the accuracy obtained using feature selection and feature extraction, we used a public credit dataset having 11 features and 150,000 records. After performing feature reduction, Logistic Regression algorithm was used for classification. In our results, we observed that Stepwise Logistic Regression gave a 14% increase in accuracy as compared to Singular Value Decomposition (SVD) and a 10% increase in accuracy as compared to Weighted Singular Value Decomposition (SVD). Thus, we can conclude that Stepwise Logistic Regression performed significantly better than both Singular Value Decomposition (SVD) and Weighted Singular Value Decomposition (SVD). The benefit of using feature selection was that it helped us in identifying important features, which improved the prediction accuracy of the classifier.
Recommended Citation
Mathew, Ansen, "CREDIT SCORING USING LOGISTIC REGRESSION" (2017). Master's Projects. 532.
DOI: https://doi.org/10.31979/etd.3czc-rhe3
https://scholarworks.sjsu.edu/etd_projects/532