Credit Score Classification

Financial institutions assess customers' credit risk and repayment ability based on their credit scores. In this project, I used Python for credit score classification, selecting Random Forest as the model. The classification results are divided into three categories: High, Average, and Low. First, I performed exploratory data analysis (EDA) to understand the distribution of each feature, followed by training the Random Forest model using K-fold cross-validation. Afterward, I evaluated the model's classification performance using metrics such as confusion matrix, accuracy, precision, recall, and F1-score. However, due to the small dataset, the model may face potential issues of insufficient generalization or overfitting.

Click to view the project notes on Kaggle

金融機構會基於客戶的信用評分來衡量其信用風險和還款能力。在這個專案中，我使用Python進行信用評分分類，並選用了隨機森林作為模型。分類結果分為三個等級：高、中、低。首先，我透過探索性資料分析來了解各個特徵的分佈情況，然後用K折交叉驗證訓練隨機森林模型。之後，我使用混淆矩陣、準確率、精確率、召回率和F1-Score等指標來評估模型的分類表現。不過，由於數據量較少，模型可能面臨泛化能力不足或過度擬合的潛在問題。

點擊查看Kaggle上的專案筆記

Credit Score Classification

Life Expectancy Prediction

Contact