Credit Score Classification
Financial institutions assess customers' credit risk and repayment ability based on their credit scores. In this project, I used Python for credit score classification, selecting Random Forest as the model. The classification results are divided into three categories: High, Average, and Low. First, I performed exploratory data analysis (EDA) to understand the distribution of each feature, followed by training the Random Forest model using K-fold cross-validation. Afterward, I evaluated the model's classification performance using metrics such as confusion matrix, accuracy, precision, recall, and F1-score. However, due to the small dataset, the model may face potential issues of insufficient generalization or overfitting.
金融機構會基於客戶的信用評分來衡量其信用風險和還款能力。在這個專案中,我使用Python進行信用評分分類,並選用了隨機森林作為模型。分類結果分為三個等級:高、中、低。首先,我透過探索性資料分析來了解各個特徵的分佈情況,然後用K折交叉驗證訓練隨機森林模型。之後,我使用混淆矩陣、準確率、精確率、召回率和F1-Score等指標來評估模型的分類表現。不過,由於數據量較少,模型可能面臨泛化能力不足或過度擬合的潛在問題。