| # Customer Churn Prediction β Project Documentation |
|
|
| π **Full Document:** [Customer_Churn_Prediction_Model_Document.md](./Customer_Churn_Prediction_Model_Document.md) |
|
|
| ## Overview |
| Comprehensive project document for building a Customer Churn Prediction system using Adaptive Ensemble Machine Learning with Explainable AI. |
|
|
| ## Document Sections |
| 1. **Title** β Project framing and subtitle |
| 2. **Problem Statement** β Business context, technical challenges, and gaps |
| 3. **Idea of Solution** β Stacking ensemble architecture with 5 base models |
| 4. **Objectives** β Primary/secondary goals and success criteria |
| 5. **Literature Review & References** β 21 cited papers spanning 2016β2024 |
| 6. **Dataset Understanding** β Audit of Telco (52 features) and Bank churn datasets |
| 7. **Proposed Methodology** β 7-phase pipeline from preprocessing to CLV scoring |
| 8. **Implementation Strategy** β Tech stack, 4-week timeline, code architecture |
| 9. **Experimental Design** β 5 experiments, 10 metrics, statistical rigor |
| 10. **Result Analysis** β Expected performance, SHAP analysis, business impact |
| 11. **Iterative Improvement** β 6 iterations from feature engineering to production |
|
|
| ## Key Datasets |
| - [Telco Customer Churn](https://hf.co/datasets/aai510-group1/telco-customer-churn) β 7,043 customers, 52 features |
| - [Bank Customer Churn](https://hf.co/datasets/tayaee/bank-customer-churn-prediction) β 12 features |
|
|
| ## Key Papers |
| - Stacking Ensemble (99.28% acc): [arXiv:2408.16284](https://hf.co/papers/2408.16284) |
| - XGBoost Temporal (1st/575 teams): [arXiv:1802.03396](https://hf.co/papers/1802.03396) |
| - Transformer Time-Series (AUC=0.858): [arXiv:2309.14390](https://hf.co/papers/2309.14390) |
|
|