Skip to content
Go back

Churn Prediction Using Random Forest: A Business Value Perspective

Churn Prediction Using Random Forest: A Business Value Perspective

In a data analytics project during my postgraduate program (2024–2025), our team explored how to maximize business value through machine learning. The challenge: identify customers likely to churn, and do it in a way that delivers actionable insights—not just accuracy scores.

We chose to focus on Random Forest, a robust and interpretable model, and applied permutation importance to understand which features most impacted predictions. But rather than optimizing for F1 or AUC alone, we introduced a novel angle: Business Value.


Methodology

We worked with a telecom churn dataset. Our pipeline included:

Instead of traditional model metrics, we assigned financial value to different outcomes in the confusion matrix:

PredictionActual Positive (Churn)Actual Negative
True Positive+$150-$10
False Positive-$20-$10

Results

Each hypothesis (H6, H6_N1…N4) was evaluated using cross-validation and simulated business value. Our key findings:

Here’s a simplified comparison:

ModelBusiness Value
H0$2,340
H6$3,050
H6_N2$3,480

Lessons Learned


Team & Acknowledgments

This was a group project developed during the PUCP Data Analytics program, with:


Source Code

Repository available at:
🔗 https://github.com/milkreator/churn-prediction


Share this post on:

Previous Post
SRT-H: When Surgical Robots Finally Start Speaking Our Language
Next Post
Phishing Website Detection Using Machine Learning