SQL Case Study: Analyzing Customer Churn for an E-Commerce Business
Introduction
If you're a beginner in SQL and data analysis, chances are you've run into datasets but struggled with what to analyze and how to approach the problem. So, let's dive into a real-world case study and break it down step by step.
We're going to analyze customer churn for an e-commerce business. The goal? Figure out why customers are leaving and provide insights to improve retention.

Customer Churn: SQL Case Study
Understanding the Problem Statement
Business Scenario
An e-commerce company noticed a decline in returning customers and suspects that many are churning (i.e., leaving the platform after their initial purchases). As a data analyst, your job is to identify the key factors behind customer churn using SQL and help the business make data-driven decisions.
Dataset Overview
We have the following tables in our PostgreSQL database:
customers
(Customer details)
customer_id | name | signup_date | country |
---|---|---|---|
101 | Alice | 2023-01-15 | USA |
102 | Bob | 2023-02-20 | Canada |
103 | Charlie | 2023-03-10 | India |
orders
(Customer order details)
order_id | customer_id | order_date | total_amount | status |
---|---|---|---|---|
201 | 101 | 2023-02-01 | 150.00 | Completed |
202 | 102 | 2023-03-05 | 200.00 | Completed |
203 | 103 | 2023-03-20 | 80.00 | Cancelled |
Step 1: Defining Churn
Before writing SQL queries, we must first define churn in a way that makes sense for the business. Here’s a common definition:
A customer is considered churned if they have not placed an order in the last 6 months.
We will write an SQL query to identify customers who haven’t made a purchase in the last 6 months from today’s date.
SQL Query to Identify Churned Customers
SELECT c.customer_id, c.name, c.signup_date,
MAX(o.order_date) AS last_order_date
FROM customers c
LEFT JOIN orders o ON c.customer_id = o.customer_id
GROUP BY c.customer_id, c.name, c.signup_date
HAVING MAX(o.order_date) < NOW() - INTERVAL '6 months' OR MAX(o.order_date) IS NULL;
Explanation
-
We join
customers
withorders
oncustomer_id
. -
Use
MAX(order_date)
to find the last order date of each customer. -
Use HAVING to filter customers whose last purchase was more than 6 months ago (or who never made a purchase).
-
LEFT JOIN
ensures we don’t lose customers who signed up but never ordered.
Step 2: Identifying Patterns in Churned Customers
Now that we have the list of churned customers, let’s dig deeper to find patterns:
1. Do churned customers have smaller average order values?
SELECT
CASE
WHEN o.customer_id IS NULL THEN 'No Orders'
ELSE 'Churned Customers'
END AS customer_type,
ROUND(AVG(o.total_amount),2) AS avg_order_value
FROM customers c
LEFT JOIN orders o ON c.customer_id = o.customer_id
GROUP BY customer_type;
Insights:
-
If churned customers have lower average order values, we might need better discounts or incentives.
2. Are there specific months where churn is higher?
SELECT
DATE_TRUNC('month', signup_date) AS signup_month,
COUNT(DISTINCT customer_id) AS churned_customers
FROM customers c
LEFT JOIN orders o ON c.customer_id = o.customer_id
WHERE (MAX(o.order_date) < NOW() - INTERVAL '6 months' OR MAX(o.order_date) IS NULL)
GROUP BY signup_month
ORDER BY signup_month;
Insights:
-
If more customers churn in specific months, there might be seasonal trends.
3. Does churn vary by country?
SELECT c.country, COUNT(DISTINCT c.customer_id) AS churned_customers
FROM customers c
LEFT JOIN orders o ON c.customer_id = o.customer_id
WHERE (MAX(o.order_date) < NOW() - INTERVAL '6 months' OR MAX(o.order_date) IS NULL)
GROUP BY c.country
ORDER BY churned_customers DESC;
Insights:
-
If churn is higher in a particular country, we may need better localized marketing.
Step 3: Recommendations for the Business
After analyzing the data, here are some actionable recommendations:
-
Introduce Loyalty Programs: If we find that churned customers generally spend less, a loyalty program might incentivize repeat purchases.
-
Retargeting Campaigns: If a specific country has high churn, we should launch a targeted campaign.
-
Seasonal Promotions: If churn is seasonal, we can prepare discounts or email reminders before high-churn months.
-
Personalized Offers: If customers with a specific order history are churning, personalized discount emails might help retain them.
Conclusion
SQL is an incredibly powerful tool for understanding customer behavior. By analyzing churn, we identified insights that could reduce customer loss and increase retention. The key takeaway? Data tells a story—you just need the right SQL queries to uncover it!
If you found this useful, let me know! What other SQL case studies would you like me to break down?
Comments
Post a Comment