Skip to main content

2 posts tagged with "personalization"

View All Tags

Measuring the Impact of Recommendation Systems

· 3 min read
Eric Ngo
ML Engineer @dodo, ex-ML engineers @Meta/Samsung-Research/7-11

Building a recommendation system is only half the battle. To understand its true value, you must be able to measure its impact on both user behavior and business health. Measuring a RecSys requires a combination of online and offline metrics.

1. Business & North Star Metrics

These are the top-level indicators of whether your RecSys is actually helping the business grow.

  • Conversion Rate (CVR): The percentage of users who take a desired action (e.g., purchase, sign-up) after interacting with a recommendation.
  • Click-Through Rate (CTR): Measures the immediate relevance of your recommendations. A high CTR suggests your "Discovery" layer is working.
  • Average Order Value (AOV): Personalized cross-selling often leads to users adding more items to their cart, increasing the total value per session.
  • Customer Lifetime Value (CLV): Long-term measurement of how personalized experiences improve retention and total spend over time.
  • Revenue Lift: The delta in revenue between a personalized experience and a baseline (e.g., popularity-based).

2. Engagement Metrics

These help you understand how users are interacting with the system on a daily basis.

  • Time Spent on Platform: Personalized feeds (like TikTok or Netflix) are designed to maximize this metric.
  • Retention Rate: Are users coming back? High-quality recommendations are one of the strongest drivers of Day-7 and Day-30 retention.
  • Session Depth: How many items a user views or interacts with in a single session.

3. Machine Learning (Offline) Metrics

Before deploying a model, data scientists use these to evaluate its mathematical performance on historical data.

  • Recall@K: Out of all the items a user actually interacted with, how many did the model correctly predict in the top $K$ results?
  • Precision@K: Out of the top $K$ items recommended, how many were actually relevant?
  • NDCG (Normalized Discounted Cumulative Gain): Measures the quality of the ranking. It rewards the model for placing the most relevant items at the very top of the list.
  • MRR (Mean Reciprocal Rank): Specifically focuses on the position of the first relevant item.

4. Ecosystem Health Metrics

A healthy RecSys doesn't just show the same 5 popular items. It must maintain a diverse and fresh ecosystem.

  • Coverage: The percentage of your total catalog that is actually being recommended to at least one user. High coverage ensures "long-tail" items are discovered.
  • Novelty: Measures how "new" or "surprising" the recommendations are to the user.
  • Diversity: Ensures the list of recommendations isn't too repetitive (e.g., showing 10 identical black t-shirts).

Conclusion

The most successful companies don't just pick one metric. They use a balanced scorecard—combining offline accuracy (NDCG) with online business impact (Revenue Lift) and long-term ecosystem health (Diversity).

To see how these metrics vary across different types of models, check out our next post.

Recommendation Systems 101

· 3 min read
Eric Ngo
ML Engineer @dodo, ex-ML engineers @Meta/Samsung-Research/7-11

What is a Recommendation System?

At its heart, a Recommendation System (RecSys) is an engine for personalization. While traditionally defined as a tool for predicting user preferences, its true purpose in the modern digital landscape is to curate a unique, individualized experience for every user.

Instead of a "one-size-fits-all" interface, a RecSys transforms a platform into a living environment that adapts to each user's specific tastes, behaviors, and evolving needs. It acts as a bridge between a massive, often overwhelming catalog of choices and the unique intent of the person interacting with it.

Why is it Important and Valuable?

In the age of "Infinite Scroll" and "Paradox of Choice," the value of a recommendation system cannot be overstated. It is often the primary driver of both user satisfaction and business growth.

1. Delivering Human-Centric Value

  • Reducing Cognitive Load: By surfacing what's relevant, a RecSys saves users from the "choice paralysis" that comes with too many options.
  • Discovery and Serendipity: A great system doesn't just show you what you know you like; it introduces you to "the next big thing" you didn't even know existed.
  • Contextual Relevance: Modern systems understand that a user's needs change based on time, location, and device, providing the right recommendation at the right moment.

2. Driving Business Impact

The implementation of an effective recommendation engine often results in a massive shift in key business performance indicators:

  • Revenue and Conversion: personalization directly correlates with higher conversion rates. When users see products they actually want, they are significantly more likely to purchase.
  • Customer Lifetime Value (CLV): By consistently providing value through personalization, businesses build deep trust and loyalty, keeping users on the platform for years rather than months.
  • Operational Efficiency:
    • Inventory Liquidity: Surfacing the "long tail" of products reduces dead stock and ensures a more diverse range of items are sold.
    • Automated Curation: Reduces the need for manual, rule-based merchandising which is impossible to scale at the individual level.
  • Data-Driven Insights: The feedback loop from a RecSys provides invaluable data on market trends and user preferences, informing future product development and marketing strategies.

The impact is measurable and profound: industry leaders often attribute 30% to 75% of their total revenue to personalized recommendations.

Simple Rec-Sys Algorithms

Starting with recommendation systems doesn't always require complex deep learning. Here are the foundational approaches:

1. Collaborative Filtering (CF)

This approach is based on the idea that "if User A and User B have similar tastes in the past, they will likely have similar tastes in the future."

  • User-Based: Find similar users and recommend what they liked.
  • Item-Based: Find items similar to those the user has interacted with based on other users' behaviors.

2. Content-Based Filtering

This method uses item features (e.g., category, price, description) and user profiles to make recommendations. If you like "Action Movies," the system will recommend more "Action Movies" regardless of what other users do.

3. Popularity-Based (Heuristics)

The simplest form of recommendation: show everyone the "Trending" or "Best Selling" items. While not personalized, it's a strong baseline and effective for new users (the "Cold Start" problem).

Next read