Etsy

Introduction

Etsy is a two-sided marketplace where sellers sell handmade items, and where buyers go to purchase personalized handmade items. One feature of Etsy, is that it provides personalized item recommendations to buyers based on their previous interactions on the site.

Implicit feedback includes all of a users interactions on a site including the items that they purchased, items that they favorited, items that they clicked on and even items that they saw but made no interaction with. Etsy wanted to use this kind of information in combination with user and listing features to train their machine learning models and predict the probability of purchase as their recommender system. With these probabilities of purchase, they could also use it to infer how new users might make purchases.

A problem that Etsy faced is that they had too many users and too many product listings to compute all of these purchase probabilities. Therefore, they did it in a two stage process: 1) ā€œCandidate Selectionā€ in which millions of listings are filtered down to hundreds of relevant items and 2) ā€œRankerā€ in which those items are then ranked based on relevance using more precision.

Originally, the second stage ā€œRankerā€ used a linear model, but more recently there has been experimentation using non-linear models and deep neural networks (DNNs); a DNN is better in this case because it provides a better fit of the model. After experimentation, it was found that a 3-layer neural network was optimal for predicting the likelihood of purchase.

Once Etsy developed their recommender system for relevant items, they than wanted to see if they could optimize for both relevance and profit at the same time. So, instead of only showing users relevant items, or showing them items priced from highest to lowest, they wanted to find an optimum where they could show a relevant item while not compromising on revenue. Therefore, they introduced a revenue term into their model, and were successful at optimizing for revenue without compromising on relevance.

To help avoid the computational workload from serving recommendation from enormous matrices for each of the more than 50 million users, Etsy useĀ locality sensitive hashingĀ to pool similar items into buckets which assumes that users will fall into the same bucket if they have similar tastes.

Etsy also useĀ Latent Dirichlet AllocationĀ to generate style profiles for all items and builds a distribution for each user amongst the style profile. Aryafar says that ā€˜LDA essentially gives you these style profiles and it also gives you the distribution of users preference in terms of these style profiles’. These distributions then allow users to be positioned near users who have a similar style profiles to serve recommendations.

When it came to evaluating the model, metrics used included Area under the Curve (AUC) for relevance, Normalized Discounted Cumulative Gain (NDCG) used for ranking in terms of both relevance and price and then price based metrics like Profit. When compared against linear regression, and logistic and weighted logistic regressions for baseline performance, the new revenue-relevance model was in fact able to attain the highest Profit and AUC metrics among these.

The value of having a data science team for recommender systems at Etsy is tangible through comprehensive A/B testingĀ and in summary Aryafar says that personalized content at Etsy has been found to engage users and enhance business metrics.

Images

References

  1. https://journeytodatascientist.blog/2020/04/18/recommender-systems-at-etsy/
  2. https://youtu.be/jMaa45fw3dQ
  3. https://www.dsml.etsy.com/publications
  4. https://www.hongliangjie.com/data-science-at-etsy/
  5. https://youtu.be/UbytXZLqezo?list=PLN7ADELDRRhgecE5dNlvs5ej5UB6ia-dv
  6. Presentation PDF
  7. https://arxiv.org/abs/1812.04407
  8. https://arxiv.org/abs/1905.06452