Recommender System Seminar

RECOMMENDER SYSTEM

Building Recommender System in Industries

1. Background: Deﬁnition, Problem Construction and Possible Solutions. 2. SVD-like
for Recommender System: Matrix Factorization to model matrix completion problem 3. Implicit and Explicit signal: How to model Explicit and Implicit Signal with SVD-like RecSys 4. Recommender System workﬂow: Estimate, Filtering, Ranking and Randomization 5. Recommender System Cases Content

Background: Recommender System in Industries

• Chris Anderson in “The Long Tail ◦ “We are
leaving the age of information and entering the age of recommendation” • CNN Money, “The race to create a 'smart' Google”: ◦ “The Web, they say, is leaving the era of search and entering one of discovery. What's the difference? Search is what you do when you're looking for something. Discovery is when something wonderful that you didn't know existed, or didn't know how to ask for, ﬁnds you.” Source: compiled from Xavier Amatriain MLSS slides (2014) Background

Background

• Value of Recommendation ◦ Netﬂix: 2/3 of the movies
watched are recommended ◦ Google News: recommendations generate 38% more click through ◦ Amazon: 35% sales from recommendations • Metrics which Affected by Recommendation ◦ Activity Metrics: Increase in user retention ◦ Financial Metrics: Increase in sales ◦ Product Activity: Increase on number of unique items bought Source: compiled from Xavier Amatriain MLSS slides (2014) Background

• The “Recommender problem” ◦ Estimate a utility function that
automatically predicts how a user will like an item. ◦ Based on: ▪ Past behavior ▪ Relations to other users ▪ Item similarity ▪ Context Source: compiled from Xavier Amatriain MLSS slides (2014) Background

The “Recommender problem” • Let C be set of all
users and let S be set of all possible recommendable items • Let u be a utility function measuring the usefulness of item s to user c, i.e., u : C X S→R, where R is a totally ordered set • For each user c є C, we want to choose items s є S that maximize u. Utility is usually represented by rating but can be any function Source: compiled from Xavier Amatriain MLSS slides (2014) Background

Possible Solutions Source: compiled from Xavier Amatriain MLSS slides (2014)
Background

Problems in Recommendation System: Long Tail Sales Background

Problems in Recommendation System • Some users never use some
items Background

SVD-like for Recommender System: Matrix Factorization to model RecSys

Non-Personalized Recommendation

Before dwelling to SVD-like RecSys CF, let’s focus on other
approach: 1. Non-Personalized Recommendation a. For example, Content based ﬁltering, recommend similar items. Product name embedding with Word2Vec SVD-like Recommendation

approach: 1. Non-Personalized Recommendation SVD-like Recommendation

approach: 1. Non-Personalized Recommendation a. Recommend items based on similarity: SVD-like Recommendation

approach: 1. Non-Personalized Recommendation a. Pros: i. Can be used if you don’t have any transaction history in the beginning ii. Sometimes can beat popular items benchmark. b. Cons: i. Low diversity metrics 1. Buy (mie-ayam), next recommendation (mie-ayam) SVD-like Recommendation

approach: 1. Non-Personalized Recommendation SVD-like Recommendation

approach: 1. Non-Personalized Recommendation SVD-like Recommendation arg max similarity(item1, item2)

Personalized Recommendation

approach: 2. Personalized Recommendation SVD-like Recommendation arg max Probability(item, user)

approach: 1. Personalized Recommendation a. Based on items, users, and items-users interaction SVD-like Recommendation Alex Smola slides form Berkley ML class (2012)

How to model matrix completion: • Matrix Factorization SVD-like Recommendation
Alex Smola slides form Berkley ML class (2012)

Matrix Factorization • Latent Variable SVD-like Recommendation Alex Smola slides
form Berkley ML class (2012)

Matrix Factorization • Unfortunately we can’t user Matrix Factorization for
sparse data SVD-like Recommendation Alex Smola slides form Berkley ML class (2012)

Funk SVD Alex Smola slides form Berkley ML class (2012)

Funk SVD Source: compiled from Alex Lin slides, CF with
MF (2011)

SVD-like Recommender System Source: compiled from Alex Lin slides, CF
with MF (2011)

Modeling Implicit and Explicit Feedback

Two types of Feedback Explicit and Implicit Feedback Source: compiled
from Li Yen Kuo slides, Implicit RecSys (2019)

In rating systems, such as MovieLens and Allmusic, the value
of an entry denotes the rating of the item given by the user. Ratings can explicitly reﬂect the preference of an individual. Explicit and Implicit Feedback Source: compiled from Li Yen Kuo slides, Implicit RecSys (2019)

For instance, in a music podcast service, the value of
an entry may denote the subscription. Explicit and Implicit Feedback Source: compiled from Li Yen Kuo slides, Implicit RecSys (2019)

Or play count Explicit and Implicit Feedback Source: compiled from
Li Yen Kuo slides, Implicit RecSys (2019)

Source: compiled from Li Yen Kuo slides, Implicit RecSys (2019)

SVD-like model is so easy to ﬁt any feedback type
1. Explicit a. Rating (Netﬂix Competition) Change the loss function to RMSE, regression task. Explicit and Implicit Feedback Source: compiled from Li Yen Kuo slides, Implicit RecSys (2019)

1. Explicit b. Like/Dislike (Netﬂix Now) Change the loss function to log-loss, sigmoid activation. Explicit and Implicit Feedback

2. Implicit a. Read/Not Read (Quora) Change loss function to BPR Loss Explicit and Implicit Feedback

2. Implicit b. Frequency of Buy (Retail) Change loss function to Regression loss with Alpha, Koren (2009) Explicit and Implicit Feedback

Recommender System workﬂow: Estimate, Ranking and Randomization

Recommender System Workﬂow Source: compiled from Xavier Amatriain MLSS slides
(2014)

Modeling 1. Build Recommender System based on Watch/Not a. Implicit
model, maximize click through rate 2. Build Recommender System based on Rating a. Explicit model, maximize preference 3. Build Recommender System based on Churn a. Implicit model, maximize retention Recommender System Workﬂow Source: compiled from Xavier Amatriain MLSS slides (2014)

Recommender System Workﬂow Ranking • Most recommendations are presented in
a sorted list • Recommendation can be understood as a ranking problem • Popularity is the obvious baseline • Ratings prediction is a clear secondary data input that allows for personalization • Many other features can be added

Recommender System Workﬂow Ranking Source: compiled from Xavier Amatriain MLSS
slides (2014)

Randomization Randomization • We will focus on Diversity Metrics vs
Accuracy Metrics. • More diverse recommendation will increase Netﬂix CTR • More accurate recommendation will increase Netﬂix CTR • Diversity and Accuracy are negatively correlated.

Randomization

Recommender System Cases

Questions Pair: Similarity Task

Questions Pair: Similarity Task Question 1 Question 2 {0, 1}

Questions Pair: Similarity Task • Is Item A similar to
Item B?

Questions Pair: Similarity Task Input 1 Input 2 Neural Networks
{0, 1}

Temporal Recommendation Task

Temporal Recommendation Task Item 1 Item 4 Item 2 Item
1 Item 2 Item 3 Item 2 Item 1 Item 5 RNN

Recommendation Task

Recommendation Task Click data Hotel 33 Hotel 77 . .
. Hotel 44

Searching Task 60 Pseudo code BPR For K times: 1.
A = Sample random label positive 2. B = Sample random label negative 3. Dist = Model predict distance(A,B) 4. Update, make the distance farther.

Searching Task Loss Function: 61

Searching Task • Pointwise ◦ for each items, train a
classiﬁer / regressor on it to predict how relevant it is • Pairwise ◦ given a pair of documents, compare which one has the highest rank. • Listwise ◦ sort the entire list of documents ▪ Direct optimization of IR measures such as NDCG

Searching Task: 63

Cowok 1 Cowok 3 Cowok 2 Cewek A Cewek C
Cewek B Cowok 4 Cewek D Matching Task

Matching Task Cowok 1 Cewek C Cewek D Cewek A
Cewek B Cewek A Cowok 4 Cowok 2 Cowok 1 Cowok 3

Matching Task 66 Cewek 1 Cowok 2 {0, 1}

Matching Task Loss Function: 67

Matching Task Matching Algorithm: Gale-Shapley 68

cowok cowok cowok cewek cewek cewek cowok cewek Matching Task

Matching Task 70

You can learn all of these in our Advance ML
CLass Contact: [email protected]

We believe everyone can build cool Recommendation Systems Contact: [email protected]

Upcoming Advance ML CLass Contact: [email protected]

Check our website: www.pacmann.ai Contact: [email protected]

Past Classes Contact: [email protected]

Pacmann AI Classes Quality State of the Art of Machine
Learning Research Practical Skills Theoretical Understanding > 50 institutions 400++ alumni 6 Classes in the past [email protected] https://pacmann.ai

Previous Classes [email protected] https://pacmann.ai 61 participants 8 weeks 48 institutions

Previous Classes Facts & Figure [email protected] https://pacmann.ai Field of Work

Previous Participants [email protected] https://pacmann.ai

Contact Email: [email protected] Whatsapp Business: +62 812-8122-1707

Recommender System Seminar

Recommender System Seminar

More Decks by Pacmann AI

Featured

Transcript