Icon18revrec sudeshna

Personalized review
recommendation
Part 2
M.Chelliah and Sudeshna Sarkar
1

Deep Learning in Review Recommendation
2

Feed forward network
● An artificial neuron
○ Receives inputs
○ Summation followed by a nonlinear activation.
○ Produces an output for next layer neurons
● Weights learnt through back propagation
Source:https://en.wikipedia.org/wiki/Connectionism

MultiLayer Perceptron
xn
x2
x1
am
a1
o output
x1
x2
xn
Hidden neuron
z1 a1= g(z1)
w11
w12
w13
x W z|a
sigmoid tanh ReLU
gradient ~ 0

xn
x2
x1
x W[1] z[1]|a[1] W[2] output
oK
o2
o1
softmax
MultiLayer Perceptron - loss functions
Learn weights by minimizing the loss

DL in Recommendation Systems
● Deep learning based models achieved the best performances and is a promising tool
for recommender problems
● Representation plays an important role in Recommender Systems. Multiple sources
of data may be leveraged for rich representation
○ Representation of users and items
○ Transactions
○ Content: Product description, Metadata, Reviews, Product Image
○ User demography
○ Product ontology
6

Using all embeddings
• Multiple ways of learning/combining
representations
• Example 1: (Adapted from one experiment at
Flipkart Recommender System)
• Concatenate different pre-trained
embeddings into a single vector
• Learn a simple Logistic Regression
OR
• Feed into another ANN with softmax
output (corresponding to purchase or
clicks)
• Example 2: Joint learning of the embeddings
• Similar products - Locality Sensitive Hashing
(LSH) on embeddings
CF embedding Image embedding Text embedding
Hidden layer
Output layer

Representation of Items, Users, Reviews
Representation of items and/or
users is necessary for getting a
recommender system to work.
• One-hot representation
• Embedding as vectors: pre-
trained
• Embedding can be learned on
the task
• Item Representation
1. Based on item interaction
sequences and item properties
– Prod2vec, Meta Prod2vec etc
2. Based on user-item interaction
• Neural collaborative filtering: Model
user-item interactions
3. User and item information
(content)
• Review representation
• User representation

Product Embedding
Prod2vec or Item2vec : Product embedding
• Based on item-item co-occurrence from transaction sequences
co-purhased products)
• Uses method of word embedding: low-dimensional, distributed
embeddings of words based on word sequences in text documents
1. Barkan, Oren, and Noam Koenigstein. "Item2vec: neural item embedding for collaborative filtering."
Machine Learning for Signal Processing (MLSP), 2016 IEEE 26th International Workshop on. IEEE,
2016.
2. Grbovic, Mihajlo, et al. "E-commerce in your inbox: Product recommendations at
scale." Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and
Data Mining. ACM, 2015.

• Representation of words
``Similar words have similar contexts”
1. CBOW: P Word Context
2. Skipgram: P(Context|Word)
Word2vec
Input Projection Output

Skipgram Model
• Input: Central word 𝑤𝑤𝑡𝑡
• Output: Words in its context: 𝒘𝒘𝐜𝐜𝐜𝐜𝐜𝐜
𝑤𝑤𝑡𝑡−𝑐𝑐, … , 𝑤𝑤𝑡𝑡−1 , 𝑤𝑤𝑡𝑡+1, … , 𝑤𝑤𝑡𝑡+𝑐𝑐
• Each input word represented by a 1-hot
encoding of size V
Source Text:
Deep Learning attempts to learn multiple
levels of representation from data.
Input output pairs :
Positive samples:
• (representation, levels)
• (representation, of)
• (representation, from)
• (representation, data)
Negative samples:
• (representation, x)
[x: all other words except the 4
positive]
𝑤𝑤𝑡𝑡
𝑤𝑤𝑡𝑡+1
𝑤𝑤𝑡𝑡+𝑐𝑐
𝑤𝑤𝑡𝑡−𝑐𝑐
𝑤𝑤𝑡𝑡−1
INPUT PROJECTION OUTPUT

Prod2vec
Use word2vec on co-purchased products
Purchase sequence of user u
𝑝𝑝𝑢𝑢𝑢, 𝑝𝑝𝑢𝑢𝑢, … , 𝑝𝑝𝑢𝑢𝑢𝑢
Skipgram applied on transaction sequence
𝑝𝑝𝑡𝑡
𝑝𝑝𝑡𝑡+1
𝑝𝑝𝑡𝑡+𝑐𝑐
𝑝𝑝𝑡𝑡−𝑐𝑐
𝑝𝑝𝑡𝑡−1
Positive samples: Negative
samples
𝑝𝑝𝑢𝑢,𝑡𝑡, 𝑝𝑝𝑢𝑢,𝑡𝑡−𝑐𝑐
𝑝𝑝𝑢𝑢,𝑡𝑡, 𝑝𝑝𝑢𝑢,𝑡𝑡−1
𝑝𝑝𝑢𝑢,𝑡𝑡, 𝑝𝑝𝑢𝑢,𝑡𝑡+1
𝑝𝑝𝑢𝑢,𝑡𝑡, 𝑝𝑝𝑢𝑢,𝑡𝑡+𝑐𝑐
𝒑𝒑𝒕𝒕, 𝒑𝒑𝒄𝒄𝒄𝒄𝒄𝒄
𝑝𝑝𝑢𝑢,𝑡𝑡, p
p is any product
other than 𝑝𝑝𝑐𝑐𝑐𝑐𝑐𝑐

Extensions of prod2vec
1. Meta prod2vec
• Use product metadata in
addition to transaction
sequence
– Category, Brand, Description, etc
– Embedded metadata added both
to input and context
2. Several other *2vec models
𝑝𝑝𝑡𝑡
𝑝𝑝𝑡𝑡+1
𝑀𝑀𝑡𝑡+1
𝑝𝑝𝑡𝑡−1
𝑀𝑀𝑡𝑡
𝑀𝑀𝑡𝑡−1
𝑊𝑊𝑃𝑃
𝑊𝑊𝑀𝑀

Review Representation
● Word2vec for word representation (concept representation)
● Document or paragraph representation
● Averaging or summing of word vectors
● Paragraph or document vector using CNN, RNN, etc
17

Convolution for Text
this
sequence
of
word
vectors
represents
a
text
𝑛𝑛 × 𝑘𝑘 representation of sentence
𝑐𝑐1
𝑐𝑐2
Filter of size 2 × 𝑘𝑘
Feature map
of 1 filter
𝑥𝑥1
𝑥𝑥2
𝑥𝑥𝑛𝑛
𝑥𝑥3
𝑐𝑐𝑖𝑖 = 𝑓𝑓 𝑤𝑤 ∙ 𝑥𝑥𝑖𝑖:𝑖𝑖+ℎ−1 + 𝑏𝑏

CNN for Text
this
sequence
of
word
vectors
represents
a
text
𝑛𝑛 × 𝑘𝑘 representation of sentence
Feature maps
of 𝑑𝑑 filters
…….…
𝑥𝑥1
𝑥𝑥2
𝑥𝑥𝑛𝑛
𝑥𝑥3

CNN for Text: Kim’s CNN
Kim, Yoon. "Convolutional neural networks for sentence classification." arXiv
preprint arXiv:1408.5882 (2014).

Kim’s CNN
● Word embedding
● Padding to size n
Dropout
Rescale the weight
vectors of each class to
fixed number s
1. Convolution layer
2. Max-pooling layer
𝑐𝑐̂ = max 𝑐𝑐
3. Fully connected layer with
softmax output

Dynamic k-max pooling
Handles sentences of varying length
k is a function of the sentence
length and network depth
𝑘𝑘𝑙𝑙 = max 𝑘𝑘𝑡𝑡𝑡𝑡𝑡𝑡,
𝐿𝐿 − 𝑙𝑙
𝐿𝐿
∙ 𝑠𝑠
𝑙𝑙：current convolution layer
𝐿𝐿：Number of convolutional layers
𝑠𝑠: Sentence length
𝑘𝑘𝑡𝑡𝑡𝑡𝑡𝑡：fixed pooling parameter for the
topmost convolutional layer
Neural Network for Modeling
Sentences” by Nal Kalchbrenner, Edward
Grefenstette, Phil Blunsom arXiv, 2014

CNN to find Document Latent Vector
• CNN Architecture to generate
document latent vector.
23
Item Text
Review, etc.
Convolution
Maxpooling
.
.
..
.
..
.
..
.
.
.
.
..
.
..
.
..
.
.
.
.
..
.
..
.
..
.
.
.
.
..
.
..
.
..
.
.
∗ 𝑘𝑘𝑗𝑗…
Output layer
Projection
Document Latent Vector

Recurrent Neural Networks: make use of sequential information
24
Input at time step 𝑡𝑡 − 1
Hidden state at time step t
Output state at time step t
Activation function

Long term dependencies
● In practice, RNNs cannot capture long term dependencies.
26

Long Short Term Memory (LSTMs)
28
● Provide highway to pass cell state (or memory).
● LSTM can add or remove information to the cell state through regulated
structures called gates. LSTM has three gates to protect and control cell
state

f u
LSTMs – Details
Forget Update
+
tanh output
tanh
softmax
o
Ct-1
ht-1
xt
Ct
ht
C~
t
yt

Forget gate
● Forget gate layer decides what information will be thrown away
● Looks at ℎ𝑡𝑡−1 and 𝑥𝑥𝑡𝑡 and outputs a number between 0 and 1
● 1 represents completely keep this, 0 represents completely get rid of this
● Example: forget the gender of the old subject, when we see a new subject
30

Update
● A sigmoid layer (input gate layer): decides what values we’ll update
● A tanh layer: creates a vector of new candidate values, 𝐶𝐶𝑡𝑡
�
31
Input gate layer tanh layer

Update
32
• Update old state by 𝐶𝐶𝑡𝑡−1 into the new cell state 𝐶𝐶𝑡𝑡
• Multiply old state by 𝑓𝑓𝑡𝑡
• Forgetting the things we decided to forget earlier
• Then we add 𝑖𝑖𝑡𝑡 ∗ 𝐶𝐶𝑡𝑡
�

Output
● A sigmoid layer decides what parts of the cell state to output
● Put the cell state through tanh and multiply it by the output of the sigmoid gate
33

Gated Recurrent Unit
● Replace forget (f) and input (i)
gates with an update gate (z)
● Introduce a reset gate (r ) that
modifies ht-1
● Eliminate internal memory ct
Source: http://colah.github.io/posts/2015-08-Understanding-LSTMs/

Bidirectional LSTM
w1 w2 w3 wn

Attention Mechanism
● Pool of source states
● Retrieve as needed
36

Attention - Context
37
𝑐𝑐𝑡𝑡 = � 𝑎𝑎𝑡𝑡 𝑠𝑠 ℎ�𝑠𝑠
𝑠𝑠
ℎ𝑡𝑡 = 𝑓𝑓(ℎ𝑡𝑡−1, 𝑐𝑐𝑡𝑡, 𝑒𝑒𝑡𝑡)

Sentence Modeling
● Word Average
● CNN
● LSTM: A sentence 𝑠𝑠 is transformed to a fix-length vector 𝑒𝑒𝑠𝑠 by recursively
applying a LSTM unit to each word embedding 𝑒𝑒𝑤𝑤𝑡𝑡
and the previous step
ℎ𝑡𝑡−1.
● BiLSTM: Bi-directional LSTM (BiLSTM) captures both the left and right
context.
○ Two hidden states ℎ𝑡𝑡
𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓
and ℎ𝑡𝑡
𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑤𝑤𝑤𝑤𝑤𝑤𝑤𝑤
are concatenated into a
final hidden state ℎ𝑡𝑡
𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏
38

Joint deep modelling of Users and Items using
Review for Recommendation
● A shared layer helps couple latent factors learned for user behavior and item
properties from reviews by two parallel networks
39
Joint deep modelling of Users and Items using Review for
Recommendation, Lei Zheng, Vahid Noroozi, Philip S. Yu, WSDM 2017

DeepCONN Architecture
40
User Review Text Item Review Text
Factorization Machine
TCNNu
TCNNi
𝑥𝑥𝑢𝑢 𝑦𝑦𝑖𝑖
User Network Item Network

41
Li P, Wang Z, Ren Z, Bing L, Lam W. Neural rating
regression with abstractive tips generation for
recommendation, SIGIR 2017.
Abstractive Tips Generation
● Gated recurrent neural networks help translate
user and item latent representations into concise
abstractive tips with good linguistic quality
simulating user experience and feelings
● Deep Composite Model : MLP and RNN
● Two Tasks
1. Rating prediction with MLP
2. Tips Generation

Neural Rating Tips
● Examples of the predicted ratings
and the generated tips.
● The first line of each group shows
the generated rating and tips.
● The second line shows the ground
truth.
43

Unified Representation
By translating various sources
(e.g., review, rating) into a
unified representation space,
heterogeneous information can
be integrated for informed
recommendation.
44
Zhang Y, Ai Q, Chen X, Croft WB.
Joint representation learning for
top-n recommendation with
heterogeneous information
sources, CIKM 2017.

NAPRE: Neural Attentional Rating Regression with Review-level Explanations
● To predict a rating given a user and an
item, as well as to select both useful
and representative reviews.
● Useful reviews are obtained through a
attention mechanism and provide
explanations for users to make better and
faster decisions.
● NAPRE learns the usefulness of each
review.
45
Chen C, Zhang M, Liu Y, Ma S. Neural Attentional Rating Regression with Review-level
Explanations, WWW 2018.

TextCNN
● CNN text Processor: inputs a
sequence of words and
outputs a n-dimensional
vector representation for the
input.
46

NAPRE
● Utilize the attention mechanism to assign
weights to reviews when modeling users
and items.
● Two parallel neural networks for user
modeling and item modeling.
● A prediction layer to let the hidden latent
factors of user and item interact.
● The training data consists of users,
items, and text reviews.
● Test stage: only users and items are
available.
47

Item Modelling
● CNN Text Processor is applied to process the
textual reviews of item i.
● Each review of i is transformed into a matrix of
word vectors, (Vi1, Vi2,...Vik).
● These matrixes are sent to the convolutional
layer and the feature vectors of them can be
obtained from the output as (Oi1, Oi2, ...Oi j) .
● Attention mechanism is used to learn the weight
of each review.
48

Attention-based Review Pooling
● Select reviews that are representative to item i’s
features
● Aggregate the representation of informative reviews to
characterize item i.
● A two-layer network is applied to compute the
attention score 𝑎𝑎𝑖𝑖𝑖𝑖 .
● The input contains the feature vector of the lth review
of item i (𝑂𝑂𝑖𝑖𝑖𝑖) and the user who wrote it
● (ID embedding, 𝑢𝑢𝑖𝑖𝑖𝑖 ) models the quality of users, helps
identify users who always write less-useful reviews.
49

Attention-based Review Pooling
● The attention network:
𝑎𝑎∗
𝑖𝑖𝑖𝑖 = ℎ𝑇𝑇 𝑟𝑟𝑟𝑟𝑟𝑟𝑟𝑟 𝑊𝑊𝑂𝑂 𝑂𝑂𝑖𝑖𝑖𝑖 + 𝑊𝑊𝑢𝑢 𝑢𝑢𝑖𝑖𝑖𝑖 + 𝑏𝑏1 + 𝑏𝑏2
● Normalize: 𝑎𝑎𝑖𝑖𝑖𝑖 =
exp(𝑎𝑎∗
𝑖𝑖𝑖𝑖)
∑ exp(𝑎𝑎∗
𝑖𝑖𝑖𝑖)𝑘𝑘
𝑖𝑖=0
● the feature vector of item i: 𝑂𝑂𝑖𝑖 = ∑ 𝑎𝑎𝑖𝑖𝑖𝑖 𝑂𝑂𝑖𝑖𝑖𝑖𝑙𝑙=1…𝑘𝑘
● The output of the attention-based pooling layer is a
k1 dimensional vector
● A fully connected layer computes the final
representation of item i: 𝑌𝑌𝑖𝑖 = 𝑊𝑊0 𝑂𝑂𝑖𝑖 + 𝑏𝑏0
50

NAPRE: Prediction Layer
● NAPRE extends user preferences and item
features in LFM model to two components: one
based on ratings while the other based on
reviews.
● A neural form LFM used for predicting ratings.
● the latent factors of user and item are mapped
to a shared hidden space.
● The interaction between u and i is modelled
as:
ℎ0 = (𝑞𝑞𝑢𝑢 + 𝑋𝑋𝑢𝑢)⨀(𝑝𝑝𝑖𝑖 + 𝑌𝑌𝑖𝑖)
51

NAPRE: Prediction Layer
● The interaction between u and i is modelled as:
ℎ0 = (𝑞𝑞𝑢𝑢 + 𝑋𝑋𝑢𝑢)⨀(𝑝𝑝𝑖𝑖 + 𝑌𝑌𝑖𝑖)
● 𝑞𝑞𝑢𝑢 and 𝑝𝑝𝑖𝑖 are user preferences and item features
based on ratings from LFM
● 𝑋𝑋𝑢𝑢 and 𝑌𝑌𝑖𝑖 are user preferences and item features
obtained from the above method
● ⨀ denotes the element-wise product of vectors.
● The output is a n-d vector, passed to prediction
layer:
𝑅𝑅�𝑢𝑢,𝑖𝑖 = 𝑤𝑤1
𝑇𝑇
ℎ0 + 𝑏𝑏𝑢𝑢 + 𝑏𝑏𝑖𝑖 + 𝜇𝜇
52

Aspect Similarity Recognition using Deep Learning
● Two sentences are identified as
aspect similar if they mention at
least one aspect in common
55
A Deep Learning Study of Aspect
Similarity Recognition Huy-Tien
Nguyen, Quan-Hoang Vo, and Minh-
Le Nguyen, 2018

Recommendation of High Quality Representative Reviews in E-commerce
Recsys 2017, Debanjan Paul, Sudeshna Sarkar (IIT Kharagpur)
Muthusamy Chelliah, Chetan Kalyan, Prajit Nadkarni (Flipkart)

Product
Reviews
Aspect and
Sentiment
Extraction
Review
Selection
Preserves
Statistical
Distribution
Subset of
‘k’
Reviews
Aspect
Synonymy
Detection
Review Quality
Estimation
Set of annotated review data
Distribution of aspects and
sentiments
Product
aspects and
sentiments
Review
quality score
System Architecture

Product
Reviews
Aspect and
Sentiment
Extraction
Review
Selection
Preserves
Statistical
Distribution
Subset
of ‘k’
Review
s
Aspect
Synonymy
Detection
Review Quality
Estimation
Set of annotated review data
Distribution of aspects and
sentiments
Product
aspects and
sentiments
Review
quality score
Extracts product aspect phrases along with the
corresponding sentiments from each reviews of
the item.
E.g. The lens of this camera is great.
Considers semantic similarity between
aspects to merge similar aspects.
E.g. ‘Photo’, ‘Picture’, ‘camera’ refer to the
same feature of a mobile phone.
Estimates review quality
score from its textual
content
Representative Reviews
Covers different product aspects and
sentiments.
Preserves the statistical distribution of product
aspects and opinions.

Aspect Extraction
Extracts product aspect phrases from each reviews of the item along with
the sentiments expressed about them.
Sample Input: The picture quality is great. It
also has cool looks. But the phone hangs
sometimes, if you install too many apps.
Output:
picture quality,+1 ;
looks,+1 ;
processor,-1
Qiu, Guang, et al. "Opinion word expansion and target extraction through double propagation."
Computational linguistics 37.1 (2011)
Use a semi-supervised technique
based on bootstrapping from a small
opinion lexicon.
Utilize several syntactic relations that
link opinion words and targets to
expand the initial opinion lexicon and
to extract targets.

Evaluation of Aspect and Sentiment Extraction Module
• Used 300 reviews from
mobile phone, camera and
blue tooth headsets product
in Amazon dataset
• Manually annotated to find
the correct aspect phrases
and corresponding
sentiment
No. of reviews 300
Actual no of aspect phrases 660
No of extracted aspect phrases 630
Correctly extracted aspect phrases 468
Precision 70.90%

Aspect Synonymy Detection
General Display Processor Memory Camera Connectivity Multimedia Charging
56% 47% 16% 6% 29% 49% 34% 36%
Audio
quality
Video
Recording
Sound
quality
Photo Camera Bluetooth
headset
RAM Picture
quality
Touch
Screen
Battery
life
…
2% 0% 1% 1% 10% 6% 0% 2% 1% 7%
• 125 reviews of an “Asus Zenfone Max”.
• 35 aspects before: audio quality, video recording, sound, photo, camera, bluetooth headset, RAM,
picture quality, touch screen, battery life, picture, clock speed, frequency, OS, SIM, packaging,
network, model, ROM, pixel, LED, button, power, price, formats, display, disk, processing,
configuration, thing, smartphone, support, resolution.
Use word embeddings of words and phrases. Merge phrases with similar word vectors.
8 aspects obtained.

Evaluation of Aspect Similarity Detection Module
• We extracted aspect phrases for
10 products in mobile phone
domain and mapped the
extracted aspect phrases with
the Flipkart catalogue aspects.
• We have manually evaluated
the correctness of the mappings
obtained.
• The accuracy of our aspect
similarity detection module is
83.75%.
Aspect
Category
#Extracted
aspect
phrases
Correctly
mapped
Incorrectly
mapped
Camera 360 342 18
Connectivity 135 63 72
Dimension 99 45 54
Display 306 198 108
General 423 333 90
Memory 27 27 0
Multimedia 45 27 18
OS 207 207 0
Others 1062 963 99
Not Mapped 216 207 9
Total 2880 2412 468

Review Quality Estimation
• Some review recommendation systems use review helpfulness
score as the parameter to judge review quality.
• New reviews do not have any helpfulness score (cold start).
• Not all sites capture helpfulness score.
• We develop a CNN (convolutional neural network) to estimate the
quality score of the textual reviews.
– Simple CNN
– Dynamic CNN with k-max pooling

Dynamic Convolutional Neural Network Architecture
Input Layer
• All reviews of a product (length s)
• Each word is embedded as a d-dim vector.
Convolutional Layer
• A convolution matrix is applied to input
matrix S (filter size of m)
Dynamic K-max pooling layer
• Selects K most active features.
• K is determined dynamically as a function of
the length of the sentence and depth of the
network .
𝐾𝐾𝑙𝑙 = max 𝐾𝐾𝑡𝑡𝑡𝑡𝑡𝑡, 𝐿𝐿 − 𝑙𝑙 ∗ 𝑠𝑠
𝐿𝐿�

Dynamic Convolutional Neural Network Architecture
Folding
• Elements in two rows of the matrix are added
component wise in order to obtain a reduced
matrix of dimension d/2.
Fully Connected Layer
• Fully connected softmax layer.

Experiments
• Amazon dataset.
• Reviews from six different product
domains
– electronics, clothing, books,
shoes, music, watches
• Accuracy:
– CNN: 71%
– DCNN: 83%
Number of
reviews
34,686,770
Number of users 6,643,669
Number of
products
2,441,053
Users with > 50
reviews
56,772
Median number of
words per review
82
Timespan Jun 1995 –
Mar 2013

Review Selection
• Given a corpus of reviews R on an item, find a k-size subset S ⊆ R of
reviews that accurately capture the proportion of opinions of the item’s
features.
• For t = 1 to k
1. Pick one review r ϵ R to form subset S(t) ⊆ R
2. so that the distance D( π( S(t−1) ∪ {r}), τ) is minimized, where 𝝅𝝅 is the
percentage distribution of the aspects in the set 𝝅𝝅(S) and D is the L2
2 norm
of their difference i.e. 𝑫𝑫 𝝅𝝅, 𝝉𝝉 = 𝑳𝑳𝟐𝟐
2 𝝅𝝅 − 𝝉𝝉 = ∑ 𝝅𝝅 𝒊𝒊 − 𝝉𝝉 𝒊𝒊 𝟐𝟐𝒎𝒎
𝒊𝒊=𝟏𝟏 , where
m is the total number of extracted aspects.
Lappas, Theodoros, Mark Crovella, and Evimaria Terzi. "Selecting a characteristic set of reviews."
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining.
ACM, 2012

Sample Output
• Product Name: Plantronics Voyager 510 Bluetooth Headset
• Number of Reviews : 465
• Number of Attributes (Catalogue Attributes) : 8
• Original attribute distribution:
General Display Processor Memory &
Storage
Camera Connectivity Multimedia Charging
60% 60% 20% 0% 20% 40% 40% 40%
General Display Processor Memory &
Storage
Camera Connectivity Multimedia Charging
56% 47% 16% 6% 29% 49% 34% 36%
Resulting attribute distribution in recommended reviews:

System Evaluation
We have recommended three sets of reviews for 8
products based on:
• RQS: Explicit review helpfulness score.
• CNN: Quality score of review estimated by CNN
• DCNN: Quality score of review estimated by
DCNN
• CRS: Characteristic Review Selection algorithm.
• COMB: Combined approach
• 3 sets of 5 reviews
from the methods are
given to 10 human
annotators to rank the
methods from 1 to 3.
• Ranking done on basis
of which set of reviews
defines the product
best

Comparison of RQS,CNN and DCNN method
• DCNN has an accuracy of 83% wrt RQS
• Plain CNN has 70%.
• COMB is declared as the best method in
60% of the reports obtained from
annotators.
• COMB approach has the lowest rank.
Method Percentage Average Rank
RQS 35 1.925
CNN 20 2.2625
DCNN 45 1.7875
Method Percentage Average Rank
CRS 15 2.175
CNN 25 2.2125
COMB 60 1.6125

Live Evaluation of the system at Flipkart
Live Evaluation of COMB method against
default recommendation system at Flipkart
• 100 sorted reviews recommended by
COMB method for 24 random
products in mobile phone category.
• 25% of users experience the reviews
recommended by our method while
rest experience the default behavior.
• Impact is measured in number of
transactions (Txns) per product page
visit (PPV).
0
0.5
1
1.5
2
Product1
Product3
Product5
Product7
Product9
Product11
Product13
Product15
Product17
Product19
Product21
Product23
Default
COMB
• COMB shows an increase in product
purchase for 10 products.
• Default recommendation system performs
better in case of 5 products.
• Similar performance in case of 9 products.

Live Evaluation of the system at Flipkart
• Net product transaction increase by 1.85%
out of 1 million product units by using
COMB.
• COMB combines quality score of reviews
with the product aspects and sentiments
along with maintaining statistical
distribution of opinions found in
underlying corpus.
• COMB has performed better than both
CNN and default recommendation system
at Flipkart live testing.
Algorithm Txns PPV
Default 1777 313776
COMB 1781 308782
Increase in conversion by 1.85%

List of reviews recommended by CRS method Current List of reviews recommended by COMB
I love my headset. It is clear and comfortable. I could
wear it all day.
Works well, good audio quality, battery lasts a long
time.
I bought this model for my wife after her last one died.
It was her third or fourth..? I've lost track. This model
does not compare to the others; it's much better. It
hooked up easy to her Verizon phone and sounds
clear on BOTH ends. I hear her fine and her voice is
crisp and clear. What else is there? Oh yes, price....
around $40...Buy one, you will not be sorry.
I have been using this headset for about a month. The
sound quality is better than most headsets that I have
used. One day when it was very windy there was a
little too much wind noise in the mic for the person that
I was speaking to. The device automatically connected
to my RAVR phone so connection was not an issue. I
am not crazy about the connect button on the mic arm,
and open the phone to connect to calls The set comes
with three different sizes of ear pieces which makes it
fit well for almost everyone. The battery life on this
device is amazing
Doesn't work. Will be sending back---home. Tried
pairing it up over 10 times with Iphone, which is with
Verizon, and it did not work with phone. Expecting a
refund back when I get the CD back to send with the
rest of items. Thank you anyway.
I bought the 510 based on the volume of positive
reviews. Well if this truly is the best Bluetooth headset
then I'm going back to using a wired headset. Sound
quality was poor, people couldn't hear me if there was
any background noise and the ear pieces were all
uncomfortable.

Personalised review recommendation in e-commerce
74
Mtech thesis of Surjodoy Ghosh Dastider, IIT Kharagpur, 2018
Review 1: Good UI, user-friendly, sturdy
Review 2: Poor battery life and performance of
apps
...
User 1: Prefers user-
friendly phones
User 2: Prefers high-
performance phones
Product: Mobile Phone XYZ

Methodology
Our proposed review recommendation system performs the following tasks:
1. Extract aspects and corresponding sentiments from all reviews and questions in a
given product domain.
2. Represent each review, product and user as a probability distribution over topics.
3. Generate helpfulness/quality score for each review.
4. Generate the desired aspect distribution in the recommended set of reviews, using
product and user profiles.
5. Select an optimal subset of reviews to recommend to a user.
6. Use user cluster profiles instead of individual user profiles.
75

Domain Topic Identification
● We use a topic model, Sentiment-LDA (Li ’10) for aspect based
sentiment analysis (ABSA), to extract product aspects and corresponding
sentiments.
● Given a product domain, we discover the optimal number of global
topics T for all product reviews in that domain.
● For each topic-sentiment pair, we retrieve the top 30 words
77

Representation of Reviews and Products
We analyze each review r to get a vector V(r) of size 2*T. Similarly, we find
vectors from questions available in the e-commerce platform.
For each product P, we aggregate vectors of all reviews Rev(P) written for that
product to generate the product vector V(P).
V (P) = Σr∈Rev(P) V (r)
79

Representation of Users
For each user U, we used three sources of information:
● Reviews written by that user Revu(u)
UR(u) = Σr ∈ Revu(u)V (r)
● Questions asked by that user Ques(u)
UQ(u) = Σq ∈ Ques(u)V (q)
● Reviews up-voted/down-voted by that user Vote(u)
UV(u) = Σr ∈ Vote(u) V (r)
Each such user vector gives the distribution of topics over each user, or the aspects that
each user is interested in. Some or all of them can be used to build the user profile U(u).
80

Personalization Strategy
The base review recommendation model considers the product vector V(P) as the
desired distribution of topics with sentiments in the recommended review set.
A personalized system needs to take the user interests into consideration. For this
we use the user profile U(u).
We felt that it is important to balance the inputs from U(u) as well as V(P) to form
the desired topic distribution in the recommended review set.
Thus the desired distribution is given by D(p, u) = f(V(p), U(u)).
81

Personalization Strategy (contd.)
We considered the following forms for the composition:
D(p, u) = V(p)α . U(u)β
1. V(p): α = 1, β = 0.
2. U(u): α = 0, β = 1.
3. A combination of both the terms, obtained by keeping α = 1 and varying β. A
higher value of β corresponds to more aggressive personalization.
82

User Clustering
Drawback: Latency issues when used in a online e-commerce platform.
The reviews to be recommended to a user may be pre-computed in order to serve users in
real time.
Workaround: Group users based on the similarity of their user profiles so that different
review recommendations may be associated with each group.
Used a clustering algorithm (k-means) to group the users based on their aspect preferences
for each domain and created c groups of users.
Computed the mean vector U(Ci) of each group, and replaced the user's own vector U(u)
with the mean vector U(Ci) of the user's group.
84

Dataset
Mobile Phones
Number of Reviews: 100000
Number of Products: 5987
Number of Users: 93434
85
Computers
Number of Reviews: 40958
Number of Products: 3455
Number of Users: 39336
● Used reviews from the product review dataset of Flipkart.
● Each review item includes product title, user info, review rating, and plaintext review.
● Reviews chosen from two popular product domains - mobile phones and computers.

Evaluation Strategy
A way to evaluate the system is to check whether the reviews recommended by
the system are found to be helpful to the user.
We can find whether the reviews of a product that a user actually up-voted were
recommended by the review recommendation algorithm. We used the offline
review up-vote data by users for this purpose.
86

Evaluation Strategy
Supposing that from the review set R = Rev(p), RU is the set up-voted by the user,
and S is the set recommended. We evaluate the performance of our system using
average Precision and average Recall.
Precision: Percentage of reviews in the recommended set S which have been
up-voted.
Recall: Percentage of up-voted reviews RU returned in the recommended set S.
87

Experiments
1. For aspect-sentiment extraction, 6 topics were taken for cell phones and 8 topics were
taken for computers. Number of sentiments was taken as 2.
2. We used the following Dirichlet priors for our topic model: α = 0.1, β = 0.01, γ = 1.
3. Using the resulting global topics, we created vectors of size 6*2=12 for each mobile
phone review, and of size 8*2=16 for each computer review. Based on this we created
product profiles for each product using Rev(p), and user profiles for each user by using
three sources of information: UR(u), UQ(u) and UV(u).
88

Experiments
4. We used the following different combinations of the aforementioned user vectors to
obtain the final user vector U(u) for our personalization step:
a. UR = UR(u)
b. URQ = UR(u) + UQ(u)
c. URV = UR(u) + UV(u)
d. URQV = UR(u) + UQ(u) + UV(u)
5. Given a user u, a product p, its reviews Rev(p), we recommended a subset SK
u,p of K
reviews using our algorithm based on our desired distribution D(p, u).
89

Testing
We took N=1000 products from each domain with the highest review count.
For each product, we randomly selected one user who has up-voted more than
three reviews of that product, thus creating N product-user pairs.
For each product-user pair, we generated K recommendations. We experimented
with the recommendation set size K, using values of 5, 10 and 15.
90

Results
● The system's recall for both domains increases with increase in K.
● We found that the user profile URQV computed using all information works best.
● We next ran the grouping based recommendation and experimented for group
sizes of 5, 10 and 15. We found that the results using the group profile is better
than the non-personalized case even for 5 groups. In most cases the results are
better with more number of groups (c).
● For the mobile phone domain with K=5, the best precision was 11.57% for c=15
as compared to 12.04% for the individual approach.
● The best recall obtained for K=15 is 26.76% for mobile phones and 23.85% for
computers.
91

Results for mobile phone domain, K = 15, individual profiles
92

Results for computer domain, K = 15, individual profiles
93

Group Personalization Results, K = 15, D(p,u) = V(p) · U(u)5
94
Mobile
Phones
Computers

Conclusions and Future Work
● Our system for personalized review recommendation using individual profiles
performs better than other proposed systems addressing the same problem.
● The group based implementation is novel and makes our system practical.
● We plan to perform live testing and check the effectiveness of our method in
increasing product purchases.
95

Icon18revrec sudeshna

Recommended

Recommended

More Related Content

What's hot

What's hot (19)

Similar to Icon18revrec sudeshna

Similar to Icon18revrec sudeshna (20)

Recently uploaded

Recently uploaded (20)

Icon18revrec sudeshna