SlideShare a Scribd company logo
1 of 49
Download to read offline
Towards the Recommendation of
Personalised Activity Sequences in the
Tourism Domain
Gunjan Kumar, Houssem Jerbi, and Michael P. O’Mahony
Insight Centre for Data Analytics
University College Dublin
RecTour 2017, Como
Aug 27, 2017
e-tourism
e-tourism
Sensors: Rich User Activity Data
Sensors: Rich User Activity Data
• Sequential nature of user activities
• Activities have associated
features/context, e.g. location, time,
weather, etc.
Rich User Activity Data
For Recommender Systems
Facilitates real time recommendations for a given user and context
(e.g. time, location, weather, etc.)
Previous work:
A framework for sequence- and context-based recommendation of
next activity (lifelogging/modes of transport) to perform. [Kumar
et al., 2014, 2016]
Current Research Problem:
Recommending the next sequence of activities to users.
Insight Centre for Data Analytics RecTour 2017 Slide 5
Rich User Activity Data
For Recommender Systems
Facilitates real time recommendations for a given user and context
(e.g. time, location, weather, etc.)
Previous work:
A framework for sequence- and context-based recommendation of
next activity (lifelogging/modes of transport) to perform. [Kumar
et al., 2014, 2016]
Current Research Problem:
Recommending the next sequence of activities to users.
e.g. visiting a museum, having Italian food, and going to a theatre.
Insight Centre for Data Analytics RecTour 2017 Slide 5
Our Contribution
• A generic activity recommendation framework to recommend
the next sequence of activities to users based on past activity
patterns and context (extending [Kumar et al., 2014, 2016]).
• Application of the proposed approach in the tourism domain.
Experiments using a location checkin dataset.
Insight Centre for Data Analytics RecTour 2017 Slide 6
Related Work
Capturing Sequence
• Hierarchical-graph-based model:
- [Li et al., 2008; Zheng et al., 2009; Yoon et al., 2010]
• All-kth
-order Markov models:
- [Bohnenberger and Jameson, 2001; Deshpande and Karypis, 2004; Shani
et al., 2005]
Capturing Context
• Tensor and matrix factorization models:
- [Zheng et al., 2010, 2012; Wang et al., 2010; Symeonidis et al., 2011; ?;
Adomavicius et al., 2011; Braunhofer et al., 2013]
Related Work
Capturing Both Sequence & Context
• To improve recommendations
- [Adomavicius and Tuzhilin, 2005; Zheng et al., 2012]
• Content-based Activity Recommendation Framework
- [Kumar et al., 2014, 2016]
• Stochastic Modelling
- [Sun et al., 2016]
Framework Overview
Ranking
User Data TimelinesData Modelling
Timeline Matching
Append top ranked activity
to timeline/RS
Similarity Assesment
Recommended Sequence (RS)
Data Model
Activity Object
A single occurrence of an activity and consists of a set of features
describing the activity or the context.
Activity Timeline
A chronological sequence of n activity objects performed by the
user during a time interval δ:
T =< ao1, ao2, ..., aon >
Museum, 10:15, (53.343N,-6.317W), 11015
Theatre, 16:50, (53.340,-6.263W), 2008
Italian-Food, 12:31, (53.339N,-6.277W), 128
ao1 ao2 ao3
Recommendation Algorithm
User Timeline
Time
00 hrs 00 hrs 00 hrs
Insight Centre for Data Analytics RecTour 2017 Slide 11
Recommendation Algorithm
User Timeline
Time
00 hrs 00 hrs 00 hrs Target Activity Seq
< aot1
, aot2
, aot3
>
Current
Activity
(aoc)
? ? ?
Insight Centre for Data Analytics RecTour 2017 Slide 11
Recommendation Algorithm
00 hrs Target Activity Seq
< aot1
, aot2
, aot3
>
Current
Activity
(aoc)
? ? ?
Current Timeline
Insight Centre for Data Analytics RecTour 2017 Slide 11
Recommendation Algorithm
00 hrs Target Activity Seq
< aot1
, aot2
, aot3
>
Current
Activity
(aoc)
? ? ?
Current Timeline
Candidate Timeline #2
Candidate Timeline #1
Candidate Timeline #j
...
Insight Centre for Data Analytics RecTour 2017 Slide 11
Recommendation Algorithm
Target Activity Seq
< aot1
, aot2
, aot3
>
Current
Activity (aoc)
? ? ?
Current Timeline
Candidate Timeline #2
Candidate Timeline #1
Candidate Timeline #j
... N-count matching (N = 4)
124 3
124 3
124 3
124 3
Matching unit
Determines the length of the subsequences to be compared.
Insight Centre for Data Analytics RecTour 2017 Slide 11
Similarity Assessment
Target Activity Seq
< aot1
, aot2
, aot3
>
Current
Activity (aoc)
? ? ?
Current Timeline
Candidate Timeline #2
Candidate Timeline #1
Candidate Timeline #j
...
124 3
124 3
124 3
124 3
Two-level Edit Distance
[Kumar et al., 2014]
Insight Centre for Data Analytics RecTour 2017 Slide 12
Ranking
Target Activity Seq
< aot1
, aot2
, aot3
>
Current
Activity (aoc)
? ? ?
Current Timeline
Candidate Timeline #2
Candidate Timeline #1
Candidate Timeline #j
...
124 3
124 3
124 3
124 3
j
Ranked
1
Score(aoj
rec) = 1 −
d(Tj, Tc) − min
Tp∈T
d(Tp, Tc)
max
Tp∈T
d(Tp, Tc) − min
Tp∈T
d(Tp, Tc)
Insight Centre for Data Analytics RecTour 2017 Slide 13
Recommending
Target Activity Seq
< aot1
, aot2
, aot3
>
Current
Activity (aoc)
? ?
Current Timeline
Candidate Timeline #2
Candidate Timeline #1
Candidate Timeline #j
...
124 3
124 3
124 3
124 3
j
Ranked
1
aorec1
Insight Centre for Data Analytics RecTour 2017 Slide 14
Recommending
Target Activity Seq
< aot2 , aot3>
Current
Activity (aoc)
? ?
Current Timeline
Candidate Timeline #2
Candidate Timeline #1
Candidate Timeline #j
...
124 3
124 3
124 3
124 3
j
Ranked
1
aorec1
Insight Centre for Data Analytics RecTour 2017 Slide 14
Recommending
Target Activity Seq
< aot3>
Current
Activity (aoc)
? ?
Current Timeline
124 3
aorec2
aorec1
Recommendation Algorithm
( SeqNCSeqRec)
Insight Centre for Data Analytics RecTour 2017 Slide 14
Recommending
?
Recommended Sequence
aorec2
Recommendation Algorithm
( SeqNCSeqRec)
aorec1
aorec3
Insight Centre for Data Analytics RecTour 2017 Slide 14
Dataset
• Gowalla checkins dataset [Liu et al., 2013]
• Every checkin is bound to a specific location and timestamp.
• Features: latitude, longitude, number of users checking in to it,
number of photos taken at the location, etc.
• Locations have categories assigned to them, such as, ‘Italian
Food’, ‘Museum’, ‘City Park’, etc.
Food
Outdoors
Shopping
Nightlife
Entertainment
Community
Mexican
Travel
Asian
Italian
Coffee Shop
South American/Latin
Fish & Chips
Starbucks
Dunkin Donuts... ...
Level 1 Level 2 Level 3
(7) (134) (151)
Dataset
• Sampled dataset: 9.2k users , 6 million checkins in total.
• median # checkins per user per day: 10 - 270
• median # of distinct categories per user per day : 2 - 76
0
500
1,000
1,500
2,000
2,500
3,000
3,500
0 25 50 75 100
Median #checkins per user per day
Numofusers
(a)
0
1000
2000
3000
4000
5000
0 10 20 30 40 50 60 70 80
Median #distinct categories per user per day
Numofusers
(b)
Methodology
User Timeline
Time
00 hrs 00 hrs 00 hrs Target Activity Seq
< aot1
, aot2
, aot3
>
Current
Activity
(aoc)
? ? ?
Recommendation time
(RT)
• Leave-one-out evaluation: Each user’s complete timeline is
split into training and test timelines by time.
• Agreement @ k (k = 1, 2, 3): % of RTs for a user where the
first k categories in the recommended sequence and the actual
sequence are an exact match.
• Recommendation algorithms:
• N-count recommendation algorithm (SeqNCSeqRec)
• Bi-gram-based sequence recommender (BiGramSeqRec)
• Popularity-based sequence recommender (PopSeqRec)
Insight Centre for Data Analytics RecTour 2017 Slide 17
Recommendation Performance (Level 2)
Figure: Mean percentage agreements for recommended sequences for
SeqNCSeqRec (top 10% neighbours) and baseline algorithms using
timelines constructed from categories at level 2 in the hierarchy.
Insight Centre for Data Analytics RecTour 2017 Slide 18
Recommendation Performance
• Some level 2 activities are semantically closer than others.
• ‘true’ performance between those at level 2 and level 1.
Food
Outdoors
Shopping
Nightlife
Entertainment
Community
Mexican
Travel
Asian
Italian
Coffee Shop
South American/Latin
Fish & Chips
Starbucks
Dunkin Donuts... ...
Level 1 Level 2 Level 3
(7) (134) (151)
Insight Centre for Data Analytics RecTour 2017 Slide 19
Recommendation Performance
• Some level 2 activities are semantically closer than others.
• ‘true’ performance between those at level 2 and level 1.
Food
Outdoors
Shopping
Nightlife
Entertainment
Community
Live Music
Travel
Casino
Art Museum
Ice Skating
Theatre
Zoo
...
Level 1 Level 2 Level 3
(7) (134) (151)
Insight Centre for Data Analytics RecTour 2017 Slide 20
Recommendation Performance (Level 1)
Figure: Mean percentage agreements for recommended sequences for
SeqNCSeqRec (top 10% neighbours) and baseline algorithms using
timelines constructed from categories at level 1 in the hierarchy.
Insight Centre for Data Analytics RecTour 2017 Slide 21
Conclusion
• A generic activity recommendation framework to recommend
the next sequence of activities to users based on past activity
patterns and context (extending [Kumar et al., 2014, 2016]).
• Experiments demonstrate the efficacy of our approach in
recommending sequences given a diverse variety of activities
and user activity patterns.
Insight Centre for Data Analytics RecTour 2017 Slide 22
Future Work
• Consider alternative approaches to suggest sequences of
activities (for example, using RNNs).
• Introduce new evaluation metrics for evaluating sequence
recommendation.
• Investigate the recommendation of context (for example,
where, when, with whom etc.) associated with each of the
suggested sequence of activities.
• Consider socio-economic characteristics, user demographics,
and travel variables.
Insight Centre for Data Analytics RecTour 2017 Slide 23
Thank You
gunjan.kumar@insight-centre.org
@gunjanthesystem
References I
G. Adomavicius and A. Tuzhilin. Toward the next generation of recommender systems:
A survey of the state-of-the-art and possible extensions. IEEE Trans. on Knowl.
and Data Eng., 17(6):734–749, June 2005. ISSN 1041-4347. doi:
10.1109/TKDE.2005.99. URL http://dx.doi.org/10.1109/TKDE.2005.99.
G. Adomavicius, B. Mobasher, F. Ricci, and A. Tuzhilin. Context-aware recommender
systems. AI Magazine, 32(3), 2011.
T. Bohnenberger and A. Jameson. When policies are better than plans:
Decision-theoretic planning of recommendation sequences. In Proceedings of the
6th International Conference on Intelligent User Interfaces, IUI ’01, pages 21–24,
New York, NY, USA, 2001. ACM. ISBN 1-58113-325-1. doi:
10.1145/359784.359829. URL http://doi.acm.org/10.1145/359784.359829.
M. Braunhofer, M. Elahi, F. Ricci, and T. Schievenin. Context-Aware Points of
Interest Suggestion with Dynamic Weather Data Management, pages 87–100.
Springer-Verlag, Cham, 2013. ISBN 978-3-319-03973-2. doi:
10.1007/978-3-319-03973-2_7. URL
https://doi.org/10.1007/978-3-319-03973-2_7.
M. Deshpande and G. Karypis. Selective Markov models for predicting web page
accesses. ACM Transactions on Internet Technology, 4(2):163–184, May 2004.
ISSN 1533-5399. doi: 10.1145/990301.990304. URL
http://doi.acm.org/10.1145/990301.990304.
G. Dong and J. Pei. Sequence Data Mining (Advances in Database Systems).
Springer-Verlag New York, Inc., Secaucus, NJ, USA, 2007. ISBN 0387699368.
References II
G. Kumar, H. Jerbi, C. Gurrin, and M. P. O’Mahony. Towards activity
recommendation from lifelogs. In Proceedings of the 16th International Conference
on Information Integration and Web-based Applications & Services, iiWAS ’14,
pages 87–96, New York, NY, USA, 2014. ACM. ISBN 978-1-4503-3001-5. doi:
10.1145/2684200.2684298. URL http://doi.acm.org/10.1145/2684200.2684298.
G. Kumar, H. Jerbi, and M. P. O’Mahony. Personalised recommendations for modes
of transport: A sequence-based approach. The 5th ACM SIGKDD International
Workshop on Urban Computing (UrbComp 2016), 2016.
Q. Li, Y. Zheng, X. Xie, Y. Chen, W. Liu, and W.-Y. Ma. Mining user similarity based
on location history. In Proceedings of the 16th ACM SIGSPATIAL International
Conference on Advances in Geographic Information Systems, GIS ’08, pages
34:1–34:10, New York, NY, USA, 2008. ACM. ISBN 978-1-60558-323-5. doi:
10.1145/1463434.1463477. URL http://doi.acm.org/10.1145/1463434.1463477.
X. Liu, Y. Liu, K. Aberer, and C. Miao. Personalized point-of-interest recommendation
by mining users’ preference transition. In Proceedings of the 22nd ACM
International Conference on Information & Knowledge Management, CIKM ’13,
pages 733–738, New York, NY, USA, 2013. ACM. ISBN 978-1-4503-2263-8. doi:
10.1145/2505515.2505639. URL http://doi.acm.org/10.1145/2505515.2505639.
G. Shani, D. Heckerman, and R. I. Brafman. An MDP-based recommender system.
Journal of Machine Learning Research, 6:1265–1295, Dec. 2005. ISSN 1532-4435.
URL http://dl.acm.org/citation.cfm?id=1046920.1088715.
Insight Centre for Data Analytics RecTour 2017 Slide 2
References III
Y. Sun, N. J. Yuan, X. Xie, K. McDonald, and R. Zhang. Collaborative nowcasting for
contextual recommendation. In Proceedings of the 25th International Conference
on World Wide Web, WWW ’16, pages 1407–1418, Republic and Canton of
Geneva, Switzerland, 2016. International World Wide Web Conferences Steering
Committee. ISBN 978-1-4503-4143-1. doi: 10.1145/2872427.2874812. URL
http://dx.doi.org/10.1145/2872427.2874812.
P. Symeonidis, A. Papadimitriou, Y. Manolopoulos, P. Senkul, and I. Toroslu.
Geo-social recommendations based on incremental tensor reduction and local path
traversal. In Proceedings of the 3rd ACM SIGSPATIAL International Workshop on
Location-Based Social Networks, LBSN ’11, pages 89–96, New York, NY, USA,
2011. ACM. ISBN 978-1-4503-1033-8. doi: 10.1145/2063212.2063228. URL
http://doi.acm.org/10.1145/2063212.2063228.
C.-Y. Wang, Y.-H. Wu, and S.-C. T. Chou. Toward a ubiquitous personalized daily-life
activity recommendation service with contextual information: A services science
perspective. Information Systems and E-Business Management, 8(1):13–32,
January 2010.
Z. Xing, J. Pei, and E. Keogh. A brief survey on sequence classification. SIGKDD
Explor. Newsl., 12(1):40–48, Nov. 2010. ISSN 1931-0145. doi:
10.1145/1882471.1882478. URL http://doi.acm.org/10.1145/1882471.1882478.
Insight Centre for Data Analytics RecTour 2017 Slide 3
References IV
H. Yoon, Y. Zheng, X. Xie, and W. Woo. Smart itinerary recommendation based on
user-generated GPS trajectories. In Proceedings of the 7th International
Conference on Ubiquitous Intelligence and Computing, UIC’10, pages 19–34,
Berlin, Heidelberg, 2010. Springer-Verlag. ISBN 3-642-16354-8,
978-3-642-16354-8. URL http://dl.acm.org/citation.cfm?id=1929661.1929669.
V. W. Zheng, B. Cao, Y. Zheng, X. Xie, and Q. Yang. Collaborative filtering meets
mobile recommendation: A user-centered approach. In AAAI 2010. Association for
Computing Machinery, Inc., July 2010. URL
http://research.microsoft.com/apps/pubs/default.aspx?id=143146.
V. W. Zheng, Y. Zheng, X. Xie, and Q. Yang. Towards mobile intelligence: Learning
from GPS history data for collaborative recommendation. Artif. Intell., 184-185:
17–37, June 2012. ISSN 0004-3702. doi: 10.1016/j.artint.2012.02.002. URL
http://dx.doi.org/10.1016/j.artint.2012.02.002.
Y. Zheng, L. Zhang, X. Xie, and W.-Y. Ma. Mining interesting locations and travel
sequences from GPS trajectories. In Proceedings of the 18th International
Conference on World Wide Web, WWW ’09, pages 791–800, New York, NY, USA,
2009. ACM. ISBN 978-1-60558-487-4. doi: 10.1145/1526709.1526816. URL
http://doi.acm.org/10.1145/1526709.1526816.
Insight Centre for Data Analytics RecTour 2017 Slide 4
Some Extra Bits
Insight Centre for Data Analytics RecTour 2017 Slide 5
Trend across matching-unit N
5
6
7
8
0000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000 1111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111 2222222222222222222222222222222222222222222222222222222222222222222222222222222222222222222222222222 3333333333333333333333333333333333333333333333333333333333333333333333333333333333333333333333333333 4444444444444444444444444444444444444444444444444444444444444444444444444444444444444444444444444444 6666666666666666666666666666666666666666666666666666666666666666666666666666666666666666666666666666 8888888888888888888888888888888888888888888888888888888888888888888888888888888888888888888888888888
N−count
Mean%agreement
Figure: MRR versus matching unit for three user groups for first
sequence index.
Two-level Distance
• Inspired by edit distance
• Adapted for sequence of objects
09:00 10:05 10:35 10:50 11:3211:25
Two-level Distance
• Inspired by edit distance
• Adapted for sequence of objects
09:00 10:05 10:35 10:50 11:32
Step 1
Align activities
11:25
( (
dactivity (T1, T2) =
r
i=1
wobj × cins +
s
j=1
wobj × cdel +
t
k=1
wobj × csub
Two-level Distance
• Inspired by edit distance
• Adapted for sequence of objects
09:00 10:05 10:35 10:50 11:32
Step 1
Step 2
Align activities
Align features
11:25
(
(
(
(
d(T1, T2) = dactivity (T1, T2) +
n
i=1
dfeature(ao1
i , ao2
i ) .
Methodology
Learning Optimal Matching Unit range
• Wrapper attribute selection: C4.5 algorithm, greedy
backward search and area under ROC curve as evaluation
measure.
• Classification: pruned attribute vectors for each user fed into
a C4.5 induction algorithm to predict optimal matching unit
range.
Insight Centre for Data Analytics RecTour 2017 Slide 8
Attribute Extraction: Timeline Decomposition
• Each user represented by an attribute vector.
• For attribute extraction:
timelines are decomposed into features-sequence :
User Timeline
Time
00 hrs 00 hrs 00 hrs
Attribute Extraction: Timeline Decomposition
• Each user represented by an attribute vector.
• For attribute extraction:
timelines are decomposed into features-sequence :
User Timeline
Time
00 hrs 00 hrs 00 hrs
dist-travel
start-geo
start-time
Attribute Extraction: Timeline Decomposition
• Each user represented by an attribute vector.
• For attribute extraction:
timelines are decomposed into features-sequence :
User Timeline
Time
00 hrs 00 hrs 00 hrs
dist-travel
start-geo
dist-travel
start-geo
start-time
start-time
Timeline Attributes
Timeline Attributes
Regularity
Repetition
Timeline Attributes
Regularity Attributes: Sample Entropy
1. SampEnp
z : sample entropy of a feature sequence Sz for epoch
length p,
2. µSampEnp
T : mean sample entropy over all feature sequences
Sz , z = 1, 2, ..., m of timeline T for epoch length p,
3. σSampEnp
T : standard deviation of sample entropy over all
feature sequences Sz , z = 1, 2, ..., m of timeline T for epoch
length p.
Insight Centre for Data Analytics RecTour 2017 Slide 11
Timeline Attributes
Repetition Attributes: k-gram attributes
Previously used for sequence classification, biological sequence
analysis and text classification [Xing et al., 2010; Dong and Pei, 2007].
1. ηk
z : total number of distinct k-grams in feature sequence Sz ,
normalised by total number of k-grams occurring in Sz ,
2. µf k
z : mean frequency of occurrence of distinct k-grams in
feature sequence Sz , normalised by total number of k-grams
occurring in Sz ,
3. σf k
z : standard deviation of frequency of occurrence of distinct
k-grams in feature sequence Sz , normalised by length of Sz .
Insight Centre for Data Analytics RecTour 2017 Slide 12

More Related Content

Similar to Recommending Sequences RecTour 2017

Assessment outcomes from the TENCompetence project
Assessment outcomes from the TENCompetence projectAssessment outcomes from the TENCompetence project
Assessment outcomes from the TENCompetence projectUniversity of Strathclyde
 
MediaEval 2016 - IR Evaluation: Putting the User Back in the Loop
MediaEval 2016 - IR Evaluation: Putting the User Back in the LoopMediaEval 2016 - IR Evaluation: Putting the User Back in the Loop
MediaEval 2016 - IR Evaluation: Putting the User Back in the Loopmultimediaeval
 
Markov chain for the recommendation of
Markov chain for the recommendation ofMarkov chain for the recommendation of
Markov chain for the recommendation ofIJCSEA Journal
 
MARKOV CHAIN FOR THE RECOMMENDATION OF MATERIALIZED VIEWS IN REAL-TIME DATA W...
MARKOV CHAIN FOR THE RECOMMENDATION OF MATERIALIZED VIEWS IN REAL-TIME DATA W...MARKOV CHAIN FOR THE RECOMMENDATION OF MATERIALIZED VIEWS IN REAL-TIME DATA W...
MARKOV CHAIN FOR THE RECOMMENDATION OF MATERIALIZED VIEWS IN REAL-TIME DATA W...IJCSEA Journal
 
Markov Chain for the Recommendation of Materialized Views in Real-Time Data W...
Markov Chain for the Recommendation of Materialized Views in Real-Time Data W...Markov Chain for the Recommendation of Materialized Views in Real-Time Data W...
Markov Chain for the Recommendation of Materialized Views in Real-Time Data W...IJCSEA Journal
 
[AFEL] Neighborhood Troubles: On the Value of User Pre-Filtering To Speed Up ...
[AFEL] Neighborhood Troubles: On the Value of User Pre-Filtering To Speed Up ...[AFEL] Neighborhood Troubles: On the Value of User Pre-Filtering To Speed Up ...
[AFEL] Neighborhood Troubles: On the Value of User Pre-Filtering To Speed Up ...Emanuel Lacić
 
Contextual Information Elicitation in Travel Recommender Systems
Contextual Information Elicitation in Travel Recommender SystemsContextual Information Elicitation in Travel Recommender Systems
Contextual Information Elicitation in Travel Recommender SystemsMatthias Braunhofer
 
Geotecs: Exploiting Geographical, temporal, categorical, and social context f...
Geotecs: Exploiting Geographical, temporal, categorical, and social context f...Geotecs: Exploiting Geographical, temporal, categorical, and social context f...
Geotecs: Exploiting Geographical, temporal, categorical, and social context f...rameshraj
 
Classical Approaches in Test Estimation
Classical Approaches in Test EstimationClassical Approaches in Test Estimation
Classical Approaches in Test EstimationGlobalLogic Ukraine
 
Dominik Kowald PhD Defense Recommender Systems
Dominik Kowald PhD Defense Recommender SystemsDominik Kowald PhD Defense Recommender Systems
Dominik Kowald PhD Defense Recommender SystemsDominik Kowald
 
Context-Aware Points of Interest Suggestion with Dynamic Weather Data Management
Context-Aware Points of Interest Suggestion with Dynamic Weather Data ManagementContext-Aware Points of Interest Suggestion with Dynamic Weather Data Management
Context-Aware Points of Interest Suggestion with Dynamic Weather Data ManagementMatthias Braunhofer
 
Menno-Jan Kraak - The time wave
Menno-Jan Kraak - The time waveMenno-Jan Kraak - The time wave
Menno-Jan Kraak - The time waveswenney
 
Preservation Planning using Plato, by Hannes Kulovits and Andreas Rauber
Preservation Planning using Plato, by Hannes Kulovits and Andreas RauberPreservation Planning using Plato, by Hannes Kulovits and Andreas Rauber
Preservation Planning using Plato, by Hannes Kulovits and Andreas RauberJISC KeepIt project
 
Predicting Stock Market Price Using Support Vector Regression
Predicting Stock Market Price Using Support Vector RegressionPredicting Stock Market Price Using Support Vector Regression
Predicting Stock Market Price Using Support Vector RegressionChittagong Independent University
 
cloudworkloadanalysisandsimulation-140521153543-phpapp02
cloudworkloadanalysisandsimulation-140521153543-phpapp02cloudworkloadanalysisandsimulation-140521153543-phpapp02
cloudworkloadanalysisandsimulation-140521153543-phpapp02PRIYANKA MEHTA
 
Cloudera Movies Data Science Project On Big Data
Cloudera Movies Data Science Project On Big DataCloudera Movies Data Science Project On Big Data
Cloudera Movies Data Science Project On Big DataAbhishek M Shivalingaiah
 
Surveying, Planning and Scheduling For A Hill Road Work at Kalrayan Hills by ...
Surveying, Planning and Scheduling For A Hill Road Work at Kalrayan Hills by ...Surveying, Planning and Scheduling For A Hill Road Work at Kalrayan Hills by ...
Surveying, Planning and Scheduling For A Hill Road Work at Kalrayan Hills by ...IJSRD
 

Similar to Recommending Sequences RecTour 2017 (20)

Assessment outcomes from the TENCompetence project
Assessment outcomes from the TENCompetence projectAssessment outcomes from the TENCompetence project
Assessment outcomes from the TENCompetence project
 
MediaEval 2016 - IR Evaluation: Putting the User Back in the Loop
MediaEval 2016 - IR Evaluation: Putting the User Back in the LoopMediaEval 2016 - IR Evaluation: Putting the User Back in the Loop
MediaEval 2016 - IR Evaluation: Putting the User Back in the Loop
 
Markov chain for the recommendation of
Markov chain for the recommendation ofMarkov chain for the recommendation of
Markov chain for the recommendation of
 
MARKOV CHAIN FOR THE RECOMMENDATION OF MATERIALIZED VIEWS IN REAL-TIME DATA W...
MARKOV CHAIN FOR THE RECOMMENDATION OF MATERIALIZED VIEWS IN REAL-TIME DATA W...MARKOV CHAIN FOR THE RECOMMENDATION OF MATERIALIZED VIEWS IN REAL-TIME DATA W...
MARKOV CHAIN FOR THE RECOMMENDATION OF MATERIALIZED VIEWS IN REAL-TIME DATA W...
 
Markov Chain for the Recommendation of Materialized Views in Real-Time Data W...
Markov Chain for the Recommendation of Materialized Views in Real-Time Data W...Markov Chain for the Recommendation of Materialized Views in Real-Time Data W...
Markov Chain for the Recommendation of Materialized Views in Real-Time Data W...
 
[AFEL] Neighborhood Troubles: On the Value of User Pre-Filtering To Speed Up ...
[AFEL] Neighborhood Troubles: On the Value of User Pre-Filtering To Speed Up ...[AFEL] Neighborhood Troubles: On the Value of User Pre-Filtering To Speed Up ...
[AFEL] Neighborhood Troubles: On the Value of User Pre-Filtering To Speed Up ...
 
Contextual information elicitation in travel recommender systems
Contextual information elicitation in travel recommender systemsContextual information elicitation in travel recommender systems
Contextual information elicitation in travel recommender systems
 
Contextual Information Elicitation in Travel Recommender Systems
Contextual Information Elicitation in Travel Recommender SystemsContextual Information Elicitation in Travel Recommender Systems
Contextual Information Elicitation in Travel Recommender Systems
 
Geotecs: Exploiting Geographical, temporal, categorical, and social context f...
Geotecs: Exploiting Geographical, temporal, categorical, and social context f...Geotecs: Exploiting Geographical, temporal, categorical, and social context f...
Geotecs: Exploiting Geographical, temporal, categorical, and social context f...
 
Gamifying Research Activity Support System
Gamifying Research Activity Support SystemGamifying Research Activity Support System
Gamifying Research Activity Support System
 
Classical Approaches in Test Estimation
Classical Approaches in Test EstimationClassical Approaches in Test Estimation
Classical Approaches in Test Estimation
 
Dominik Kowald PhD Defense Recommender Systems
Dominik Kowald PhD Defense Recommender SystemsDominik Kowald PhD Defense Recommender Systems
Dominik Kowald PhD Defense Recommender Systems
 
Context-Aware Points of Interest Suggestion with Dynamic Weather Data Management
Context-Aware Points of Interest Suggestion with Dynamic Weather Data ManagementContext-Aware Points of Interest Suggestion with Dynamic Weather Data Management
Context-Aware Points of Interest Suggestion with Dynamic Weather Data Management
 
Context-Aware Points of Interest Suggestion with Dynamic Weather Data Management
Context-Aware Points of Interest Suggestion with Dynamic Weather Data ManagementContext-Aware Points of Interest Suggestion with Dynamic Weather Data Management
Context-Aware Points of Interest Suggestion with Dynamic Weather Data Management
 
Menno-Jan Kraak - The time wave
Menno-Jan Kraak - The time waveMenno-Jan Kraak - The time wave
Menno-Jan Kraak - The time wave
 
Preservation Planning using Plato, by Hannes Kulovits and Andreas Rauber
Preservation Planning using Plato, by Hannes Kulovits and Andreas RauberPreservation Planning using Plato, by Hannes Kulovits and Andreas Rauber
Preservation Planning using Plato, by Hannes Kulovits and Andreas Rauber
 
Predicting Stock Market Price Using Support Vector Regression
Predicting Stock Market Price Using Support Vector RegressionPredicting Stock Market Price Using Support Vector Regression
Predicting Stock Market Price Using Support Vector Regression
 
cloudworkloadanalysisandsimulation-140521153543-phpapp02
cloudworkloadanalysisandsimulation-140521153543-phpapp02cloudworkloadanalysisandsimulation-140521153543-phpapp02
cloudworkloadanalysisandsimulation-140521153543-phpapp02
 
Cloudera Movies Data Science Project On Big Data
Cloudera Movies Data Science Project On Big DataCloudera Movies Data Science Project On Big Data
Cloudera Movies Data Science Project On Big Data
 
Surveying, Planning and Scheduling For A Hill Road Work at Kalrayan Hills by ...
Surveying, Planning and Scheduling For A Hill Road Work at Kalrayan Hills by ...Surveying, Planning and Scheduling For A Hill Road Work at Kalrayan Hills by ...
Surveying, Planning and Scheduling For A Hill Road Work at Kalrayan Hills by ...
 

Recently uploaded

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 

Recently uploaded (20)

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 

Recommending Sequences RecTour 2017

  • 1. Towards the Recommendation of Personalised Activity Sequences in the Tourism Domain Gunjan Kumar, Houssem Jerbi, and Michael P. O’Mahony Insight Centre for Data Analytics University College Dublin RecTour 2017, Como Aug 27, 2017
  • 4. Sensors: Rich User Activity Data
  • 5. Sensors: Rich User Activity Data • Sequential nature of user activities • Activities have associated features/context, e.g. location, time, weather, etc.
  • 6. Rich User Activity Data For Recommender Systems Facilitates real time recommendations for a given user and context (e.g. time, location, weather, etc.) Previous work: A framework for sequence- and context-based recommendation of next activity (lifelogging/modes of transport) to perform. [Kumar et al., 2014, 2016] Current Research Problem: Recommending the next sequence of activities to users. Insight Centre for Data Analytics RecTour 2017 Slide 5
  • 7. Rich User Activity Data For Recommender Systems Facilitates real time recommendations for a given user and context (e.g. time, location, weather, etc.) Previous work: A framework for sequence- and context-based recommendation of next activity (lifelogging/modes of transport) to perform. [Kumar et al., 2014, 2016] Current Research Problem: Recommending the next sequence of activities to users. e.g. visiting a museum, having Italian food, and going to a theatre. Insight Centre for Data Analytics RecTour 2017 Slide 5
  • 8. Our Contribution • A generic activity recommendation framework to recommend the next sequence of activities to users based on past activity patterns and context (extending [Kumar et al., 2014, 2016]). • Application of the proposed approach in the tourism domain. Experiments using a location checkin dataset. Insight Centre for Data Analytics RecTour 2017 Slide 6
  • 9. Related Work Capturing Sequence • Hierarchical-graph-based model: - [Li et al., 2008; Zheng et al., 2009; Yoon et al., 2010] • All-kth -order Markov models: - [Bohnenberger and Jameson, 2001; Deshpande and Karypis, 2004; Shani et al., 2005] Capturing Context • Tensor and matrix factorization models: - [Zheng et al., 2010, 2012; Wang et al., 2010; Symeonidis et al., 2011; ?; Adomavicius et al., 2011; Braunhofer et al., 2013]
  • 10. Related Work Capturing Both Sequence & Context • To improve recommendations - [Adomavicius and Tuzhilin, 2005; Zheng et al., 2012] • Content-based Activity Recommendation Framework - [Kumar et al., 2014, 2016] • Stochastic Modelling - [Sun et al., 2016]
  • 11. Framework Overview Ranking User Data TimelinesData Modelling Timeline Matching Append top ranked activity to timeline/RS Similarity Assesment Recommended Sequence (RS)
  • 12. Data Model Activity Object A single occurrence of an activity and consists of a set of features describing the activity or the context. Activity Timeline A chronological sequence of n activity objects performed by the user during a time interval δ: T =< ao1, ao2, ..., aon > Museum, 10:15, (53.343N,-6.317W), 11015 Theatre, 16:50, (53.340,-6.263W), 2008 Italian-Food, 12:31, (53.339N,-6.277W), 128 ao1 ao2 ao3
  • 13. Recommendation Algorithm User Timeline Time 00 hrs 00 hrs 00 hrs Insight Centre for Data Analytics RecTour 2017 Slide 11
  • 14. Recommendation Algorithm User Timeline Time 00 hrs 00 hrs 00 hrs Target Activity Seq < aot1 , aot2 , aot3 > Current Activity (aoc) ? ? ? Insight Centre for Data Analytics RecTour 2017 Slide 11
  • 15. Recommendation Algorithm 00 hrs Target Activity Seq < aot1 , aot2 , aot3 > Current Activity (aoc) ? ? ? Current Timeline Insight Centre for Data Analytics RecTour 2017 Slide 11
  • 16. Recommendation Algorithm 00 hrs Target Activity Seq < aot1 , aot2 , aot3 > Current Activity (aoc) ? ? ? Current Timeline Candidate Timeline #2 Candidate Timeline #1 Candidate Timeline #j ... Insight Centre for Data Analytics RecTour 2017 Slide 11
  • 17. Recommendation Algorithm Target Activity Seq < aot1 , aot2 , aot3 > Current Activity (aoc) ? ? ? Current Timeline Candidate Timeline #2 Candidate Timeline #1 Candidate Timeline #j ... N-count matching (N = 4) 124 3 124 3 124 3 124 3 Matching unit Determines the length of the subsequences to be compared. Insight Centre for Data Analytics RecTour 2017 Slide 11
  • 18. Similarity Assessment Target Activity Seq < aot1 , aot2 , aot3 > Current Activity (aoc) ? ? ? Current Timeline Candidate Timeline #2 Candidate Timeline #1 Candidate Timeline #j ... 124 3 124 3 124 3 124 3 Two-level Edit Distance [Kumar et al., 2014] Insight Centre for Data Analytics RecTour 2017 Slide 12
  • 19. Ranking Target Activity Seq < aot1 , aot2 , aot3 > Current Activity (aoc) ? ? ? Current Timeline Candidate Timeline #2 Candidate Timeline #1 Candidate Timeline #j ... 124 3 124 3 124 3 124 3 j Ranked 1 Score(aoj rec) = 1 − d(Tj, Tc) − min Tp∈T d(Tp, Tc) max Tp∈T d(Tp, Tc) − min Tp∈T d(Tp, Tc) Insight Centre for Data Analytics RecTour 2017 Slide 13
  • 20. Recommending Target Activity Seq < aot1 , aot2 , aot3 > Current Activity (aoc) ? ? Current Timeline Candidate Timeline #2 Candidate Timeline #1 Candidate Timeline #j ... 124 3 124 3 124 3 124 3 j Ranked 1 aorec1 Insight Centre for Data Analytics RecTour 2017 Slide 14
  • 21. Recommending Target Activity Seq < aot2 , aot3> Current Activity (aoc) ? ? Current Timeline Candidate Timeline #2 Candidate Timeline #1 Candidate Timeline #j ... 124 3 124 3 124 3 124 3 j Ranked 1 aorec1 Insight Centre for Data Analytics RecTour 2017 Slide 14
  • 22. Recommending Target Activity Seq < aot3> Current Activity (aoc) ? ? Current Timeline 124 3 aorec2 aorec1 Recommendation Algorithm ( SeqNCSeqRec) Insight Centre for Data Analytics RecTour 2017 Slide 14
  • 23. Recommending ? Recommended Sequence aorec2 Recommendation Algorithm ( SeqNCSeqRec) aorec1 aorec3 Insight Centre for Data Analytics RecTour 2017 Slide 14
  • 24. Dataset • Gowalla checkins dataset [Liu et al., 2013] • Every checkin is bound to a specific location and timestamp. • Features: latitude, longitude, number of users checking in to it, number of photos taken at the location, etc. • Locations have categories assigned to them, such as, ‘Italian Food’, ‘Museum’, ‘City Park’, etc. Food Outdoors Shopping Nightlife Entertainment Community Mexican Travel Asian Italian Coffee Shop South American/Latin Fish & Chips Starbucks Dunkin Donuts... ... Level 1 Level 2 Level 3 (7) (134) (151)
  • 25. Dataset • Sampled dataset: 9.2k users , 6 million checkins in total. • median # checkins per user per day: 10 - 270 • median # of distinct categories per user per day : 2 - 76 0 500 1,000 1,500 2,000 2,500 3,000 3,500 0 25 50 75 100 Median #checkins per user per day Numofusers (a) 0 1000 2000 3000 4000 5000 0 10 20 30 40 50 60 70 80 Median #distinct categories per user per day Numofusers (b)
  • 26. Methodology User Timeline Time 00 hrs 00 hrs 00 hrs Target Activity Seq < aot1 , aot2 , aot3 > Current Activity (aoc) ? ? ? Recommendation time (RT) • Leave-one-out evaluation: Each user’s complete timeline is split into training and test timelines by time. • Agreement @ k (k = 1, 2, 3): % of RTs for a user where the first k categories in the recommended sequence and the actual sequence are an exact match. • Recommendation algorithms: • N-count recommendation algorithm (SeqNCSeqRec) • Bi-gram-based sequence recommender (BiGramSeqRec) • Popularity-based sequence recommender (PopSeqRec) Insight Centre for Data Analytics RecTour 2017 Slide 17
  • 27. Recommendation Performance (Level 2) Figure: Mean percentage agreements for recommended sequences for SeqNCSeqRec (top 10% neighbours) and baseline algorithms using timelines constructed from categories at level 2 in the hierarchy. Insight Centre for Data Analytics RecTour 2017 Slide 18
  • 28. Recommendation Performance • Some level 2 activities are semantically closer than others. • ‘true’ performance between those at level 2 and level 1. Food Outdoors Shopping Nightlife Entertainment Community Mexican Travel Asian Italian Coffee Shop South American/Latin Fish & Chips Starbucks Dunkin Donuts... ... Level 1 Level 2 Level 3 (7) (134) (151) Insight Centre for Data Analytics RecTour 2017 Slide 19
  • 29. Recommendation Performance • Some level 2 activities are semantically closer than others. • ‘true’ performance between those at level 2 and level 1. Food Outdoors Shopping Nightlife Entertainment Community Live Music Travel Casino Art Museum Ice Skating Theatre Zoo ... Level 1 Level 2 Level 3 (7) (134) (151) Insight Centre for Data Analytics RecTour 2017 Slide 20
  • 30. Recommendation Performance (Level 1) Figure: Mean percentage agreements for recommended sequences for SeqNCSeqRec (top 10% neighbours) and baseline algorithms using timelines constructed from categories at level 1 in the hierarchy. Insight Centre for Data Analytics RecTour 2017 Slide 21
  • 31. Conclusion • A generic activity recommendation framework to recommend the next sequence of activities to users based on past activity patterns and context (extending [Kumar et al., 2014, 2016]). • Experiments demonstrate the efficacy of our approach in recommending sequences given a diverse variety of activities and user activity patterns. Insight Centre for Data Analytics RecTour 2017 Slide 22
  • 32. Future Work • Consider alternative approaches to suggest sequences of activities (for example, using RNNs). • Introduce new evaluation metrics for evaluating sequence recommendation. • Investigate the recommendation of context (for example, where, when, with whom etc.) associated with each of the suggested sequence of activities. • Consider socio-economic characteristics, user demographics, and travel variables. Insight Centre for Data Analytics RecTour 2017 Slide 23
  • 34. References I G. Adomavicius and A. Tuzhilin. Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions. IEEE Trans. on Knowl. and Data Eng., 17(6):734–749, June 2005. ISSN 1041-4347. doi: 10.1109/TKDE.2005.99. URL http://dx.doi.org/10.1109/TKDE.2005.99. G. Adomavicius, B. Mobasher, F. Ricci, and A. Tuzhilin. Context-aware recommender systems. AI Magazine, 32(3), 2011. T. Bohnenberger and A. Jameson. When policies are better than plans: Decision-theoretic planning of recommendation sequences. In Proceedings of the 6th International Conference on Intelligent User Interfaces, IUI ’01, pages 21–24, New York, NY, USA, 2001. ACM. ISBN 1-58113-325-1. doi: 10.1145/359784.359829. URL http://doi.acm.org/10.1145/359784.359829. M. Braunhofer, M. Elahi, F. Ricci, and T. Schievenin. Context-Aware Points of Interest Suggestion with Dynamic Weather Data Management, pages 87–100. Springer-Verlag, Cham, 2013. ISBN 978-3-319-03973-2. doi: 10.1007/978-3-319-03973-2_7. URL https://doi.org/10.1007/978-3-319-03973-2_7. M. Deshpande and G. Karypis. Selective Markov models for predicting web page accesses. ACM Transactions on Internet Technology, 4(2):163–184, May 2004. ISSN 1533-5399. doi: 10.1145/990301.990304. URL http://doi.acm.org/10.1145/990301.990304. G. Dong and J. Pei. Sequence Data Mining (Advances in Database Systems). Springer-Verlag New York, Inc., Secaucus, NJ, USA, 2007. ISBN 0387699368.
  • 35. References II G. Kumar, H. Jerbi, C. Gurrin, and M. P. O’Mahony. Towards activity recommendation from lifelogs. In Proceedings of the 16th International Conference on Information Integration and Web-based Applications & Services, iiWAS ’14, pages 87–96, New York, NY, USA, 2014. ACM. ISBN 978-1-4503-3001-5. doi: 10.1145/2684200.2684298. URL http://doi.acm.org/10.1145/2684200.2684298. G. Kumar, H. Jerbi, and M. P. O’Mahony. Personalised recommendations for modes of transport: A sequence-based approach. The 5th ACM SIGKDD International Workshop on Urban Computing (UrbComp 2016), 2016. Q. Li, Y. Zheng, X. Xie, Y. Chen, W. Liu, and W.-Y. Ma. Mining user similarity based on location history. In Proceedings of the 16th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, GIS ’08, pages 34:1–34:10, New York, NY, USA, 2008. ACM. ISBN 978-1-60558-323-5. doi: 10.1145/1463434.1463477. URL http://doi.acm.org/10.1145/1463434.1463477. X. Liu, Y. Liu, K. Aberer, and C. Miao. Personalized point-of-interest recommendation by mining users’ preference transition. In Proceedings of the 22nd ACM International Conference on Information & Knowledge Management, CIKM ’13, pages 733–738, New York, NY, USA, 2013. ACM. ISBN 978-1-4503-2263-8. doi: 10.1145/2505515.2505639. URL http://doi.acm.org/10.1145/2505515.2505639. G. Shani, D. Heckerman, and R. I. Brafman. An MDP-based recommender system. Journal of Machine Learning Research, 6:1265–1295, Dec. 2005. ISSN 1532-4435. URL http://dl.acm.org/citation.cfm?id=1046920.1088715. Insight Centre for Data Analytics RecTour 2017 Slide 2
  • 36. References III Y. Sun, N. J. Yuan, X. Xie, K. McDonald, and R. Zhang. Collaborative nowcasting for contextual recommendation. In Proceedings of the 25th International Conference on World Wide Web, WWW ’16, pages 1407–1418, Republic and Canton of Geneva, Switzerland, 2016. International World Wide Web Conferences Steering Committee. ISBN 978-1-4503-4143-1. doi: 10.1145/2872427.2874812. URL http://dx.doi.org/10.1145/2872427.2874812. P. Symeonidis, A. Papadimitriou, Y. Manolopoulos, P. Senkul, and I. Toroslu. Geo-social recommendations based on incremental tensor reduction and local path traversal. In Proceedings of the 3rd ACM SIGSPATIAL International Workshop on Location-Based Social Networks, LBSN ’11, pages 89–96, New York, NY, USA, 2011. ACM. ISBN 978-1-4503-1033-8. doi: 10.1145/2063212.2063228. URL http://doi.acm.org/10.1145/2063212.2063228. C.-Y. Wang, Y.-H. Wu, and S.-C. T. Chou. Toward a ubiquitous personalized daily-life activity recommendation service with contextual information: A services science perspective. Information Systems and E-Business Management, 8(1):13–32, January 2010. Z. Xing, J. Pei, and E. Keogh. A brief survey on sequence classification. SIGKDD Explor. Newsl., 12(1):40–48, Nov. 2010. ISSN 1931-0145. doi: 10.1145/1882471.1882478. URL http://doi.acm.org/10.1145/1882471.1882478. Insight Centre for Data Analytics RecTour 2017 Slide 3
  • 37. References IV H. Yoon, Y. Zheng, X. Xie, and W. Woo. Smart itinerary recommendation based on user-generated GPS trajectories. In Proceedings of the 7th International Conference on Ubiquitous Intelligence and Computing, UIC’10, pages 19–34, Berlin, Heidelberg, 2010. Springer-Verlag. ISBN 3-642-16354-8, 978-3-642-16354-8. URL http://dl.acm.org/citation.cfm?id=1929661.1929669. V. W. Zheng, B. Cao, Y. Zheng, X. Xie, and Q. Yang. Collaborative filtering meets mobile recommendation: A user-centered approach. In AAAI 2010. Association for Computing Machinery, Inc., July 2010. URL http://research.microsoft.com/apps/pubs/default.aspx?id=143146. V. W. Zheng, Y. Zheng, X. Xie, and Q. Yang. Towards mobile intelligence: Learning from GPS history data for collaborative recommendation. Artif. Intell., 184-185: 17–37, June 2012. ISSN 0004-3702. doi: 10.1016/j.artint.2012.02.002. URL http://dx.doi.org/10.1016/j.artint.2012.02.002. Y. Zheng, L. Zhang, X. Xie, and W.-Y. Ma. Mining interesting locations and travel sequences from GPS trajectories. In Proceedings of the 18th International Conference on World Wide Web, WWW ’09, pages 791–800, New York, NY, USA, 2009. ACM. ISBN 978-1-60558-487-4. doi: 10.1145/1526709.1526816. URL http://doi.acm.org/10.1145/1526709.1526816. Insight Centre for Data Analytics RecTour 2017 Slide 4
  • 38. Some Extra Bits Insight Centre for Data Analytics RecTour 2017 Slide 5
  • 39. Trend across matching-unit N 5 6 7 8 0000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000 1111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111 2222222222222222222222222222222222222222222222222222222222222222222222222222222222222222222222222222 3333333333333333333333333333333333333333333333333333333333333333333333333333333333333333333333333333 4444444444444444444444444444444444444444444444444444444444444444444444444444444444444444444444444444 6666666666666666666666666666666666666666666666666666666666666666666666666666666666666666666666666666 8888888888888888888888888888888888888888888888888888888888888888888888888888888888888888888888888888 N−count Mean%agreement Figure: MRR versus matching unit for three user groups for first sequence index.
  • 40. Two-level Distance • Inspired by edit distance • Adapted for sequence of objects 09:00 10:05 10:35 10:50 11:3211:25
  • 41. Two-level Distance • Inspired by edit distance • Adapted for sequence of objects 09:00 10:05 10:35 10:50 11:32 Step 1 Align activities 11:25 ( ( dactivity (T1, T2) = r i=1 wobj × cins + s j=1 wobj × cdel + t k=1 wobj × csub
  • 42. Two-level Distance • Inspired by edit distance • Adapted for sequence of objects 09:00 10:05 10:35 10:50 11:32 Step 1 Step 2 Align activities Align features 11:25 ( ( ( ( d(T1, T2) = dactivity (T1, T2) + n i=1 dfeature(ao1 i , ao2 i ) .
  • 43. Methodology Learning Optimal Matching Unit range • Wrapper attribute selection: C4.5 algorithm, greedy backward search and area under ROC curve as evaluation measure. • Classification: pruned attribute vectors for each user fed into a C4.5 induction algorithm to predict optimal matching unit range. Insight Centre for Data Analytics RecTour 2017 Slide 8
  • 44. Attribute Extraction: Timeline Decomposition • Each user represented by an attribute vector. • For attribute extraction: timelines are decomposed into features-sequence : User Timeline Time 00 hrs 00 hrs 00 hrs
  • 45. Attribute Extraction: Timeline Decomposition • Each user represented by an attribute vector. • For attribute extraction: timelines are decomposed into features-sequence : User Timeline Time 00 hrs 00 hrs 00 hrs dist-travel start-geo start-time
  • 46. Attribute Extraction: Timeline Decomposition • Each user represented by an attribute vector. • For attribute extraction: timelines are decomposed into features-sequence : User Timeline Time 00 hrs 00 hrs 00 hrs dist-travel start-geo dist-travel start-geo start-time start-time
  • 48. Timeline Attributes Regularity Attributes: Sample Entropy 1. SampEnp z : sample entropy of a feature sequence Sz for epoch length p, 2. µSampEnp T : mean sample entropy over all feature sequences Sz , z = 1, 2, ..., m of timeline T for epoch length p, 3. σSampEnp T : standard deviation of sample entropy over all feature sequences Sz , z = 1, 2, ..., m of timeline T for epoch length p. Insight Centre for Data Analytics RecTour 2017 Slide 11
  • 49. Timeline Attributes Repetition Attributes: k-gram attributes Previously used for sequence classification, biological sequence analysis and text classification [Xing et al., 2010; Dong and Pei, 2007]. 1. ηk z : total number of distinct k-grams in feature sequence Sz , normalised by total number of k-grams occurring in Sz , 2. µf k z : mean frequency of occurrence of distinct k-grams in feature sequence Sz , normalised by total number of k-grams occurring in Sz , 3. σf k z : standard deviation of frequency of occurrence of distinct k-grams in feature sequence Sz , normalised by length of Sz . Insight Centre for Data Analytics RecTour 2017 Slide 12