SlideShare una empresa de Scribd logo
1 de 47
Sequential Patterns Wen-Chih Peng ( 彭文志 ) Dept. of Computer Science National Chiao-Tung University
Sequence Data Sequence Database: Object Timestamp Events A 10 2, 3, 5 A 20 6, 1 A 23 1 B 11 4, 5, 6 B 17 2 B 21 7, 8, 1, 2 B 28 1, 6 C 14 1, 8, 7
Examples of Sequence Data Sequence E1 E2 E1 E3 E2 E3 E4 E2 Element (Transaction) Event  (Item) Bases A,T,G,C An element of the DNA sequence  DNA sequence of a particular species Genome sequences Types of alarms generated by sensors  Events triggered by a sensor at time t History of events generated by a given sensor Event data Home page, index page, contact info, etc A collection of files viewed by a Web visitor after a single mouse click Browsing activity of a particular Web visitor Web Data Books, diary products, CDs, etc A set of items bought by a customer at time t Purchase history of a given customer Customer Event (Item) Element (Transaction) Sequence Sequence Database
Formal Definition of a Sequence ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Examples of Sequence ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Formal Definition of a Subsequence ,[object Object],[object Object],[object Object],< {2,4} {2,4} {2,5} > < {1,2} {3,4} >  < {2,4} {3,5,6} {8} > Data sequence Yes < {2} {4} > No < {1} {2} > Yes < {2} {3,5} > Contain? Subsequence
What Is Sequential Pattern Mining? ,[object Object],A  sequence database   A  sequence  : < (ef) (ab)  (df) c b > An element may contain a set of items. Items within an element are unordered and we list them alphabetically.   <a(bc)dc> is a  subsequence  of  < a (a bc )(ac) d ( c f)> Given  support threshold   min_sup  =2, <(ab)c> is a  sequential pattern <eg(af)cbc> 40 <(ef)( ab )(df) c b> 30 <(ad)c(bc)(ae)> 20 <a( ab c)(a c )d(cf)> 10 sequence SID
Sequential Pattern Mining: Definition ,[object Object],[object Object],[object Object],[object Object],[object Object]
Extracting Sequential Patterns ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Sequential Pattern Mining: Challenge ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Mining Sequential Patterns ,[object Object],[object Object],[object Object],[object Object],[object Object]
Sequential Patterns (cont’d) ,[object Object],[object Object],[object Object]
Algorithm ,[object Object],[object Object],[object Object],[object Object]
Algorithm (cont’d) ,[object Object],[object Object],[object Object],[object Object]
Original Database
Sort Phase
Customer-Sequence Version  of the Database
Large itemset Phase (support:2)
Transformation Phase
Sequence Phase ,[object Object],[object Object],2 <2 3 4> 2 <1 3 5> 3 <1 3 4>  2 <1 2 4> 2 <1 2 3> Support Sequence <1 3 5 4> <1 3 4 5> <1 2 4 3> <1 2 3 4>
<{1 5} {2} {3} {4}> <{1} {3} {4} {3 5}> <{1} {2} {3} {4}> <{1} {3} {5}> <{4} {5}> Customer Sequences Sequence Support <1> 4 <2> 2 <3> 4 <4> 4 <5> 4 Large 1-Sequences Sequence Support <1 2> 2 <1 3> 4 <1 4> 3 <1 5> 2 <2 3> 2 <2 4> 2 <3 4> 3 <3 5> 2 <4 5> 2 Large 2-Sequences Sequence Support <1 2 3> 2 <1 2 4> 2 <1 3 4> 3 <1 3 5> 2 <2 3 4> 2 Large 3-Sequences Sequence Support <1 2 3 4> 2 Large 4-Sequences Sequence Support <1 2 3 4> 2 <1 3 5> 2 <4 5> 2 Maximal Large Sequences
Maximal Sequence ,[object Object],[object Object],[object Object]
Sequential Patterns With support > 25% < (30) (90) > < (30) (40 70) > The answer set
Notes ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Rule Discovery from Time Sequences ,[object Object],[object Object],[object Object],[object Object],[object Object]
Episode Mining ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],E B A F B A C
Example of Episode Mining  ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],0  10  20  30  40  50  60  70  80  90  100  110  120 130 140  150 D  C  A  B  D  A  B  C  A  D  C  A  B  D  A
Event Sequence ,[object Object],[object Object]
Sliding Window ,[object Object],[object Object],0  10  20  30  40  50  60  70  80  90 D  C  A  B  D  A  B  C
Frequency of an Episode ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Find Frequent Episodes ,[object Object],[object Object],[object Object],[object Object]
FreeSpan ,[object Object],[object Object],[object Object],[object Object]
Example of FreeSpan f_list = a: 4 ,b: 4 ,c: 4 ,d: 3 ,e: 3 ,f: 3 g is deleted because of support of g <2 . Example database: min support = 2 <eg(af)cbc> 40 <(ef)(ab)(df)cb> 30 <(ad)c(bc)(ae)> 20 <a(abc)(ac)d(cf)> 10 Sequence Sequence id
Example of FreeSpan (cont’d) ,[object Object],Frequent Patterns <a> <aa> => {a}-projected database <e( a f)cbc> 40 <(ef)( a b)(df)cb> 30 <( a d)c(bc)( a e)> 20 < a ( a bc)( a c)d(cf)> 10 Sequence Sequence id <a> 40 <a> 30 <aa> 20 <aaa> 10
Example of FreeSpan (cont’d) ,[object Object],Frequent Patterns <b> <ab> <ba> <(ab)> => {b}-projected database <e( a f)c b c> 40 <(ef)( ab )(df)c b > 30 <( a d)c( b c)( a e)> 20 < a ( ab c)( a c)d(cf)> 10 Sequence Sequence id <ab> 40 <(ab)b> 30 <aba> 20 <a(ab)a> 10
Example of FreeSpan (cont’d) ,[object Object],Frequent Patterns <c> <ac> <bc> <(bc)> <ca> <cb> <(ab)c> <acc> <acb> => {c}-projected database <e( a f) cbc > 40 <(ef)( ab )(df) cb > 30 <( a d) c ( bc )( a e)> 20 < a ( abc )( ac )d( c f)> 10 Sequence Sequence id <acbc> 40 <(ab)cb> 30 <ac(bc)a> 20 <a(abc)(ac)c> 10
PrefixSpan ,[object Object],[object Object],[object Object]
PrefixSpan - Concepts ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
PrefixSpan – Concepts ,[object Object],[object Object],[object Object],[object Object]
Example of PrefixSpan <a>-projected database By scanning <a>-projected database once, all the length-2 sequential patterns having prefix <a> can be found. <aa>:2  <ab>:4  <(ab)>:2  <ac>:4  <ad>:2  <af>:2 Recursively, patterns with prefix <a> can be partitioned into 6 subsets. => <e( a f)cbc > 40 <(ef)( a b)(df)cb > 30 <( a d)c(bc)(ae )> 20 < a (abc)(ac)d(cf) > 10 Sequence Sequence id <(_f)cbc> 40 <(_b)(df)cb> 30 <(_d)c(bc)(ae)> 20 <(abc)(ac)d(cf)> 10
Example of PrefixSpan (cont’d) < aa >-projected database => < ab >-projected database => Sequential patterns of <ab>-projected db: <(_c)>,<(_c)a>,<a><c> <(_f)c b c> 40 <(_b)(df)cb> 30 <(_d)c( b c)( a e)> 20 <( a b c)(ac)d(cf)> 10 Sequence Sequence id <c> 40 <(_c)(ae)> 20 <(_c)(ac)d(cf)> 10 <(_e)> 20 <(_bc)(ac)d(cf)> 10
Example of PrefixSpan (cont’d) <b>-projected database Sequential patterns <b> <ba> <bc> <(bc)> <(bc)a> <bd> <bdc> <bf> => <e(af)c b c > 40 <(ef)(a b )(df)cb > 30 <(ad)c( b c)(ae )> 20 <a(a b c)(ac)d(cf)> 10 Sequence Sequence id <c> 40 <(df)cb> 30 <(_c)(ae)> 20 <(_c)(ac)d(cf)> 10
References ,[object Object],[object Object],[object Object]
References (cont’d) ,[object Object],[object Object],[object Object],[object Object]
References (cont’d) ,[object Object],[object Object],[object Object]
References (cont’d) ,[object Object],[object Object],[object Object]
References (cont’d) ,[object Object],[object Object],[object Object],[object Object]

Más contenido relacionado

La actualidad más candente

AST: threats and opportunities
AST: threats and opportunitiesAST: threats and opportunities
AST: threats and opportunitiesAlexander Lifanov
 
Python memory management_v2
Python memory management_v2Python memory management_v2
Python memory management_v2Jeffrey Clark
 
From java to kotlin beyond alt+shift+cmd+k - Droidcon italy
From java to kotlin beyond alt+shift+cmd+k - Droidcon italyFrom java to kotlin beyond alt+shift+cmd+k - Droidcon italy
From java to kotlin beyond alt+shift+cmd+k - Droidcon italyFabio Collini
 
Python 표준 라이브러리
Python 표준 라이브러리Python 표준 라이브러리
Python 표준 라이브러리용 최
 
Yoyak ScalaDays 2015
Yoyak ScalaDays 2015Yoyak ScalaDays 2015
Yoyak ScalaDays 2015ihji
 
The Ring programming language version 1.8 book - Part 53 of 202
The Ring programming language version 1.8 book - Part 53 of 202The Ring programming language version 1.8 book - Part 53 of 202
The Ring programming language version 1.8 book - Part 53 of 202Mahmoud Samir Fayed
 
Erlang Introduction Bcberlin3
Erlang Introduction Bcberlin3Erlang Introduction Bcberlin3
Erlang Introduction Bcberlin3guesta3202
 
Python fundamentals - basic | WeiYuan
Python fundamentals - basic | WeiYuanPython fundamentals - basic | WeiYuan
Python fundamentals - basic | WeiYuanWei-Yuan Chang
 
Taint-based Dynamic Analysis (CoC Research Day 2009)
Taint-based Dynamic Analysis (CoC Research Day 2009)Taint-based Dynamic Analysis (CoC Research Day 2009)
Taint-based Dynamic Analysis (CoC Research Day 2009)James Clause
 
Haskellで学ぶ関数型言語
Haskellで学ぶ関数型言語Haskellで学ぶ関数型言語
Haskellで学ぶ関数型言語ikdysfm
 
java 8 Hands on Workshop
java 8 Hands on Workshopjava 8 Hands on Workshop
java 8 Hands on WorkshopJeanne Boyarsky
 
집단지성 프로그래밍 08-가격모델링
집단지성 프로그래밍 08-가격모델링집단지성 프로그래밍 08-가격모델링
집단지성 프로그래밍 08-가격모델링Kwang Woo NAM
 
Introducción a Elixir
Introducción a ElixirIntroducción a Elixir
Introducción a ElixirSvet Ivantchev
 
MySQL 5.7 NF – JSON Datatype 활용
MySQL 5.7 NF – JSON Datatype 활용MySQL 5.7 NF – JSON Datatype 활용
MySQL 5.7 NF – JSON Datatype 활용I Goo Lee
 
Advanced Python, Part 2
Advanced Python, Part 2Advanced Python, Part 2
Advanced Python, Part 2Zaar Hai
 
Getting started cpp full
Getting started cpp   fullGetting started cpp   full
Getting started cpp fullVõ Hòa
 
Tokyo APAC Groundbreakers tour - The Complete Java Developer
Tokyo APAC Groundbreakers tour - The Complete Java DeveloperTokyo APAC Groundbreakers tour - The Complete Java Developer
Tokyo APAC Groundbreakers tour - The Complete Java DeveloperConnor McDonald
 
Обзор фреймворка Twisted
Обзор фреймворка TwistedОбзор фреймворка Twisted
Обзор фреймворка TwistedMaxim Kulsha
 

La actualidad más candente (20)

AST: threats and opportunities
AST: threats and opportunitiesAST: threats and opportunities
AST: threats and opportunities
 
Python memory management_v2
Python memory management_v2Python memory management_v2
Python memory management_v2
 
From java to kotlin beyond alt+shift+cmd+k - Droidcon italy
From java to kotlin beyond alt+shift+cmd+k - Droidcon italyFrom java to kotlin beyond alt+shift+cmd+k - Droidcon italy
From java to kotlin beyond alt+shift+cmd+k - Droidcon italy
 
Python 표준 라이브러리
Python 표준 라이브러리Python 표준 라이브러리
Python 표준 라이브러리
 
Yoyak ScalaDays 2015
Yoyak ScalaDays 2015Yoyak ScalaDays 2015
Yoyak ScalaDays 2015
 
The Ring programming language version 1.8 book - Part 53 of 202
The Ring programming language version 1.8 book - Part 53 of 202The Ring programming language version 1.8 book - Part 53 of 202
The Ring programming language version 1.8 book - Part 53 of 202
 
Erlang Introduction Bcberlin3
Erlang Introduction Bcberlin3Erlang Introduction Bcberlin3
Erlang Introduction Bcberlin3
 
Python fundamentals - basic | WeiYuan
Python fundamentals - basic | WeiYuanPython fundamentals - basic | WeiYuan
Python fundamentals - basic | WeiYuan
 
Taint-based Dynamic Analysis (CoC Research Day 2009)
Taint-based Dynamic Analysis (CoC Research Day 2009)Taint-based Dynamic Analysis (CoC Research Day 2009)
Taint-based Dynamic Analysis (CoC Research Day 2009)
 
Haskellで学ぶ関数型言語
Haskellで学ぶ関数型言語Haskellで学ぶ関数型言語
Haskellで学ぶ関数型言語
 
PDBC
PDBCPDBC
PDBC
 
java 8 Hands on Workshop
java 8 Hands on Workshopjava 8 Hands on Workshop
java 8 Hands on Workshop
 
집단지성 프로그래밍 08-가격모델링
집단지성 프로그래밍 08-가격모델링집단지성 프로그래밍 08-가격모델링
집단지성 프로그래밍 08-가격모델링
 
Introducción a Elixir
Introducción a ElixirIntroducción a Elixir
Introducción a Elixir
 
MySQL 5.7 NF – JSON Datatype 활용
MySQL 5.7 NF – JSON Datatype 활용MySQL 5.7 NF – JSON Datatype 활용
MySQL 5.7 NF – JSON Datatype 활용
 
Advanced Python, Part 2
Advanced Python, Part 2Advanced Python, Part 2
Advanced Python, Part 2
 
Intro
IntroIntro
Intro
 
Getting started cpp full
Getting started cpp   fullGetting started cpp   full
Getting started cpp full
 
Tokyo APAC Groundbreakers tour - The Complete Java Developer
Tokyo APAC Groundbreakers tour - The Complete Java DeveloperTokyo APAC Groundbreakers tour - The Complete Java Developer
Tokyo APAC Groundbreakers tour - The Complete Java Developer
 
Обзор фреймворка Twisted
Обзор фреймворка TwistedОбзор фреймворка Twisted
Обзор фреймворка Twisted
 

Destacado

Analyzing Time Series Data with Apache Spark and Cassandra
Analyzing Time Series Data with Apache Spark and CassandraAnalyzing Time Series Data with Apache Spark and Cassandra
Analyzing Time Series Data with Apache Spark and CassandraPatrick McFadin
 
Preview - The Mind of Business Plan
Preview - The Mind of Business PlanPreview - The Mind of Business Plan
Preview - The Mind of Business Plans.anshuman
 
Videopillola Eurostoxx: E ora si riparte al rialzo?
Videopillola Eurostoxx: E ora si riparte al rialzo?Videopillola Eurostoxx: E ora si riparte al rialzo?
Videopillola Eurostoxx: E ora si riparte al rialzo?Zio Romolo
 
The Push-Pull of Communications and Marketing: Reduce Spending, Extend Reach
The Push-Pull of Communications and Marketing: Reduce Spending, Extend ReachThe Push-Pull of Communications and Marketing: Reduce Spending, Extend Reach
The Push-Pull of Communications and Marketing: Reduce Spending, Extend ReachMichael L. Warden, APR
 
Preview book the rabbit who forgot to fly
Preview book the rabbit who forgot to flyPreview book the rabbit who forgot to fly
Preview book the rabbit who forgot to flys.anshuman
 
Diario Blog 0119 0123
Diario Blog 0119 0123Diario Blog 0119 0123
Diario Blog 0119 0123Zio Romolo
 
Evaluarea Dinamica Vs Evaluarea Psihometrica
Evaluarea Dinamica Vs Evaluarea PsihometricaEvaluarea Dinamica Vs Evaluarea Psihometrica
Evaluarea Dinamica Vs Evaluarea Psihometrica1Leu
 
Tulburari De Personalitate
Tulburari De PersonalitateTulburari De Personalitate
Tulburari De Personalitate1Leu
 
Architecture Enforcement Aspects Itarc2009
Architecture Enforcement Aspects Itarc2009Architecture Enforcement Aspects Itarc2009
Architecture Enforcement Aspects Itarc2009guest2b8cd
 
Dementastd
DementastdDementastd
Dementastd1Leu
 
Pdi abril2011 (1)
Pdi abril2011 (1)Pdi abril2011 (1)
Pdi abril2011 (1)lpalacin
 
Toxicostd
ToxicostdToxicostd
Toxicostd1Leu
 
Presentazione WMO Milan 13 december 2014 $SPY $FTMIB $EUROSTOXX50 $GOLD
Presentazione WMO Milan 13 december 2014 $SPY $FTMIB $EUROSTOXX50 $GOLDPresentazione WMO Milan 13 december 2014 $SPY $FTMIB $EUROSTOXX50 $GOLD
Presentazione WMO Milan 13 december 2014 $SPY $FTMIB $EUROSTOXX50 $GOLDZio Romolo
 
Diario Operativo Blog Gennaio Marzo 2010
Diario Operativo Blog Gennaio Marzo 2010Diario Operativo Blog Gennaio Marzo 2010
Diario Operativo Blog Gennaio Marzo 2010Zio Romolo
 
Pendidikan Jasmani Zaman Purba
Pendidikan Jasmani Zaman PurbaPendidikan Jasmani Zaman Purba
Pendidikan Jasmani Zaman Purbakaloky
 
Comportamentele Pozitive
Comportamentele PozitiveComportamentele Pozitive
Comportamentele Pozitive1Leu
 
New York Bluejays
New York BluejaysNew York Bluejays
New York Bluejaysjawad723
 
Preview - Fifty Two Steps: Discover a New You
Preview - Fifty Two Steps: Discover a New YouPreview - Fifty Two Steps: Discover a New You
Preview - Fifty Two Steps: Discover a New Yous.anshuman
 
Preview - Stop! Playing the Dice: Devices to Design Amazing Products
Preview - Stop! Playing the Dice: Devices to Design Amazing ProductsPreview - Stop! Playing the Dice: Devices to Design Amazing Products
Preview - Stop! Playing the Dice: Devices to Design Amazing Productss.anshuman
 
EFT On Feeling Disappointed
EFT On Feeling DisappointedEFT On Feeling Disappointed
EFT On Feeling Disappointedchriscase
 

Destacado (20)

Analyzing Time Series Data with Apache Spark and Cassandra
Analyzing Time Series Data with Apache Spark and CassandraAnalyzing Time Series Data with Apache Spark and Cassandra
Analyzing Time Series Data with Apache Spark and Cassandra
 
Preview - The Mind of Business Plan
Preview - The Mind of Business PlanPreview - The Mind of Business Plan
Preview - The Mind of Business Plan
 
Videopillola Eurostoxx: E ora si riparte al rialzo?
Videopillola Eurostoxx: E ora si riparte al rialzo?Videopillola Eurostoxx: E ora si riparte al rialzo?
Videopillola Eurostoxx: E ora si riparte al rialzo?
 
The Push-Pull of Communications and Marketing: Reduce Spending, Extend Reach
The Push-Pull of Communications and Marketing: Reduce Spending, Extend ReachThe Push-Pull of Communications and Marketing: Reduce Spending, Extend Reach
The Push-Pull of Communications and Marketing: Reduce Spending, Extend Reach
 
Preview book the rabbit who forgot to fly
Preview book the rabbit who forgot to flyPreview book the rabbit who forgot to fly
Preview book the rabbit who forgot to fly
 
Diario Blog 0119 0123
Diario Blog 0119 0123Diario Blog 0119 0123
Diario Blog 0119 0123
 
Evaluarea Dinamica Vs Evaluarea Psihometrica
Evaluarea Dinamica Vs Evaluarea PsihometricaEvaluarea Dinamica Vs Evaluarea Psihometrica
Evaluarea Dinamica Vs Evaluarea Psihometrica
 
Tulburari De Personalitate
Tulburari De PersonalitateTulburari De Personalitate
Tulburari De Personalitate
 
Architecture Enforcement Aspects Itarc2009
Architecture Enforcement Aspects Itarc2009Architecture Enforcement Aspects Itarc2009
Architecture Enforcement Aspects Itarc2009
 
Dementastd
DementastdDementastd
Dementastd
 
Pdi abril2011 (1)
Pdi abril2011 (1)Pdi abril2011 (1)
Pdi abril2011 (1)
 
Toxicostd
ToxicostdToxicostd
Toxicostd
 
Presentazione WMO Milan 13 december 2014 $SPY $FTMIB $EUROSTOXX50 $GOLD
Presentazione WMO Milan 13 december 2014 $SPY $FTMIB $EUROSTOXX50 $GOLDPresentazione WMO Milan 13 december 2014 $SPY $FTMIB $EUROSTOXX50 $GOLD
Presentazione WMO Milan 13 december 2014 $SPY $FTMIB $EUROSTOXX50 $GOLD
 
Diario Operativo Blog Gennaio Marzo 2010
Diario Operativo Blog Gennaio Marzo 2010Diario Operativo Blog Gennaio Marzo 2010
Diario Operativo Blog Gennaio Marzo 2010
 
Pendidikan Jasmani Zaman Purba
Pendidikan Jasmani Zaman PurbaPendidikan Jasmani Zaman Purba
Pendidikan Jasmani Zaman Purba
 
Comportamentele Pozitive
Comportamentele PozitiveComportamentele Pozitive
Comportamentele Pozitive
 
New York Bluejays
New York BluejaysNew York Bluejays
New York Bluejays
 
Preview - Fifty Two Steps: Discover a New You
Preview - Fifty Two Steps: Discover a New YouPreview - Fifty Two Steps: Discover a New You
Preview - Fifty Two Steps: Discover a New You
 
Preview - Stop! Playing the Dice: Devices to Design Amazing Products
Preview - Stop! Playing the Dice: Devices to Design Amazing ProductsPreview - Stop! Playing the Dice: Devices to Design Amazing Products
Preview - Stop! Playing the Dice: Devices to Design Amazing Products
 
EFT On Feeling Disappointed
EFT On Feeling DisappointedEFT On Feeling Disappointed
EFT On Feeling Disappointed
 

Similar a 03

Team activity analysis / visualization
Team activity analysis / visualizationTeam activity analysis / visualization
Team activity analysis / visualizationNicolas Maisonneuve
 
Reverse Engineering Dojo: Enhancing Assembly Reading Skills
Reverse Engineering Dojo: Enhancing Assembly Reading SkillsReverse Engineering Dojo: Enhancing Assembly Reading Skills
Reverse Engineering Dojo: Enhancing Assembly Reading SkillsAsuka Nakajima
 
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Data Provenance Support in...
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Data Provenance Support in...Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Data Provenance Support in...
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Data Provenance Support in...Data Con LA
 
Data in Motion: Streaming Static Data Efficiently
Data in Motion: Streaming Static Data EfficientlyData in Motion: Streaming Static Data Efficiently
Data in Motion: Streaming Static Data EfficientlyMartin Zapletal
 
Data Presentations Cassandra Sigmod
Data  Presentations  Cassandra SigmodData  Presentations  Cassandra Sigmod
Data Presentations Cassandra SigmodJeff Hammerbacher
 
RxJava In Baby Steps
RxJava In Baby StepsRxJava In Baby Steps
RxJava In Baby StepsAnnyce Davis
 
Mining of time series data base using fuzzy neural information systems
Mining of time series data base using fuzzy neural information systemsMining of time series data base using fuzzy neural information systems
Mining of time series data base using fuzzy neural information systemsDr.MAYA NAYAK
 
Extracting a Micro State Transition Table Using KLEE
Extracting a Micro State Transition Table Using KLEEExtracting a Micro State Transition Table Using KLEE
Extracting a Micro State Transition Table Using KLEENorihiro Yoshida
 
Flink Forward SF 2017: Stefan Richter - Improvements for large state and reco...
Flink Forward SF 2017: Stefan Richter - Improvements for large state and reco...Flink Forward SF 2017: Stefan Richter - Improvements for large state and reco...
Flink Forward SF 2017: Stefan Richter - Improvements for large state and reco...Flink Forward
 
Writing Domain-Specific Languages for BeepBeep
Writing Domain-Specific Languages for BeepBeepWriting Domain-Specific Languages for BeepBeep
Writing Domain-Specific Languages for BeepBeepSylvain Hallé
 
Basics & asymptotic notations
Basics & asymptotic notationsBasics & asymptotic notations
Basics & asymptotic notationsRajendran
 
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?Marlon Dumas
 
Data Structure: Algorithm and analysis
Data Structure: Algorithm and analysisData Structure: Algorithm and analysis
Data Structure: Algorithm and analysisDr. Rajdeep Chatterjee
 
WSO2 Product Release Webinar: WSO2 Complex Event Processor 4.0
WSO2 Product Release Webinar: WSO2 Complex Event Processor 4.0WSO2 Product Release Webinar: WSO2 Complex Event Processor 4.0
WSO2 Product Release Webinar: WSO2 Complex Event Processor 4.0WSO2
 
Java 5 6 Generics, Concurrency, Garbage Collection, Tuning
Java 5 6 Generics, Concurrency, Garbage Collection, TuningJava 5 6 Generics, Concurrency, Garbage Collection, Tuning
Java 5 6 Generics, Concurrency, Garbage Collection, TuningCarol McDonald
 

Similar a 03 (20)

Team activity analysis / visualization
Team activity analysis / visualizationTeam activity analysis / visualization
Team activity analysis / visualization
 
chapter-8.ppt
chapter-8.pptchapter-8.ppt
chapter-8.ppt
 
Reverse Engineering Dojo: Enhancing Assembly Reading Skills
Reverse Engineering Dojo: Enhancing Assembly Reading SkillsReverse Engineering Dojo: Enhancing Assembly Reading Skills
Reverse Engineering Dojo: Enhancing Assembly Reading Skills
 
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Data Provenance Support in...
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Data Provenance Support in...Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Data Provenance Support in...
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Data Provenance Support in...
 
04 sequentialbasics 1
04 sequentialbasics 104 sequentialbasics 1
04 sequentialbasics 1
 
SMDMS'13
SMDMS'13SMDMS'13
SMDMS'13
 
Data in Motion: Streaming Static Data Efficiently
Data in Motion: Streaming Static Data EfficientlyData in Motion: Streaming Static Data Efficiently
Data in Motion: Streaming Static Data Efficiently
 
Data Presentations Cassandra Sigmod
Data  Presentations  Cassandra SigmodData  Presentations  Cassandra Sigmod
Data Presentations Cassandra Sigmod
 
RxJava In Baby Steps
RxJava In Baby StepsRxJava In Baby Steps
RxJava In Baby Steps
 
Mining of time series data base using fuzzy neural information systems
Mining of time series data base using fuzzy neural information systemsMining of time series data base using fuzzy neural information systems
Mining of time series data base using fuzzy neural information systems
 
dfl
dfldfl
dfl
 
Odp
OdpOdp
Odp
 
Extracting a Micro State Transition Table Using KLEE
Extracting a Micro State Transition Table Using KLEEExtracting a Micro State Transition Table Using KLEE
Extracting a Micro State Transition Table Using KLEE
 
Flink Forward SF 2017: Stefan Richter - Improvements for large state and reco...
Flink Forward SF 2017: Stefan Richter - Improvements for large state and reco...Flink Forward SF 2017: Stefan Richter - Improvements for large state and reco...
Flink Forward SF 2017: Stefan Richter - Improvements for large state and reco...
 
Writing Domain-Specific Languages for BeepBeep
Writing Domain-Specific Languages for BeepBeepWriting Domain-Specific Languages for BeepBeep
Writing Domain-Specific Languages for BeepBeep
 
Basics & asymptotic notations
Basics & asymptotic notationsBasics & asymptotic notations
Basics & asymptotic notations
 
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
Accurate and Reliable What-If Analysis of Business Processes: Is it Achievable?
 
Data Structure: Algorithm and analysis
Data Structure: Algorithm and analysisData Structure: Algorithm and analysis
Data Structure: Algorithm and analysis
 
WSO2 Product Release Webinar: WSO2 Complex Event Processor 4.0
WSO2 Product Release Webinar: WSO2 Complex Event Processor 4.0WSO2 Product Release Webinar: WSO2 Complex Event Processor 4.0
WSO2 Product Release Webinar: WSO2 Complex Event Processor 4.0
 
Java 5 6 Generics, Concurrency, Garbage Collection, Tuning
Java 5 6 Generics, Concurrency, Garbage Collection, TuningJava 5 6 Generics, Concurrency, Garbage Collection, Tuning
Java 5 6 Generics, Concurrency, Garbage Collection, Tuning
 

Último

Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 

Último (20)

Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 

03

  • 1. Sequential Patterns Wen-Chih Peng ( 彭文志 ) Dept. of Computer Science National Chiao-Tung University
  • 2. Sequence Data Sequence Database: Object Timestamp Events A 10 2, 3, 5 A 20 6, 1 A 23 1 B 11 4, 5, 6 B 17 2 B 21 7, 8, 1, 2 B 28 1, 6 C 14 1, 8, 7
  • 3. Examples of Sequence Data Sequence E1 E2 E1 E3 E2 E3 E4 E2 Element (Transaction) Event (Item) Bases A,T,G,C An element of the DNA sequence DNA sequence of a particular species Genome sequences Types of alarms generated by sensors Events triggered by a sensor at time t History of events generated by a given sensor Event data Home page, index page, contact info, etc A collection of files viewed by a Web visitor after a single mouse click Browsing activity of a particular Web visitor Web Data Books, diary products, CDs, etc A set of items bought by a customer at time t Purchase history of a given customer Customer Event (Item) Element (Transaction) Sequence Sequence Database
  • 4.
  • 5.
  • 6.
  • 7.
  • 8.
  • 9.
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 17. Customer-Sequence Version of the Database
  • 18. Large itemset Phase (support:2)
  • 20.
  • 21. <{1 5} {2} {3} {4}> <{1} {3} {4} {3 5}> <{1} {2} {3} {4}> <{1} {3} {5}> <{4} {5}> Customer Sequences Sequence Support <1> 4 <2> 2 <3> 4 <4> 4 <5> 4 Large 1-Sequences Sequence Support <1 2> 2 <1 3> 4 <1 4> 3 <1 5> 2 <2 3> 2 <2 4> 2 <3 4> 3 <3 5> 2 <4 5> 2 Large 2-Sequences Sequence Support <1 2 3> 2 <1 2 4> 2 <1 3 4> 3 <1 3 5> 2 <2 3 4> 2 Large 3-Sequences Sequence Support <1 2 3 4> 2 Large 4-Sequences Sequence Support <1 2 3 4> 2 <1 3 5> 2 <4 5> 2 Maximal Large Sequences
  • 22.
  • 23. Sequential Patterns With support > 25% < (30) (90) > < (30) (40 70) > The answer set
  • 24.
  • 25.
  • 26.
  • 27.
  • 28.
  • 29.
  • 30.
  • 31.
  • 32.
  • 33. Example of FreeSpan f_list = a: 4 ,b: 4 ,c: 4 ,d: 3 ,e: 3 ,f: 3 g is deleted because of support of g <2 . Example database: min support = 2 <eg(af)cbc> 40 <(ef)(ab)(df)cb> 30 <(ad)c(bc)(ae)> 20 <a(abc)(ac)d(cf)> 10 Sequence Sequence id
  • 34.
  • 35.
  • 36.
  • 37.
  • 38.
  • 39.
  • 40. Example of PrefixSpan <a>-projected database By scanning <a>-projected database once, all the length-2 sequential patterns having prefix <a> can be found. <aa>:2 <ab>:4 <(ab)>:2 <ac>:4 <ad>:2 <af>:2 Recursively, patterns with prefix <a> can be partitioned into 6 subsets. => <e( a f)cbc > 40 <(ef)( a b)(df)cb > 30 <( a d)c(bc)(ae )> 20 < a (abc)(ac)d(cf) > 10 Sequence Sequence id <(_f)cbc> 40 <(_b)(df)cb> 30 <(_d)c(bc)(ae)> 20 <(abc)(ac)d(cf)> 10
  • 41. Example of PrefixSpan (cont’d) < aa >-projected database => < ab >-projected database => Sequential patterns of <ab>-projected db: <(_c)>,<(_c)a>,<a><c> <(_f)c b c> 40 <(_b)(df)cb> 30 <(_d)c( b c)( a e)> 20 <( a b c)(ac)d(cf)> 10 Sequence Sequence id <c> 40 <(_c)(ae)> 20 <(_c)(ac)d(cf)> 10 <(_e)> 20 <(_bc)(ac)d(cf)> 10
  • 42. Example of PrefixSpan (cont’d) <b>-projected database Sequential patterns <b> <ba> <bc> <(bc)> <(bc)a> <bd> <bdc> <bf> => <e(af)c b c > 40 <(ef)(a b )(df)cb > 30 <(ad)c( b c)(ae )> 20 <a(a b c)(ac)d(cf)> 10 Sequence Sequence id <c> 40 <(df)cb> 30 <(_c)(ae)> 20 <(_c)(ac)d(cf)> 10
  • 43.
  • 44.
  • 45.
  • 46.
  • 47.