SlideShare una empresa de Scribd logo
1 de 35
Descargar para leer sin conexiΓ³n
Jared Simpson
!
Ontario Institute for Cancer Research
&
Department of Computer Science
University ofToronto
Error correction, assembly
and consensus algorithms
for MinION data
London Calling, May 14th, 2015
Our collaboration
2
An overview of NGS assembly
β€’ Illumina data: short reads, very accurate, very deep	

β€’ nearly all Illumina assembly is based on exact matching algorithms	

β€’ fragmented assemblies	

!
β€’ Algorithms for Illumina data do not work for long, noisy reads	

β€’ PacBio developed a pipeline (β€œHGAP”) to assemble their data	

β€’ We used this recipe as a starting point but with custom components
3
Long read assembly pipeline
4
Error correction
Celera Assembler
Consensus
Input reads
Genome Assembly
Input Data
β€’ First challenge is finding overlaps for reads with 15-20% errors
5
Overlap Detection
6
we use github.com/thegenemyers/daligner to compute overlaps
Partial Order Graphs
7
add read GCTACGAT that we want to correct to graph
Partial Order Graphs
8
add sequence GCTCGAT to graph
Partial Order Graphs
9
add sequence GCTCGATT to graph
Partial Order Graphs
10
maximum weight path GCTCGAT is the corrected read
Error Correction
11
Contig Assembly
12
Celera Assembler produces one contig at 98.5% identity
Assembly Polishing
β€’ Consensus problem is viewed as choosing a sequence C’ that maximizes
the probability of the event data
13
C0
= arg max
S2C
P(D|S)
P(D|S) =
rY
k=1
P(ei,k, ei+1,k, ..., ej,k|S, β‡₯)
where
Selecting a Consensus
14
Mutate
ACTACGATCGACTTACGA
CCTACGATCGACTTACGA
TCTACGATCGACTTACGA
...
-CTACGATCGACTTACGA
G-TACGATCGACTTACGA
GC-ACGATCGACTTACGA
GCT-CGATCGACTTACGA
...
GACTACGATCGACTTACGA
GCCTACGATCGACTTACGA
GGCTACGATCGACTTACGA
GTCTACGATCGACTTACGA
P(D|S)
-190
-187
-192
-176
-191
-193
-168
-198
-191
-195
-181
GCTACGATCGACTTACGA
Selecting a Consensus
15
GCT-CGATCGACTTACGA
Mutate
A
C
T
...
-
G
GC
GCT
...
G
G
G
G
P(D|S)
-190
-187
-192
-176
-191
-193
-168
-198
-191
-195
-181
Select new consensus
GCT-CGATCGACTTACGA -168
Pore Models
16
Generating Events
β€’ What do we expect events from a given sequence to look like?
17
GCTACGATT
Sample Current ●●●
●
●●●●●
●●
●
●
●●
●
●●●
●
●●
●
●●●●●
●
●
●
●●●●
●
●●
●●
●●●●●
●
●
●●●●
●
●●
●●
●
●
●
●●●
●●●
●●●●
●●
●
●
●●●●
●
●
●
●
●●●
●●
●●●●●●●
●
●
●
●
●
●●●
●
●
●●●●●●●●
●
●
●
●●●●
●
●●●
●●●
●
●●
●●●
●●
●●
●●●
●
●●
●●●●●●●●
●
●●●●
●
●●●●●●
●●●●
●●●
●
●●●●
●
●
●●
●
●
●●●●●●●
●●●
●●●
●●●
●
●●●
●●
●●
●
●
●●●
●●
●●●
●●●●●●●
●
●
●
●●●●
●
●
●●●●●
●
●●●●
●●
●
●
●
●
●
●
●
●
●●
●
●●
●
●●●●
●●
●●
●
●
●●
●
●
●●●●●
●
●
●●
●●●●
●
●
●●●
●
●
●●
●●
●●
●
●●
●
●●●●●●●●●●●
●●●
●
●●
●
●●●
●
●●●
●
●
●●●●●●●
●
●●
●
●●
●●●●●
●
●
●●
●
●
●●
●●
●
●
●●
●
●●●
●●●
●
●
●
●
●
●
●
●
●
●●●
●
●●●●●●●●●●●●●●
●
●
●●●●●●
●●
●●
●
●●
●●●
●●●
●●●
●
●●
●
●
●●
●●
●●●●●
●●
●●●
●●●●
●
●
●●●
●
●●
●●●
●
●●●●●●
●
●
●●
●●●
●●●●
●
●
●●●
●
●
●
●●●●
●
●●●●●
●●
●
●
●●
●
●●●●
●●●
●
●●
●●
●●
●
●●●●●
●●●●●
●●
●
0
25
50
75
100
0.00 0.25 0.50 0.75 1.00 1.25
time (s)Current(pA)
Generating Events
β€’ What do we expect events from a given sequence to look like?
18
GCTACGATT
Sample Current ●●●
●
●●●●●
●●
●
●
●●
●
●●●
●
●●
●
●●●●●
●
●
●
●●●●
●
●●
●●
●●●●●
●
●
●●●●
●
●●
●●
●
●
●
●●●
●●●
●●●●
●●
●
●
●●●●
●
●
●
●
●●●
●●
●●●●●●●
●
●
●
●
●
●●●
●
●
●●●●●●●●
●
●
●
●●●●
●
●●●
●●●
●
●●
●●●
●●
●●
●●●
●
●●
●●●●●●●●
●
●●●●
●
●●●●●●
●●●●
●●●
●
●●●●
●
●
●●
●
●
●●●●●●●
●●●
●●●
●●●
●
●●●
●●
●●
●
●
●●●
●●
●●●
●●●●●●●
●
●
●
●●●●
●
●
●●●●●
●
●●●●
●●
●
●
●
●
●
●
●
●
●●
●
●●
●
●●●●
●●
●●
●
●
●●
●
●
●●●●●
●
●
●●
●●●●
●
●
●●●
●
●
●●
●●
●●
●
●●
●
●●●●●●●●●●●
●●●
●
●●
●
●●●
●
●●●
●
●
●●●●●●●
●
●●
●
●●
●●●●●
●
●
●●
●
●
●●
●●
●
●
●●
●
●●●
●●●
●
●
●
●
●
●
●
●
●
●●●
●
●●●●●●●●●●●●●●
●
●
●●●●●●
●●
●●
●
●●
●●●
●●●
●●●
●
●●
●
●
●●
●●
●●●●●
●●
●●●
●●●●
●
●
●●●
●
●●
●●●
●
●●●●●●
●
●
●●
●●●
●●●●
●
●
●●●
●
●
●
●●●●
●
●●●●●
●●
●
●
●●
●
●●●●
●●●
●
●●
●●
●●
●
●●●●●
●●●●●
●●
●
●●
●●
●●●●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●●
●●●●●●●
●
●●
●
●●●●
●●
●
●
●
●
●
●●
●
●
●
●●●●●
●
●●●
●
●
●
●●
●
●
●●●●●●
●●
●
●●●●
●●
●
●
●●●
●●●●
●
●
●●●●●
●●
●●
●
●
●●●●●●●
0
25
50
75
100
0.00 0.25 0.50 0.75 1.00 1.25
time (s)Current(pA)
Generating Events
β€’ What do we expect events from a given sequence to look like?
19
GCTACGATT
Sample Current ●●●
●
●●●●●
●●
●
●
●●
●
●●●
●
●●
●
●●●●●
●
●
●
●●●●
●
●●
●●
●●●●●
●
●
●●●●
●
●●
●●
●
●
●
●●●
●●●
●●●●
●●
●
●
●●●●
●
●
●
●
●●●
●●
●●●●●●●
●
●
●
●
●
●●●
●
●
●●●●●●●●
●
●
●
●●●●
●
●●●
●●●
●
●●
●●●
●●
●●
●●●
●
●●
●●●●●●●●
●
●●●●
●
●●●●●●
●●●●
●●●
●
●●●●
●
●
●●
●
●
●●●●●●●
●●●
●●●
●●●
●
●●●
●●
●●
●
●
●●●
●●
●●●
●●●●●●●
●
●
●
●●●●
●
●
●●●●●
●
●●●●
●●
●
●
●
●
●
●
●
●
●●
●
●●
●
●●●●
●●
●●
●
●
●●
●
●
●●●●●
●
●
●●
●●●●
●
●
●●●
●
●
●●
●●
●●
●
●●
●
●●●●●●●●●●●
●●●
●
●●
●
●●●
●
●●●
●
●
●●●●●●●
●
●●
●
●●
●●●●●
●
●
●●
●
●
●●
●●
●
●
●●
●
●●●
●●●
●
●
●
●
●
●
●
●
●
●●●
●
●●●●●●●●●●●●●●
●
●
●●●●●●
●●
●●
●
●●
●●●
●●●
●●●
●
●●
●
●
●●
●●
●●●●●
●●
●●●
●●●●
●
●
●●●
●
●●
●●●
●
●●●●●●
●
●
●●
●●●
●●●●
●
●
●●●
●
●
●
●●●●
●
●●●●●
●●
●
●
●●
●
●●●●
●●●
●
●●
●●
●●
●
●●●●●
●●●●●
●●
●
●●
●●
●●●●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●●
●●●●●●●
●
●●
●
●●●●
●●
●
●
●
●
●
●●
●
●
●
●●●●●
●
●●●
●
●
●
●●
●
●
●●●●●●
●●
●
●●●●
●●
●
●
●●●
●●●●
●
●
●●●●●
●●
●●
●
●
●●●●●●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●●●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●
●●●
●
●
●●
●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●●
●
●●
●
●
●
●●
●●
●●
●
●●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●●
●
●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●●●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●●
●
●●
●
●
●●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●●
●
0
25
50
75
100
0.00 0.25 0.50 0.75 1.00 1.25
time (s)Current(pA)
Generating Events
β€’ What do we expect events from a given sequence to look like?
20
GCTACGATT
Sample Current ●●●
●
●●●●●
●●
●
●
●●
●
●●●
●
●●
●
●●●●●
●
●
●
●●●●
●
●●
●●
●●●●●
●
●
●●●●
●
●●
●●
●
●
●
●●●
●●●
●●●●
●●
●
●
●●●●
●
●
●
●
●●●
●●
●●●●●●●
●
●
●
●
●
●●●
●
●
●●●●●●●●
●
●
●
●●●●
●
●●●
●●●
●
●●
●●●
●●
●●
●●●
●
●●
●●●●●●●●
●
●●●●
●
●●●●●●
●●●●
●●●
●
●●●●
●
●
●●
●
●
●●●●●●●
●●●
●●●
●●●
●
●●●
●●
●●
●
●
●●●
●●
●●●
●●●●●●●
●
●
●
●●●●
●
●
●●●●●
●
●●●●
●●
●
●
●
●
●
●
●
●
●●
●
●●
●
●●●●
●●
●●
●
●
●●
●
●
●●●●●
●
●
●●
●●●●
●
●
●●●
●
●
●●
●●
●●
●
●●
●
●●●●●●●●●●●
●●●
●
●●
●
●●●
●
●●●
●
●
●●●●●●●
●
●●
●
●●
●●●●●
●
●
●●
●
●
●●
●●
●
●
●●
●
●●●
●●●
●
●
●
●
●
●
●
●
●
●●●
●
●●●●●●●●●●●●●●
●
●
●●●●●●
●●
●●
●
●●
●●●
●●●
●●●
●
●●
●
●
●●
●●
●●●●●
●●
●●●
●●●●
●
●
●●●
●
●●
●●●
●
●●●●●●
●
●
●●
●●●
●●●●
●
●
●●●
●
●
●
●●●●
●
●●●●●
●●
●
●
●●
●
●●●●
●●●
●
●●
●●
●●
●
●●●●●
●●●●●
●●
●
●●
●●
●●●●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●●
●●●●●●●
●
●●
●
●●●●
●●
●
●
●
●
●
●●
●
●
●
●●●●●
●
●●●
●
●
●
●●
●
●
●●●●●●
●●
●
●●●●
●●
●
●
●●●
●●●●
●
●
●●●●●
●●
●●
●
●
●●●●●●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●●●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●
●●●
●
●
●●
●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●●
●
●●
●
●
●
●●
●●
●●
●
●●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●●
●
●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●●●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●●
●
●●
●
●
●●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●●
●●
●
●
●●●
●
●
●●●
●●●
●
●●●
●
●
●
●●●
●
●
●
●●
●
●
●●
●
●
●
●
●●
●
●●
●●
0
25
50
75
100
0.00 0.25 0.50 0.75 1.00 1.25
time (s)Current(pA)
Generating Events
β€’ What do we expect events from a given sequence to look like?
21
CTACGATT
Sample Current ●●●
●
●●●●●
●●
●
●
●●
●
●●●
●
●●
●
●●●●●
●
●
●
●●●●
●
●●
●●
●●●●●
●
●
●●●●
●
●●
●●
●
●
●
●●●
●●●
●●●●
●●
●
●
●●●●
●
●
●
●
●●●
●●
●●●●●●●
●
●
●
●
●
●●●
●
●
●●●●●●●●
●
●
●
●●●●
●
●●●
●●●
●
●●
●●●
●●
●●
●●●
●
●●
●●●●●●●●
●
●●●●
●
●●●●●●
●●●●
●●●
●
●●●●
●
●
●●
●
●
●●●●●●●
●●●
●●●
●●●
●
●●●
●●
●●
●
●
●●●
●●
●●●
●●●●●●●
●
●
●
●●●●
●
●
●●●●●
●
●●●●
●●
●
●
●
●
●
●
●
●
●●
●
●●
●
●●●●
●●
●●
●
●
●●
●
●
●●●●●
●
●
●●
●●●●
●
●
●●●
●
●
●●
●●
●●
●
●●
●
●●●●●●●●●●●
●●●
●
●●
●
●●●
●
●●●
●
●
●●●●●●●
●
●●
●
●●
●●●●●
●
●
●●
●
●
●●
●●
●
●
●●
●
●●●
●●●
●
●
●
●
●
●
●
●
●
●●●
●
●●●●●●●●●●●●●●
●
●
●●●●●●
●●
●●
●
●●
●●●
●●●
●●●
●
●●
●
●
●●
●●
●●●●●
●●
●●●
●●●●
●
●
●●●
●
●●
●●●
●
●●●●●●
●
●
●●
●●●
●●●●
●
●
●●●
●
●
●
●●●●
●
●●●●●
●●
●
●
●●
●
●●●●
●●●
●
●●
●●
●●
●
●●●●●
●●●●●
●●
●
●●
●●
●●●●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●●
●●●●●●●
●
●●
●
●●●●
●●
●
●
●
●
●
●●
●
●
●
●●●●●
●
●●●
●
●
●
●●
●
●
●●●●●●
●●
●
●●●●
●●
●
●
●●●
●●●●
●
●
●●●●●
●●
●●
●
●
●●●●●●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●●●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●
●●●
●
●
●●
●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●●
●
●●
●
●
●
●●
●●
●●
●
●●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●●
●
●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●●●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●●
●
●●
●
●
●●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●●
●●
●
●
●●●
●
●
●●●
●●●
●
●●●
●
●
●
●●●
●
●
●
●●
●
●
●●
●
●
●
●
●●
●
●●
●●
●
●
●
●
●●
●
●
●
●
●
●●
●
●●●●
●●●
●
●
●●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●●●●●●
●
●
●
●
●●
●●●●
●
●●●
●●
●
●
●
●
●●●
●●
●
●
●
●●●
●
●
●
●●
●●
●
●
●
●
●●
●
●●
●●●●
●●●
●
●
●
●
●●
●●
●●●
●
●
●●
●●●
●
●
●
●●●
●●●●●
●
●
●
●
●
●
0
25
50
75
100
0.00 0.25 0.50 0.75 1.00 1.25
time (s)Current(pA)
Event Detection
22
●●●
●
●●●●●
●●
●
●
●●
●
●●●
●
●●
●
●●●●●
●
●
●
●●●●
●
●●
●●
●●●●●
●
●
●●●●
●
●●
●●
●
●
●
●●●
●●●
●●●●
●●
●
●
●●●●
●
●
●
●
●●●
●●
●●●●●●●
●
●
●
●
●
●●●
●
●
●●●●●●●●
●
●
●
●●●●
●
●●●
●●●
●
●●
●●●
●●
●●
●●●
●
●●
●●●●●●●●
●
●●●●
●
●●●●●●
●●●●
●●●
●
●●●●
●
●
●●
●
●
●●●●●●●
●●●
●●●
●●●
●
●●●
●●
●●
●
●
●●●
●●
●●●
●●●●●●●
●
●
●
●●●●
●
●
●●●●●
●
●●●●
●●
●
●
●
●
●
●
●
●
●●
●
●●
●
●●●●
●●
●●
●
●
●●
●
●
●●●●●
●
●
●●
●●●●
●
●
●●●
●
●
●●
●●
●●
●
●●
●
●●●●●●●●●●●
●●●
●
●●
●
●●●
●
●●●
●
●
●●●●●●●
●
●●
●
●●
●●●●●
●
●
●●
●
●
●●
●●
●
●
●●
●
●●●
●●●
●
●
●
●
●
●
●
●
●
●●●
●
●●●●●●●●●●●●●●
●
●
●●●●●●
●●
●●
●
●●
●●●
●●●
●●●
●
●●
●
●
●●
●●
●●●●●
●●
●●●
●●●●
●
●
●●●
●
●●
●●●
●
●●●●●●
●
●
●●
●●●
●●●●
●
●
●●●
●
●
●
●●●●
●
●●●●●
●●
●
●
●●
●
●●●●
●●●
●
●●
●●
●●
●
●●●●●
●●●●●
●●
●
●●
●●
●●●●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●●
●●●●●●●
●
●●
●
●●●●
●●
●
●
●
●
●
●●
●
●
●
●●●●●
●
●●●
●
●
●
●●
●
●
●●●●●●
●●
●
●●●●
●●
●
●
●●●
●●●●
●
●
●●●●●
●●
●●
●
●
●●●●●●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●●●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●
●●●
●
●
●●
●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●●
●
●●
●
●
●
●●
●●
●●
●
●●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●●
●
●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●●●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●●
●
●●
●
●
●●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●●
●●
●
●
●●●
●
●
●●●
●●●
●
●●●
●
●
●
●●●
●
●
●
●●
●
●
●●
●
●
●
●
●●
●
●●
●●
●
●
●
●
●●
●
●
●
●
●
●●
●
●●●●
●●●
●
●
●●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●●●●●●
●
●
●
●
●●
●●●●
●
●●●
●●
●
●
●
●
●●●
●●
●
●
●
●●●
●
●
●
●●
●●
●
●
●
●
●●
●
●●
●●●●
●●●
●
●
●
●
●●
●●
●●●
●
●
●●
●●●
●
●
●
●●●
●●●●●
●
●
●
●
●
●
0
25
50
75
100
0.00 0.25 0.50 0.75 1.00 1.25
time (s)Current(pA)
Event mean current (pA) current stdv duration (s)
1 60.3 0.7 0.521
2 40.6 1.0 0.112
3 52.2 2.0 0.356
4 54.1 1.2 0.291
5 49.5 1.5 0.141
A simple model
β€’ What is the probability of observing events E given a sequence S?	

β€’ Assuming for the moment there are no missing or extra events:
23
P(e1, e2, ..., en|s1, s2, ..., sn, β‡₯) =
nY
i=1
P(ei|si, Β΅si , si )
P(ei|k, Β΅k, k) = N(Β΅k, 2
k)
Complications
24
●
●
●●
●●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●●●
●
●
●
●
●
●
●●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●●
●●
●
●●●●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●●
●
●●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●●●
●
●●
●●
●
●
●●
●
●
●
●
●
●●
●●
●●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●●
●●
●
●●
●●
●
●●
●
●
●
●
●
●
●●
●
●
●●●
●
●
●
●●
●
●
●
●●
●
●
●●●
●
●
●
●
●
●
●
●
●
●●
●●●
●
●●
●●
●
●
●●●
●
●
●●●●
●
●
●●
●
●
●
●
●
●●●
●
●
●
●●●
●
●
●
●
●
●
●
●
●●
●
●
●●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●●●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●●●●
●
●
●
●
●●
●
●
●
●
●
●
●
●●●
●●
●
●
●
●
●
●●●
●
●
●
●
●
30
40
50
60
70
0.0 0.2 0.4 0.6
time (s)
Current(pA)
Complications
25
●
●
●●
●●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●●●
●
●
●
●
●
●
●●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●●
●●
●
●●●●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●●
●
●●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●●●
●
●●
●●
●
●
●●
●
●
●
●
●
●●
●●
●●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●●
●●
●
●●
●●
●
●●
●
●
●
●
●
●
●●
●
●
●●●
●
●
●
●●
●
●
●
●●
●
●
●●●
●
●
●
●
●
●
●
●
●
●●
●●●
●
●●
●●
●
●
●●●
●
●
●●●●
●
●
●●
●
●
●
●
●
●●●
●
●
●
●●●
●
●
●
●
●
●
●
●
●●
●
●
●●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●●●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●●●●
●
●
●
●
●●
●
●
●
●
●
●
●
●●●
●●
●
●
●
●
●
●●●
●
●
●
●
●
30
40
50
60
70
0.0 0.2 0.4 0.6
time (s)
Current(pA)
Is this an event ?
Complications
26
●
●
●●
●●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●●●
●
●
●
●
●
●
●●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●●
●●
●
●●●●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●●
●
●●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●●●
●
●●
●●
●
●
●●
●
●
●
●
●
●●
●●
●●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●●
●●
●
●●
●●
●
●●
●
●
●
●
●
●
●●
●
●
●●●
●
●
●
●●
●
●
●
●●
●
●
●●●
●
●
●
●
●
●
●
●
●
●●
●●●
●
●●
●●
●
●
●●●
●
●
●●●●
●
●
●●
●
●
●
●
●
●●●
●
●
●
●●●
●
●
●
●
●
●
●
●
●●
●
●
●●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●●●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●●●●
●
●
●
●
●●
●
●
●
●
●
●
●
●●●
●●
●
●
●
●
●
●●●
●
●
●
●
●
30
40
50
60
70
0.0 0.2 0.4 0.6
time (s)
Current(pA) One event or two ?
Nanopore HMM
β€’ must consider:	

β€’ over segmentation	

β€’ under segmentation	

β€’ missed short events	

β€’ HMM:	

β€’ M states: match event to 5-mers	

β€’ E states: extra obs. of an event	

β€’ K states: no event obs. for 5-mer
27
P(D|S)
P(⇑, e1, e2, ..., en|S, β‡₯) =
nY
i=1
P(ei|⇑i, Β΅si , si )P(⇑i|⇑i 1, S)
P(e1, e2, ..., en|S, β‡₯) =
X
⇑
P(⇑, e1, e2, ..., en|S, β‡₯)
Transition Probabilities
β€’ Probability of not observing an
event is a function of absolute
difference between (expected)
current
28
●
●
●●
●●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●●●
●
●
●
●
●
●
●●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●●
●●
●
●●●●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●●
●
●●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●●●
●
●●
●●
●
●
●●
●
●
●
●
●
●●
●●
●●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●●
●●
●
●●
●●
●
●●
●
●
●
●
●
●
●●
●
●
●●●
●
●
●
●●
●
●
●
●●
●
●
●●●
●
●
●
●
●
●
●
●
●
●●
●●●
●
●●
●●
●
●
●●●
●
●
●●●●
●
●
●●
●
●
●
●
●
●●●
●
●
●
●●●
●
●
●
●
●
●
●
●
●●
●
●
●●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●●●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●●●●
●
●
●
●
●●
●
●
●
●
●
●
●
●●●
●●
●
●
●
●
●
●●●
●
●
●
●
●
30
40
50
60
70
0.0 0.2 0.4 0.6
time (s)
Current(pA)
Transition Probabilities
β€’ Probability of not observing an
event is a function of absolute
difference between (expected)
current
29
●
●
●●
●●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●●●
●
●
●
●
●
●
●●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●●
●●
●
●●●●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●●
●
●●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●●●
●
●●
●●
●
●
●●
●
●
●
●
●
●●
●●
●●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●●
●●
●
●●
●●
●
●●
●
●
●
●
●
●
●●
●
●
●●●
●
●
●
●●
●
●
●
●●
●
●
●●●
●
●
●
●
●
●
●
●
●
●●
●●●
●
●●
●●
●
●
●●●
●
●
●●●●
●
●
●●
●
●
●
●
●
●●●
●
●
●
●●●
●
●
●
●
●
●
●
●
●●
●
●
●●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●●●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●●●●
●
●
●
●
●●
●
●
●
●
●
●
●
●●●
●●
●
●
●
●
●
●●●
●
●
●
●
●
30
40
50
60
70
0.0 0.2 0.4 0.6
time (s)
Current(pA)
Transition Probabilities
30
●
●
●
●
●
●
●
●
●
●
●
● ● ● ●
●
●
●
● ● ● ● ●
● ●
● ● ●
●
●
●
●
● ●
●
●
●
●
● ●
0.1
0.2
0.3
0.4
0.5
0 5 10 15 20
absolute difference (pA)
SkipProbability
Assembly Accuracy
31
0
5000
10000
0 5000 10000
5 mer count in reference
5mercountindraftassembly
0
3000
6000
9000
12000
TTTTT AAAAA TTTTG CAAAA CTTTT AAAAG CCCCC GGGGG
kmer
count
draft
reference
y
12000
A
C
B
D
0
5000
0 5000 10000
5 mer count in reference
5mercou
0
5000
10000
0 5000 10000
5 mer count in reference
5mercountinpolishedassembly
C D
Draft: 98.5% accuracy Polished: 99.5% accuracy
Assembly Accuracy
32
0
3000
6000
9000
12000
TTTTT AAAAA TTTTG CAAAA CTTTT AAAAG CCCCC GGGGG
kmer
count
draft
reference
9000
12000
B
D
0
0 5000 10000
5 mer count in reference
5mer
0
3000
TTTTT AAAAA TTTTG CAAAA CTTTT AAAAG CCCCC GGGGG
kmer
0
5000
10000
0 5000 10000
5 mer count in reference
5mercountinpolishedassembly
0
3000
6000
9000
12000
TTTTT AAAAA TTTTG CAAAA CTTTT AAAAG CCCCC GGGGG
kmer
count
polished
reference
C D
Aligning Events to a Reference
β€’ HMM can also align events to a reference genome



!
!
!
!
!
β€’ Read about it here:	

β€’ http://simpsonlab.github.io/2015/04/08/eventalign/
33
Planned Improvements
β€’ Model dwell duration to better call homopolymers	

!
!
!
β€’ SNP calling/genotyping	

!
!
!
β€’ Improve scalability to handle larger genomes	

β€’ Use signal data during error correction
34
CTAAAAAAAAAAAAGTACA
P(gi|D) =
P(D|gi)P(gi)
P(D)
Acknowledgements & Code
β€’ Collaborators:	

β€’ Nick Loman, Josh Quick (Birmingham)	

β€’ Jonathan Dursi (OICR)	

!
!
β€’ Code:	

β€’ github.com/jts/nanocorrect (error correction)	

β€’ github.com/jts/nanopolish (signal-level algorithms)	

β€’ github.com/jts/nanopore-paper-analysis (reproduce our paper)
35

MΓ‘s contenido relacionado

Similar a 150514 jts london_calling

Metrics with Ganglia
Metrics with GangliaMetrics with Ganglia
Metrics with GangliaGareth Rushgrove
Β 
Towards reading genomic data using deep learning-driven NLP techniques
Towards reading genomic data using deep learning-driven NLP techniquesTowards reading genomic data using deep learning-driven NLP techniques
Towards reading genomic data using deep learning-driven NLP techniquesWesley De Neve
Β 
design-compiler.pdf
design-compiler.pdfdesign-compiler.pdf
design-compiler.pdfFrangoCamila
Β 
A Fast Intro to Fast Query with ClickHouse, by Robert Hodges
A Fast Intro to Fast Query with ClickHouse, by Robert HodgesA Fast Intro to Fast Query with ClickHouse, by Robert Hodges
A Fast Intro to Fast Query with ClickHouse, by Robert HodgesAltinity Ltd
Β 
Proportional-integral genetic algorithm controller for stability of TCP network
Proportional-integral genetic algorithm controller for stability of TCP network Proportional-integral genetic algorithm controller for stability of TCP network
Proportional-integral genetic algorithm controller for stability of TCP network IJECEIAES
Β 
Overlap Layout Consensus assembly
Overlap Layout Consensus assemblyOverlap Layout Consensus assembly
Overlap Layout Consensus assemblyZhuyi Xue
Β 
Python lec 1004_ch02_excercies
Python lec 1004_ch02_excerciesPython lec 1004_ch02_excercies
Python lec 1004_ch02_excerciesRamadan Babers, PhD
Β 
Automated Machine Learning via Sequential Uniform Designs
Automated Machine Learning via Sequential Uniform DesignsAutomated Machine Learning via Sequential Uniform Designs
Automated Machine Learning via Sequential Uniform DesignsAijun Zhang
Β 
Scalable and Adaptive Graph Querying with MapReduce
Scalable and Adaptive Graph Querying with MapReduceScalable and Adaptive Graph Querying with MapReduce
Scalable and Adaptive Graph Querying with MapReduceKyong-Ha Lee
Β 
PLNOG20 - PaweΕ‚ MaΕ‚achowski - Stress your DUT–wykorzystanie narzΔ™dzi open sou...
PLNOG20 - PaweΕ‚ MaΕ‚achowski - Stress your DUT–wykorzystanie narzΔ™dzi open sou...PLNOG20 - PaweΕ‚ MaΕ‚achowski - Stress your DUT–wykorzystanie narzΔ™dzi open sou...
PLNOG20 - PaweΕ‚ MaΕ‚achowski - Stress your DUT–wykorzystanie narzΔ™dzi open sou...PROIDEA
Β 
CLUSTBIGFIM-FREQUENT ITEMSET MINING OF BIG DATA USING PRE-PROCESSING BASED ON...
CLUSTBIGFIM-FREQUENT ITEMSET MINING OF BIG DATA USING PRE-PROCESSING BASED ON...CLUSTBIGFIM-FREQUENT ITEMSET MINING OF BIG DATA USING PRE-PROCESSING BASED ON...
CLUSTBIGFIM-FREQUENT ITEMSET MINING OF BIG DATA USING PRE-PROCESSING BASED ON...ijfcstjournal
Β 
Clustbigfim frequent itemset mining of
Clustbigfim frequent itemset mining ofClustbigfim frequent itemset mining of
Clustbigfim frequent itemset mining ofijfcstjournal
Β 
CARI2020: A CGM-Based Parallel Algorithm Using the Four-Russians Speedup for ...
CARI2020: A CGM-Based Parallel Algorithm Using the Four-Russians Speedup for ...CARI2020: A CGM-Based Parallel Algorithm Using the Four-Russians Speedup for ...
CARI2020: A CGM-Based Parallel Algorithm Using the Four-Russians Speedup for ...Mokhtar SELLAMI
Β 
Improving go-git performance
Improving go-git performanceImproving go-git performance
Improving go-git performancesource{d}
Β 
Langmead bosc2010 cloud-genomics
Langmead bosc2010 cloud-genomicsLangmead bosc2010 cloud-genomics
Langmead bosc2010 cloud-genomicsBOSC 2010
Β 

Similar a 150514 jts london_calling (20)

Tree building 2
Tree building 2Tree building 2
Tree building 2
Β 
Metrics with Ganglia
Metrics with GangliaMetrics with Ganglia
Metrics with Ganglia
Β 
Towards reading genomic data using deep learning-driven NLP techniques
Towards reading genomic data using deep learning-driven NLP techniquesTowards reading genomic data using deep learning-driven NLP techniques
Towards reading genomic data using deep learning-driven NLP techniques
Β 
design-compiler.pdf
design-compiler.pdfdesign-compiler.pdf
design-compiler.pdf
Β 
A Fast Intro to Fast Query with ClickHouse, by Robert Hodges
A Fast Intro to Fast Query with ClickHouse, by Robert HodgesA Fast Intro to Fast Query with ClickHouse, by Robert Hodges
A Fast Intro to Fast Query with ClickHouse, by Robert Hodges
Β 
Proportional-integral genetic algorithm controller for stability of TCP network
Proportional-integral genetic algorithm controller for stability of TCP network Proportional-integral genetic algorithm controller for stability of TCP network
Proportional-integral genetic algorithm controller for stability of TCP network
Β 
Overlap Layout Consensus assembly
Overlap Layout Consensus assemblyOverlap Layout Consensus assembly
Overlap Layout Consensus assembly
Β 
Python lec 1004_ch02_excercies
Python lec 1004_ch02_excerciesPython lec 1004_ch02_excercies
Python lec 1004_ch02_excercies
Β 
Automated Machine Learning via Sequential Uniform Designs
Automated Machine Learning via Sequential Uniform DesignsAutomated Machine Learning via Sequential Uniform Designs
Automated Machine Learning via Sequential Uniform Designs
Β 
Scalable and Adaptive Graph Querying with MapReduce
Scalable and Adaptive Graph Querying with MapReduceScalable and Adaptive Graph Querying with MapReduce
Scalable and Adaptive Graph Querying with MapReduce
Β 
Stress your DUT
Stress your DUTStress your DUT
Stress your DUT
Β 
PLNOG20 - PaweΕ‚ MaΕ‚achowski - Stress your DUT–wykorzystanie narzΔ™dzi open sou...
PLNOG20 - PaweΕ‚ MaΕ‚achowski - Stress your DUT–wykorzystanie narzΔ™dzi open sou...PLNOG20 - PaweΕ‚ MaΕ‚achowski - Stress your DUT–wykorzystanie narzΔ™dzi open sou...
PLNOG20 - PaweΕ‚ MaΕ‚achowski - Stress your DUT–wykorzystanie narzΔ™dzi open sou...
Β 
TiReX: Tiled Regular eXpression matching architecture
TiReX: Tiled Regular eXpression matching architectureTiReX: Tiled Regular eXpression matching architecture
TiReX: Tiled Regular eXpression matching architecture
Β 
Blast fasta 4
Blast fasta 4Blast fasta 4
Blast fasta 4
Β 
CLUSTBIGFIM-FREQUENT ITEMSET MINING OF BIG DATA USING PRE-PROCESSING BASED ON...
CLUSTBIGFIM-FREQUENT ITEMSET MINING OF BIG DATA USING PRE-PROCESSING BASED ON...CLUSTBIGFIM-FREQUENT ITEMSET MINING OF BIG DATA USING PRE-PROCESSING BASED ON...
CLUSTBIGFIM-FREQUENT ITEMSET MINING OF BIG DATA USING PRE-PROCESSING BASED ON...
Β 
Clustbigfim frequent itemset mining of
Clustbigfim frequent itemset mining ofClustbigfim frequent itemset mining of
Clustbigfim frequent itemset mining of
Β 
CARI2020: A CGM-Based Parallel Algorithm Using the Four-Russians Speedup for ...
CARI2020: A CGM-Based Parallel Algorithm Using the Four-Russians Speedup for ...CARI2020: A CGM-Based Parallel Algorithm Using the Four-Russians Speedup for ...
CARI2020: A CGM-Based Parallel Algorithm Using the Four-Russians Speedup for ...
Β 
Improving go-git performance
Improving go-git performanceImproving go-git performance
Improving go-git performance
Β 
Langmead bosc2010 cloud-genomics
Langmead bosc2010 cloud-genomicsLangmead bosc2010 cloud-genomics
Langmead bosc2010 cloud-genomics
Β 
Thiele
ThieleThiele
Thiele
Β 

Último

OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024innovationoecd
Β 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024AyushiRastogi48
Β 
Carbon Dioxide Capture and Storage (CSS)
Carbon Dioxide Capture and Storage (CSS)Carbon Dioxide Capture and Storage (CSS)
Carbon Dioxide Capture and Storage (CSS)Tamer Koksalan, PhD
Β 
preservation, maintanence and improvement of industrial organism.pptx
preservation, maintanence and improvement of industrial organism.pptxpreservation, maintanence and improvement of industrial organism.pptx
preservation, maintanence and improvement of industrial organism.pptxnoordubaliya2003
Β 
User Guide: Magellan MXβ„’ Weather Station
User Guide: Magellan MXβ„’ Weather StationUser Guide: Magellan MXβ„’ Weather Station
User Guide: Magellan MXβ„’ Weather StationColumbia Weather Systems
Β 
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)riyaescorts54
Β 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxpriyankatabhane
Β 
Topic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxTopic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxJorenAcuavera1
Β 
basic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomybasic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomyDrAnita Sharma
Β 
FREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naFREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naJASISJULIANOELYNV
Β 
Microteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringMicroteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringPrajakta Shinde
Β 
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPirithiRaju
Β 
Volatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -IVolatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -INandakishor Bhaurao Deshmukh
Β 
Call Girls in Majnu Ka Tilla Delhi πŸ”9711014705πŸ” Genuine
Call Girls in Majnu Ka Tilla Delhi πŸ”9711014705πŸ” GenuineCall Girls in Majnu Ka Tilla Delhi πŸ”9711014705πŸ” Genuine
Call Girls in Majnu Ka Tilla Delhi πŸ”9711014705πŸ” Genuinethapagita
Β 
Best Call Girls In Sector 29 Gurgaon❀️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❀️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❀️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❀️8860477959 EscorTs Service In 24/7 Delh...lizamodels9
Β 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentationtahreemzahra82
Β 
User Guide: Orionβ„’ Weather Station (Columbia Weather Systems)
User Guide: Orionβ„’ Weather Station (Columbia Weather Systems)User Guide: Orionβ„’ Weather Station (Columbia Weather Systems)
User Guide: Orionβ„’ Weather Station (Columbia Weather Systems)Columbia Weather Systems
Β 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxmalonesandreagweneth
Β 
Pests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPirithiRaju
Β 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPirithiRaju
Β 

Último (20)

OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024
Β 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024
Β 
Carbon Dioxide Capture and Storage (CSS)
Carbon Dioxide Capture and Storage (CSS)Carbon Dioxide Capture and Storage (CSS)
Carbon Dioxide Capture and Storage (CSS)
Β 
preservation, maintanence and improvement of industrial organism.pptx
preservation, maintanence and improvement of industrial organism.pptxpreservation, maintanence and improvement of industrial organism.pptx
preservation, maintanence and improvement of industrial organism.pptx
Β 
User Guide: Magellan MXβ„’ Weather Station
User Guide: Magellan MXβ„’ Weather StationUser Guide: Magellan MXβ„’ Weather Station
User Guide: Magellan MXβ„’ Weather Station
Β 
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
Β 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptx
Β 
Topic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxTopic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptx
Β 
basic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomybasic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomy
Β 
FREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naFREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by na
Β 
Microteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringMicroteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical Engineering
Β 
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Β 
Volatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -IVolatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -I
Β 
Call Girls in Majnu Ka Tilla Delhi πŸ”9711014705πŸ” Genuine
Call Girls in Majnu Ka Tilla Delhi πŸ”9711014705πŸ” GenuineCall Girls in Majnu Ka Tilla Delhi πŸ”9711014705πŸ” Genuine
Call Girls in Majnu Ka Tilla Delhi πŸ”9711014705πŸ” Genuine
Β 
Best Call Girls In Sector 29 Gurgaon❀️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❀️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❀️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❀️8860477959 EscorTs Service In 24/7 Delh...
Β 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentation
Β 
User Guide: Orionβ„’ Weather Station (Columbia Weather Systems)
User Guide: Orionβ„’ Weather Station (Columbia Weather Systems)User Guide: Orionβ„’ Weather Station (Columbia Weather Systems)
User Guide: Orionβ„’ Weather Station (Columbia Weather Systems)
Β 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
Β 
Pests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdf
Β 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Β 

150514 jts london_calling