SlideShare una empresa de Scribd logo
1 de 22
Descargar para leer sin conexión
Brief	
  explana,on	
  of	
  
“Integra,ng	
  dilu,on-­‐based	
  sequencing	
  
and	
  popula,on	
  genotypes	
  	
  
for	
  single	
  individual	
  haplotyping”	
Hirotaka	
  Matsumoto
INTRODUCTION
Single	
  individual	
  haplotyping	
  (SIH)	
•  Infer	
  haplotypes	
  from	
  sequence	
  fragments.	
(SNP	
  fragments)
Single	
  individual	
  haplotyping	
  (SIH)	
•  Infer	
  haplotypes	
  from	
  sequence	
  fragments.
Single	
  individual	
  haplotyping	
  (SIH)	
•  Infer	
  haplotypes	
  from	
  sequence	
  fragments.
Dilu,on-­‐based	
  sequencing	
•  SIH	
  needs	
  long	
  DNA	
  sequencing	
  reads	
  
•  Dilu,on-­‐based	
  sequencing	
  can	
  produce	
  long	
  reads	
  
–  Fosmid	
  pool-­‐based	
  NGS	
  
	
  
	
  
–  Long	
  fragment	
  technology	
  
–  Dilu,on-­‐amplifica,on-­‐based	
  sequencing
Process	
  of	
  dilu,on-­‐based	
  seq	
DNA	
  fragments	
  are	
  separated	
  into	
  mul,ple	
  low-­‐concentra,on	
  dilu,ons.	
  
	
  
ASer	
  sequencing	
  and	
  mapping	
  an	
  aliquot,	
  mapped	
  reads	
  form	
  clusters	
  
which	
  correspond	
  to	
  DNA	
  fragments.	
  
	
  
Clusters	
  are	
  merged	
  into	
  read	
  fragments	
  (SNP	
  fragments)	
(i)	
  
	
  
(ii)	
  
	
  
	
  
(iii)	
  
	
  
Chimeric	
  fragment	
  (CF)	
•  Problem	
  of	
  producing	
  chimeric	
  fragments	
  (CFs)	
  
–  Reads	
  with	
  different	
  chromosomal	
  origins	
  are	
  regarded	
  as	
  one	
  cluster	
  
and	
  merged	
  into	
  a	
  fragment	
  when	
  an	
  aliquot	
  happen	
  to	
  have	
  some	
  
long	
  DNA	
  fragments	
  derived	
  from	
  the	
  same	
  region.	
  
–  CFs	
  significantly	
  decrease	
  the	
  accuracy	
  of	
  SIH.
METHOD	
  
	
  
target:	
  detec,on	
  of	
  CFs
Detec,on	
  of	
  CFs	
•  Basis	
  of	
  our	
  strategy	
  
– CFs	
  correspond	
  to	
  an	
  ar,ficially	
  recombinant	
  
haplotype	
  and	
  differ	
  from	
  biological	
  haplotypes	
  in	
  
the	
  popula,on.	
  
PHASE	
•  Sta,s,cal	
  phasing	
  method	
  
–  Infer	
  haplotypes	
  from	
  popula,on.	
  
–  The	
  diversity	
  of	
  haplotypes	
  is	
  limited	
  and	
  there	
  are	
  
conserved	
  haplotypes.	
  
•  We	
  use	
  PHASE	
  to	
  obtain	
  the	
  haplotype	
  candidates.	
  
–  Example	
  of	
  output	
  
A	
  candidate	
  of	
  haplotypes	
  
and	
  its	
  probability.
CF	
  detec,on	
  model	
•  We	
  model	
  the	
  probabili,es	
  that	
  a	
  SNP	
  
fragment	
  is	
  normal	
  fragment	
  and	
  chimeric	
  
fragment.	
  
•  With	
  there	
  probabili,es	
  we	
  develop	
  a	
  
indicator	
  “CSP”	
  which	
  evaluates	
  the	
  chimerity	
  
of	
  a	
  SNP	
  fragment.
NF	
  probability	
•  NF	
  probability	
  
–  The	
  probability	
  that	
  a	
  SNP	
  fragment	
  is	
  normal	
  fragment	
  (NF).	
  
–  Calculate	
  the	
  consistency	
  between	
  sta,s,cally	
  phased	
  haplotypes	
  and	
  
a	
  fragment.	
  
CF	
  probability	
•  CF	
  probability	
  
–  The	
  probability	
  that	
  a	
  SNP	
  fragment	
  is	
  chimeric	
  fragment.	
  
–  LeS	
  and	
  right	
  parts	
  are	
  derived	
  from	
  different	
  haplotypes.	
  
	
ll
CSP	
•  Chimericy	
  based	
  on	
  sta,s,cal	
  phasing	
  (CSP)	
  
•  Low	
  CSP	
  values	
  means	
  
– the	
  fragment	
  correspond	
  to	
  recombinant	
  of	
  
sta,s,cally	
  phased	
  haplotypes.	
  
– the	
  fragment	
  is	
  suspected	
  of	
  CF.
Sliding-­‐window	
  approach	
•  Running	
  ,me	
  of	
  PHASE	
  increases	
  according	
  to	
  SNP	
  
fragment	
  size.	
  
–  Complexity	
  of	
  popula,on	
  haplotypes	
  increase	
  
exponen,ally.	
  
•  We	
  use	
  sliding-­‐window	
  approach	
  (W=5).	
sliding-­‐window
RESULT
dataset	
•  Dilu,on-­‐based	
  sequencing	
  
– Kaper’s	
  data	
  
– Duitama’s	
  data	
  
•  True	
  haplotypes	
  
– Trio-­‐based	
  haplotypes	
  
•  True	
  NFs	
  and	
  CFs	
  
– Defined	
  by	
  true	
  haplotypes
CSP	
  distribu,on	
•  CSP	
  of	
  CFs	
  is	
  lower	
  than	
  that	
  of	
  NFs	
Theore,cal	
  lowest	
  value	
  (W=5)	
  
	
  -­‐	
  	
  Change	
  haplotype	
  origin	
  at	
  second	
  or	
  third	
  site.	
Fragment:	
  	
  	
  	
  00011	
  
Haplotypes:	
  00000	
  /	
  11111
CF	
  detec,on	
•  CSP	
  is	
  a	
  highly	
  efficient	
  measure	
  to	
  detect	
  CFs.
SIH	
  accuracy	
  aSer	
  removing	
  CFs	
•  The	
  accuracies	
  of	
  SIH	
  increased	
  significantly	
  
aSer	
  removing	
  CSs	
  detected	
  by	
  CSP.
CONCLUSION	
•  CSP	
  is	
  a	
  highly	
  efficient	
  measure	
  to	
  detect	
  
chimeric	
  fragments.	
  
•  SIH	
  accuracy	
  increased	
  significantly	
  aSer	
  
removing	
  CFs	
  candidates	
  detected	
  using	
  CSP.

Más contenido relacionado

Similar a CSP

GENE gene marker blood typing , abo blood typing vntr
GENE gene marker blood typing , abo blood typing vntrGENE gene marker blood typing , abo blood typing vntr
GENE gene marker blood typing , abo blood typing vntrAparnaAjayan8
 
Gene mapping methods
Gene mapping methodsGene mapping methods
Gene mapping methodsMEENAKSHI DAS
 
RNASeq Experiment Design
RNASeq Experiment DesignRNASeq Experiment Design
RNASeq Experiment DesignYaoyu Wang
 
Assembling NGS Data - IMB Winter School - 3 July 2012
Assembling NGS Data - IMB Winter School - 3 July 2012Assembling NGS Data - IMB Winter School - 3 July 2012
Assembling NGS Data - IMB Winter School - 3 July 2012Torsten Seemann
 
molecular marker RFLP, and application
molecular marker RFLP, and applicationmolecular marker RFLP, and application
molecular marker RFLP, and applicationKAUSHAL SAHU
 
GENE marker typing , abo blood grouping ,karl lamdsteiner
GENE marker typing , abo blood grouping ,karl lamdsteinerGENE marker typing , abo blood grouping ,karl lamdsteiner
GENE marker typing , abo blood grouping ,karl lamdsteinerAparnaAjayan8
 
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...VHIR Vall d’Hebron Institut de Recerca
 
Forsharing cshl2011 sequencing
Forsharing cshl2011 sequencingForsharing cshl2011 sequencing
Forsharing cshl2011 sequencingSean Davis
 
Single Nucleotide Polymorphisms (2)-1.pptx
Single Nucleotide Polymorphisms (2)-1.pptxSingle Nucleotide Polymorphisms (2)-1.pptx
Single Nucleotide Polymorphisms (2)-1.pptxkingmaxton8
 
Gene mapping and sequencing
Gene mapping and sequencingGene mapping and sequencing
Gene mapping and sequencingPREETAM PALKAR
 
Genetic fingerprinting
Genetic fingerprintingGenetic fingerprinting
Genetic fingerprintingaqsa ayoub
 
Fundamentals of Fluorescence in situ Hybridization
Fundamentals of Fluorescence in situ Hybridization Fundamentals of Fluorescence in situ Hybridization
Fundamentals of Fluorescence in situ Hybridization Amartya Pradhan
 
2007. stephen chanock. technologic issues in gwas and follow up studies
2007. stephen chanock. technologic issues in gwas and follow up studies2007. stephen chanock. technologic issues in gwas and follow up studies
2007. stephen chanock. technologic issues in gwas and follow up studiesFOODCROPS
 
Next Generation Sequencing
Next Generation SequencingNext Generation Sequencing
Next Generation Sequencingshinycthomas
 
Fluoroscent insitu hybridizatio nppt
Fluoroscent insitu hybridizatio npptFluoroscent insitu hybridizatio nppt
Fluoroscent insitu hybridizatio npptGenevia Vincent
 
Dna fingerprinting by laxmee dahal
Dna fingerprinting by laxmee dahalDna fingerprinting by laxmee dahal
Dna fingerprinting by laxmee dahallaxmidahal4
 
Molecular marker by anil bl gather
Molecular marker by anil bl gatherMolecular marker by anil bl gather
Molecular marker by anil bl gatherANIL BL GATHER
 
Single Nucleotide Polymorphism
Single Nucleotide PolymorphismSingle Nucleotide Polymorphism
Single Nucleotide PolymorphismDeepender Kumar
 
Nanoball squencing
Nanoball squencingNanoball squencing
Nanoball squencingshru1604
 

Similar a CSP (20)

GENE gene marker blood typing , abo blood typing vntr
GENE gene marker blood typing , abo blood typing vntrGENE gene marker blood typing , abo blood typing vntr
GENE gene marker blood typing , abo blood typing vntr
 
Gene mapping methods
Gene mapping methodsGene mapping methods
Gene mapping methods
 
RNASeq Experiment Design
RNASeq Experiment DesignRNASeq Experiment Design
RNASeq Experiment Design
 
Assembling NGS Data - IMB Winter School - 3 July 2012
Assembling NGS Data - IMB Winter School - 3 July 2012Assembling NGS Data - IMB Winter School - 3 July 2012
Assembling NGS Data - IMB Winter School - 3 July 2012
 
molecular marker RFLP, and application
molecular marker RFLP, and applicationmolecular marker RFLP, and application
molecular marker RFLP, and application
 
GENE marker typing , abo blood grouping ,karl lamdsteiner
GENE marker typing , abo blood grouping ,karl lamdsteinerGENE marker typing , abo blood grouping ,karl lamdsteiner
GENE marker typing , abo blood grouping ,karl lamdsteiner
 
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
 
Forsharing cshl2011 sequencing
Forsharing cshl2011 sequencingForsharing cshl2011 sequencing
Forsharing cshl2011 sequencing
 
Single Nucleotide Polymorphisms (2)-1.pptx
Single Nucleotide Polymorphisms (2)-1.pptxSingle Nucleotide Polymorphisms (2)-1.pptx
Single Nucleotide Polymorphisms (2)-1.pptx
 
Gene mapping and sequencing
Gene mapping and sequencingGene mapping and sequencing
Gene mapping and sequencing
 
Genetic fingerprinting
Genetic fingerprintingGenetic fingerprinting
Genetic fingerprinting
 
Fundamentals of Fluorescence in situ Hybridization
Fundamentals of Fluorescence in situ Hybridization Fundamentals of Fluorescence in situ Hybridization
Fundamentals of Fluorescence in situ Hybridization
 
2007. stephen chanock. technologic issues in gwas and follow up studies
2007. stephen chanock. technologic issues in gwas and follow up studies2007. stephen chanock. technologic issues in gwas and follow up studies
2007. stephen chanock. technologic issues in gwas and follow up studies
 
Next Generation Sequencing
Next Generation SequencingNext Generation Sequencing
Next Generation Sequencing
 
Fluoroscent insitu hybridizatio nppt
Fluoroscent insitu hybridizatio npptFluoroscent insitu hybridizatio nppt
Fluoroscent insitu hybridizatio nppt
 
Dna fingerprinting by laxmee dahal
Dna fingerprinting by laxmee dahalDna fingerprinting by laxmee dahal
Dna fingerprinting by laxmee dahal
 
Molecular marker by anil bl gather
Molecular marker by anil bl gatherMolecular marker by anil bl gather
Molecular marker by anil bl gather
 
Single Nucleotide Polymorphism
Single Nucleotide PolymorphismSingle Nucleotide Polymorphism
Single Nucleotide Polymorphism
 
Nanoball squencing
Nanoball squencingNanoball squencing
Nanoball squencing
 
EiB Seminar from Antoni Miñarro, Ph.D
EiB Seminar from Antoni Miñarro, Ph.DEiB Seminar from Antoni Miñarro, Ph.D
EiB Seminar from Antoni Miñarro, Ph.D
 

Más de Hirotaka Matsumoto

球面と双曲面の幾何学入門の入門
球面と双曲面の幾何学入門の入門球面と双曲面の幾何学入門の入門
球面と双曲面の幾何学入門の入門Hirotaka Matsumoto
 
次元圧縮周りでの気付き&1細胞発現データにおける次元圧縮の利用例@第3回wacode
次元圧縮周りでの気付き&1細胞発現データにおける次元圧縮の利用例@第3回wacode次元圧縮周りでの気付き&1細胞発現データにおける次元圧縮の利用例@第3回wacode
次元圧縮周りでの気付き&1細胞発現データにおける次元圧縮の利用例@第3回wacodeHirotaka Matsumoto
 
MixSIH: a mixture model for single individual haplotyping
MixSIH: a mixture model for single individual haplotypingMixSIH: a mixture model for single individual haplotyping
MixSIH: a mixture model for single individual haplotypingHirotaka Matsumoto
 

Más de Hirotaka Matsumoto (7)

球面と双曲面の幾何学入門の入門
球面と双曲面の幾何学入門の入門球面と双曲面の幾何学入門の入門
球面と双曲面の幾何学入門の入門
 
ISMB/ECCB2019読み会_松本
ISMB/ECCB2019読み会_松本ISMB/ECCB2019読み会_松本
ISMB/ECCB2019読み会_松本
 
ISMB2018読み会
ISMB2018読み会ISMB2018読み会
ISMB2018読み会
 
PRML11.2 - 11.6
PRML11.2 - 11.6PRML11.2 - 11.6
PRML11.2 - 11.6
 
次元圧縮周りでの気付き&1細胞発現データにおける次元圧縮の利用例@第3回wacode
次元圧縮周りでの気付き&1細胞発現データにおける次元圧縮の利用例@第3回wacode次元圧縮周りでの気付き&1細胞発現データにおける次元圧縮の利用例@第3回wacode
次元圧縮周りでの気付き&1細胞発現データにおける次元圧縮の利用例@第3回wacode
 
Prml11 sup
Prml11 supPrml11 sup
Prml11 sup
 
MixSIH: a mixture model for single individual haplotyping
MixSIH: a mixture model for single individual haplotypingMixSIH: a mixture model for single individual haplotyping
MixSIH: a mixture model for single individual haplotyping
 

Último

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 

Último (20)

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 

CSP

  • 1. Brief  explana,on  of   “Integra,ng  dilu,on-­‐based  sequencing   and  popula,on  genotypes     for  single  individual  haplotyping” Hirotaka  Matsumoto
  • 3. Single  individual  haplotyping  (SIH) •  Infer  haplotypes  from  sequence  fragments. (SNP  fragments)
  • 4. Single  individual  haplotyping  (SIH) •  Infer  haplotypes  from  sequence  fragments.
  • 5. Single  individual  haplotyping  (SIH) •  Infer  haplotypes  from  sequence  fragments.
  • 6. Dilu,on-­‐based  sequencing •  SIH  needs  long  DNA  sequencing  reads   •  Dilu,on-­‐based  sequencing  can  produce  long  reads   –  Fosmid  pool-­‐based  NGS       –  Long  fragment  technology   –  Dilu,on-­‐amplifica,on-­‐based  sequencing
  • 7. Process  of  dilu,on-­‐based  seq DNA  fragments  are  separated  into  mul,ple  low-­‐concentra,on  dilu,ons.     ASer  sequencing  and  mapping  an  aliquot,  mapped  reads  form  clusters   which  correspond  to  DNA  fragments.     Clusters  are  merged  into  read  fragments  (SNP  fragments) (i)     (ii)       (iii)    
  • 8. Chimeric  fragment  (CF) •  Problem  of  producing  chimeric  fragments  (CFs)   –  Reads  with  different  chromosomal  origins  are  regarded  as  one  cluster   and  merged  into  a  fragment  when  an  aliquot  happen  to  have  some   long  DNA  fragments  derived  from  the  same  region.   –  CFs  significantly  decrease  the  accuracy  of  SIH.
  • 9. METHOD     target:  detec,on  of  CFs
  • 10. Detec,on  of  CFs •  Basis  of  our  strategy   – CFs  correspond  to  an  ar,ficially  recombinant   haplotype  and  differ  from  biological  haplotypes  in   the  popula,on.  
  • 11. PHASE •  Sta,s,cal  phasing  method   –  Infer  haplotypes  from  popula,on.   –  The  diversity  of  haplotypes  is  limited  and  there  are   conserved  haplotypes.   •  We  use  PHASE  to  obtain  the  haplotype  candidates.   –  Example  of  output   A  candidate  of  haplotypes   and  its  probability.
  • 12. CF  detec,on  model •  We  model  the  probabili,es  that  a  SNP   fragment  is  normal  fragment  and  chimeric   fragment.   •  With  there  probabili,es  we  develop  a   indicator  “CSP”  which  evaluates  the  chimerity   of  a  SNP  fragment.
  • 13. NF  probability •  NF  probability   –  The  probability  that  a  SNP  fragment  is  normal  fragment  (NF).   –  Calculate  the  consistency  between  sta,s,cally  phased  haplotypes  and   a  fragment.  
  • 14. CF  probability •  CF  probability   –  The  probability  that  a  SNP  fragment  is  chimeric  fragment.   –  LeS  and  right  parts  are  derived  from  different  haplotypes.   ll
  • 15. CSP •  Chimericy  based  on  sta,s,cal  phasing  (CSP)   •  Low  CSP  values  means   – the  fragment  correspond  to  recombinant  of   sta,s,cally  phased  haplotypes.   – the  fragment  is  suspected  of  CF.
  • 16. Sliding-­‐window  approach •  Running  ,me  of  PHASE  increases  according  to  SNP   fragment  size.   –  Complexity  of  popula,on  haplotypes  increase   exponen,ally.   •  We  use  sliding-­‐window  approach  (W=5). sliding-­‐window
  • 18. dataset •  Dilu,on-­‐based  sequencing   – Kaper’s  data   – Duitama’s  data   •  True  haplotypes   – Trio-­‐based  haplotypes   •  True  NFs  and  CFs   – Defined  by  true  haplotypes
  • 19. CSP  distribu,on •  CSP  of  CFs  is  lower  than  that  of  NFs Theore,cal  lowest  value  (W=5)    -­‐    Change  haplotype  origin  at  second  or  third  site. Fragment:        00011   Haplotypes:  00000  /  11111
  • 20. CF  detec,on •  CSP  is  a  highly  efficient  measure  to  detect  CFs.
  • 21. SIH  accuracy  aSer  removing  CFs •  The  accuracies  of  SIH  increased  significantly   aSer  removing  CSs  detected  by  CSP.
  • 22. CONCLUSION •  CSP  is  a  highly  efficient  measure  to  detect   chimeric  fragments.   •  SIH  accuracy  increased  significantly  aSer   removing  CFs  candidates  detected  using  CSP.