SlideShare a Scribd company logo
1 of 30
Download to read offline
Minimum information about an Adaptive Immune Receptor
Repertoire Sequencing Experiment
Presenter: Syed Ahmad Chan Bukhari, PhD
Department of Pathology, Yale School of Medicine
Inability to reproduce scientific experiments is a
big challenge.
Lithgow, G. J., Driscoll, M., & Phillips, P. (2017). A long
journey to reproducible results. Nature News, 548(7668), 387.
● A drug-like molecule could extend an
roundworm lifespan by as much as 67%.
● Other labs failed to replicate the studies.
● Two cancer labs spent more than a year
trying to understand inconsistencies with
same tumour biopsy.
● Because of lack of standards, both labs
were using different cell isolation
protocols.
Inability to reproduce scientific experiments is a
big challenge.
Begley, C. G., & Ellis, L. M. (2012). Drug development: Raise standards for
preclinical cancer research. Nature, 483(7391), 531-533.
● Amgen could reproduce the findings
in only 6 of 53 “landmark” papers
in cancer biology
● Bayer could validate only 25% of
67 preclinical studies
Inability to reproduce scientific experiments can
have multiple reasons behind.
● Undocumented scientific procedures
● Datasets size and variability
● Problem with statistical techniques
● A documented but a difficult procedure to follow
Standardization is a proven way to make sense
to scientific procedures and outcomes.
How and what array platform used?
Experiments in Immunology facing the similar
reproducibility challenges
● High-throughput sequencing (HTS) of B-cell (antibody, immunoglobulin) and T-cell
receptor repertoires has increased dramatically since the technique was
introduced in 2009.
○ Previously relied on low-resolution approaches, such as flow cytometry, spectratyping and
Sanger sequencing
● B cell receptors (BCRs) and T cell
receptors (TCRs) serve as the primary
means for specific detection of foreign
antigens.
Adaptive Immune Receptor Repertoire (AIRR) Sequencing
● Collection of BCRs or TCRs in an individual, tissue, cell subset or during an immune
response is referred to as the repertoire.
● AIRR-seq studies are associated with complex metadata, such as donor
phenotypes, cell types and nucleic acid material used.
○ Crucial for ensuring reproducibility and facilitating secondary and meta analyses
● AIRR sequencing has enormous promise for understanding the dynamics of the
immune repertoire in vaccinology, infectious disease, autoimmunity, and cancer
biology.
Adaptive Immune Receptor Repertoire (AIRR) Sequencing
(Popularity)
Adaptive Immune-Receptor Repertoire (AIRR) Community
Next-generation sequencing of B & T cell receptor repertoires (AIRR-seq)
Developing standard protocols for reporting and sharing AIRR-seq data to
optimize their use in biomedical research and patient care
AIRR Community
Formed
AIRR Community Data Elements
Each of the 6 high-level principles has been expanded into a set of data elements
“Accurate specification of the pathophysiological
condition is important for cross-comparison of
multiple studies”
● This set describes the experimental study design including the title of the
study, laboratory contact information etc
● For individual subjects, the species, sex, age, and ancestry are included
along with information about disease state(s) etc
● This set describes the metadata about the diagnosis process
“Information about the origin and expected
composition of the biological sample(s) is central
for the interpretation of downstream sequencing
results.”
“Proper interpretation of experimental results for
future comparative analysis require information”
● How cells are prepared for processing?
● how the sequencing is performed?
● Quality of the data produced are all critically important too.
“MiAIRR focuses on what information need to be
shared rather suggesting the analysis techniques
and tools”
Providing raw data enables the most
up-to-date data processing to be performed,
as the analysis tools for AIRR-seq data are
undergoing rapid evolution
● Providing the raw NGS data for each sequencing run (e.g., FASTQ files) permits the
reanalysis, secondary analysis and combination of multiple data sets from different
studies using meta-analysis techniques.
● Variety of tools are in use sequencing and processing. MiAIRR does not
provide tool specific details.
● MiAIRR defines broad categories that cover the essential data processing
steps.
● The software tools with version numbers, quality
thresholds, primer match and length cutoffs, etc.
● This final MiAIRR set will thus comprise the list of processed
sequences, along with sequence-level annotations.
● This should include the V(D)J gene segment and constant region (isotype)
annotation if used in the associated publication, along with the CDR3
sequence.
MiAIRR Elements Distribution to the NCBI
How MiAIRR elements look like?
https://github.com/airr-community/airr-standards/blob/master/AIRR_Minimal_Standard_Data_Elements.tsv
https://github.com/airr-community/airr-standards
BioSample
Sequence Read Archive
CAIRR: A pipeline to submit AIRR
data to the NCBI through the
CEDAR-workbench
NCBI is an important resource to archive biomedical data
● NCBI hosts a collection of biomedical databases:
○ BioProject, BioSample, SRA, GenBank, GEO etc.
● Provide infrastructure to submit experimental data and associated metadata
● Minimal use of standard terminologies to define the necessary metadata
○ Ontologies recommended for some data elements (Not implemented)
● NCBI metadata are often described using inconsistent terminologies
○ Limit our ability to access, find, interoperate and reuse the data sets
Goal: Leverage CEDAR to improve NCBI metadata submissions
NCBI BioSample guideline suggests to use Disease Ontology terms
What are the issues with the current NCBI
submission process?
● Rapid growth
● Lack of metadata standardization
● Error prone data entry
● Lack of community-specific metadata
(e.g., AIRR)
● Laborious metadata entry
NCBIGrowth
GenBankGrowth
Metadata Diversity in NCBI repositories
How are metadata currently submitted to NCBI?
BioProject
BioSample
Sequence Read Archive
Combination of web-based forms
and excel templates
● No mechanism to enforce standardized
vocabularies or ontology links
CAIRR Workflow
CAIRR Templates
Created CEDAR templates to submit metadata to:
NCBI BioProject, BioSample and SRA
AIRR Data Submission
CAIRR Metadata Generation
Data Submitter
NCBI CAIRR
Controlled Vocabularies
Predictive Entry
Interactive Metadata Entry
Metadata Findability
Metadata Accessibility
Metadata Interoperability
Metadata Reusability
represents limited features availability
Metadata submissions to NCBI BioProject, BioSample
and SRA are ontologically controlled and relationally
linked, which enables concept-based federated queries
across repositories that are silos otherwise.
Why CAIRR?
Resources
● Download AIRR NCBI templates:
https://github.com/airr-community/airr-standards
● How to submit AIRR data to NCBI Manual?
https://www.overleaf.com/read/tytddwptgkhb
Breden et. al. “Reproducibility and Reuse of Adaptive
Immune Receptor Repertoire Data” (2017)Rubelt, F., Busse, C., Bukhari, SAC et. al. “Adaptive Immune
Receptor Repertoire (AIRR) Community Recommendations for
Sharing Immune Repertoire Sequencing Data” (2017)
Kei-Hoi Cheung, Yale University, Dept. of Medical Informatics
● AIRR Community
Kleinstein Lab, Yale University, Dept. of Pathology

More Related Content

What's hot

Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
CEDAR: Center for Expanded Data Annotation and Retrieval
 
Bioinformatics Analysis of Nucleotide Sequences
Bioinformatics Analysis of Nucleotide SequencesBioinformatics Analysis of Nucleotide Sequences
Bioinformatics Analysis of Nucleotide Sequences
Adrian Gustavo Avellaneda Vergara
 
Semantic Web Technologies as a Framework for Clinical Informatics
Semantic Web Technologies as a Framework for Clinical InformaticsSemantic Web Technologies as a Framework for Clinical Informatics
Semantic Web Technologies as a Framework for Clinical Informatics
Chimezie Ogbuji
 

What's hot (20)

Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental MetadataMaking it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
 
Final Acb All Hands 26 11 07.Key
Final Acb All Hands 26 11 07.KeyFinal Acb All Hands 26 11 07.Key
Final Acb All Hands 26 11 07.Key
 
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
 
Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
 
Bioinformatics Analysis of Nucleotide Sequences
Bioinformatics Analysis of Nucleotide SequencesBioinformatics Analysis of Nucleotide Sequences
Bioinformatics Analysis of Nucleotide Sequences
 
Bioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future PerspectivesBioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future Perspectives
 
Bioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahuBioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahu
 
NETTAB 2012
NETTAB 2012NETTAB 2012
NETTAB 2012
 
The CEDAR Workbench: An Ontology-Assisted Environment for Authoring Metadata ...
The CEDAR Workbench: An Ontology-Assisted Environment for Authoring Metadata ...The CEDAR Workbench: An Ontology-Assisted Environment for Authoring Metadata ...
The CEDAR Workbench: An Ontology-Assisted Environment for Authoring Metadata ...
 
AI in Bioinformatics
AI in BioinformaticsAI in Bioinformatics
AI in Bioinformatics
 
Being FAIR: Enabling Reproducible Data Science
Being FAIR: Enabling Reproducible Data ScienceBeing FAIR: Enabling Reproducible Data Science
Being FAIR: Enabling Reproducible Data Science
 
Enabling faster analysis of vaccine adverse event reports with ontology support
Enabling faster analysis of vaccine adverse event reports with ontology supportEnabling faster analysis of vaccine adverse event reports with ontology support
Enabling faster analysis of vaccine adverse event reports with ontology support
 
Data retreival system
Data retreival systemData retreival system
Data retreival system
 
Career oppurtunities in the field of Bioinformatics
Career oppurtunities in the field of BioinformaticsCareer oppurtunities in the field of Bioinformatics
Career oppurtunities in the field of Bioinformatics
 
Link Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked DataLink Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked Data
 
Data retriveal ,srg and dbget
Data retriveal ,srg and dbgetData retriveal ,srg and dbget
Data retriveal ,srg and dbget
 
Drug Discovery- ELRIG -2012
Drug Discovery- ELRIG -2012Drug Discovery- ELRIG -2012
Drug Discovery- ELRIG -2012
 
Phylogenetics: Making publication-quality tree figures
Phylogenetics: Making publication-quality tree figuresPhylogenetics: Making publication-quality tree figures
Phylogenetics: Making publication-quality tree figures
 
NETTAB 2013
NETTAB 2013NETTAB 2013
NETTAB 2013
 
Semantic Web Technologies as a Framework for Clinical Informatics
Semantic Web Technologies as a Framework for Clinical InformaticsSemantic Web Technologies as a Framework for Clinical Informatics
Semantic Web Technologies as a Framework for Clinical Informatics
 

Similar to MiAIRR:Minimum information about an Adaptive Immune Receptor Repertoire Sequencing Experiment

2015 GU-ICBI Poster (third printing)
2015 GU-ICBI Poster (third printing)2015 GU-ICBI Poster (third printing)
2015 GU-ICBI Poster (third printing)
Michael Atkins
 
FedCentric_Presentation
FedCentric_PresentationFedCentric_Presentation
FedCentric_Presentation
Yatpang Cheung
 
Cancer Analytics Poster
Cancer Analytics PosterCancer Analytics Poster
Cancer Analytics Poster
Michael Atkins
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009
Ian Foster
 
Multivariate data analysis
Multivariate data analysisMultivariate data analysis
Multivariate data analysis
Setia Pramana
 

Similar to MiAIRR:Minimum information about an Adaptive Immune Receptor Repertoire Sequencing Experiment (20)

2015 GU-ICBI Poster (third printing)
2015 GU-ICBI Poster (third printing)2015 GU-ICBI Poster (third printing)
2015 GU-ICBI Poster (third printing)
 
Bioinformatics-R program의 실례
Bioinformatics-R program의 실례Bioinformatics-R program의 실례
Bioinformatics-R program의 실례
 
FedCentric_Presentation
FedCentric_PresentationFedCentric_Presentation
FedCentric_Presentation
 
Leveraging CEDAR workbench for ontology-linked submission of adaptive immune ...
Leveraging CEDAR workbench for ontology-linked submission of adaptive immune ...Leveraging CEDAR workbench for ontology-linked submission of adaptive immune ...
Leveraging CEDAR workbench for ontology-linked submission of adaptive immune ...
 
Cancer Analytics Poster
Cancer Analytics PosterCancer Analytics Poster
Cancer Analytics Poster
 
SooryaKiran Bioinformatics
SooryaKiran BioinformaticsSooryaKiran Bioinformatics
SooryaKiran Bioinformatics
 
Enabling Patient-Driven Medicine Using Graph Database
Enabling Patient-Driven Medicine Using Graph DatabaseEnabling Patient-Driven Medicine Using Graph Database
Enabling Patient-Driven Medicine Using Graph Database
 
Standardization of the HIPC Data Templates
Standardization of the HIPC Data TemplatesStandardization of the HIPC Data Templates
Standardization of the HIPC Data Templates
 
Standardization of the HIPC Data Templates: The Story So Far
Standardization of the HIPC Data Templates: The Story So FarStandardization of the HIPC Data Templates: The Story So Far
Standardization of the HIPC Data Templates: The Story So Far
 
Text Mining for Biocuration of Bacterial Infectious Diseases
Text Mining for Biocuration of Bacterial Infectious DiseasesText Mining for Biocuration of Bacterial Infectious Diseases
Text Mining for Biocuration of Bacterial Infectious Diseases
 
Next-Generation Sequencing and Data Analysis.pptx
Next-Generation Sequencing and Data Analysis.pptxNext-Generation Sequencing and Data Analysis.pptx
Next-Generation Sequencing and Data Analysis.pptx
 
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
 
PadminiNarayanan-Intro-2018.pptx
PadminiNarayanan-Intro-2018.pptxPadminiNarayanan-Intro-2018.pptx
PadminiNarayanan-Intro-2018.pptx
 
NCI Cancer Genomic Data Commons for NCAB September 2016
NCI Cancer Genomic Data Commons for NCAB September 2016NCI Cancer Genomic Data Commons for NCAB September 2016
NCI Cancer Genomic Data Commons for NCAB September 2016
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009
 
171017 giab for giab grc workshop
171017 giab for giab grc workshop171017 giab for giab grc workshop
171017 giab for giab grc workshop
 
Multivariate data analysis
Multivariate data analysisMultivariate data analysis
Multivariate data analysis
 
AIQC - ISCB 2022.pdf
AIQC - ISCB 2022.pdfAIQC - ISCB 2022.pdf
AIQC - ISCB 2022.pdf
 
Pathomics, Clinical Studies, and Cancer Surveillance
Pathomics, Clinical Studies, and Cancer SurveillancePathomics, Clinical Studies, and Cancer Surveillance
Pathomics, Clinical Studies, and Cancer Surveillance
 

More from Syed Ahmad Chan Bukhari, PhD

BioNLP-SADI: A Suite of interoperable BioNLP Semantic Web Services based on S...
BioNLP-SADI: A Suite of interoperable BioNLP Semantic Web Services based on S...BioNLP-SADI: A Suite of interoperable BioNLP Semantic Web Services based on S...
BioNLP-SADI: A Suite of interoperable BioNLP Semantic Web Services based on S...
Syed Ahmad Chan Bukhari, PhD
 

More from Syed Ahmad Chan Bukhari, PhD (9)

CEDAR: Easing Authoring of Metadata to Make Biomedical Data Sets More Findabl...
CEDAR: Easing Authoring of Metadata to Make Biomedical Data Sets More Findabl...CEDAR: Easing Authoring of Metadata to Make Biomedical Data Sets More Findabl...
CEDAR: Easing Authoring of Metadata to Make Biomedical Data Sets More Findabl...
 
Finding and Reusing Biomedical Datasets using CEDAR Metadata Repository and T...
Finding and Reusing Biomedical Datasets using CEDAR Metadata Repository and T...Finding and Reusing Biomedical Datasets using CEDAR Metadata Repository and T...
Finding and Reusing Biomedical Datasets using CEDAR Metadata Repository and T...
 
CEDAR Technologies for AIRR Submissions
CEDAR Technologies for AIRR SubmissionsCEDAR Technologies for AIRR Submissions
CEDAR Technologies for AIRR Submissions
 
CEDAR: Web-Based Tools for Accelerating the Creation of Standardized Metadata
CEDAR: Web-Based Tools for Accelerating the Creation of Standardized MetadataCEDAR: Web-Based Tools for Accelerating the Creation of Standardized Metadata
CEDAR: Web-Based Tools for Accelerating the Creation of Standardized Metadata
 
CAIRR: A pipeline to submit AIRR data to the NCBI through the CEDAR Workbench
CAIRR: A pipeline to submit AIRR data to the NCBI through the CEDAR WorkbenchCAIRR: A pipeline to submit AIRR data to the NCBI through the CEDAR Workbench
CAIRR: A pipeline to submit AIRR data to the NCBI through the CEDAR Workbench
 
BioNLP-SADI: A Suite of interoperable BioNLP Semantic Web Services based on S...
BioNLP-SADI: A Suite of interoperable BioNLP Semantic Web Services based on S...BioNLP-SADI: A Suite of interoperable BioNLP Semantic Web Services based on S...
BioNLP-SADI: A Suite of interoperable BioNLP Semantic Web Services based on S...
 
Type 2 fuzzy ontology ahmadchan
Type 2 fuzzy ontology ahmadchanType 2 fuzzy ontology ahmadchan
Type 2 fuzzy ontology ahmadchan
 
AN Intelligent Realtime multiple vessel collision risk assessment system
AN Intelligent Realtime multiple vessel collision risk assessment system AN Intelligent Realtime multiple vessel collision risk assessment system
AN Intelligent Realtime multiple vessel collision risk assessment system
 
Type-2 Fuzzy Ontology
Type-2 Fuzzy OntologyType-2 Fuzzy Ontology
Type-2 Fuzzy Ontology
 

Recently uploaded

Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
shivangimorya083
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
amitlee9823
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
shivangimorya083
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
amitlee9823
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
Lars Albertsson
 

Recently uploaded (20)

Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptx
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 

MiAIRR:Minimum information about an Adaptive Immune Receptor Repertoire Sequencing Experiment

  • 1. Minimum information about an Adaptive Immune Receptor Repertoire Sequencing Experiment Presenter: Syed Ahmad Chan Bukhari, PhD Department of Pathology, Yale School of Medicine
  • 2. Inability to reproduce scientific experiments is a big challenge. Lithgow, G. J., Driscoll, M., & Phillips, P. (2017). A long journey to reproducible results. Nature News, 548(7668), 387. ● A drug-like molecule could extend an roundworm lifespan by as much as 67%. ● Other labs failed to replicate the studies. ● Two cancer labs spent more than a year trying to understand inconsistencies with same tumour biopsy. ● Because of lack of standards, both labs were using different cell isolation protocols.
  • 3. Inability to reproduce scientific experiments is a big challenge. Begley, C. G., & Ellis, L. M. (2012). Drug development: Raise standards for preclinical cancer research. Nature, 483(7391), 531-533. ● Amgen could reproduce the findings in only 6 of 53 “landmark” papers in cancer biology ● Bayer could validate only 25% of 67 preclinical studies
  • 4. Inability to reproduce scientific experiments can have multiple reasons behind. ● Undocumented scientific procedures ● Datasets size and variability ● Problem with statistical techniques ● A documented but a difficult procedure to follow
  • 5. Standardization is a proven way to make sense to scientific procedures and outcomes. How and what array platform used?
  • 6. Experiments in Immunology facing the similar reproducibility challenges ● High-throughput sequencing (HTS) of B-cell (antibody, immunoglobulin) and T-cell receptor repertoires has increased dramatically since the technique was introduced in 2009. ○ Previously relied on low-resolution approaches, such as flow cytometry, spectratyping and Sanger sequencing ● B cell receptors (BCRs) and T cell receptors (TCRs) serve as the primary means for specific detection of foreign antigens.
  • 7. Adaptive Immune Receptor Repertoire (AIRR) Sequencing ● Collection of BCRs or TCRs in an individual, tissue, cell subset or during an immune response is referred to as the repertoire. ● AIRR-seq studies are associated with complex metadata, such as donor phenotypes, cell types and nucleic acid material used. ○ Crucial for ensuring reproducibility and facilitating secondary and meta analyses ● AIRR sequencing has enormous promise for understanding the dynamics of the immune repertoire in vaccinology, infectious disease, autoimmunity, and cancer biology.
  • 8. Adaptive Immune Receptor Repertoire (AIRR) Sequencing (Popularity)
  • 9. Adaptive Immune-Receptor Repertoire (AIRR) Community Next-generation sequencing of B & T cell receptor repertoires (AIRR-seq) Developing standard protocols for reporting and sharing AIRR-seq data to optimize their use in biomedical research and patient care AIRR Community Formed
  • 10. AIRR Community Data Elements Each of the 6 high-level principles has been expanded into a set of data elements
  • 11. “Accurate specification of the pathophysiological condition is important for cross-comparison of multiple studies” ● This set describes the experimental study design including the title of the study, laboratory contact information etc ● For individual subjects, the species, sex, age, and ancestry are included along with information about disease state(s) etc ● This set describes the metadata about the diagnosis process
  • 12. “Information about the origin and expected composition of the biological sample(s) is central for the interpretation of downstream sequencing results.”
  • 13. “Proper interpretation of experimental results for future comparative analysis require information” ● How cells are prepared for processing? ● how the sequencing is performed? ● Quality of the data produced are all critically important too. “MiAIRR focuses on what information need to be shared rather suggesting the analysis techniques and tools”
  • 14. Providing raw data enables the most up-to-date data processing to be performed, as the analysis tools for AIRR-seq data are undergoing rapid evolution ● Providing the raw NGS data for each sequencing run (e.g., FASTQ files) permits the reanalysis, secondary analysis and combination of multiple data sets from different studies using meta-analysis techniques.
  • 15. ● Variety of tools are in use sequencing and processing. MiAIRR does not provide tool specific details. ● MiAIRR defines broad categories that cover the essential data processing steps. ● The software tools with version numbers, quality thresholds, primer match and length cutoffs, etc.
  • 16. ● This final MiAIRR set will thus comprise the list of processed sequences, along with sequence-level annotations. ● This should include the V(D)J gene segment and constant region (isotype) annotation if used in the associated publication, along with the CDR3 sequence.
  • 18. How MiAIRR elements look like? https://github.com/airr-community/airr-standards/blob/master/AIRR_Minimal_Standard_Data_Elements.tsv
  • 20. CAIRR: A pipeline to submit AIRR data to the NCBI through the CEDAR-workbench
  • 21. NCBI is an important resource to archive biomedical data ● NCBI hosts a collection of biomedical databases: ○ BioProject, BioSample, SRA, GenBank, GEO etc. ● Provide infrastructure to submit experimental data and associated metadata ● Minimal use of standard terminologies to define the necessary metadata ○ Ontologies recommended for some data elements (Not implemented) ● NCBI metadata are often described using inconsistent terminologies ○ Limit our ability to access, find, interoperate and reuse the data sets Goal: Leverage CEDAR to improve NCBI metadata submissions NCBI BioSample guideline suggests to use Disease Ontology terms
  • 22. What are the issues with the current NCBI submission process? ● Rapid growth ● Lack of metadata standardization ● Error prone data entry ● Lack of community-specific metadata (e.g., AIRR) ● Laborious metadata entry NCBIGrowth GenBankGrowth Metadata Diversity in NCBI repositories
  • 23. How are metadata currently submitted to NCBI? BioProject BioSample Sequence Read Archive Combination of web-based forms and excel templates ● No mechanism to enforce standardized vocabularies or ontology links
  • 25. CAIRR Templates Created CEDAR templates to submit metadata to: NCBI BioProject, BioSample and SRA
  • 27. CAIRR Metadata Generation Data Submitter NCBI CAIRR Controlled Vocabularies Predictive Entry Interactive Metadata Entry Metadata Findability Metadata Accessibility Metadata Interoperability Metadata Reusability represents limited features availability Metadata submissions to NCBI BioProject, BioSample and SRA are ontologically controlled and relationally linked, which enables concept-based federated queries across repositories that are silos otherwise. Why CAIRR?
  • 28. Resources ● Download AIRR NCBI templates: https://github.com/airr-community/airr-standards ● How to submit AIRR data to NCBI Manual? https://www.overleaf.com/read/tytddwptgkhb
  • 29. Breden et. al. “Reproducibility and Reuse of Adaptive Immune Receptor Repertoire Data” (2017)Rubelt, F., Busse, C., Bukhari, SAC et. al. “Adaptive Immune Receptor Repertoire (AIRR) Community Recommendations for Sharing Immune Repertoire Sequencing Data” (2017)
  • 30. Kei-Hoi Cheung, Yale University, Dept. of Medical Informatics ● AIRR Community Kleinstein Lab, Yale University, Dept. of Pathology