Pine Biotech conducts monthly informational workshops on the topics related to high-throughput data analysis, interpretation and integration. The workshops highlight our research tools and educational resources developed with collaborators in the US and across the world.
5. 5
Patients
Most important clinical features
Normalizedclinicalfeatures
Missing Values
Group-to-group
associations
(bipartite network)
OmicsFeatures
PREDICTION
Enrichment of Clinical Data
Clinical characteristics of patients that are of most interest
can be predicted using other clinical features. This type of
group-to-group association network can be prepared within
a normalized set of clinical features across patients, thus
identifying patients at risk or selecting patients for specific
data to be generated. Validated associations can become a
part of a clinical application used regularly to select patients
or match patients to requirements.
Integration of Omics Data
Omics data represents a feature-rich resource that can be
used for discovery of underlying biology linked to a
particular condition, disease stage, co-morbidities and
population stratification. Integrating the omics data into a
usable, normalized resource and linking it to clinical
outcomes can enable precision targeting of patients and
more precise treatment for complex diseases.
Machine Learning Applications
7. the National Center
for Biotechnology
Information (NCBI)
Gene Expression
Omnibus repository
(GEO) alone
contains 80,985
public datasets,
spanning hundreds
of tissue types in
thousands of
organisms
Omics Data: What’s out there?
7
15. INDEPENDENT PROJECTS (ONCOLOGY SPECIALIZATION EXAMPLE)
Projects are prepared from high impact publications relevant to the specialization. Public domain datasets are curated to prepare focused
assignments illustrating how the data is processed and used to achieve similar results to the publication. Other approaches that can perform a
similar function are discussed, providing a review of the methods section of the paper. Finally, full dataset is organized into a project format that
can be analyzed for discovery.
17. 17
FALL 2018
INDIA:
USA:
ONLINE:
Louisiana Biomedical Research Network – September 2018
Georgetown University – October 2018
August 25 – workshop in Kolkata (APT SOFTWARE AT SALT LAKE)
September 6 – AIIMS (AIIMS, THEATRE ROOM )
September 15: Short Term Training in Next Generation Sequencing
(Transcriptomics and Genomics)
October 15: Short Term Training in Machine Learning for Biomedical Data
19. T-BioInfo is designed for processing, analysis and
integration of multi-omics data. The platform is used in
multiple research groups to extract meaningful insights
from large multi-omics datasets. Our current effort
expands to education, by enabling more people to
extract meaningful, data-driven insights from omics
datasets with biomedical applications. To learn more
about the platform and it’s research and educational
features, follow the highlighted links .
T-bio.info | edu.t-bio.info | server.t-bio.info
19
20. Modeling Precision Medicine
Machine Learning for Transcriptomics Data: Extracting
Meaningful insights from high-throughput biomedical data.
20
25. Files we will use in this session
25
/export-data/sciservice/data/pipelines/5629cc88e71b7bf5/upload/SRR925687_1.fq
/export-data/sciservice/data/pipelines/5629cc88e71b7bf5/upload/SRR925687_2.fq
/export-data/sciservice/data/pipelines/5629cc88e71b7bf5/upload/SRR925697_1.fq
/export-data/sciservice/data/pipelines/5629cc88e71b7bf5/upload/SRR925697_2.fq
29. Preprocessing:
• Adapters removal plus additional
trimming
• Removing PCR duplicates
29
Quantification of expression levels
Mapping
• Mapping on the set of known transcripts
• Mapping on genome (and potential
identification of novel transcripts)
• Combined strategy
RNA-Seq: overview
39. 39
Unsupervised analysis: Hierarchical Clustering
• Identify groups
• Associate sample to group
Why use clustering?
• Various methods
• Random selection in some methods
• Interpretation
Considerations:
53. 53
Upcoming Programs and Registration:
Elia Brodsky
Co-founder and CEO
Pine Biotech (pine-biotech.com)
bit.ly/2KG7iEj
https://edu.t-bio.info/workshops/
elia@pine.bio; sahil@pine.bio