SlideShare a Scribd company logo
1 of 88
Download to read offline
UCSC
genome browsing

        Paco Hulpiau



  http://www.bits.vib.be
Introduction


§
    Browse genes in their genomic context
§
    See features in and around a specific gene
§
    Investigate genome organization and explore larger

    chromosome regions
§
    Search and retrieve information on a gene- and

    genome-scale
§
    Compare genomes
Introduction

§
    Collaboration between main genome browsers
       Ensembl, UCSC and NCBI
       » use same genome assemblies
       » interlinking between sites

§
    Ensembl Genome Browser: http://www.ensembl.org/
§
    NCBI Map Viewer: http://www.ncbi.nlm.nih.gov/mapview/
§
    UCSC Genome Browser: http://genome.ucsc.edu/
Introduction
Introduction
Introduction
Introduction
Introduction
Introduction

§
    Collaboration between main genome browsers
       Ensembl, UCSC and NCBI
       » use same genome assemblies
       » interlinking between sites

§
    Ensembl Genome Browser: http://www.ensembl.org/
§
    NCBI Map Viewer: http://www.ncbi.nlm.nih.gov/mapview/
§
    UCSC Genome Browser: http://genome.ucsc.edu/
Introduction
Introduction
Introduction

§
    Collaboration between main genome browsers
       Ensembl, UCSC and NCBI
       » use same genome assemblies
       » interlinking between sites

§
    Ensembl Genome Browser: http://www.ensembl.org/
§
    NCBI Map Viewer: http://www.ncbi.nlm.nih.gov/mapview/
§
    UCSC Genome Browser: http://genome.ucsc.edu/
Introduction
Introduction
Introduction
Introduction

§
    Other genome browsers and genome databases:
http://genome.jgi-psf.org      Eukaryotic (143) and prokaryotic (505) genomes

http://www.xenbase.org         Xenopus tropicalis

http://flybase.org             Drosophila genes & genomes

http://www.wormbase.org             C. elegans and some related nematodes

http://www.tigr.org => http://www.jcvi.org/ Comprehensive Microbial Resource (CMR)
                    => http://cmr.jcvi.org/tigr-scripts/CMR/CmrHomePage.cgi

http://genolist.pasteur.fr     Microbial genomes
Introduction
Introduction
§   The UCSC Genome browser was created by the

    Genome Bioinformatics Group

    at the University of California Santa Cruz (UCSC).



    http://genome.ucsc.edu/
§
    The Genome Browser zooms and scrolls
    over chromosomes, showing the work of
    annotators worldwide.
§   Blat quickly maps your sequence to the genome.

             BLAT is not BLAST !
BLAT works by keeping an index of the entire genome in memory.
The index consists of all non-overlapping DNA 11-mers or protein 4-mers.
The index is used to find areas of probable homology, which are then
loaded into memory for a detailed alignment.

BLAT on DNA can quickly find sequences of 95% and greater similarity
of length 40 bases or more.

BLAT on proteins finds sequences of 80% and greater similarity of length
20 amino acids or more.
§
    The Table Browser provides convenient
    access to the underlying database.
§   The Gene Sorter displays a sorted table of genes

    that are related to one another.

    The relationship can be one of several types, including protein-
level homology,

    similarity of gene expression profiles,

    or genomic proximity.
§   In-Silico PCR searches a sequence database with a pair of PCR
    primers, using an indexing strategy for fast performance.
§   When successful, the search returns a file (fasta) containing all
    sequences in the database that lie between and include the
    primer pair.
§   Genome Graphs is a tool for displaying

    genome-wide data sets such as the results

    of genome-wide SNP association studies,

    linkage studies and homozygosity mapping.
§   Galaxy allows you to do analyses you cannot do
    anywhere else without the need to install or
    download anything.
§   You can analyze multiple alignments, compare
    genomic annotations and much more...
§   VisiGene lets you browse through a large

    collection of in situ mouse and frog images.
§   The Proteome Browser provides a wealth of

    protein information presented in the form of
graphical images of tracks and histograms

    and links to other sites.
§   The Utilities page contains links to some tools

    created by the UCSC Genome Bioinformatics Group.


§   DNA Duster & Protein Duster remove non-sequence

    related characters from an input sequence.
§   The Utilities page contains links to some tools

    created by the UCSC Genome Bioinformatics Group.


§   DNA Duster & Protein Duster remove non-sequence

    related characters from an input sequence.
Clade – Genome - Assembly
GENOME
BROWSER
DISPLAY
POSITION
CONTROL
TRACK
CONTROL
Navigation: position control
Navigation: position control




§
    Click the zoom in and zoom out buttons on top

       to zoom in or out 1.5, 3 or 10-fold

       on the center of the window
Navigation: position control




§
    Zoom in 3-fold by clicking anywhere

       on the base position track
§
    Zoom to a specific region using “drag and zoom”
Navigation: position control




§
    To scroll the view of the display horizontally

       by set increments of 10%, 50% or 95%

       of the displayed size (as given in base pairs)

       click the corresponding move arrow
Navigation: position control




§
    To scroll the left of right side by a specified number of

    vertical gridlines while keeping the opposite side fixed

       click the appropriate move start or move end

    arrow
Navigation: position control




§
    To display a (completely) different position

       enter the new location in the position/search text

    box
§
    You can also jump to an other gene location
Annotation Tracks




 TRACK
CONTROL
HIDE = removes a track from view



FULL = each item on a separate line
DENSE = all items collapsed into single line

  SQUISH = all items on several lines
     PACKED and at 50% height
    PACK = each item separate and
    efficiently stacked (full height)
Annotation Tracks
Annotation Tracks




§
    Different genome/assembly => different tracks!
Annotation Tracks
Annotation Tracks
Annotation Tracks
Annotation Tracks
Annotation Tracks
Annotation Tracks
Annotation Tracks
Annotation Tracks


§
    Now try to change the tracks as follows
Annotation Tracks


§
    and...
DENSE


 FULL




SQUISH




SQUISH


PACK
direction of transcription

UTR   EXON                                EXON
                      INTRON
Annotation Tracks
Annotation Tracks
Annotation Tracks
Annotation Tracks
Annotation Tracks
Annotation Tracks
Annotation Tracks
Annotation Tracks
Browser graphics in PDF



        TABLE              GET          CURRENT
        BROWSER            DNA          BROWSER
                                        GRAPHIC IN PDF




                                   TO GET
                                   OTHER
CLICK                               DATA
LINE
1
      CURRENT
      BROWSER
      GRAPHIC IN PDF




TO GET
OTHER
 DATA
Exercises (I)


1)   Search for your gene of interest
         on Human Feb. 2009 (GRCh37/hg19) Assembly


         » Include 1000 base pairs up- and downstream


         » Only show the tracks:
            RefSeq Genes (pack)
            Conservation (full, primates only)


         » Save graphical view as PDF (exercises1_1)
Exercises (I)


2)   How many transcripts are there?



        » Compare UCSC Genes with RefSeq and Ensembl genes!


        » Save graphical view as PDF (exercises1_2)
Exercises (I)


3)   What are the flanking genes?

         Are these conserved outside mammals?

        » Zoom out until you can see at least

          two or three flanking genes

         (may need to hide some tracks, leave RefSeq on)


        » Now have a look in the chicken genome

        » Save graphical view as PDF

          (exercises1_3a en exercises1_3b)
Exercises (I)


4)   Is there any regulatory information available?

         » Change the view to see the genomic region upstream

           (exon 1 and ~2000 upstream) and open some regulatory tracks

           e.g. ORegAnno, TFBS Conserved, TS miRNA sites

         » Save graphical view as PDF (exercises1_4)
BITS: UCSC genome browser - Part 1
BITS: UCSC genome browser - Part 1
BITS: UCSC genome browser - Part 1

More Related Content

What's hot

Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
hemantbreeder
 

What's hot (20)

Genomic databases
Genomic databasesGenomic databases
Genomic databases
 
Tech Talk: UCSC Genome Browser
Tech Talk: UCSC Genome BrowserTech Talk: UCSC Genome Browser
Tech Talk: UCSC Genome Browser
 
Genome annotation
Genome annotationGenome annotation
Genome annotation
 
The ensembl database
The ensembl databaseThe ensembl database
The ensembl database
 
sequence of file formats in bioinformatics
sequence of file formats in bioinformaticssequence of file formats in bioinformatics
sequence of file formats in bioinformatics
 
Tools and database of NCBI
Tools and database of NCBITools and database of NCBI
Tools and database of NCBI
 
Structural genomics
Structural genomicsStructural genomics
Structural genomics
 
Protein databases
Protein databasesProtein databases
Protein databases
 
Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-
 
Nucleic Acid Sequence databases
Nucleic Acid Sequence databasesNucleic Acid Sequence databases
Nucleic Acid Sequence databases
 
Clustal
ClustalClustal
Clustal
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Protein Structure, Databases and Structural Alignment
Protein Structure, Databases and Structural AlignmentProtein Structure, Databases and Structural Alignment
Protein Structure, Databases and Structural Alignment
 
SEQUENCE ANALYSIS
SEQUENCE ANALYSISSEQUENCE ANALYSIS
SEQUENCE ANALYSIS
 
Kegg databse
Kegg databseKegg databse
Kegg databse
 
Parsimony methods
Parsimony methodsParsimony methods
Parsimony methods
 
Ddbj
DdbjDdbj
Ddbj
 
Uni prot presentation
Uni prot presentationUni prot presentation
Uni prot presentation
 
Secondary protein structure prediction
Secondary protein structure predictionSecondary protein structure prediction
Secondary protein structure prediction
 
Dynamic programming
Dynamic programming Dynamic programming
Dynamic programming
 

Viewers also liked (9)

EMBL-EBI
EMBL-EBIEMBL-EBI
EMBL-EBI
 
NCBI
NCBINCBI
NCBI
 
Presentation on Biological database By Elufer Akram @ University Of Science ...
Presentation on Biological database  By Elufer Akram @ University Of Science ...Presentation on Biological database  By Elufer Akram @ University Of Science ...
Presentation on Biological database By Elufer Akram @ University Of Science ...
 
GenomeBrowser
GenomeBrowserGenomeBrowser
GenomeBrowser
 
BITS training - UCSC Genome Browser - Part 2
BITS training - UCSC Genome Browser - Part 2BITS training - UCSC Genome Browser - Part 2
BITS training - UCSC Genome Browser - Part 2
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Biological Databases
Biological DatabasesBiological Databases
Biological Databases
 
databases in bioinformatics
databases in bioinformaticsdatabases in bioinformatics
databases in bioinformatics
 
Protein databases
Protein databasesProtein databases
Protein databases
 

Similar to BITS: UCSC genome browser - Part 1

Sequencedatabases
SequencedatabasesSequencedatabases
Sequencedatabases
Abhik Seal
 

Similar to BITS: UCSC genome browser - Part 1 (20)

Apollo - A webinar for the Phascolarctos cinereus research community
Apollo - A webinar for the Phascolarctos cinereus research communityApollo - A webinar for the Phascolarctos cinereus research community
Apollo - A webinar for the Phascolarctos cinereus research community
 
Sequencedatabases
SequencedatabasesSequencedatabases
Sequencedatabases
 
Apolo Taller en BIOS
Apolo Taller en BIOS Apolo Taller en BIOS
Apolo Taller en BIOS
 
Apollo Introduction for the Chestnut Research Community
Apollo Introduction for the Chestnut Research CommunityApollo Introduction for the Chestnut Research Community
Apollo Introduction for the Chestnut Research Community
 
Apollo : A workshop for the Manakin Research Coordination Network
Apollo: A workshop for the Manakin Research Coordination NetworkApollo: A workshop for the Manakin Research Coordination Network
Apollo : A workshop for the Manakin Research Coordination Network
 
BLAST
BLASTBLAST
BLAST
 
Introduction to Apollo for i5k
Introduction to Apollo for i5kIntroduction to Apollo for i5k
Introduction to Apollo for i5k
 
Bioinformaatics for M.Sc. Biotecchnology.pptx
Bioinformaatics for M.Sc. Biotecchnology.pptxBioinformaatics for M.Sc. Biotecchnology.pptx
Bioinformaatics for M.Sc. Biotecchnology.pptx
 
Michael Reich, GenomeSpace Workshop, fged_seattle_2013
Michael Reich, GenomeSpace Workshop, fged_seattle_2013Michael Reich, GenomeSpace Workshop, fged_seattle_2013
Michael Reich, GenomeSpace Workshop, fged_seattle_2013
 
Genome Curation using Apollo
Genome Curation using ApolloGenome Curation using Apollo
Genome Curation using Apollo
 
Apollo Introduction for i5K Groups 2015-10-07
Apollo Introduction for i5K Groups 2015-10-07Apollo Introduction for i5K Groups 2015-10-07
Apollo Introduction for i5K Groups 2015-10-07
 
Web Apollo Workshop University of Exeter
Web Apollo Workshop University of ExeterWeb Apollo Workshop University of Exeter
Web Apollo Workshop University of Exeter
 
Bioinformatica t2-databases
Bioinformatica t2-databasesBioinformatica t2-databases
Bioinformatica t2-databases
 
Bioinformatics t2-databases v2014
Bioinformatics t2-databases v2014Bioinformatics t2-databases v2014
Bioinformatics t2-databases v2014
 
Using VarSeq to Improve Variant Analysis Research Workflows
Using VarSeq to Improve Variant Analysis Research WorkflowsUsing VarSeq to Improve Variant Analysis Research Workflows
Using VarSeq to Improve Variant Analysis Research Workflows
 
Using VarSeq to Improve Variant Analysis Research Workflows
Using VarSeq to Improve Variant Analysis Research WorkflowsUsing VarSeq to Improve Variant Analysis Research Workflows
Using VarSeq to Improve Variant Analysis Research Workflows
 
Understanding Genome
Understanding Genome Understanding Genome
Understanding Genome
 
Sequencealignmentinbioinformatics 100204112518-phpapp02
Sequencealignmentinbioinformatics 100204112518-phpapp02Sequencealignmentinbioinformatics 100204112518-phpapp02
Sequencealignmentinbioinformatics 100204112518-phpapp02
 
Exploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVS
Exploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVSExploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVS
Exploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVS
 
BiDiBlast Tool Presentation
BiDiBlast Tool PresentationBiDiBlast Tool Presentation
BiDiBlast Tool Presentation
 

More from BITS

More from BITS (20)

RNA-seq for DE analysis: detecting differential expression - part 5
RNA-seq for DE analysis: detecting differential expression - part 5RNA-seq for DE analysis: detecting differential expression - part 5
RNA-seq for DE analysis: detecting differential expression - part 5
 
RNA-seq for DE analysis: extracting counts and QC - part 4
RNA-seq for DE analysis: extracting counts and QC - part 4RNA-seq for DE analysis: extracting counts and QC - part 4
RNA-seq for DE analysis: extracting counts and QC - part 4
 
RNA-seq for DE analysis: the biology behind observed changes - part 6
RNA-seq for DE analysis: the biology behind observed changes - part 6RNA-seq for DE analysis: the biology behind observed changes - part 6
RNA-seq for DE analysis: the biology behind observed changes - part 6
 
RNA-seq: analysis of raw data and preprocessing - part 2
RNA-seq: analysis of raw data and preprocessing - part 2RNA-seq: analysis of raw data and preprocessing - part 2
RNA-seq: analysis of raw data and preprocessing - part 2
 
RNA-seq: general concept, goal and experimental design - part 1
RNA-seq: general concept, goal and experimental design - part 1RNA-seq: general concept, goal and experimental design - part 1
RNA-seq: general concept, goal and experimental design - part 1
 
RNA-seq: Mapping and quality control - part 3
RNA-seq: Mapping and quality control - part 3RNA-seq: Mapping and quality control - part 3
RNA-seq: Mapping and quality control - part 3
 
Productivity tips - Introduction to linux for bioinformatics
Productivity tips - Introduction to linux for bioinformaticsProductivity tips - Introduction to linux for bioinformatics
Productivity tips - Introduction to linux for bioinformatics
 
Text mining on the command line - Introduction to linux for bioinformatics
Text mining on the command line - Introduction to linux for bioinformaticsText mining on the command line - Introduction to linux for bioinformatics
Text mining on the command line - Introduction to linux for bioinformatics
 
The structure of Linux - Introduction to Linux for bioinformatics
The structure of Linux - Introduction to Linux for bioinformaticsThe structure of Linux - Introduction to Linux for bioinformatics
The structure of Linux - Introduction to Linux for bioinformatics
 
Managing your data - Introduction to Linux for bioinformatics
Managing your data - Introduction to Linux for bioinformaticsManaging your data - Introduction to Linux for bioinformatics
Managing your data - Introduction to Linux for bioinformatics
 
Introduction to Linux for bioinformatics
Introduction to Linux for bioinformaticsIntroduction to Linux for bioinformatics
Introduction to Linux for bioinformatics
 
BITS - Genevestigator to easily access transcriptomics data
BITS - Genevestigator to easily access transcriptomics dataBITS - Genevestigator to easily access transcriptomics data
BITS - Genevestigator to easily access transcriptomics data
 
BITS - Comparative genomics: the Contra tool
BITS - Comparative genomics: the Contra toolBITS - Comparative genomics: the Contra tool
BITS - Comparative genomics: the Contra tool
 
BITS - Comparative genomics on the genome level
BITS - Comparative genomics on the genome levelBITS - Comparative genomics on the genome level
BITS - Comparative genomics on the genome level
 
BITS - Comparative genomics: gene family analysis
BITS - Comparative genomics: gene family analysisBITS - Comparative genomics: gene family analysis
BITS - Comparative genomics: gene family analysis
 
BITS - Introduction to comparative genomics
BITS - Introduction to comparative genomicsBITS - Introduction to comparative genomics
BITS - Introduction to comparative genomics
 
BITS - Protein inference from mass spectrometry data
BITS - Protein inference from mass spectrometry dataBITS - Protein inference from mass spectrometry data
BITS - Protein inference from mass spectrometry data
 
BITS - Overview of sequence databases for mass spectrometry data analysis
BITS - Overview of sequence databases for mass spectrometry data analysisBITS - Overview of sequence databases for mass spectrometry data analysis
BITS - Overview of sequence databases for mass spectrometry data analysis
 
BITS - Search engines for mass spec data
BITS - Search engines for mass spec dataBITS - Search engines for mass spec data
BITS - Search engines for mass spec data
 
BITS - Introduction to proteomics
BITS - Introduction to proteomicsBITS - Introduction to proteomics
BITS - Introduction to proteomics
 

Recently uploaded

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
SoniaTolstoy
 

Recently uploaded (20)

Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 

BITS: UCSC genome browser - Part 1

  • 1. UCSC genome browsing Paco Hulpiau http://www.bits.vib.be
  • 2. Introduction § Browse genes in their genomic context § See features in and around a specific gene § Investigate genome organization and explore larger chromosome regions § Search and retrieve information on a gene- and genome-scale § Compare genomes
  • 3. Introduction § Collaboration between main genome browsers Ensembl, UCSC and NCBI » use same genome assemblies » interlinking between sites § Ensembl Genome Browser: http://www.ensembl.org/ § NCBI Map Viewer: http://www.ncbi.nlm.nih.gov/mapview/ § UCSC Genome Browser: http://genome.ucsc.edu/
  • 9. Introduction § Collaboration between main genome browsers Ensembl, UCSC and NCBI » use same genome assemblies » interlinking between sites § Ensembl Genome Browser: http://www.ensembl.org/ § NCBI Map Viewer: http://www.ncbi.nlm.nih.gov/mapview/ § UCSC Genome Browser: http://genome.ucsc.edu/
  • 12. Introduction § Collaboration between main genome browsers Ensembl, UCSC and NCBI » use same genome assemblies » interlinking between sites § Ensembl Genome Browser: http://www.ensembl.org/ § NCBI Map Viewer: http://www.ncbi.nlm.nih.gov/mapview/ § UCSC Genome Browser: http://genome.ucsc.edu/
  • 16. Introduction § Other genome browsers and genome databases: http://genome.jgi-psf.org Eukaryotic (143) and prokaryotic (505) genomes http://www.xenbase.org Xenopus tropicalis http://flybase.org Drosophila genes & genomes http://www.wormbase.org C. elegans and some related nematodes http://www.tigr.org => http://www.jcvi.org/ Comprehensive Microbial Resource (CMR) => http://cmr.jcvi.org/tigr-scripts/CMR/CmrHomePage.cgi http://genolist.pasteur.fr Microbial genomes
  • 19. § The UCSC Genome browser was created by the Genome Bioinformatics Group at the University of California Santa Cruz (UCSC). http://genome.ucsc.edu/
  • 20. § The Genome Browser zooms and scrolls over chromosomes, showing the work of annotators worldwide.
  • 21. § Blat quickly maps your sequence to the genome. BLAT is not BLAST ! BLAT works by keeping an index of the entire genome in memory. The index consists of all non-overlapping DNA 11-mers or protein 4-mers. The index is used to find areas of probable homology, which are then loaded into memory for a detailed alignment. BLAT on DNA can quickly find sequences of 95% and greater similarity of length 40 bases or more. BLAT on proteins finds sequences of 80% and greater similarity of length 20 amino acids or more.
  • 22. § The Table Browser provides convenient access to the underlying database.
  • 23. § The Gene Sorter displays a sorted table of genes that are related to one another. The relationship can be one of several types, including protein- level homology, similarity of gene expression profiles, or genomic proximity.
  • 24. § In-Silico PCR searches a sequence database with a pair of PCR primers, using an indexing strategy for fast performance. § When successful, the search returns a file (fasta) containing all sequences in the database that lie between and include the primer pair.
  • 25. § Genome Graphs is a tool for displaying genome-wide data sets such as the results of genome-wide SNP association studies, linkage studies and homozygosity mapping.
  • 26. § Galaxy allows you to do analyses you cannot do anywhere else without the need to install or download anything. § You can analyze multiple alignments, compare genomic annotations and much more...
  • 27. § VisiGene lets you browse through a large collection of in situ mouse and frog images.
  • 28. § The Proteome Browser provides a wealth of protein information presented in the form of graphical images of tracks and histograms and links to other sites.
  • 29. § The Utilities page contains links to some tools created by the UCSC Genome Bioinformatics Group. § DNA Duster & Protein Duster remove non-sequence related characters from an input sequence.
  • 30. § The Utilities page contains links to some tools created by the UCSC Genome Bioinformatics Group. § DNA Duster & Protein Duster remove non-sequence related characters from an input sequence.
  • 31.
  • 32. Clade – Genome - Assembly
  • 33.
  • 34.
  • 35.
  • 36.
  • 37.
  • 38.
  • 39.
  • 44.
  • 45.
  • 46. Navigation: position control § Click the zoom in and zoom out buttons on top to zoom in or out 1.5, 3 or 10-fold on the center of the window
  • 47. Navigation: position control § Zoom in 3-fold by clicking anywhere on the base position track § Zoom to a specific region using “drag and zoom”
  • 48. Navigation: position control § To scroll the view of the display horizontally by set increments of 10%, 50% or 95% of the displayed size (as given in base pairs) click the corresponding move arrow
  • 49. Navigation: position control § To scroll the left of right side by a specified number of vertical gridlines while keeping the opposite side fixed click the appropriate move start or move end arrow
  • 50. Navigation: position control § To display a (completely) different position enter the new location in the position/search text box § You can also jump to an other gene location
  • 52. HIDE = removes a track from view FULL = each item on a separate line
  • 53. DENSE = all items collapsed into single line SQUISH = all items on several lines PACKED and at 50% height PACK = each item separate and efficiently stacked (full height)
  • 55. Annotation Tracks § Different genome/assembly => different tracks!
  • 63. Annotation Tracks § Now try to change the tracks as follows
  • 66. direction of transcription UTR EXON EXON INTRON
  • 67.
  • 68.
  • 69.
  • 78. Browser graphics in PDF TABLE GET CURRENT BROWSER DNA BROWSER GRAPHIC IN PDF TO GET OTHER CLICK DATA LINE
  • 79. 1 CURRENT BROWSER GRAPHIC IN PDF TO GET OTHER DATA
  • 80.
  • 81.
  • 82. Exercises (I) 1) Search for your gene of interest on Human Feb. 2009 (GRCh37/hg19) Assembly » Include 1000 base pairs up- and downstream » Only show the tracks: RefSeq Genes (pack) Conservation (full, primates only) » Save graphical view as PDF (exercises1_1)
  • 83. Exercises (I) 2) How many transcripts are there? » Compare UCSC Genes with RefSeq and Ensembl genes! » Save graphical view as PDF (exercises1_2)
  • 84. Exercises (I) 3) What are the flanking genes? Are these conserved outside mammals? » Zoom out until you can see at least two or three flanking genes (may need to hide some tracks, leave RefSeq on) » Now have a look in the chicken genome » Save graphical view as PDF (exercises1_3a en exercises1_3b)
  • 85. Exercises (I) 4) Is there any regulatory information available? » Change the view to see the genomic region upstream (exon 1 and ~2000 upstream) and open some regulatory tracks e.g. ORegAnno, TFBS Conserved, TS miRNA sites » Save graphical view as PDF (exercises1_4)