NOTE: We are working on migrating this site away from MediaWiki, so editing pages will be disabled for now.
Difference between revisions of "GBrowse2 REST API"
(→Getting Tracks from a Particular Datasource) |
(→Downloading a Track) |
||
(7 intermediate revisions by the same user not shown) | |||
Line 96: | Line 96: | ||
:The value of the '''track source''' option, which can be used to identify a mirrored track. | :The value of the '''track source''' option, which can be used to identify a mirrored track. | ||
;select | ;select | ||
− | :A space-delimited list of subtrack selection names. These can be used to selectively turn on particular subtracks using the syntax described in [[# | + | :A space-delimited list of subtrack selection names. These can be used to selectively turn on particular subtracks using the syntax described in [[#Selecting Tracks and Subtracks]]. |
+ | |||
+ | ===Getting Chromosome Sizes=== | ||
+ | |||
+ | To get the list of chromosomes and their sizes from a datasource, invoke the gbrowse URL with "action=chrom_sizes": | ||
+ | |||
+ | http://somewhere.com/gb2/gbrowse/SOURCE/?action=chrom_sizes | ||
+ | |||
+ | You will get a text file like the following: | ||
+ | |||
+ | ##species http://www.ncbi.nlm.nih.gov/Taxonomy/Browser/wwwtax.cgi?id=6239 | ||
+ | ##genome-build WS180 | ||
+ | II 15279316 | ||
+ | MtDNA 13794 | ||
+ | V 20919568 | ||
+ | X 17718851 | ||
+ | III 13783681 | ||
+ | IV 17493785 | ||
+ | I 15072421 | ||
+ | |||
+ | The first column is the chromosome name. The second column is its size in bases. If the appropriate metadata was provided, the first few lines will contain taxon and genome build information as shown in the example. | ||
+ | |||
+ | ===Downloading a Track=== | ||
+ | |||
+ | To download the data from a track, invoke the gbrowse URL with | ||
+ | |||
+ | $URL/?gbgff=1;l=<trackname> | ||
+ | |||
+ | where <trackname> is the name of a track obtained from "scan" (see above). This will immediately start downloading the track data in the most appropriate format. | ||
+ | |||
+ | To limit the downloaded region to a chromosome: | ||
+ | |||
+ | $URL/?gbgff=1;l=<trackname>;q=chrXX | ||
+ | |||
+ | To limit the downloaded region to a part of a chromosome (example 110000 to 120000: | ||
+ | |||
+ | $URL/?gbgff=1;l=<trackname>;q=chrXX:110000..1200000 | ||
==Generating Static Images== | ==Generating Static Images== | ||
Line 105: | Line 141: | ||
;SOURCE | ;SOURCE | ||
− | :The data source, such as | + | :The data source, such as |
+ | ==Focusing on a Region== | ||
+ | ==Managing Sessions== | ||
+ | |||
+ | A user's state is stored in a cookie named "gbrowse_sess". The session ID is a 64-character hexadecimal string (numerals 0-9 and characters a-f). You can select different session IDs by passing gbrowse the CGI parameter "id", as in: | ||
+ | |||
+ | $HOST/gb2/gbrowse/yeast/?id=134947aacaa27a3c60b06f94e073b6e7 | ||
+ | |||
+ | From the user interface, you can obtain the current session's id by choosing Help->Show My User ID... This will also return the uploads id, which is used for sharing uploaded track data withing sharing the user's entire session. | ||
+ | |||
+ | From within a script, you may capture the ID from the cookie as shown in the [[media:gbrowse_get_sessionid.pl]] example script. You may also obtain the ID by calling GBrowse with "action=get_ids" (version 2.Session ID: 1e7995e8ced0494dcca25af4cee37f69 | ||
+ | Upload ID: efee698db7bc6ebc8a69af04072a7143 | ||
+ | 23 and higher): | ||
+ | |||
+ | $HOST/gb2/gbrowse/yeast/?action=get_ids | ||
+ | |||
+ | This will return a plain text file with the following structure: | ||
+ | |||
+ | Session ID: 1e7995e8ced0494dcca25af4cee37f69 | ||
+ | Upload ID: efee698db7bc6ebc8a69af04072a7143 | ||
+ | |||
+ | |||
+ | ==Centering on a region== | ||
− | |||
==Selecting Tracks and Subtracks== | ==Selecting Tracks and Subtracks== | ||
+ | |||
+ | To select subtracks within a track, use the syntax '''l=trackname/st1+st2+st3''', where ''trackname'' is the name of the track, and ''st1'', ''st2'', and so forth, are the subtrack names as indicated by the "subtrack table" option. Note that subtrack names are separated by spaces. Use "+" or "%20" to escape the URL. The parameter name is "l", short for '''l'''abel. | ||
==Uploading Datasets== | ==Uploading Datasets== |
Latest revision as of 15:29, 27 August 2012
This article describes the GBrowse2 REST API.
For the main GBrowse 2.0 HOWTO article, see: GBrowse 2.0 HOWTO.
Contents
The GBrowse2 REST API
Getting Information from the Server
You can get a list of public data sources from the GBrowse server, and interrogate each one for a list and description of the tracks it supports.
Getting a List of Sources
Invoke gbrowse with "action=list":
http://somewhere.com/gb2/gbrowse/?action=list
This will return a plain text tab-delimited document like the following:
# Name Description Species TaxID CoordinateType BuildAuthority BuildVersion BuildURL elegans C. elegans genes Caenorhabditis elegans 6239 Chromosome WS 180 http://www.dasregistry.org/coordsys/CS_DS109 yeast Yeast chromosomes 1+2 Saccharomyces cerevisiae 4932 Chromosome SGD 1 http://www.dasregistry.org/coordsys/CS_DS69 bamtest BAM/SAM Test homo_36 Human NCI 36i coordinates volvox Volvox Tutorial Database
Each row is a data source. The fields are:
- Name
- Data source name to use in subsequent calls to gbrowse.
- Description
- Human readable text describing the data source
- Species (optional)
- Human readable species name.
- TaxID (optional)
- NCBI taxon ID.
- CoordinateType (optional)
- Description of the type of coordinate, such as "Chromosome" or "Contig"
- BuildAuthority (optional)
- The group that maintains the coordinate system. See http://www.dasregistry.org/help_coordsys.jsp for a list of codes.
- BuildVersion (optional)
- Version of the build.
- BuildURL (optional)
- URL that will provide information about the build (can also be used as a unique ID for the build).
These fields are derived from GBrowse.conf and the "metadata" option in the configuration file for the data source. Also see GBrowse_2.0_HOWTO#The_GENERAL_Section.
Getting Tracks from a Particular Datasource
Given a datasource name returned by "action=list", you may retrieve information about each track that is publicly exported by the datasource. Note that the maintainer of the datasource may elect not to make information on all tracks available. Append the datasource name to the gbrowse URL, and use "action=scan":
http://somewhere.com/gb2/gbrowse/SOURCE/?action=scan
This will return a text/plain document similar to the following:
# Discoverable tracks from http://localhost/gb2/gbrowse/yeast/ [Genes] key = Named gene [CDS] key = CDS citation = This track shows CDS reading frames. [tRNAs] key = tRNAs [Transp] key = Transposons [LTRs] key = Long Terminal Repeats [Translation] key = 6-frame translation [TranslationF] key = 3-frame translation (forward) [DNA GC content] key = DNA and/or GC Content [TranslationR] key = 3-frame translation (reverse) [ncRNA] key = Noncoding RNAs
Each [stanza] contains the track name, and is followed by zero or more option=value pairs. The possible options that can be returned are:
- key
- The human-readable track title.
- citation
- The human-readable track citation.
- data source
- The value of the data source option, which can be used to identify where the track data originated.
- track source
- The value of the track source option, which can be used to identify a mirrored track.
- select
- A space-delimited list of subtrack selection names. These can be used to selectively turn on particular subtracks using the syntax described in #Selecting Tracks and Subtracks.
Getting Chromosome Sizes
To get the list of chromosomes and their sizes from a datasource, invoke the gbrowse URL with "action=chrom_sizes":
http://somewhere.com/gb2/gbrowse/SOURCE/?action=chrom_sizes
You will get a text file like the following:
##species http://www.ncbi.nlm.nih.gov/Taxonomy/Browser/wwwtax.cgi?id=6239 ##genome-build WS180 II 15279316 MtDNA 13794 V 20919568 X 17718851 III 13783681 IV 17493785 I 15072421
The first column is the chromosome name. The second column is its size in bases. If the appropriate metadata was provided, the first few lines will contain taxon and genome build information as shown in the example.
Downloading a Track
To download the data from a track, invoke the gbrowse URL with
$URL/?gbgff=1;l=<trackname>
where <trackname> is the name of a track obtained from "scan" (see above). This will immediately start downloading the track data in the most appropriate format.
To limit the downloaded region to a chromosome:
$URL/?gbgff=1;l=<trackname>;q=chrXX
To limit the downloaded region to a part of a chromosome (example 110000 to 120000:
$URL/?gbgff=1;l=<trackname>;q=chrXX:110000..1200000
Generating Static Images
To generate static images of a region in PNG, SVG or PDF format, invoke gbrowse_img:
$HOST/gb2/gbrowse_img/SOURCE/?ARGS
- SOURCE
- The data source, such as
Focusing on a Region
Managing Sessions
A user's state is stored in a cookie named "gbrowse_sess". The session ID is a 64-character hexadecimal string (numerals 0-9 and characters a-f). You can select different session IDs by passing gbrowse the CGI parameter "id", as in:
$HOST/gb2/gbrowse/yeast/?id=134947aacaa27a3c60b06f94e073b6e7
From the user interface, you can obtain the current session's id by choosing Help->Show My User ID... This will also return the uploads id, which is used for sharing uploaded track data withing sharing the user's entire session.
From within a script, you may capture the ID from the cookie as shown in the media:gbrowse_get_sessionid.pl example script. You may also obtain the ID by calling GBrowse with "action=get_ids" (version 2.Session ID: 1e7995e8ced0494dcca25af4cee37f69 Upload ID: efee698db7bc6ebc8a69af04072a7143 23 and higher):
$HOST/gb2/gbrowse/yeast/?action=get_ids
This will return a plain text file with the following structure:
Session ID: 1e7995e8ced0494dcca25af4cee37f69 Upload ID: efee698db7bc6ebc8a69af04072a7143
Centering on a region
Selecting Tracks and Subtracks
To select subtracks within a track, use the syntax l=trackname/st1+st2+st3, where trackname is the name of the track, and st1, st2, and so forth, are the subtrack names as indicated by the "subtrack table" option. Note that subtrack names are separated by spaces. Use "+" or "%20" to escape the URL. The parameter name is "l", short for label.