GBrowse

From GMOD
Revision as of 14:05, 29 October 2010 by DanBolser (Talk | contribs)

Jump to: navigation, search

The Generic Genome Browser (GBrowse) is a genome viewer and is GMOD's most popular component. For a demo of its features, see WormBase, FlyBase, or Human Genome Segmental Duplication Database and others.

Description

GBrowse is a combination of database and interactive web pages for manipulating and displaying annotations on genomes. Some of its features include:

GBrowse Versions

GBrowse 1.X (currently 1.70) is the older series that has been in use since 2002. It is recommended for applications which use a single database only and which must support legacy browsers.

GBrowse 2.0 is a rewrite of the original GBrowse to add dynamic updating via AJAX and a smoother user experience. In addition, it provides administrators with the ability to attach a different genome database to each GBrowse track, making it much easier to manage and update tracks. It also provides a distributed backend system of "slave" renderers, allowing each track to be rendered in parallel on a different machine and significantly increasing performance. GBrowse 2.0 is considered stable,but does not have full internationalization support. In addition, there may be issues with older browsers that do not support newer JavaScript features. See the GBrowse 2.0 HOWTO.

Installation

GBrowse is Perl-based. It can be installed using the standard Perl module build procedure, or automated using a network-based install script. In order to use the net installer, you will need to have Perl 5.8.6 or higher and the Apache web server installed. See the step-by-step instructions below for detailed instructions:

Documentation

On-line documentation

POD documentation

There are many useful POD documents included with the distribution. These are converted to HTML files when you install the package, and can be found in /gbrowse/docs/pod:

Since these are in Perl POD format these files may contain formatting code when viewed in a Web browser.

Downloads

Source Code Download (tar.gz file)

Download the source from the SourceForge download page.

Net-based Installer Script

The net installer script, located at the GBrowse SVN repository will automatically download and install GBrowse and its Perl libraries for you. See Installation for details on using this script.

SVN

There are often new features and bug fixes in the current development version which have not yet been released. To get the latest version, please use Subversion (SVN). The recommended branch to use is trunk, which is usually stable:

 svn co https://gmod.svn.sourceforge.net/svnroot/gmod/Generic-Genome-Browser/trunk Generic-Genome-Browser

Once you have successfully checked out the Generic-Genome-Browser distribution, fetch recent changes by executing svn update inside the Generic-Genome-Browser directory.

You can also browse the GBrowse SVN.

About Databases

GBrowse has a flexible adaptor (yes, it is spelled that way and is not "adapter") system for running off various types of databases/sources. A common question is "which adaptor should I be using?" This attempts to answer that question.

Adaptor Other required software Roughly how many users Pros Cons
Bio::DB::SeqFeature::Store (use bp_seqfeature_load.pl) MySQL, PostgreSQL, SQLite, BerkeleyDB Many and growing fast. Roughly 4X faster than Bio::DB::GFF for the same data; designed to work with GFF3 Developed for use with GFF3; about 2X slower than Bio::DB::GFF to load a database
Bio::DB::GFF (use bp_load_gff.pl, bp_bulk_load_gff.pl, bp_fast_load_gff.pl) A relational database server: MySQL, PostgreSQL, Oracle, or BerkeleyDB Lots! (Especially MySQL) Quite fast; large user base; Have to use this if your data is in the (now deprecated) GFF2 format. Does not work well with GFF3 formatted data
Bio::DB::Sam (available from CPAN) SAMtools Growing (particularly with GBrowse2) Very fast access to NextGen sequencing data Difficult to use with GBrowse 1.70
Bio::DB::BigWig and Bio::DB::BigWigSet (available from CPAN) UCSC Formats Growing (particularly with GBrowse2) Very fast access to data in bigWig format Difficult to use with GBrowse 1.70
Bio::DB::BigBed (available from CPAN) UCSC Formats Growing (particularly with GBrowse2) Very fast access to data in bigBed format Difficult to use with GBrowse 1.70
Bio::DB::Das::Chado (available from CPAN) PostgreSQL and a Chado schema Relatively few due to the specialized nature of Chado Allows 'live' viewing of the features in a Chado database Slow compared to Bio::DB::GFF
Bio::DB::Das::BioSQL (available from CPAN) MySQL and a BioSQL schema Relatively few due to the small number of BioSQL users Allows 'live' viewing of the features in a BioSQL database Slow compared to Bio::DB::GFF
Memory (ie, flat file database using either Bio::DB::GFF or SeqFeature::Store) None For real servers, none Easy for rapid development and testing Very slow for more than a few thousand features
LuceGene Lucene (searches indexed flat files) Relatively few

Email Threads

There have been some useful email threads on adaptor choices and tradeoffs.

Contacts

Please report bugs to the SourceForge Bug Tracker (select 'Category: Gbrowse').

Mailing List Link Description Archive(s)
GBrowse & GBrowse_syn gmod-gbrowse GBrowse and GBrowse_syn users and developers. Gmane, Nabble (2010/05+), Sourceforge
gmod-gbrowse-cmts Code updates. Sourceforge

The GBrowse logo was created by Alex Read, a participant in the Spring 2010 Logo Program, while a design student at Linn-Benton Community College.

References

Retrieved from "http://gmod.org/mediawiki/index.php?title=GBrowse&oldid=14930"