Difference between revisions of "MAKER"

From GMOD
Jump to: navigation, search
 
(47 intermediate revisions by 6 users not shown)
Line 1: Line 1:
{{ImageCenter|MAKERLogo.png|MAKER Logo||http://www.yandell-lab.org/software/maker.html}}
+
<!-- to alter this page, please edit the raw data, which is stored at http://gmod.org/wiki/MAKER/tool_data -->
  
__NOTITLE__
+
{{Tool data
MAKER is a portable and easy to configure genome [[:Category:Annotation|annotation]] pipeline. It's purpose is to allow smaller ''eukaryotic'' genome projects and (as of August 2009) ''prokaryotic'' genome projects to annotate their genomes and to create genome databases. MAKER identifies repeats, aligns ESTs and proteins to a genome, produces ''ab initio'' gene predictions and automatically synthesizes these data into gene annotations having evidence-based quality values. MAKER is also easily trainable: outputs of preliminary runs can be used to automatically retrain its gene prediction algorithm, producing higher quality gene-models on subsequent runs. MAKER's inputs are minimal.  It outputs are in [[GFF3]] or [[Glossary#FASTA|FASTA]] format, and can be directly loaded into [[Chado]], [[GBrowse]], and [[Apollo]].
+
|use template=yes
 +
|name=MAKER
 +
|full name=
 +
|status=mature
 +
|dev=active
 +
|support=active
 +
|type=Genome Annotation
 +
|platform=linux, os x, unix
 +
|about=MAKER is an easy-to-configure, portable genome [[:Category:Annotation|annotation]] pipeline. MAKER allows smaller ''eukaryotic'' genome projects and ''prokaryotic'' genome projects to annotate their genomes and to create genome databases. MAKER identifies repeats, aligns ESTs and proteins to a genome, produces ''ab initio'' gene predictions and automatically synthesizes these data into gene annotations with evidence-based quality values. MAKER is also easily trainable: outputs of preliminary runs can be used to automatically retrain its gene prediction algorithm, producing higher quality gene-models on subsequent runs. MAKER's inputs are minimal.  Its outputs are in [[GFF3]] or [[Glossary#FASTA|FASTA]] format, and can be directly loaded into [[Chado]], [[GBrowse]], [[JBrowse]], [[WebApollo]] or [[Apollo]].
  
Additional MAKER options/capabilities include:
+
For those annotating novel plant genomes, the [http://www.yandell-lab.org/software/maker-p.html MAKER-P pipeline] is designed to make the annotation tractable for small groups with limited bioinformatics experience and resources, and faster and more transparent for large groups with more experience and resources. The MAKER-P pipeline generates species-specific repeat libraries, as well as structural annotations of protein coding genes, non-coding RNAs, and pseudogenes. The [http://www.yandell-lab.org/software/maker-p.html MAKER-P website] has more information on the additional protocols for plant genome annotation.
*Map old annotation sets onto new assemblies.
+
*Merge multiple legacy annotation sets into a consensus set of annotations.
+
*Update existing annotations to take new evidence into account.
+
*Tag pre-existing gene models with evidence alignments and quality control metrics to assist in downstream manual curation.
+
*Use GFF3 pass-through to include both evidence alignments and predicted gene models from algorithms not natively supported by MAKER.
+
*MAKER is MPI capable for rapid parallelization across computer clusters.
+
*You can also easily integrate raw InterProScan results into MAKER, which will identify protein domains, add GO functional categories, and help assign putative gene functions to genome annotations. This data then becomes accessible as part of the GFF3 output and can be loaded into a [[Chado]] or [[GBrowse]].
+
  
 +
Additional MAKER options and capabilities include:
 +
* Map old annotation sets on to new assemblies.
 +
* Merge multiple legacy annotation sets into a consensus set of annotations.
 +
* Update existing annotations to take new evidence into account.
 +
* Tag pre-existing gene models with evidence alignments and quality control metrics to assist in downstream manual curation.
 +
* Use GFF3 pass-through to include both evidence alignments and predicted gene models from algorithms not natively supported by MAKER.
 +
* MAKER is <abbr title="message-passing interface">MPI</abbr>-capable for rapid parallelization across computer clusters.
 +
* You can also easily integrate raw InterProScan results into MAKER, which will identify protein domains; add GO functional categories; and help assign putative gene functions to genome annotations. This data then becomes accessible as part of the GFF3 output and can be loaded into a [[Chado]] database, [[JBrowse]], [[GBrowse]], [[WebApollo]] or [[Apollo]].
  
== Annotations ==
+
MAKER comes with sample data for testing purposes.  See the <tt>/data</tt> directory in the download.
{| cellpadding="5px"
+
|release date=2008
|-
+
|open source=Caveats apply
| valign="top" rowspan="2"| The MAKER genome annotation pipeline generates several different types of annotations, including
+
|licence=[http://opensource.org/licenses/gpl-license GNU GPL (for academics)], [http://opensource.org/licenses/artistic-license-2.0 Artistic License 2.0 (for academics)], custom license for commercial usage
* ''ab-initio'' gene predictions from both SNAP and MAKER
+
|language=Perl
* EST alignments from both EXONERATE and BLASTN
+
|input=[[GFF3]], [[Glossary#FASTA|FASTA]]
* Protein alignments from EXONERATE and BLASTX
+
|output=[[GFF3]], [[Glossary#FASTA|FASTA]]
* Repeats
+
|audience=private
| valign="top" style="border: 1px solid gray" | [[Image:MAKERAnnotations500x301.jpg | border ]]
+
|logo=MAKERLogo.png
|-
+
|screenshot=[[File:MAKER_Apollo_view.jpg|none|thumb|500px|View of [[MAKER]] annotations in the [[Apollo]] genome annotation curation tool. Supporting evidence is shown in the upper dark panel. Gene annotations are shown in the blue panel.]]
|valign="top" align="center" |MAKER generated annotations, shown in [[Apollo]].
+
|survey link=MAKER
|}
+
|mail={{MailingListsFor|MAKER}}
  
== Requirements ==
+
Support for MAKER is provided on a [http://yandell-lab.org/mailman/listinfo/maker-devel_yandell-lab.org mailing list], and through a Trac-based [http://malachite.genetics.utah.edu/projects/maker/wiki wiki], [http://malachite.genetics.utah.edu/projects/maker/report bug tracker] and [http://malachite.genetics.utah.edu/projects/maker/browser code browser].
 
+
|contact email=[mailto:carson.holt@genetics.utah.edu Carson Holt], [bmoore@genetics.utah.edu Barry Moore]
* Perl 5.8.0 or higher
+
|papers=
 +
*MAKER-P: a tool-kit for the rapid creation, management, and quality control of plant genome annotations. <ref name=PMID:24306534 />
 +
*MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects. <ref name=PMID:22192575 />
 +
*MAKER: an easy-to-use annotation pipeline designed for emerging model organism genomes. <span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.genre=article&amp;rft_id=info%3Apmid%2F18025269">PMID:18025269</span> <ref name=PMID:18025269/>
 +
|presentations=* [[:Image:MAKERSMBE2009.pdf|MAKER—the easy-to-use genome annotation pipeline]], presented by Mark Yandell at the [http://ccg.biology.uiowa.edu/smbe/symposia.php?action=view&sym_ID=27 SMBE 2009 GMOD Workshop].
 +
|tutorials=;[[MAKER Tutorial]]
 +
:taught as part of the [[2013 GMOD Summer School]]
 +
[https://pods.iplantcollaborative.org/wiki/display/sciplant/MAKER-P+at+iPlant MAKER at iPlant]
 +
:how to access MAKER using Atmosphere cloud image and TACC Lonestar
 +
|getting started preamble=We recommend that users read the MAKER paper [below] and the README file included with the download before installing and using MAKER.
 +
|req=* Perl 5.8.0 or higher
 
* [[BioPerl]] 1.6 or higher.
 
* [[BioPerl]] 1.6 or higher.
* [http://blast.wustl.edu WU-BLAST 2.0] or [http://www.ncbi.nlm.nih.gov/ NCBI-BLAST 2.2.X]
+
* [http://blast.wustl.edu WU-BLAST] 2.0 or higher or [http://www.ncbi.nlm.nih.gov/ NCBI-BLAST] 2.2.X or higher
* [http://homepage.mac.com/iankorf SNAP version 2008-04-10].  Also works with version 2006-07-28.
+
* [http://www.repeatmasker.org/ RepeatMasker] 3.1.6 or higher
* [http://www.repeatmasker.org/ RepeatMasker 3.1.6]
+
 
** RepeatMasker requires a repeat library, available from [http://www.girinst.org/repbase/index.html Repbase].
 
** RepeatMasker requires a repeat library, available from [http://www.girinst.org/repbase/index.html Repbase].
* [http://www.ebi.ac.uk/~guy/exonerate Exonerate 1.4].  MAKER also appears to work with Exonerate 2.0, but this has not yet been thoroughly tested.
+
* [http://www.animalgenome.org/bioinfo/resources/manuals/exonerate/ Exonerate] 1.4 or higher.
  
=== mpi_maker ===
+
Optional extras:
 +
* [http://korflab.ucdavis.edu/software.html SNAP] version 2009-02-03 or higher (for eukaryotic genomes).
 +
* [http://augustus.gobics.de Augustus] 2.0 or higher (for eukaryotic genomes)
 +
* [http://exon.biology.gatech.edu GeneMark-ES] (for eukaryotic genomes)
 +
* [http://www.softberry.com FGENESH] 2.6 or higher - requires licence (for eukaryotic genomes)
 +
* [http://exon.biology.gatech.edu GeneMarkS] (for prokaryotic genomes)
 +
|install=Please see the [http://www.yandell-lab.org/software/maker_install.html installation guide] or the README file in the download for instructions.
 +
|config=
 +
|doc=By default, MAKER runs on a single computer. A parallel version, mpi_maker, is also available. To run mpi_maker you will need a [http://en.wikipedia.org/wiki/Message_Passing_Interface message passing interface (MPI)] package installed on all participating computers; try [http://www.mcs.anl.gov/research/projects/mpich2/ MPICH2].  Remember to install MPICH2 with the <tt>--enable-sharedlibs</tt> flag set to the appropriate value .See the [http://www.mcs.anl.gov/research/projects/mpich2/documentation/index.php?s=docs MPICH2 Installer's Guide] for more information.
  
MAKER, by default runs on a single computer.  A parallel version, mpi_maker, is also available.  To run mpi_maker you will need an [http://en.wikipedia.org/wiki/Message_Passing_Interface MPI] package installed on all participating computers.  Try [http://www.mcs.anl.gov/research/projects/mpich2/ MPICH2].  Remember to install MPICH2 with the <tt>--enable-sharedlibs</tt> flag set to the appropriate value (See the[http://www.mcs.anl.gov/research/projects/mpich2/documentation/index.php?s=docs MPICH2 Installer's Guide].)
+
See also the [http://www.yandell-lab.org/software/maker-p.html MAKER-P website] for documentation on the additional pipeline steps for annotating plant genomes.
  
== Documentation ==
 
  
MAKER documentation is available from these sources:
+
=== Annotations ===
* [http://www.genome.org/cgi/content/full/18/1/188 MAKER: An easy-to-use annotation pipeline designed for emerging model organism genomes], by Brandi L. Cantarel, Ian Korf, Sofia M.C. Robb, Genis Parra, Eric Ross, Barry Moore, Carson Holt, Alejandro, Sánchez Alvarado and Mark Yandell. 2008 18: 188-196; Genome Res. (requires subscription)
+
* README file in the download
+
  
Read both the paper and the README before installing MAKER.
+
The MAKER genome annotation pipeline generates several different types of annotations, including
 +
* ''Ab initio'' gene predictions from SNAP, Augustus, FGENESH, and GeneMark
 +
* Final gene models from MAKER
 +
* EST alignments from both EXONERATE and BLASTN
 +
* Protein alignments from EXONERATE and BLASTX
 +
* Repeats from RepeatMasker and the MAKER internal RepeatRunner
  
=== Presentations ===
+
|dev ppl=Yandell Lab, Utah; Ian Korf (University of California Davis)
 +
|dev status=
 +
|dl tracking=
 +
|logo info=
 +
|see also=Almost all the ant genomes at the [http://hymenopteragenome.org/ant_genomes/ Ant Genomes Portal] were independently annotated with MAKER.
 +
|extra=
 +
}}
  
* [[:Image:MAKERSMBE2009.pdf|MAKER—the easy-to-use genome annotation pipeline]], presented by Mark Yandell at the [http://ccg.biology.uiowa.edu/smbe/symposia.php?action=view&sym_ID=27 SMBE 2009 GMOD Workshop].
+
{{SemanticLink
 +
|linkurl=http://www.yandell-lab.org/software/maker.html
 +
|linktype=download
 +
}}
  
== Downloads ==
+
{{SemanticLink
 
+
|linkurl=http://yandell.topaz.genetics.utah.edu/cgi-bin/maker_license.cgi
A compressed TAR file is available at http://www.yandell-lab.org/software/maker.html.
+
|linktype=source code
 
+
}}
== Demo ==
+
 
+
MAKER comes with sample data for testing purposes.  See the <tt>/data</tt> directory in the download.
+
  
Since MAKER is a command driven pipeline there are no screenshots or online demos available.
+
{{SemanticLink
 +
|linkurl=http://www.yandell-lab.org/software/maker.html
 +
|linktype=website
 +
}}
  
== Support ==
+
{{SemanticLink
 +
|linkurl=https://www.e-biogenouest.org/resources/48
 +
|linktype=publication
 +
|linktitle=A beginner's guide to eukaryotic genome annotation
 +
|linkdesc=Review article on eukaryotic genome annotation from the MAKER team.
 +
}}
  
Support for MAKER is provided on a [http://yandell-lab.org/mailman/listinfo/maker-devel_yandell-lab.org mailing list], and through a [http://trac.edgewall.org/ Trac] based [http://malachite.genetics.utah.edu/projects/maker/wiki wiki], [http://malachite.genetics.utah.edu/projects/maker/report bug tracker] and [http://malachite.genetics.utah.edu/projects/maker/browser code browser].
+
{{SemanticLink
 +
|linkurl=http://www.yandell-lab.org/software/maker-p.html
 +
|linktype=other
 +
|linktitle=MAKER-P
 +
|linkdesc=MAKER with additional software for plant genome annotation
 +
}}
  
== Contact ==
+
{{SemanticLink
 +
|linkurl=http://derringer.genetics.utah.edu/cgi-bin/MWAS/maker.cgi
 +
|linktype=public server
 +
|linktitle=MAKER Web Annotation Server
 +
|linkdesc=online version of the MAKER tool
 +
}}
  
Mark Yandell <[mailto:myandell@genetics.utah.edu myandell@genetics.utah.edu]>
+
{{SemanticLink
 +
|linkurl=http://hymenopteragenome.org/ant_genomes/
 +
|linktype=wild urls
 +
|linktitle=Ant Genomes Portal
 +
|linkdesc=The ant genomes were annotated using MAKER.
 +
}}
  
 
[[Category:GMOD Components]]
 
[[Category:GMOD Components]]
 
[[Category:Annotation]]
 
[[Category:Annotation]]
 
[[Category:MAKER]]
 
[[Category:MAKER]]

Latest revision as of 21:53, 2 February 2014


MAKER logo
Status
  • Mature release
  • Development: active
  • Support: active
Licence


GNU GPL (for academics), Artistic License 2.0 (for academics), custom license for commercial usage

Resources


About MAKER

MAKER is an easy-to-configure, portable genome annotation pipeline. MAKER allows smaller eukaryotic genome projects and prokaryotic genome projects to annotate their genomes and to create genome databases. MAKER identifies repeats, aligns ESTs and proteins to a genome, produces ab initio gene predictions and automatically synthesizes these data into gene annotations with evidence-based quality values. MAKER is also easily trainable: outputs of preliminary runs can be used to automatically retrain its gene prediction algorithm, producing higher quality gene-models on subsequent runs. MAKER's inputs are minimal. Its outputs are in GFF3 or FASTA format, and can be directly loaded into Chado, GBrowse, JBrowse, WebApollo or Apollo.

For those annotating novel plant genomes, the MAKER-P pipeline is designed to make the annotation tractable for small groups with limited bioinformatics experience and resources, and faster and more transparent for large groups with more experience and resources. The MAKER-P pipeline generates species-specific repeat libraries, as well as structural annotations of protein coding genes, non-coding RNAs, and pseudogenes. The MAKER-P website has more information on the additional protocols for plant genome annotation.

Additional MAKER options and capabilities include:

  • Map old annotation sets on to new assemblies.
  • Merge multiple legacy annotation sets into a consensus set of annotations.
  • Update existing annotations to take new evidence into account.
  • Tag pre-existing gene models with evidence alignments and quality control metrics to assist in downstream manual curation.
  • Use GFF3 pass-through to include both evidence alignments and predicted gene models from algorithms not natively supported by MAKER.
  • MAKER is MPI-capable for rapid parallelization across computer clusters.
  • You can also easily integrate raw InterProScan results into MAKER, which will identify protein domains; add GO functional categories; and help assign putative gene functions to genome annotations. This data then becomes accessible as part of the GFF3 output and can be loaded into a Chado database, JBrowse, GBrowse, WebApollo or Apollo.

MAKER comes with sample data for testing purposes. See the /data directory in the download.


Visit the MAKER website.

Screenshots

View of MAKER annotations in the Apollo genome annotation curation tool. Supporting evidence is shown in the upper dark panel. Gene annotations are shown in the blue panel.

Downloads

Using MAKER

We recommend that users read the MAKER paper [below] and the README file included with the download before installing and using MAKER.

System Requirements

Optional extras:

  • SNAP version 2009-02-03 or higher (for eukaryotic genomes).
  • Augustus 2.0 or higher (for eukaryotic genomes)
  • GeneMark-ES (for eukaryotic genomes)
  • FGENESH 2.6 or higher - requires licence (for eukaryotic genomes)
  • GeneMarkS (for prokaryotic genomes)

Installation

Please see the installation guide or the README file in the download for instructions.

Documentation

By default, MAKER runs on a single computer. A parallel version, mpi_maker, is also available. To run mpi_maker you will need a message passing interface (MPI) package installed on all participating computers; try MPICH2. Remember to install MPICH2 with the --enable-sharedlibs flag set to the appropriate value .See the MPICH2 Installer's Guide for more information.

See also the MAKER-P website for documentation on the additional pipeline steps for annotating plant genomes.


Annotations

The MAKER genome annotation pipeline generates several different types of annotations, including

  • Ab initio gene predictions from SNAP, Augustus, FGENESH, and GeneMark
  • Final gene models from MAKER
  • EST alignments from both EXONERATE and BLASTN
  • Protein alignments from EXONERATE and BLASTX
  • Repeats from RepeatMasker and the MAKER internal RepeatRunner

Publications, Tutorials, and Presentations

Publications on or mentioning MAKER

  • MAKER-P: a tool-kit for the rapid creation, management, and quality control of plant genome annotations. [1]
  • MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects. [2]
  • MAKER: an easy-to-use annotation pipeline designed for emerging model organism genomes. PMID:18025269 [3]

Tutorials

MAKER Tutorial
taught as part of the 2013 GMOD Summer School

MAKER at iPlant

how to access MAKER using Atmosphere cloud image and TACC Lonestar

Presentations

Contacts and Mailing Lists

Mailing List Link Description Archive(s)
MAKER maker-devel MAKER developers and users list. Google, Nabble (2010/05+)

Support for MAKER is provided on a mailing list, and through a Trac-based wiki, bug tracker and code browser.


MAKER Development

Development team

Yandell Lab, Utah; Ian Korf (University of California Davis)

See also

Almost all the ant genomes at the Ant Genomes Portal were independently annotated with MAKER.

More on MAKER

See Category:MAKER


  1. Cite error: Invalid <ref> tag; no text was provided for refs named PMID:24306534
  2. Cite error: Invalid <ref> tag; no text was provided for refs named PMID:22192575
  3. Cite error: Invalid <ref> tag; no text was provided for refs named PMID:18025269

"wild urls" is not in the list of possible values (website, download, source code, public server, demo server, wild URL, documentation, publication, social media, other) for this property.

Facts about "MAKER"RDF feed
Available on platformlinux +, os x + and unix +
Has URLhttp://www.yandell-lab.org/software/maker.html +, http://yandell.topaz.genetics.utah.edu/cgi-bin/maker_license.cgi +, https://www.e-biogenouest.org/resources/48 +, http://www.yandell-lab.org/software/maker-p.html +, http://derringer.genetics.utah.edu/cgi-bin/MWAS/maker.cgi + and http://hymenopteragenome.org/ant_genomes/ +
Has descriptionMAKER is an easy-to-configure, portable geMAKER is an easy-to-configure, portable genome annotation pipeline. MAKER allows smaller eukaryotic genome projects and prokaryotic genome projects to annotate their genomes and to create genome databases. MAKER identifies repeats, aligns ESTs and proteins to a genome, produces ab initio gene predictions and automatically synthesizes these data into gene annotations with evidence-based quality values. MAKER is also easily trainable: outputs of preliminary runs can be used to automatically retrain its gene prediction algorithm, producing higher quality gene-models on subsequent runs. MAKER's inputs are minimal. Its outputs are in GFF3 or FASTA format, and can be directly loaded into Chado, GBrowse, JBrowse, WebApollo or Apollo.

For those annotating novel plant genomes, the MAKER-P pipeline is designed to make the annotation tractable for small groups with limited bioinformatics experience and resources, and faster and more transparent for large groups with more experience and resources. The MAKER-P pipeline generates species-specific repeat libraries, as well as structural annotations of protein coding genes, non-coding RNAs, and pseudogenes. The MAKER-P website has more information on the additional protocols for plant genome annotation.

Additional MAKER options and capabilities include:

  • Map old annotation sets on to new assemblies.
  • Merge multiple legacy annotation sets into a consensus set of annotations.
  • Update existing annotations to take new evidence into account.
  • Tag pre-existing gene models with evidence alignments and quality control metrics to assist in downstream manual curation.
  • Use GFF3 pass-through to include both evidence alignments and predicted gene models from algorithms not natively supported by MAKER.
  • MAKER is MPI-capable for rapid parallelization across computer clusters.
  • You can also easily integrate raw InterProScan results into MAKER, which will identify protein domains; add GO functional categories; and help assign putative gene functions to genome annotations. This data then becomes accessible as part of the GFF3 output and can be loaded into a Chado database, JBrowse, GBrowse, WebApollo or Apollo.
MAKER comes with sample data for testing purposes. See the /data directory in the download.
/data directory in the download. +, Review article on eukaryotic genome annotation from the MAKER team. +, MAKER with additional software for plant genome annotation +, online version of the MAKER tool + and The ant genomes were annotated using MAKER. +
Has development statusactive +
Has input formatGFF3 + and FASTA +
Has licenceGNU GPL (for academics) +, Artistic License 2.0 (for academics) + and custom license for commercial usage +
Has logoMAKERLogo.png +
Has output formatGFF3 + and FASTA +
Has software maturity statusmature +
Has support statusactive +
Has titleA beginner's guide to eukaryotic genome annotation +, MAKER-P +, MAKER Web Annotation Server + and Ant Genomes Portal +
Has topicMAKER +
Is open sourceCaveats apply +
Link typedownload +, source code +, website +, publication +, other + and public server +
Release date2008 +
Tool functionality or classificationGenome Annotation +
Written in languagePerl +
Has subobjectThis property is a special property in this wiki.MAKER#http://www.yandell-lab.org/software/maker.html +, MAKER#http://yandell.topaz.genetics.utah.edu/cgi-bin/maker_license.cgi +, MAKER#https://www.e-biogenouest.org/resources/48 +, MAKER#http://www.yandell-lab.org/software/maker-p.html +, MAKER#http://derringer.genetics.utah.edu/cgi-bin/MWAS/maker.cgi + and MAKER#http://hymenopteragenome.org/ant_genomes/ +