Difference between revisions of "GSoC"

From GMOD
Jump to: navigation, search
(About Genome Informatics GSoC)
m (Google Summer of Code 2019 @ Open Genome Informatics)
(260 intermediate revisions by 20 users not shown)
Line 1: Line 1:
==Welcome to the Genome Informatics GSoC==
+
[[File:GoogleSummer_2016logo.jpg|373px|right|link=GSoC]]
''“Google Summer of Code (GSoC) is a global program that offers student developers stipends to write code for various open source software projects. We have worked with several open source, free software, and technology-related groups to identify and fund several projects over a three month period. Since its inception in 2005, the program has brought together over 4,500 students and more than more than 4,000 mentors & co-mentors from over 85 countries worldwide, all for the love of code. Through Google Summer of Code, accepted student applicants are paired with a mentor or mentors from the participating projects, thus gaining exposure to real-world software development scenarios and the opportunity for employment in areas related to their academic pursuits. In turn, the participating projects are able to more easily identify and bring in new developers. Best of all, more source code is created and released for the use and benefit of all.”''<br>
+
  
GSoC has several goals:
+
== Google Summer of Code 2019 @ Open Genome Informatics ==
  
*get more open source code created and released for the benefit of all
+
'''[https://summerofcode.withgoogle.com/ Google Summer of Code]''' is a global program that offers student developers stipends to write code for various open source software projects. We work with many open source, free software, and technology-related groups to identify and fund projects over a three month period. Since its inception in 2005, the program has brought together over 14,000 successful student participants from 118 countries, 651 open source organizations, and over 35 million lines of code. Through Google Summer of Code, accepted student applicants are paired with a mentor or mentors from the participating projects, thus gaining exposure to real-world software development scenarios and the opportunity for employment in areas related to their academic pursuits. In turn, the participating projects are able to more easily identify and bring in new developers. Best of all, more source code is created and released for the use and benefit of all. (''Excerpt from the [https://summerofcode.withgoogle.com/ Google Summer of Code website]'')
*inspire young developers to begin participating in open source development
+
*help open source projects identify and bring in new developers and committers
+
*provide students the opportunity to do work related to their academic pursuits during the summer
+
*give students more exposure to real-world software development scenarios.
+
  
[http://socghop.appspot.com Google Summer of Code (GSoC)]
+
Since 2011, the Open Genome Informatics group has served as an "umbrella organization" to a variety of bioinformatics projects, including [[Main Page|GMOD]] and its software projects -- [[JBrowse]], [[Apollo]], [[Chado]], [[Galaxy]] etc.; [http://www.informatics.jax.org/ Mouse Genome Informatics]; [https://oicr.on.ca/research-portfolio/ OICR]; [http://www.reactome.org Reactome]; [http://www.wormbase.org WormBase]; and [https://bioconda.github.io/ Bioconda].
  
====About Genome Informatics GSoC====
+
'''More information about this year's participating bioinformatics groups can be found [[GSOC_Groups | here]].'''
The Genome Infomatics group is organizing the joint efforts of WormBase, Reactome, and GBrowse (see below). This is a great opportunity for students to contribute to the work of three different bioinformatics projects.<br>
+
  
;'''[http://www.wormbase.org Wormbase]''' : An online bioinformatics database of the biology and genome of the model organism Caenorhabditis elegans and related nematodes. It is used by the C. elegans research community both as an information resource and as a mode to publish and distribute their results. The database is constantly updated and new versions are released on a monthly basis. WormBase is a collaboration among the Wellcome Trust Sanger Institute, Ontario Institute for Cancer Research, Washington University in St. Louis, and the California Institute of Technology. Links: [http://www.wormbase.org Website].
+
To learn more about this year's event and how GSoC works, please refer to the [https://developers.google.com/open-source/gsoc/faq FAQ].
  
;'''[http://www.reactome.org Reactome]''' : A manually curated database of core pathways and reactions in human biology that functions as a data mining resource and electronic textbook. The Reactome data model describes diverse processes in the human system, including the pathways of intermediary metabolism, regulatory pathways, signal transduction, and high-level processes, such as the cell cycle. Reactome software uses only freely available (and often open source) components and has been created with cross-platform compatibility and wide usability in mind. Data is stored in a MySQL database, the web site is implemented in Perl and data entry tool in Java programming language. The Reactome team is composed of individuals who are both biologists and programmers at the Ontario Institute for Cancer Research, New York University Langone Medical Center, Cold Spring Harbor Laboratory, and The European Bioinformatics Institute. Links: [http://www.reactome.org Website], [http://wiki.reactome.org ReactomeWiki ].
+
==Mailing lists, IRC, and other ways to get in touch  ==
  
;'''[http://www.gmod.org Generic Model Organism Database (GMOD)]''' : An open source project to develop a complete set of software for creating and administering a model organism database. Components of this project include genome visualization and editing tools, literature curation tools, a robust database schema, biological ontology tools, and a set of standard operating procedures. This project is collaboration of several database projects, including WormBase, FlyBase, Mouse Genome Informatics, Gramene, the Rat Genome Database, TAIR, EcoCyc, and the Saccharomyces Genome Database. Links: [http://www.gmod.org Website], [http://blog.gmod.org GMOD Blog]
+
*Email: [mailto:robin.haw@oicr.on.ca robin.haw@oicr.on.ca] '''and''' [mailto:help@gmod.org help@gmod.org] -- find out more about GSoC, a specific project, or your potential mentor(s).
 
+
====Contact Us====
+
*Email: robin.haw[AT]oicr.on.ca - contact me to find out more about a project or your potential mentor(s).
+
 
*Discussion mailing lists: [http://groups.google.com/group/genome-informatics Genome Informatics Google Groups] - ask about our projects; join the community!
 
*Discussion mailing lists: [http://groups.google.com/group/genome-informatics Genome Informatics Google Groups] - ask about our projects; join the community!
 +
*IRC channel: #genomeinformatics on Freenode.
 +
* Students and Mentors can email both [[User:Robin.haw|Robin]] and [[User:Scott|Scott]] to get more information about the program.
  
====How to apply====
+
== [[GSOC_Project_Ideas_2019 | Project Ideas]] ==
We would like to know who you are and how you think. Incorporate the following into your application:
+
 
+
*Your information
+
**Name, email, and website (optional)
+
*Brief background: education and relevant work experience
+
*Your programming interests and strengths
+
**What are your languages of choice?
+
**Any prior experience with open source development?
+
**Your interest and background in biology or bioinformatics
+
**Any prior exposure to biology or bioinformatics?
+
*Your ideas for a project (an original idea or one expanded from our Ideas Page)
+
**Provide as much detail as possible
+
**Strong applicants include an implementation plan and timeline (hint!)
+
**Refer to and link to other projects or products that illustrate your ideas
+
**Identify possible hurdles and questions that will require more research/planning
+
*What can you bring to the team?
+
 
+
====Guidelines and Advice====
+
*[http://www.booki.cc/gsocstudentguide GSoC Student Guide]
+
*[http://www.google-melange.com/document/show/gsoc_program/google/gsoc2011/faqs#applying Application Guidelines]
+
*[http://code.google.com/p/google-summer-of-code/wiki/AdviceforStudents Advice for Student Applicants]
+
 
+
====Project Ideas====
+
As we are developing new features for WormBase, Reactome and GBrowse, we are exploring a number of areas ideal for Google Summer of Code students. These projects include a broad set of skills, technologies and domains, such as GUIs, database integration and algorithms. You are also encouraged to propose your own ideas related to our projects. If you have strong computer skills and have an interest in biology or bioinformatics, then you should apply!
+
 
+
=====IDEA 1: Your Original Idea=====
+
Feel free to propose your own idea. As long as it relates to one of our projects, we will give it serious consideration. Creativity and self-motivation are great traits for open source programmers, but make sure your proposal is also relevant.
+
 
+
=====IDEA 2: Export Reactome layout in BioPAX=====
+
It is difficult to exchange pathway layout or diagrams among different pathway databases. BioPAX is a pathway exchange format, but without support of pathway diagram. In this project, we propose to create an XML file format for visualization based on Reactome pathway diagrams for BioPAX level 2 and 3 exports, and a Java applet or Web Start visualization tool based on our curator tool to display this format.
+
 
+
* Language and Skills:  Java, XML, OWL and BioPAX
+
* Idea by: Guanming Wu
+
* Potential Mentors: Guanming Wu
+
 
+
=====IDEA 3: HTML 5 canvas based visualization tool for pathways and networks=====
+
With modern browsers' support of HTML5 canvas, it is a time to develop a canvas-based, dynamic visualization tool for biological interaction networks and pathways based on the new canvas tag. In this project, we propose to develop a canvas-based network interaction visualization prototype that can run in both modern browsers in a full-fledged computer, or in a tablet (e.g. ipad). The whole ideas are based on these two canvas based web applications: [http://www.canvasdemos.com/2010/08/05/lucidchart/ LucidChart] and [http://dev.opera.com/articles/view/html5-canvas-painting/ HTML 5 canvas painting].
+
 
+
* Language and Skills: HTML5 (esp canvas), JavaScript, Java, GWT
+
* Idea by: Guanming Wu
+
* Potential Mentors: Guanming Wu, Robin Haw and Marc Gillespie
+
 
+
=====IDEA 4: '''Cytoscape Web''' pathway visualization tool=====
+
[http://www.reactome.org Reactome] currently uses custom Javascript to to provide an interactive pathway browsing tool.  This project proposes to replace the Javascript with [http://cytoscapeweb.cytoscape.org/ Cytoscape Web] technology, to provide powerful, extendable and browser-independent visualization for pathways.
+
 
+
[http://cytoscapeweb.cytoscape.org/ Cytoscape Web] is a new technology that allows programmers to present networks of nodes and arcs on a webpage in a very flexible way.  It is part of the Cytoscape project, but unlike regular Cytoscape, it can seamlessly integrate into a website, without the need for Java Webstart.  Currently, [http://cytoscapeweb.cytoscape.org/ Cytoscape Web] uses Flash for performing animation, but the special appeal for developers is that the API is independent of the animation engine, so that if a future version switches to HTML5, all of the code written for this project will continue to work.
+
 
+
This would reuse pre-existing server-side code wherever possible, to reduce the initial learning curve, and would give the student an opportunity to work with cutting edge bioinformatics technology.
+
 
+
* Language and Skills: Java, Javascript
+
* Idea by: David Croft
+
* Potential Mentors: David Croft and Guanming Wu
+
 
+
=====IDEA 5: Pathway Summary visualization=====
+
The original [http://www.reactome.org Reactome] website provided a view called the "Sky", which gave a visual summary of all of the pathways in the [http://www.reactome.org Reactome] database.  Unfortunately, this overview was lost in the migration to the new, [http://code.google.com/webtoolkit/ GWT]-based website.
+
 
+
This project would produce a replacement for the old "Sky".  In particular, it would show expression and species comparison information in a multi-pathway context.  The project would be [http://code.google.com/webtoolkit/ GWT]-based, giving a student experience with an increasingly popular website construction environment.
+
 
+
* Language and Skills: Java, GWT
+
* Idea by: David Croft
+
* Potential Mentors: David Croft
+
 
+
=====IDEA 6: Reactome RESTful API=====
+
It is crucial to have a nice API for Reactome so that  the Reactome annotated data can be used programmatically and integrated in other web applications (e.g. Wormbase) easily with very low maintenance. In this project, we propose to develop a Reactome RESTful API which should be lightweight, easy to use, and well-documented.
+
 
+
* Language and Skills:RESTful Web Service, Java, Jersey , MySQL
+
* Idea by: Guanming Wu
+
* Potential Mentors: Guanming Wu
+
 
+
=====IDEA 7: Chado RESTful API=====
+
[[Chado]] is the database schema underlying several GMOD projects, and there are many installed instances around the world.  Having a RESTful API would allow both the creation of "mashup" sites that pull data from various Chado instances but would also facilitate data sharing between sites that have Chado installed.
+
 
+
* Language and Skills: RESTful Web Service, Perl (Dancer or Catalyst), PostgreSQL
+
* Idea by: [[User:Scott|Scott Cain]]
+
* Potential Mentors: [[User:Scott|Scott Cain]], [[User:Jogoodma|Josh Goodman]]
+
 
+
=====IDEA 8: GIS Indexing for Chado=====
+
Genome feature sequence data is stored in Chado and getting it out quickly can be a bottleneck.  In this project, we propose creating "geographic" coordinate indexes on the feature location table, as well as functions to use them.
+
 
+
* Language and Skills: PostgreSQL and plpgsql, Perl
+
* Idea by: [[User:Scott|Scott Cain]]
+
* Potential Mentors: [[User:Scott|Scott Cain]]
+
 
+
=====IDEA 9: Develop collaborative genome assembly tools and databases=====
+
''This idea is speculative!''
+
 
+
With the advent of NGS sequencing technology, more complex genomes are being sequenced than ever before. However, after the initial sequencing funds have been spent, much more long term 'curation' work is needed. Most sequencing projects will produce an obligatory 'draft' genome, after which the community needs tools to enable it to continue to work on improving the assembly and annotation of this draft.
+
 
+
The rough outline of the idea is to create a hybrid between a genome sequence assembly database (such as Chado, BAM or Bio::SeqFeature::Store) and a version control system (such as GIT). Using this system the community can work together on building and rebuilding the reference genome for their model organism.
+
 
+
* Language and Skills: Perl (BioPerl, Catalyst, Moose), Chado, Java, PHP (MediaWiki), GIT, RDBMS, creativity, vision, insight
+
* Potential Mentors: [[User:Scott|Scott Cain]], [[User:DanBolser|Dan Bolser]] <-- ''Click names for contact details.''
+
 
+
====== Details ======
+
* See [[GSoC/IDEA 9]].
+
 
+
=====IDEA 10: The genome game: crowd-sourcing better crops=====
+
''This idea is a mix of a few different ideas...''
+
 
+
With the ongoing 'data deluge' in biology, approaches for integrating knowledge from disparate groups of specialists have never been so important.
+
 
+
The genome of a specific individual (potato, tomato or human) varies at specific positions along it's length, and these variations (e.g. SNPs) give different individuals their different traits and propensities. In many cases, traits can be 'scored' according to some objective requirement (typically responses to various biotic and abiotic stresses or disease propensities).
+
 
+
We can build a wiki system that allows the community to collectively assign (and negotiate) scores for specific SNPs via published association studies. Players can then submit and rank their personal genomes (in a given 'environment'), as well as work on defining 'optimum' genotypes for given requirements (in a given 'environment'). The system would provide a platform for education, allowing direct comparison of cumulative risks for specific combinations of lifestyle choices and genetic factors.
+
  
* Language and Skills: The best tools for the job, [http://www.arg.dundee.ac.uk/ argumentation], [http://www.youtube.com/watch?v=ihUt-163gZI game mechanics], wiki ([http://www.mediawiki.org MediaWiki], [http://www.semantic-mediawiki.org Semantic MediaWiki]), creativity, vision, insight, fun, ...
+
'''Got an idea for a GSOC project? [[GSOC_Project_Ideas_2019 |Add it here]].'''  Ideas will be included in the proposal we send to GSOC, and great ideas make for a great proposal, so please add yours now.
* Potential Mentors: [[User:DanBolser|Dan Bolser]], many others. ''<-- Click names for contact details.''
+
 +
These projects can use a broad set of skills, technologies, and domains, such as GUIs, database integration, and algorithms. Students are also encouraged to propose their own ideas related to our projects. If you have strong computer skills and have an interest in biology or bioinformatics, you should definitely apply! '''Do not hesitate to propose your own project idea: some of the best applications we see are by students that go this route.''' As long as it is relevant to one of our projects, we will give it serious consideration. Creativity and self-motivation are great traits for open source programmers.
  
====== Details ======
 
Similar projects:
 
* [http://www.snpedia.com SNPedia], a wiki for human SNP association studies.
 
* [https://www.23andme.com/ 23andMe], when they eventually steal this idea.
 
* The PGP 'wiki': [http://evidence.personalgenomes.org/about GET-Evidence], "assists genome analysis by providing a Wikipedia-style location where individuals can record their interpretations of individual variants and review relevant literature".
 
* The [http://solgenomics.net/ Solanaceae Genomics Network], some of the best plant bioinformaticians around.
 
  
Correspondence:
+
== Preparing for GSoC 2019 ==
* See [[GSoC/IDEA 10]].
+
Right now it is the organization application process for GSoC - we won't know if Open Genome Informatics has been accepted as a GSOC 2019 mentoring organization until [https://developers.google.com/open-source/gsoc/timeline February 6th]. Nevertheless, it is a perfect time if students would like to talk to mentors about project ideas. If you are interested in mentoring, please check the Mentors section below, and contact the organization admin.
  
====Resources====
+
===Students===
*[http://socghop.appspot.com GSoC Main Site]
+
More information about [[GSOC_Applications_Guide | writing your application]] will be available closer to the start of the student application period.
*[http://code.google.com/p/google-summer-of-code/wiki/GsocFlyers#2011_Flyers This year's flyer]
+
*[http://code.google.com/p/google-summer-of-code/wiki/ProgramPresentations GSoC Presentations]
+
*[http://socghop.appspot.com/document/show/gsoc_program/google/gsoc2011/faqs Program FAQ]
+
*[http://socghop.appspot.com/document/show/gsoc_program/google/gsoc2011/faqs#timeline Program Timeline]
+
*[http://groups.google.com/group/google-summer-of-code-discuss GSoC Discussion List]
+
*[http://groups.google.com/group/google-summer-of-code-announce Announcement List]
+
*[http://www.youtube.com/user/googOSPOstudntprgrms GSoC YouTube Channel]
+
*[http://google-opensource.blogspot.com/ GSoC Blog]
+
  
=====For Students=====
+
===Mentors===
*[http://socghop.appspot.com/document/show/gsoc_program/google/gsoc2010/faqs#payments When do I get paid?!]
+
We encourage mentors and mentoring organizations to think about new projects year round! If you'd like help with your ideas page or your separate mentoring org application, please feel to contact the organization admins. Links to [[GSOC_Mentoring_Guide | advice about mentoring and other resources]] are available.
*[http://code.google.com/p/google-summer-of-code/wiki/AdviceforStudents Advice for Student Applicants]
+
*[http://groups.google.com/group/google-summer-of-code-students-list GSoC Students-Only List]
+
  
=====For Mentors=====
+
[[Category:Galaxy]]
*[http://socghop.appspot.com/ GSoC Main Site]
+
[[Category:JBrowse]]
*[http://en.flossmanuals.net/GSoCMentoringGuide Mentoring: The Book]
+
[[Category:MGI]]
*[http://code.google.com/p/google-summer-of-code/wiki/AdviceforMentors Advice for Mentors]
+
[[Category:WormBase]]
*[http://groups.google.com/group/google-summer-of-code-mentors-list GSoC Mentors-Only List]
+
[[Category:GSoC]]
 +
[[Category:Reactome]]
 +
[[Category:WebApollo]]

Revision as of 17:09, 18 December 2018

GoogleSummer 2016logo.jpg

Google Summer of Code 2019 @ Open Genome Informatics

Google Summer of Code is a global program that offers student developers stipends to write code for various open source software projects. We work with many open source, free software, and technology-related groups to identify and fund projects over a three month period. Since its inception in 2005, the program has brought together over 14,000 successful student participants from 118 countries, 651 open source organizations, and over 35 million lines of code. Through Google Summer of Code, accepted student applicants are paired with a mentor or mentors from the participating projects, thus gaining exposure to real-world software development scenarios and the opportunity for employment in areas related to their academic pursuits. In turn, the participating projects are able to more easily identify and bring in new developers. Best of all, more source code is created and released for the use and benefit of all. (Excerpt from the Google Summer of Code website)

Since 2011, the Open Genome Informatics group has served as an "umbrella organization" to a variety of bioinformatics projects, including GMOD and its software projects -- JBrowse, Apollo, Chado, Galaxy etc.; Mouse Genome Informatics; OICR; Reactome; WormBase; and Bioconda.

More information about this year's participating bioinformatics groups can be found here.

To learn more about this year's event and how GSoC works, please refer to the FAQ.

Mailing lists, IRC, and other ways to get in touch

Project Ideas

Got an idea for a GSOC project? Add it here. Ideas will be included in the proposal we send to GSOC, and great ideas make for a great proposal, so please add yours now.

These projects can use a broad set of skills, technologies, and domains, such as GUIs, database integration, and algorithms. Students are also encouraged to propose their own ideas related to our projects. If you have strong computer skills and have an interest in biology or bioinformatics, you should definitely apply! Do not hesitate to propose your own project idea: some of the best applications we see are by students that go this route. As long as it is relevant to one of our projects, we will give it serious consideration. Creativity and self-motivation are great traits for open source programmers.


Preparing for GSoC 2019

Right now it is the organization application process for GSoC - we won't know if Open Genome Informatics has been accepted as a GSOC 2019 mentoring organization until February 6th. Nevertheless, it is a perfect time if students would like to talk to mentors about project ideas. If you are interested in mentoring, please check the Mentors section below, and contact the organization admin.

Students

More information about writing your application will be available closer to the start of the student application period.

Mentors

We encourage mentors and mentoring organizations to think about new projects year round! If you'd like help with your ideas page or your separate mentoring org application, please feel to contact the organization admins. Links to advice about mentoring and other resources are available.