GSOC Project Ideas 2019

Revision as of 00:03, 9 February 2019 by Clements (Talk | contribs)

Jump to: navigation, search

Got an idea for GSOC 2019?

Then please post it. You can either

  1. Add it here, by directly editing this page. Just copy, paste and update the template below. This requires that you have or create a login.
  2. Describe it in the online form and we'll post it here.

Projects can use a broad set of skills, technologies, and domains, such as GUIs, database integration and algorithms.

Students are also encouraged to propose their own ideas related to our projects. If you have strong computer skills and have an interest in biology or bioinformatics, you should definitely apply! Do not hesitate to propose your own project idea: some of the best applications we see are by students that go this route. As long as it is relevant to one of our projects, we will give it serious consideration. Creativity and self-motivation are great traits for open source programmers.

Proposed project ideas for 2019

Be the first to add a project idea.


  • Project Idea Name (Project Name/Lab Name)
    • Brief explanation: Brief description of the idea, including any relevant links, etc.
    • Expected results: describe the outcome of the project idea.
    • Project Home Page URL: if there is one.
    • Project paper reference and URL: Is there a paper about the project this effort will be a part of?
    • Knowledge prerequisites: programming language(s) to be used, plus any other particular computer science skills needed.
    • Skill level: Basic, Medium or Advanced.
    • Mentors: name + contact details of the lead mentor, name + contact details of 1 or 2 backup mentors.

Automated Bioinformatics Help in Galaxy

  • Brief explanation:
    • Galaxy users often encounter errors when trying to run a bioinformatics analysis. These errors may be user or data errors (e.g. misformatted dataset) or errors due to underlying computing hardware (e.g. disk is full). Helping users and Galaxy support staff determine the kind of error they encountered would be useful because a user can likely address the first type of error, while the second type requires expert invention.
    • This project will improve Galaxy’s error system by using heuristics or machine learning to identify common types of user/data errors and make suggestions on likely causes of the error and how they might be fixed. This will benefit Galaxy users with clear and actionable error messages and support staff by reducing the amount of reported, non-system errors.
  • Expected results:
    • Create a tool for analyzing, identifying, and classifying common error messages from the extensive history of error messages from the main public Galaxy server (
      • The diversity and size of this data suggests a machine learning approach, but the specific approach taken would be decided by the student and mentor.
    • Extend Galaxy’s tool definition syntax to support defining common error classes and suggested resolutions.
    • Update Galaxy’s user interface to display potential resolutions and suggested actions based on the types of errors found in an analysis.
  • Project Home Page URL:
  • Project paper reference and URL: The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update, Enis Afgan et al., Nucleic Acids Research, Volume 46, Issue W1, 2 July 2018, Pages W537–W544,
  • Knowledge prerequisites: programming language(s) to be used, plus any other particular computer science skills needed.
  • Skill level: Medium.
  • Mentors: