Tool Guide

Introduction

This tool guide is an overview of the GenomeSpace functionality in the user interfaces of the GenomeSpace tools and data sources.  This is not meant to be comprehensive documentation for the given tools.  For more information on each tool, see the list of GenomeSpace tools and the help links provided on each tool's page.

If you are having problems launching certain desktop tools, which depend on the Java Network Launch Protocol (JNLP), see FAQ2.2 for solutions. These solutions were updated February of 2015, and should take you about five minutes to incorporate. They involve adjusting your browser pop-up blocker and system's Java security exception site list.

 

For other tool-related documentation, check out the following guides:

Loading Data into GenomeSpace

Moving Data Between Tools viaGenomeSpace


ArrayExpress

Project Website: http://www.ebi.ac.uk/arrayexpress/

ArrayExpress is a repository of over 30,000 functional genomics experiments comprising nearly 1 million assays. Users can query and retrieve data in  a number of different formats including the MIAME and MINSEQE standards.

The ArrayExpress detail view for experiments provides a link you can use to send the data associated with that view (IDF, SDRF, and data files) directly to your GenomeSpace account.  To do this:

  1. Search in ArrayExpress for an experiment that contains data of interest to you.  For more information about searching the ArrayExpress experiment archive, see the ArrayExpress Help.
  2. Click the Send [experiment ID] data to GenomeSpace link on that experiment's detail page.

  1. Select the files that you want to upload to your account on GenomeSpace and click Upload.

The uploader provides the upload status of each file.  When the upload for each file is complete, a green checkmark appears next to the file name.  When all the files are uploaded, a message appears to tell you so and also provides a link to GenomeSpace.

You will find that ArrayExpress has created an ArrayExpress folder in your GenomeSpace cloud storage, with a subfolder that contains the experiment data files.  Future ArrayExpress exports will go into the ArrayExpress folder.

Note that ArrayExpress experiment data files are exported from ArrayExpress in MAGE-TAB format.

 

Read more documentation on how to load data from ArrayExpress into GenomeSpace.

Learn how to send data from ArrayExpress to GiTools using GenomeSpace.


cBioPortal

The cBioPortal for Cancer Genomics provides visualization, analysis and download of large-scale cancer genomics data sets.

Sending Files from cBioPortal to GenomeSpace

The cBioPortal "Download Data" tab on the home page provides an option for users to select a published dataset and save the entire file or a subset back to their GenomeSpace account.

  1. Click the "Download Data" tab on the home page of the cBioPortal
  2. Using the provided form, select your dataset by providing a cancer study, genomic profile, patient/case set and gene set
  3. Click the Send Data Matrix to GenomeSpace button at the bottom of the form to create your dataset
  4. You will be asked to authenticate your GenomeSpace account if you are not currently signed in
  5. Select a directory to save your file and click Submit

 

Read more documentation how to load data from cBioPortal to GenomeSpace.


Cistrome

Project Website: http://cistrome.org/ap/

A cistrome is defined as the set of cis-acting targets of a trans-acting factor on a genome scale. The Cistrome project integrates ChIP-chip/seq data for a cistrome with standard analysis pipelines, while  providing users with a user-friendly, powerful, flexible web interface with which they can access the data and pipelines.

In addition to the standard Galaxy functions, Cistrome has 29 ChIP-chip- and ChIP-seq-specific tools in three major categories, from preliminary peak calling and correlation analyses, to downstream genome feature association, gene expression analyses, and motif discovery.


Cytoscape

Project Website: http://www.cytoscape.org

Cytoscape is an open-source bioinformatics software platform for visualizing molecular interaction networks and biological pathways, and integrating these networks with annotations, gene expression profiles, and other state data.   Cytoscape core distribution provides a basic set of features for data integration and visualization.  Additional features are available as plugins.

Cytoscape is a Java-based application that downloads to your local machine when you launch it from GenomeSpace.  Once it's downloaded to your machine (after a launch from GenomeSpace), you can start it locally (by opening the JNLP) or launch from GenomeSpace.  If you launch Cytoscape from the Cytoscape website, at this time it will not be the version that is linked to GenomeSpace.

Like several other applications in the GenomeSpace suite of tools, Cytoscape can be expanded with plug-ins.  You can download plug-ins for your Cytoscape by choosing Plugins>Manage Plugins.

Cytoscape has several GenomeSpace menu items.

Logging In

When you launch Cytoscape from GenomeSpace, your login should be handled seamlessly.

If you start from your GenomeSpace-enabled JNLP of Cytoscape and want to log into GenomeSpace in order to access your Amazon cloud storage or other GenomeSpace tools, you will be asked to log in with your GenomeSpace username and password.

You can also Reset Password on your existing GenomeSpace account.

Import Menu

Clicking File>Import>GenomeSpace provides several options for importing files from GenomeSpace.

Load Network Loads a network from your GenomeSpace cloud storage.
Load Network from Table Starts the import from your GenomeSpace cloud storage of a network from a delimited text file or single-sheet MS Excel workbook.
Load Attributes Loads attributes (for node/edge/network) from your GenomeSpace cloud storage. The load dialog allows you to opt to either Load node attributes or Load edge attributes.
Load CyTable Loads a file containing attributes for a CyNetwork from your GenomeSpace cloud storage.  The load dialog allows you to opt to either Load node attributes or Load edge attributes.
Load Attributes from Table Starts the import from your GenomeSpace cloud storage of delimited text or single-sheet MS Excel workbook containing attribute data tables that are not formatted into Cytoscape node or edge attribute file formats.
Load Session Loads a CYS file from your GenomeSpace cloud storage.  A session file (CYS) contains all networks, attributes (for node/edge/network), Desktop states (selected/hidden nodes and edges, window sizes), Properties, some plugin states, and Visual Styles present in a given working state.
Load Ontology and Annotations Loads an annotation/ontology file from your GenomeSpace cloud storage. Annotations in Cytoscape are stored as a set of ontologies (a set of controlled vocabulary terms that annotate the objects).The standard file formats used in Cytoscape are OBO and Gene Association.


Export Menu

Clicking File>Export>GenomeSpace provides options for exporting files to GenomeSpace.

Upload File Uploads a file from your local machine to the root level of your GenomeSpace cloud storage.
Save Session As Saves a session file (CYS), containing all networks, attributes (for node/edge/network), Desktop states (selected/hidden nodes and edges, window sizes), Properties, some plugin states, and Visual Styles in your present working state to your GenomeSpace cloud storage.
Save Network As Save a network to your GenomeSpace cloud storage.

GenomeSpace Menu

Clicking File>GenomeSpace provides several options for managing your files in your GenomeSpace cloud storage and using GenomeSpace to launch other applications.

Delete File in GenomeSpace Delete a file from your GenomeSpace cloud storage.
Download File Download a file in your GenomeSpace cloud storage to your local machine.
List Available Files in GenomeSpace Navigate your GenomeSpace directories.
Login To GenomeSpace Log into your GenomeSpace account.
Launch Launch another GenomeSpace tool

 


Cytoscape 3

Project Website: http://www.cytoscape.org

New Documentation for Cytoscape 3 is on its way... In the meantime, below is the help from the previous revision (Cytoscape 2.8)

Cytoscape is an open-source bioinformatics software platform for visualizing molecular interaction networks and biological pathways, and integrating these networks with annotations, gene expression profiles, and other state data.   Cytoscape core distribution provides a basic set of features for data integration and visualization.  Additional features are available as plugins.

Cytoscape is a Java-based application that downloads to your local machine when you launch it from GenomeSpace.  Once it's downloaded to your machine (after a launch from GenomeSpace), you can start it locally (by opening the JNLP) or launch from GenomeSpace.  If you launch Cytoscape from the Cytoscape website, at this time it will not be the version that is linked to GenomeSpace.

Like several other applications in the GenomeSpace suite of tools, Cytoscape can be expanded with plug-ins.  You can download plug-ins for your Cytoscape by choosing Plugins>Manage Plugins.

Cytoscape has several GenomeSpace menu items.

Logging In

When you launch Cytoscape from GenomeSpace, your login should be handled seamlessly.

If you start from your GenomeSpace-enabled JNLP of Cytoscape and want to log into GenomeSpace in order to access your Amazon cloud storage or other GenomeSpace tools, you will be asked to log in with your GenomeSpace username and password.

You can also Reset Password on your existing GenomeSpace account.

Import Menu

Clicking File>Import>GenomeSpace provides several options for importing files from GenomeSpace.

Load Network Loads a network from your GenomeSpace cloud storage.
Load Network from Table Starts the import from your GenomeSpace cloud storage of a network from a delimited text file or single-sheet MS Excel workbook.
Load Attributes Loads attributes (for node/edge/network) from your GenomeSpace cloud storage. The load dialog allows you to opt to either Load node attributes or Load edge attributes.
Load CyTable Loads a file containing attributes for a CyNetwork from your GenomeSpace cloud storage.  The load dialog allows you to opt to either Load node attributes or Load edge attributes.
Load Attributes from Table Starts the import from your GenomeSpace cloud storage of delimited text or single-sheet MS Excel workbook containing attribute data tables that are not formatted into Cytoscape node or edge attribute file formats.
Load Session Loads a CYS file from your GenomeSpace cloud storage.  A session file (CYS) contains all networks, attributes (for node/edge/network), Desktop states (selected/hidden nodes and edges, window sizes), Properties, some plugin states, and Visual Styles present in a given working state.
Load Ontology and Annotations Loads an annotation/ontology file from your GenomeSpace cloud storage. Annotations in Cytoscape are stored as a set of ontologies (a set of controlled vocabulary terms that annotate the objects).The standard file formats used in Cytoscape are OBO and Gene Association.


Export Menu

Clicking File>Export>GenomeSpace provides options for exporting files to GenomeSpace.

Upload File Uploads a file from your local machine to the root level of your GenomeSpace cloud storage.
Save Session As Saves a session file (CYS), containing all networks, attributes (for node/edge/network), Desktop states (selected/hidden nodes and edges, window sizes), Properties, some plugin states, and Visual Styles in your present working state to your GenomeSpace cloud storage.
Save Network As Save a network to your GenomeSpace cloud storage.

GenomeSpace Menu

Clicking File>GenomeSpace provides several options for managing your files in your GenomeSpace cloud storage and using GenomeSpace to launch other applications.

Delete File in GenomeSpace Delete a file from your GenomeSpace cloud storage.
Download File Download a file in your GenomeSpace cloud storage to your local machine.
List Available Files in GenomeSpace Navigate your GenomeSpace directories.
Login To GenomeSpace Log into your GenomeSpace account.
Launch Launch another GenomeSpace tool

 


FireBrowse

FireBrowse is a companion portal to the Broad Institute GDAC Firehose analysis pipeline, and was developed to cull and analyze data generated by The Cancer Genome Atlas (TCGA), which characterizes and identifies genomic patterns in human cancer models. Backed by a powerful compute infrastructure, programming interface, online reports and modern graphical tools, FireBrowse provides a simple yet capable means of visually and programmatically exploring one of the most comprehensive and deeply-characterized open cancer datasets in the world. FireBrowse provides access to a variety of cancer genomics data, such as clinical annotations, DNA copy number, miR, miRseq, mRNA and mRNAseq; as well as a comprehensive suite of more than 100 interdependent analyses of those data, including: correlations, clustering, and GISTIC and MutSigCV. FireBrowse is developed at the Broad Institute of MIT and Harvard.

More information about FireBrowse can be found using the following links:

 

Downloading data from FireBrowse to GenomeSpace

When you launch FireBrowse from GenomeSpace, you will be automatically redirected to the FireBrowse front page (http://firebrowse.org/). On this page, you can access FireBrowse data by selecting a cohort.

To do this, you can (A) select a cohort from the drop-down menu; or, (B) simply click on the respective row of the samples barchart for that cohort. In the example below, we will be downloading Breast invasive carcinoma (BRCA) data. Therefore, we can either: (A) “Breast invasive carcinoma (BRCA)” from the drop-down menu; or (B) click on the BRCA barchart.

Next, select the data that you wish to download. FireBrowse collects data from a variety of analyses: clinical, copy number, miR, miRseq, mRNA, mRNAseq, mutation and pathway analyses.

To select a data type, either: (A) select the data type from the left-hand menu; or (B) click on the corresponding barchart. In our example, we will download SNP6 Copy Number variation data. We therefore either: (A) Click on the "CopyNumber Analyses" box; or (B) click on the "SNP6 CopyNum" barchart.

When you select a data type to download, this will prompt a data download interface. To send data to GenomeSpace, use the following method:

  1. Click on the "Send To" button.
  2. Select which data file(s) to download by clicking on the checkboxes next to the file(s).
  3. Click the "GenomeSpace Upload" button.

This will prompt you to choose a GenomeSpace directory to save the file(s) to. Once you have chosen a directory, you can click the "Submit" button. This will send the file(s) to your GenomeSpace directory as a .tar.gz file.

When you return to GenomeSpace, you will have to right-click on the file and choose "Expand Archive". This will unzip the .tar.gz file and create a new folder with the same name. The files you chose to download in Step B will be available in that folder.


Galaxy

Project Website: http://galaxyproject.org/

Galaxy is an open-source, scalable framework for tool integration that allows users to analyze multiple alignments, compare genomic annotations, and profile metagenomic samples, among many possible analyses; workflows allow the linking together of analyses.

Please note that during the GenomeSpace Beta, you will be connected with a Galaxy test server.

Logging In

When you launch Galaxy from GenomeSpace, you will see a login window.  Do not enter anything in the Email address, Password, or OpenID URL fields!

In the final line under the OpenID login section ("Or, authenticate with your _____ account."), select GenomeSpace and click Login.


Galaxy Tools

Galaxy has its own tools that perform different functions or analyses, and there are tools for interacting with GenomeSpace.

Get Data

To import data to Galaxy from GenomeSpace, click Get Data in the Tools menu.  Under that, select GenomeSpace import from file browser.  This opens your GenomeSpace directories.  The navigation is a little different here than it is in GenomeSpace, since there is no separate directories pane.  Double-click folders to open them, and double-click ..(To Parent) to return to the previous level of the folder structure.

Select the checkboxes for the files you want to bring into Galaxy. You can then select a file format from the file's drop-down menu, if there is a converter.  Click the Send to Galaxy button to queue the files for import into Galaxy.

Send Data

To send data back to GenomeSpace from Galaxy, click Send Data in the Tools menu.  Under that, select GenomeSpace Exporter.

  1. Select the dataset you want to export from the drop-down menu.
  2. Select the target directory -- you can open the base directory by clicking the so that you can select a subdirectory:

  1. You can enter a file name.  If you do not enter one, Galaxy will generate a file name automatically.
  2. Click Execute to send the file to GenomeSpace.

 

Learn how to send data from Galaxy to IGV using GenomeSpace.


GenePattern

Project Website: http://www.genepattern.org

GenePattern is a powerful genomic analysis platform that provides access to more than 150 tools for gene expression analysis, proteomics, SNP analysis, flow cytometry, RNA-seq analysis, and common data processing tasks. A web-based interface provides easy access to these modules and allows for the creation of multi-step analysis pipelines that enable reproducible in silico research.

A GenomeSpace-enabled GenePattern installation has a number of elements in it that help move data files from GenomeSpace and analysis results to GenomeSpace.

Logging In

When you launch GenePattern from GenomeSpace, GenePattern allows you to log in using your GenomeSpace account.  You will see the GenePattern login screen, and you can click Sign in Using GenomeSpace.

If your GenomeSpace username is not recognized as being associated with any GenePattern username, GenePattern gives you the opportunity to either:

Sending Files to GenePattern from GenomeSpace

GenePattern receives your files and offers you a number of options of how you would like it to handle those files.

Each file can be saved to the Upload Directory by simply clicking Submit next to the Save a Copy field.  Or you can opt to send the file to a module.  If the file can be converted by one of the GenomeSpace conversion scripts, you can opt for a conversion as you send it to the module of your choice.

GenomeSpace Tab

Within the GenePattern UI, the right side of the window is occupied with several tabs that provide access to recently run jobs, uploaded files, and, in GenomeSpace-enabled installations, your GenomeSpace directories.  You can navigate these directories.

To use one of your GenomeSpace files in a module, drag and drop the name of the file on the field box.

 

Save To GenomeSpace

Files in the Job Results and Uploads tabs, as well as on the Job Results page, can be sent directly to GenomeSpace. Click the file name to generate a navigation menu for the file.

Then, in the drop-down menu, select the location in your GenomeSpace cloud storage where you want to save the file and click Save.

GenomeSpace Menu

There is also a menu of GenomeSpace functionality, allowing you to log out of GenomeSpace, open GenomeSpace, or go to the GenomeSpace web site.

 

Learn how to send data from GenePattern to IGV using GenomeSpace.

Learn how to send data from Genomica to GenePattern using GenomeSpace.


Genomica

Project Website: http://genomica.weizmann.ac.il/

Genomica is an analysis and visualization tool for genomic data that can integrate gene expression data, DNA sequence data, and gene and experiment annotation information.

Genomica is a Java-based application that downloads to your local machine when you launch it from GenomeSpace.  Once it's downloaded to your machine (after a launch from GenomeSpace), you can start it locally (by opening the JNLP) or launch from GenomeSpace.  If you launch Genomica from the Genomica website, at this time it will not be the version that is linked to GenomeSpace.

Logging In

When you launch Genomica from GenomeSpace, your login should be handled seamlessly.

If you start from your GenomeSpace-enabled JNLP of Genomica and want to log into GenomeSpace in order to access your cloud storage or other GenomeSpace tools, you need to select GenomeSpace>Login.  You will be asked to log in with your GenomeSpace username and password.

Menus

Genomica has two menus containing GenomeSpace items.

GenomeSpace Menu

The GenomeSpace menu in Genomica allows you to interact with your GenomeSpace cloud storage in a number of ways.

Login GenomeSpace/Logout GenomeSpace Signs into or out of GenomeSpace.
Open From GenomeSpace

Opens the GenomeSpace file browser to enable you to open a file from your GenomeSpace cloud storage into Genomica.

Save To GenomeSpace Saves a file from Genomica to your GenomeSpace cloud storage.
Export Network To GenomeSpace Export a network file from Genomica to your GenomeSpace cloud storage.
Launch GenomeSpace Tool

Allows you to launch another GenomeSpace-enabled tool from Genomica.

If you click Launch with GS File, the GenomeSpace file browser opens to allow you to select a file to use in launching your selected tool.

Upload Local File To GenomeSpace Uploads a file from your local machine to your GenomeSpace cloud storage.
Download GenomeSpace File to Local File Downloads a file from your GenomeSpace cloud storage to your local machine.
Delete From GenomeSpace Deletes a file from your GenomeSpace cloud storage.

Sets Menu

The Sets menu contains two GenomeSpace options.

With the Sets menu, you can load either Gene Sets or Experiment Sets from GenomeSpace.

In the User Interface

Select Algorithms>Create a Module Network...  Under the Regulation tab is a GenomeSpace Load button that enables you to load a list of candidate regulator genes for the analysis.

 

Learn how to send data from Genomica to GenePattern using GenomeSpace.


geWorkbench

Project Website: http://wiki.c2b2.columbia.edu/workbench/index.php/Overview

geWorkbench is an open-source bioinformatics platform that offers a comprehensive and extensible collection of tools for the management, analysis, visualization, and annotation of biomedical data. Many kinds of analysis are supported.  For microarrays, there are tools for filtering and normalization, basic statistical analyses, clustering, network reverse engineering, as well as many common visualization tools. For sequence data, there are routines such as BLAST, pattern detection, transcription factor mapping, and syntenic region analysis. geWorkbench provides access to a variety of external data sources, including microarray gene expression repositories (caArray), BLAST, gene annotation pages (via caBIO),protein and DNA sequence retrieval, and pathway diagrams (BioCarta), as well as acting a gateway to several computational services currently hosted on Columbia servers and clusters, including Pattern Discovery, Pudge (protein structure modeling), and SkyBase (database of molecular models).

Logging In

When you launch geWorkbench from GenomeSpace, you will se the Launch geWorkbench dialog.

If you have installed geWorkbench in the default location previously, click OK.  This will launch geWorkbench.

If you have never used geWorkbench before, or have not used the current version of geWorkbench, you need to install it.

Installing geWorkbench

If you have never used geWorkbench before, you can use the prompt you receive when you launch it from GenomeSpace to install it.  If you have used it and installed it in a non-standard directory, you can point GenomeSpace to that location in the prompt dialog.

To install geWorkbench:

  1. Click the link in the "please click here to install it" line and you will be taken to a page of installers. 
  2. Select and download the appropriate installer for your machine.
  3. Expand the ZIP archive containing the installer.
  4. Run the installer and go through the installation wizard.  It is recommended that you install geWorkbench in the default location; if you do not, you will have to enter the nonstandard location path in the prompt window every time you launch the tool.
  5. Go back to the prompt dialog and click OK. This should complete the launch of geWorkbench you began in GenomeSpace.

Enabling GenomeSpace Within geWorkbench

To access GenomeSpace in geWorkbench, enable the GenomeSpace component using the Component Configuration Manager.

  1. Select Tools> Component Configuration.
  1. In the Component Configuration Manager dialog, select the Genome Space checkbox.
  2. Click Apply.
  3. Click Close.

Using GenomeSpace in geWorkbench

When you click Workspace in the upper left corner of the screen, the GenomeSpace pane will appear to the right.

Log in by clicking the Login button next to the GenomeSpace Directories header, and now the GenomeSpace pane will display your directories.

You can work with your files within GenomeSpace:

Moving Files From GenomeSpace to geWorkbench

To move one of your GenomeSpace files into geWorkbench:

  1. If you do not already have a project in geWorkbench, right-click on Workspace and select New Project in the popup menu.

  1. Select a project.
  2. Right-click the file you want to use in geWorkbench and select Download in the popup menu.

Note that if you download a file that is not in a geWorkbench file format, GenomeSpace will convert it if there is a converter available.

Moving Files from geWorkbench to GenomeSpace

To move files from geWorkbench to GenomeSpace:

  1. Click the GenomeSpace tab.

  1. Right-click the file you want to move to GenomeSpace and select Upload to GenomeSpace from the popup menu.

  1. Select the GenomeSpace directory to which you want to upload the geWorkbench file.

  1. Click OK.

Gitools

Project Website: http://www.gitools.org/

Gitools is a framework for analysis and visualization of genomic data using interactive heatmaps that also allows data to be imported from various sources, including Biomart, Ensembl, IntOGen, and GenomeSpace.

In the Gitools interactive heatmaps, every cell is associated with one or several values, and users can visualize them and navigate among them.  Multiple annotation labels can be added to rows and columns. A number of actions can be performed within the heatmap that help to explore and interpret the data effectively, such as search, filter by value or label, cluster the heat-map, sort rows and columns by different criteria, and more. There are many editable features that make the heatmaps thoroughly customizable, such as color scales, size of cells, grid lines, font size, and color and type for labels.

It is possible to load multidimensional data genomic sets into the heatmaps.  For example, a cell in a heatmap that represents a gene in a tumour sample can represent the expression level, mutation status, copy number alterations, and methylation changes. With these properties complemented by a set of analyses (such as correlations, overlaps, enrichment, or group comparison), the exploration of large multidimensional genomics datasets becomes very effective.

Importing Data From GenomeSpace

To open a data file from GenomeSpace, you can either click the GenomeSpace icon on the Gitools welcome page:

Or by selecting File>Open>Heatmap from Genomespace...

 

Learn how to send data from ArrayExpress to GiTools using GenomeSpace.


Integrative Genomics Viewer (IGV)

Project Website: http://www.broadinstitute.org/igv/

Logging In

When you launch IGV from GenomeSpace, your login should be handled seamlessly.

Sending Files to IGV From GenomeSpace

When you send to a data file to IGV from GenomeSpace, IGV opens up with the file in the viewer, as shown below.  You can then rearrange the data tracks by dragging and dropping them where you want to view them, or otherwise manipulate the view.

GenomeSpace Menu

Within the IGV UI, there is a menu for GenomeSpace functionality:

Load from GenomeSpace

This opens up a dialog box that allows you to load one data file from your GenomeSpace directories in the cloud.

Navigate through the directories and select the file you want to load.  Click Open.

Save Session to GenomeSpace

A session in IGV is an XML snapshot of the current state of IGV (e.g., the genome that is selected, the data file(s) that is/are loaded into the viewer, the regions of interest that are defined).  You can save an IGV session locally normally, but with the GenomeSpace menu, you can save an IGV session to your GenomeSpace cloud storage.

Load Session From GenomeSpace

Loading a session file from GenomeSpace is similar to loading a data file, and uses a very similar dialog box.

 

Learn how to send data from Galaxy to IGV using GenomeSpace.

Learn how to send data from GenePattern to IGV using GenomeSpace.


ISAcreator

Project Website: http://isa-tools.org/

The open source ISA metadata tracking tools help to manage an increasingly diverse set of life science, environmental, and biomedical experiments that employ one or a combination of technologies.

Logging In

When you launch ISAcreator from GenomeSpace, the Java application launches and gives you three options, including GenomeSpace:

Click the GenomeSpace button and you will be asked to log in with your GenomeSpace username and password. 

Once you log in, you'll be asked to select a configuration to load.  A configuration is a template for describing and curating experiments.

Opening Files from GenomeSpace

When you have logged in and have your configuration loaded, you can then load ISA-TAB datasets from GenomeSpace.  From the main menu, select load an existing isatab file.

Click search genomespace.

In the GenomeSpace dialog, browse to the directory containing your ISA-TAB datasets, select it, and click load selected.

This loads the ISA-TAB files into your investigation.

Saving Files to GenomeSpace

You can save your configuration and ISA-TAB files to GenomeSpace, using either File>save in GenomeSpace or File>save in GenomeSpace as...

For more information on defining your studies, see the ISAcreator User Manual.

 


MSigDB Online Tools

Project Website: http://www.broadinstitute.org/msigdb

Logging In

When you launch MSigDB from GenomeSpace, GenomeSpace tries to log into MSigDB with your email address.  If you log in successfully, you will be brought to the Investigate Gene Sets page.  Note that MSigDB is closely affiliated with the Gene Set Enrichment Analysis (GSEA) tool, and thus shares its website.  You are in the correct place, even though the logo in the corner says GSEA.

If the email address you use for GenomeSpace is not registered with MSigDB, you will be informed that the login failed.  Click the click here link to register as a new user, or log in with a registered email address.

Even if the email addresses are different, as long as you are logged into GenomeSpace and MSigDB concurrently, they can send data to one another.

Sending Data to the MSigDB Online Tools from GenomeSpace

You can send a list of genes in text format from GenomeSpace to be used as input the MSigDB Online Tools.  The list can be one per line, or delimited by spaces, tabs, or commas.

From GenomeSpace, you can:

OR

Your gene list will populate the Gene Identifiers box on the Investigate Gene Sets page.

Note that you can also launch MSigDB by clicking its icon in the tool bar, then paste a list of genes into the Gene Identifier box.

From here, select one or more collections in the Compute Overlaps list and click compute overlaps.

For more information on using the Compute Overlaps tool, see the Compute Overlaps help.

Save To GenomeSpace

From the Compute Overlaps results page, you can click the Save to: GenomeSpace link to save your results to GenomeSpace.

In the dialog, browse to the directory you want to save the overlap file to and click Submit.

 


UCSC Table Browser

Sending Files from the UCSC Table Browser to GenomeSpace

Project Website: http://genome.ucsc.edu/index.html

The UCSC Table Browser is a valuable data source linked to GenomeSpace.  Since it is a data source, you cannot launch the UCSC Table Browser on a file in GenomeSpace.  You can only launch the UCSC Table Browser from its button in GenomeSpace, and then send files from the UCSC Table Browser to GenomeSpace.

The Table Browser allows you to retrieve data associated with a track in text format, to calculate intersections between tracks, and to retrieve DNA sequence covered by a track.  After you select the options for your output file, you can opt to send your output file to your GenomeSpace cloud storage.  For help using the Table Browser's options, see the UCSC Table Browser documentation.

To output your selected track to GenomeSpace, remember to select the checkbox for GenomeSpace

Select an output format.  In the output file field, you need to specify the name of your output file.  There are two options for this:

NOTE: You must add the correct extension to your file name. If, for instance, you select BED in the output format drop-down, you will need to add .bed to your output file name.  The UCSC Table Browser does not do this for you, and some of the GenomeSpace tools depend on the file extension to determine file type.

When you click get output, you will be asked to log into GenomeSpace, if you have not previously logged in via the UCSC Genome Browser.

After you log in (or if you previously logged in), you will see an uploading message.

Followed by a completion message.

Click Back to return to the Table Browser.

The file you sent to GenomeSpace should appear in your root directory.

 

Read more documentation on how to load data from UCSC Genome Browser to GenomeSpace.


Tool Term Glossary

GSE Identifier:  A series record links together a group of related samples, providing a focal point and description for the whole study. Series records may also contain tables describing extracted data, summary conclusions, or analyses. Each series record is assigned a unique and stable GEO accession number (GSExxx).

Module (GenePattern): A module is a discrete portion of code/functionality that is packaged with launch code, UI, and metadata, and can be run within the GenePattern user interface.


Trinity

The Trinity Cancer Transcriptome Analysis Toolkit (CTAT) aims to provide tools for leveraging RNA-Seq to gain insights into the biology of cancer transcriptomes. Bioinformatics tool support is provided for mutation detection, fusion transcript identification, and de novo transcript assembly. Trinity CTAT was developed at The Broad Institute and is hosted on the Trinity CTAT Galaxy portal by the National Center for Genome Analysis Support (NCGAS) at Indiana University. CTAT is funded by the National Cancer Institute Informatics Technology for Cancer Research (NCI ITCR) program.

More information about Trinity CTAT can be found using the following links:

 

Creating an Account

Before you can launch Trinity from GenomeSpace, you must first sign up for a Trinity account through the National Center for Genome Analysis Support (NCGAS). This can be accomplished by filling out the Trinity Galaxy Registration Form.

Trinity Galaxy support by NCGAS is publically funded, so users are required to sign up and describe the nature of their work and their sources of funding. If you have any questions about this form, please contact the NCGAS team at help@ncgas.org.

Once you have submitted your registration form, you should receive an email from the NCGAS team with your account information and temporary password. This email should arrive within 24 hours of submitting your application.

 

Connecting Your Account

Once you have signed up for and activated your Trinity CTAT account, you can connect your Trinity account to GenomeSpace using the following method:

  1. On the main login screen, bypass the Trinity Login section and instead navigate to the OpenID Login section.
  2. Under the OpenID URL box, choose "GenomeSpace" from the drop-down menu to authenticate with your GenomeSpace account.
  3. Click "Login".
  4. Once you have successfully authenticated with OpenID, you will also need to associate a Galaxy account with this ID. To do this, enter your Trinity username and password again and click "Login" (see below). Do not create a new account.

Once you have connected your account, you can begin using Trinity normally.

 

Trinity Tools

Trinity encompasses a suite of tools hosted on a Galaxy server. There are three types of tools: (1) tools for interacting with GenomeSpace; (2) auxiliary tools that are common to Galaxy builds; and, (3) Trinity-specific tools which perform different functions or analyses.

Get Data

To import data to Trinity from GenomeSpace, click Get Data in the Tools menu and select GenomeSpace import from file browser.  This opens a view of your GenomeSpace directories.  The navigation is a little different here than it is in GenomeSpace, since there is no separate directories pane.  Double-click folders to open them, and double-click ..(To Parent) to return to the previous level of the folder structure.

Select the checkboxes for the files you want to bring into Trinity. You can optionally select a file format from the file's drop-down menu, provided there is a converter.  Click the Send to Galaxy button to queue the files for import into Trinity.

Send Data

To send data back to GenomeSpace from Trinity, click Send Data in the Tools menu and select GenomeSpace Exporter.

  1. Select the dataset you want to export from the drop-down menu.
  2. Select the target directory – you can open enclosing directories by clicking the  button.

  1. You can optionally enter a file name.  If you do not enter one, Trinity will generate a file name automatically.
  2. Click Execute to send the file to GenomeSpace.

 

Other Galaxy Tools

Trinity also has built-in Galaxy tools to help with basic file manipulation and other functions. These tools can by found by clicking on Helper Tools in the Tools menu.

To learn about other Galaxy tools that can be found in Trinity, please visit the Galaxy Wiki Resource.

 

Trinity-Specific Tools

The Trinity CTAT suite of tools is specific to this build of Galaxy, and is not currently deployed on the main Galaxy server which is also connected to GenomeSpace. Currently, CTAT supports tools and functions for completing analyses such as:

More information about the capabilities of these specific tools can be found on the Trinity CTAT wiki page. In the future, Trinity CTAT may also support lincRNA classification, foreign transcript detection, and single-cell cancer transcriptome analysis.