Recipe demo datasets
The scale of genomic data is growing and to minimize the duplication of resources spent to obtain and store data of interest to the wider scientific community, providing data server access is becoming more common. In addition to the specialized data portals provided under GenomeSpace Tools, GenomeSpace enables use of other public data resources. If a public data resource provides URL addresses for data files, then you can easily access the files from your GenomeSpace account. On this page we list the example datasets used in the Analysis Recipes as well as provide links to alternative datasets that showcase GenomeSpace functionality.
GenomeSpace Public Folder Datasets
The Public folder in your left directories panel contains user specific public data and a folder titled SharedData. Example datasets used in GenomeSpace Recipes are stored in the Demos subfolder of the SharedData folder. Here we briefly describe each shared dataset. Find more information about each dataset on the corresponding Recipe page, provided as links in the table below.
Table matching /Home/Public/SharedData/Demos subfolders to Analysis Recipes as of February 3, 2015
Amazon S3 Public Datasets
See aws.amazon.com for the comprehensive list of public S3 datasets. Of note because of their scale are two projects, the 1000 Genomes Project and the Human Microbiome Project. The User Guide details how to Connect an S3 Bucket to your GenomeSpace account.
Other Public Datasets
Please let us know of other public datasets GenomeSpace should link to. We refer you to curations on the following websites listing other public datasets.