All Phenotypes for wild sunflower populations
www.helianthome.org/rest/population/phenotype_matrix/wild.csv
www.helianthome.org/rest/population/phenotype_matrix/wild.json
All Phenotypes for cultivated SAM sunflower populations
www.helianthome.org/rest/population/phenotype_matrix/sam.csv
www.helianthome.org/rest/population/phenotype_matrix/sam.json
For a specific wild sunflower population by population id
Downloading a certain wild population by ID can be done with the REST API. In the following example we show how to download all phenotypes for the population ANN_01
www.helianthome.org/rest/population/phenotype_matrix/ANN_01.csv
www.helianthome.org/rest/population/phenotype_matrix/ANN_01.json
High Quality Imaging Data
All high-quality images for 1449 wild sunflower individuals as a single ZIP file
HighQualityImagesWildIndividuals.tar
You can also download images for certain species only. For this purpose, you can use the following URL endpoint:
www.helanthome.org/data/individuals/ANN1208.tar.gz
Just replace the Individual ID, e.g. ANN1208, with any other Individual ID.
Links to VCFs called on Ha412HOv2.0 reference genome
Hard filter was applied to retain only bi-allelic SNPs with 90% tranche, MAF > 0.01 and genotype rate > 50%. Additional filter of MAF > 0.01 and genotype rate > 50% was also applied for each subset.
Imputed datasets are generated with Beagle on all available samples for each species and the same filter was applied for each subset. (SNP set is slightly different from the un-imputed one.)
The VCFs with perennials are VQSR-ed with the same goldset used in the corresponding intra-species VCF to try to keep as many consistent SNPs as possible. To accommodate different requirements, multi-allelic SNPs are kept and filtered as follows: rare alleles with AF<0.01 are removed while others at the same site are kept; indels overlapping with a SNP are set as missing; sites that are variable after filtering and have >50% samples called are kept. Downstream filtration based on the genotypes (e.g. SNPs with no more than 1 missing data in the perennial outgroup, SNPs fixed between the target species and perennials, or SNPs that are variable within the target species but fixed in the perennial outgroup) should be conducted according to one’s purpose.
Links to Datasets
Annuus:
Results of genome-wide association studies for some phenotypes can be found on easyGWAS: