This is an old revision of the document!
openSNP (open genetics data)
We split into two groups:
- trying to run a GWAS, comparing the genotypes of openSNP users to the 1000 Genomes data set, to see whether there are significantly overrepresented variants in openSNP users
- trying to recreate the main graphics of this publication, a graph of a principal component analysis which shows that genetic variation clusters well according to geography:
Doing a PCA w/ openSNP data
- Decided to only use data from 23andMe
Data
- List and link your actual and ideal data sources.
Team
- and other team members
Links
- Repo for doing PCA on openSNP: https://github.com/ciyer/opensnp-fun