10.7 Data
The admixr package provides real example data from 10 human individuals, which can be acquired by running its download_data() function:
We now have a directory called snps that contain four files:
snps.geno: Genotype of each individual (column) at each SNP (row)- Represented as counts of the alternative allele (
0,1,2)
- Represented as counts of the alternative allele (
snps.ind: Population IDs for each individualsnps.snp: SNP IDs, positions, and allelesregions.bed: A file of genomic regions (not required for basicadmixranalysis)
EIGENSTRAT format
Together, the three .geno, .ind, and .snp files constitute EIGENSTRAT format. This is just a way of representing genotype data, similar to a VCF – in fact, several software packages exist to convert between VCF and EIGENSTRAT.