10.7 Data
The admixr
package provides real example data from 10 human individuals, which can be acquired by running its download_data()
function:
We now have a directory called snps
that contain four files:
snps.geno
: Genotype of each individual (column) at each SNP (row)- Represented as counts of the alternative allele (
0
,1
,2
)
- Represented as counts of the alternative allele (
snps.ind
: Population IDs for each individualsnps.snp
: SNP IDs, positions, and allelesregions.bed
: A file of genomic regions (not required for basicadmixr
analysis)
EIGENSTRAT format
Together, the three .geno
, .ind
, and .snp
files constitute EIGENSTRAT format. This is just a way of representing genotype data, similar to a VCF – in fact, several software packages exist to convert between VCF and EIGENSTRAT.