Calculate linkage disequilibrium from an rTASSEL genotype dataset.
Source:R/analysis_diversity_functions.R
linkageDiseq.Rd
Generates a linkage disequilibrium (LD) data set from SNP data.
Arguments
- tasObj
An object of class
TasselGenotypePenotype
that contains a genotype table.- ldType
How do you want LD calculated? Currently, the available options are
"All"
and"SlidingWindow"
. IfAll
is selected, LD will be calculated for every combination of sites in the alignment (NOTE: this may produce a massive series of combinations; use only on heavily filtered genotype tables). IfSlidingWindow
is selected, LD will be calculated for sites within a window of sites surrounding the current site.- windowSize
What size do you want your LD analysis window? If you have chosen
SlidingWindow
for theldType
parameter, you will need to specify window size.- hetCalls
How should heterozygous calls be handled? Current options are
"ignore"
(ignore altogether),"missing"
(set to missing), and"third"
(treat as third state).- verbose
Display messages? Defaults to
TRUE
.
Details
Linkage disequilibrium between any set of polymorphisms can be estimated by initially filtering a genotype dataset and then using this function. At this time, \(D'\), \(r^{2}\) and P-values will be estimated. The current version calculates LD between haplotypes with known phase only (unphased diploid genotypes are not supported; see PowerMarker or Arlequin for genotype support).
\(D'\) is the standardized disequilibrium coefficient, a useful statistic for determining whether recombination or homoplasy has occurred between a pair of alleles.
\(r^{2}\) represents the correlation between alleles at two loci, which is informative for evaluating the resolution of association approaches.
\(D'\) and \(r^{2}\) can be calculated when only two alleles are present. If more than two alleles, only the two most frequent alleles are used. P-values are determined by a two-sided Fisher's Exact test is calculated. Since LD is meaningless when scored with very small sample sizes, a minimum of 20 taxa must be present to calculate LD and there must be 2 or more minor alleles.
See also
linkageDiseq
, ldPlot