generate\(algorithm\)LDpred2
LDpred2 algorithm specification
Description
Configure a single LDpred2 algorithm specification.
Usage
`generate$algorithm$LDpred2`(reference.panel, mode = c("auto", "grid", "inf"), h2.est = NULL, p.causal = 10 ^ seq(log10(1e-4), log10(0.2), length.out = 30), pval = 1, use.MLE = FALSE, ld.size = 3000, ld.thr = 0.002, alpha = 1, allow.jump.sign = FALSE, shrink.corr = 0.95, ncores = 1)
Arguments
reference.panel
|
Character scalar reference-panel name used for LD. |
variant.space
|
Optional variant-space name. When supplied, PolyGenius first creates a restricted reference panel from |
mode
|
Character scalar; one of |
h2.est
|
Optional numeric vector of heritability candidates. |
p.causal
|
Numeric vector of causal-fraction candidates. Default is a tutorial-style log-spaced sequence from |
pval
|
Numeric scalar in |
use.MLE
|
Logical; forwarded to LDpred2-auto. |
ld.size
|
Integer LD construction control mapped to bigsnpr |
ld.thr
|
Numeric LD construction control mapped to bigsnpr |
alpha
|
Numeric LDpred2 parameter. |
allow.jump.sign
|
Logical; forwarded to LDpred2-auto. |
shrink.corr
|
Numeric shrinkage for LD correlations in LDpred2-auto. |
ncores
|
Integer worker cores to request for this algorithm. |
Details
This algorithm creates and consumes the following reactive resources:
-
polygenius.model(final output) -
gwas.sumstats(input GWAS summary statistics) -
ld.bigsnpr(bigsnpr-family LD storage)
ld.size and ld.thr are LD-construction controls used by the LDpred2 workflow. They are mapped onto bigsnpr LD storage fields:
-
ld.size->window -
ld.thr->threshold
These are separate from LDpred2 fitting hyperparameters such as mode, h2.est, p.causal, alpha, use.MLE, and shrink.corr.
The current LDpred2 wrapper fixes the bigsnpr LD fields statistic = “r”, signed = TRUE, and phased = FALSE.
In mode = “auto”, p.causal is forwarded as the tutorial-style vector of initial p values for bigsnpr::snp_ldpred2_auto(). The returned chains are combined as recommended in the bigsnpr tutorial: compute range <- sapply(chains, function(auto) diff(range(auto$corr_est))), keep chains where range > 0.95 * quantile(range, 0.95, na.rm = TRUE), and use the row mean of beta_est across the kept chains as the final effect vector.
In mode = “grid”, LDpred2 returns a grid of effect vectors. The bigsnpr tutorial chooses the final grid model using prediction performance in a validation set. generate$algorithm$LDpred2() does not receive validation genotypes or outcomes, so validation-set model choice should be handled downstream when such data are available.
For LDpred2 parameter guidance, refer to the bigsnpr LDpred2 tutorial and extended polygenic-score documentation: https://privefl.github.io/bigsnpr/articles/LDpred2.html and https://privefl.github.io/bigsnpr-extdoc/polygenic-scores-pgs.html.
Value
ResourceSpecSet of generate algorithm resources.