make.sc.deg.data.Rd
We will generate Y ~ Poisson(mu * rho) where mu ~ exp(log.mu/smudge), rho ~ Gamma(a,b)
make.sc.deg.data(
file.header,
nind = 40,
ngenes = 1000,
ncausal = 5,
nreverse = 0,
ncovar.conf = 3,
ncovar.batch = 0,
ncell.ind = 10,
ngenes.covar = ngenes,
num.mixtures = 1,
pve.1 = 0.3,
pve.c = 0.5,
pve.a = 0.5,
pve.r = 0,
smudge = 1,
rho.a = 2,
rho.b = 2,
rseed = 13,
exposure.type = c("binary", "continuous")
)
file set header
num of individuals
num of genes/features
num of causal genes
num of anti-causal genes
num of confounding covariates
num of confounding batch variables
num of cells per individual
num of genes affected by covariates
num of cell mixtures
variance of treatment/disease effect
variance of confounding effect
variance of confounders to the assignment
variance of reverse causation
a scaling factor for a GLM model (default: 1)
rho ~ gamma(a, b)
rho ~ gamma(a, b)
random seed
"binary" or "continuous"
simulation results
The simulation result list will have two lists:
data
:
a matrix market data file data$mtx
a file with row names data$row
a file with column names data$col
an indexing file for the columns data$idx
a mapping file between column and individual names "indv"
indv
:
obs.mu
observed (noisy) gene x individual matrix
clean.mu
clean gene x individual matrix
X
confounder x individual matrix
W
individual-level treatment assignment
rho
sequencing depth
causal
causal genes