PMF statistics to estimate factor loading
asap_pmf_regression_cbind_mtx.Rd
PMF statistics to estimate factor loading
Usage
asap_pmf_regression_cbind_mtx(
mtx_files,
row_files,
col_files,
idx_files,
log_beta,
beta_row_names,
log_delta = NULL,
r_batch_names = NULL,
rename_columns = FALSE,
do_stdize_beta = TRUE,
do_stdize_r = TRUE,
do_log1p = FALSE,
verbose = FALSE,
a0 = 1,
b0 = 1,
max_iter = 10L,
NUM_THREADS = 0L,
BLOCK_SIZE = 1000L,
MAX_ROW_WORD = 2L,
ROW_WORD_SEP = "_",
MAX_COL_WORD = 100L,
COL_WORD_SEP = "@"
)
Arguments
- log_beta
D x K log dictionary/design matrix
- beta_row_names
row names log_beta (D vector)
- log_delta
D x B log batch effects
- r_batch_names
batch names (optional)
- rename_columns
append batch name at the end of each column name (default: FALSE)
- do_stdize_beta
use standardized log_beta (default: TRUE)
- do_stdize_r
standardize correlation matrix R (default: TRUE)
- do_log1p
do log(1+y) transformation (default: FALSE)
- verbose
verbosity
- a0
gamma(a0, b0) (default: 1)
- b0
gamma(a0, b0) (default: 1)
- max_iter
maximum iterations (default: 10)
- NUM_THREADS
number of threads in data reading
- BLOCK_SIZE
disk I/O block size (number of columns)
- MAX_ROW_WORD
maximum words per line in
row_files[i]
- ROW_WORD_SEP
word separation character to replace white space
- MAX_COL_WORD
maximum words per line in
col_files[i]
- COL_WORD_SEP
word separation character to replace white space
- mtx_file
matrix-market-formatted data file (D x N, bgzip)
- row_file
row names file (D x 1)
- col_file
column names file (N x 1)
- idx_file
matrix-market colum index file
Value
a list that contains:
beta the dictionary matrix (row x factor)
delta the dictionary matrix of batch effects (row x batch)
corr empirical correlation (column x factor)
theta factor loading (column x factor)
log.theta log-scaled factor loading (column x factor)
colsum column sum (column x 1)
rownames row names
batch.names batch names (based on
batch.index
colnames column names