Executes machine learning pipeline for MDR analysis using logistic regression with parallel processing via the future backend. Trains models on all MDR parquet files and saves results to designated output directories.
Character scalar. Base directory containing MDR matrix files.
Integer. Number of parallel workers for model training. Default is 16.
Numeric vector of length 2. Train/validation split proportions.
Integer. Number of cross-validation folds. Default 5.
Numeric vector of length 2. Proportion range for variable-importance selection.
Numeric. PCA variance threshold. Default 0.99.
Logical. Print progress messages during model training. Default TRUE.
Logical. Return tuning results from cross-validation. Default TRUE.
Logical. Return fitted model objects. Default TRUE.
Logical. Return prediction results. Default TRUE.
Logical. Whether to inherit split/seed/n_fold from ml_parameters.json. Default TRUE.
Logical. Randomly shuffle labels for baseline runs. Default FALSE.
Logical. Use PCA on predictors. Default FALSE.
NULL (invisible). Called for side effects (model training and result saving).
createMLinputList for generating input file lists,
runMLmodels for non-MDR model execution
if (FALSE) { # \dontrun{
# Run MDR models with default settings
runMDRmodels("/path/to/results")
# Run with more threads and minimal output
runMDRmodels("/path/to/results",
threads = 32,
verbose = FALSE)
# Run without saving model fits (save disk space)
runMDRmodels("/path/to/results",
threads = 16,
return_fit = FALSE)
} # }