R/data_curation.R
retrieveGenomes.RdDefault and fast method="ftp"
retrieveGenomes(
base_dir = ".",
user_bacs,
method = c("ftp", "cli"),
image = "danylmb/bvbrc:5.3",
skip_existing = TRUE,
ftp_workers = 8L,
cli_fasta_workers = 4L,
cli_gff_workers = 4L,
chunk_size = 50L,
verbose = TRUE
)Project root (results layout preserved).
Input label(s) used to locate per-selection DB path.
"ftp" (default) or "cli".
Docker image for CLI path (default "danylmb/bvbrc:5.3").
Logical; if TRUE, do not re-download genomes already complete. Default TRUE.
Parallel workers for FTP path (default 8).
Parallel chunk containers for FASTA+GTO (default 4).
Parallel chunk containers for GFF export (default 4).
Genomes per chunk container (default 50).
Verbose messages.
Character vector of genome IDs with complete file sets on disk.
Alternative path to bypass FTP: method="cli"