preprocess.yaml Reference
This configuration file corresponds to --option=preprocess and controls quality control, filtering, normalization, and dimensionality-reduction preparation.
Parameter |
Default |
Description |
|---|---|---|
|
|
Fixed identifier for the analysis stage |
|
|
Root directory for analysis outputs |
|
|
Root directory for raw input data |
|
|
Path to the sample list file |
|
|
Spatial transcriptomics platform type |
|
|
Single-sample or integrated analysis mode |
|
|
Whether to enable sample-specific filtering thresholds |
|
|
Path to a custom threshold file |
|
|
Minimum number of genes required per spot or cell |
|
|
Minimum number of cells in which a gene must appear |
|
|
Upper limit for mitochondrial proportion filtering |
|
|
Whether to perform highly variable gene selection |
|
|
Number of neighbors used for the neighbor graph |
|
|
Batch correction method for multi-sample analysis |
|
|
Number of highly variable genes |
|
|
Number of PCA components |
|
|
Whether to enable sketch-based analysis for large datasets |
|
|
Sampling fraction |
Tuning suggestions
When sample quality differs substantially across samples, enable
seg_filterto apply differential filtering.For multi-sample analyses, using
batch_methodis usually recommended to reduce batch effects.For datasets containing millions of cells, enable
sketchto improve iteration speed.