Select your data platform
This chapter is divided into six sub-tutorials according to the run_type platform. Each tutorial covers the corresponding starting steps for analysis.
Before running the workflow, you need to organize the raw data according to the platform requirements and prepare the sample list file sample.txt so that the SpatialData zarr object can be created correctly.
Complete input file checklist, including required and optional files, file formats, and filename patterns
File sources and how to obtain them, such as official platform downloads, experimental outputs, or placeholder paths
Reproducible directory structure examples
Example
integratecommands and matchingsample.txtformats
Every spatial transcriptomics platform provides its own post-sequencing analysis software, such as Space Ranger for 10x Genomics data or SAW for BGI (formerly MGI) platforms. These tools perform basic data processing tasks, including alignment of raw FASTQ reads and transcript counting. Spatialsnake takes the standardized outputs from each platform and integrates them into a unified SpatialData object, streamlining all downstream analyses and visualizations. For every supported platform, Spatialsnake offers a dedicated tutorial to help you get started quickly. Please first follow the instructions below to set up the basic directory structure:
mkdir -p project_root/data project_root/results
touch project_root/sample.txt
project_root/ (current working directory)
├── data/ (stores your raw data)
├── sample.txt (key sample description file)
└── results/ (stores analysis outputs; generated automatically)
After identifying your data platform, download and store the platform-specific output files under the data/ directory, using the sample name as the subdirectory name. Ensure that the folder hierarchy follows the official output structure of that platform. At the same time, add the corresponding sample name to sample.txt so that Spatialsnake can correctly read the input files.
Whether your goal is single-sample analysis or you have multiple samples from different experimental conditions and intend to perform multi-sample integration, we recommend first selecting the tutorial for your specific platform to learn the basic workflow. The multi-sample analysis pipeline is broadly similar once you understand the fundamentals.
Quick reference for run_type
run_type |
Output type |
Tutorial page |
|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
For each supported platform, we provide a public demonstration dataset.
run_type |
Demo dataset |
Source |
Download link |
|---|---|---|---|
|
Visium_BreastCancer_Section1 |
10x Genomics |
|
|
VisiumHD_MouseBrain_Demo |
10x Genomics |
|
|
Visium_Segmentation_Demo |
10x Genomics / Lab output |
|
|
Xenium_Human_Breast_Demo |
10x Genomics |
|
|
MERFISH_Vizgen_Demo |
Vizgen |
|
|
Stereo-seq Mouse_Brain demo |
Public repository |
Note
If you want to gain a basic understanding of SpatialSnake’s functionality using our sample data, please jump directly to Core Analysis Workflow and follow the instructions to proceed.
Detailed tutorials by data type
Note
If you want to run multi-sample integration analysis, we recommend moving to Spatialsnake for multi-sample integration after first reading the single-sample tutorials for the basic workflow.