Command Description
Purpose and Usage
mergeAdata: Merge Multiple h5ad Files
Merge multiple h5ad files into a single h5ad file
SDAS dataProcess mergeAdata -i mult.csv -o outdirh5ad2rds: Convert h5ad to rds
Convert h5ad format data to rds format
SDAS dataProcess h5ad2rds -i st.h5ad --run_mode stRNA -o outdirh5mu2h5ad: Convert h5mu to h5ad
Convert h5mu format data to h5ad format
SDAS dataProcess h5mu2h5ad -i st.h5mu -o outdirprintAdataInfo: Print adata Information
Output detailed information of the h5ad file to the shell or a specified directory
SDAS dataProcess printAdataInfo -i st.h5ad -o outdir
SDAS dataProcess printAdataInfo -i st.h5adsubsetAdata: Extract h5ad Subset
Extract a subset of h5ad based on specified conditions, supporting numeric range or list filtering
Numeric filtering:
List filtering:
Input Parameter Description
-i / --input
Yes
Input file, supports h5ad, h5mu, csv (for mergeAdata, input is csv, header in first row)
--label_key
Yes
Used in subsetAdata, column name in obs or var to extract adata subset
-o / --output
No
Output folder, if -o is not set for printAdataInfo, output adata information to shell
--run_mode
No
Used in h5ad2rds, input data type, stRNA or scRNA, default is stRNA
--gene_symbol_key
No
Used in mergeAdata, column name of gene in h5ad.var (_index means h5ad.var.index)
--layer
No
Used in h5ad2rds and subsetAdata, specifies the layer storing raw counts in h5ad
--list_include
No
Used in subsetAdata, elements to extract when label_key is a list, e.g., Fibroblast,B,NK
--list_exclude
No
Used in subsetAdata, elements not to extract when label_key is a list, e.g., Fibroblast,B,NK
--min
No
Used in subsetAdata, minimum value for filtering when label_key is numeric
--max
No
Used in subsetAdata, maximum value for filtering when label_key is numeric
Output Results Display
<input_name>.h5ad
h5ad converted from h5mu
<input_name>_subset.h5ad
Subset h5ad obtained by subsetAdata
combine.h5ad
h5ad after merging multiple files
<input_name>.rds
rds file converted from h5ad
<input_name>_adata_info.txt
Detailed information of adata
Detailed adata information
<input_name>_adata_info.txt: This file is used to quickly understand the structure of the AnnData object, the main information and the distribution of labels. The file mainly outputs the following types of information:Basic dimensions of the AnnData object (number of observations n_obs × number of features n_vars).
Field names contained in obs (observations/samples) and var (features/genes).
Types of analysis results or metadata stored in uns, obsm, layers, obsp, etc.
Column count statistics for obs and var, and the first five values of obs_names and var_names.
The number of unique values and specific values for each categorical field in obs (such as leiden cluster labels, sample information, etc.).
Last updated