API
XChrom: a cross-cell chromatin accessibility prediction model integrating genomic sequences and cellular context.
Import xchrom package
import xchrom
Reading
|
Read 10x Genomics h5 file and convert to h5ad format |
Preprocessing
|
Generate XChrom training and test inputs from a single dataset. |
|
Generate XChrom train set and test set from 2 datasets, to generate XChrom test inputs. |
|
Filter multiome RNA and ATAC data based on expression/accessibility thresholds and chromosomes |
Model training
|
Train XChrom model |
|
Generate input data for XChrom model training and create TensorFlow datasets. |
|
XChrom model construction |
Toolkit
|
Calculate the AUROC and AUPRC metrics for given true and predicted matrix. |
|
Calculate the cluster metrics of scATAC data, including the number of shared neighbors and labels. |
|
Calculate the PCA of the given data, and save the PCA components in .obsm['X_pca']. |
|
Extract sequences from BED file and write to FASTA file. |
|
Evaluate the performance in cross-cell prediction with within-sample data, calculate auROC & auPRC for overall, per-cell and per-peak. |
|
Evaluate the performance in cross-cell prediction with within-sample data, calculate neighbor score and label score for test cells. |
|
Evaluate the performance in cross-peak prediction with within-sample data, calculate auROC & auPRC for overall, per-cell and per-peak. |
|
Evaluate the performance in cross-both prediction with within-sample data, calculate auROC & auPRC for overall, per-cell and per-peak. |
|
Evaluate the performance of denoise in within-sample data, calculate neighbor score(k=10,50,100) and label score(k=10,50,100) for all cells. |
|
Evaluate the performance in cross-samples scenario, calculate auROC & auPRC for overall, per-cell and per-peak. |
|
Evaluate the performance in cross-samples scenario, calculate neighbor score(k=10,50,100) and label score(k=10,50,100) for all cells. |
|
Calculate ISM (In Silico Mutagenesis) for a single sequence |
|
Normalized the ISM scores for the four nucleotides at each position such that they summed to zero |
|
Calculate the ISM from BED file. |
|
Prepare motif data and background sequences for TF activity calculation |
|
Calculate motif insertion scores. |
Plotting
|
|
|
Plot the relative importance of the given matrix per position. |
|
Plot the per-cell auROC and auPRC scatter plot. |
|
Plot the per-peak auROC and auPRC scatter plot. |
|
Plot the activity of a single motif and celltype on UMAP. |