#=============================================== # RABC* Level1 raw : RABC*_Level1.txt #=============================================== This data represents the dataset to be processed obtained from other data sources, including raw data and unnormalized datasets. For example:*.cel, *.idat, *_count.txt, etc. #=============================================== # RABC* Level2 probe : RABC*_Level2.txt #=============================================== This data represents the output file after the RABC data preprocessing process. The preprocessing process includes background correction, quality control, normalization, etc. The specific preprocessing process for different data is shown in the Methods section of the article. #=============================================== # RABC3 Level3 file : RABC*_Level3.txt #=============================================== This data represents gene (genesymbol) expression/methylation matrix data after probe annotation processing on Level2 probe data. #=============================================== # RABC* Difference result : RABC* Difference result.txt #=============================================== Output result file after differential expression/methylation using the R language package limma. Column headers: symbol: Gene List logFC: estimate of the log2-fold-change corresponding to the effect or contrast AveExpr: average log2-expression for the gene over all arrays and channels t: moderated t-statistic P.Value: Statistical test p-value adj.P.Val: adjusted p-value B: log-odds that the gene is differentially expressed #=============================================== # RABC* GO result : RABC* GO result.txt #=============================================== Output result files after GO enrichment analysis using the R language package clusterProfiler for genes with p-values < 0.05. Column headers: ONTOLOGY: Three types of Gene ontology-BP (Biological Process), CC (Cellular Component), MF (Molecular Function) ID: GO term ID in Gene Ontology Database (http://geneontology.org/) Description: Functional description of GO term GeneRatio: k/n. k = size of the overlap of 'a vector of gene id' you input with the specific geneset (eg E2F_targets), only unique genes; (the number of genes within that list n, which are annotated to the node/n = size of the overlap of 'a vector of gene id' you input with all the members of the collection of genesets (eg the HALLMARK collection),only unique genes; is the size of the list of genes of interest BgRatio: M/N. M = size of the geneset (eg size of the E2F_targets); (is the number of genes within that distribution that are annotated (either directly or indirectly) to the node of interest)/N = size of all of the unique genes in the collection of genesets (example the HALLMARK collection); (is the total number of genes in the background distribution (universe) pvalue: Statistical test p-value p.adjust: adjusted p-value qvalue: q-value geneID: List of genes enriched to this term Count: Total number of genes enriched to this term #=============================================== # RABC* KEGG result : RABC* KEGG result.txt #=============================================== Output result files after KEGG enrichment analysis using the R language package clusterProfiler for genes with p-values < 0.05. Column headers: ONTOLOGY: Three types of Gene ontology-BP (Biological Process), CC (Cellular Component), MF (Molecular Function) ID: pathway ID in KEGG Database (https://www.genome.jp/kegg/) Description: Functional description of GO term GeneRatio: k/n. k = size of the overlap of 'a vector of gene id' you input with the specific geneset (eg E2F_targets), only unique genes; (the number of genes within that list n, which are annotated to the node/n = size of the overlap of 'a vector of gene id' you input with all the members of the collection of genesets (eg the HALLMARK collection),only unique genes; is the size of the list of genes of interest BgRatio: M/N. M = size of the geneset (eg size of the E2F_targets); (is the number of genes within that distribution that are annotated (either directly or indirectly) to the node of interest)/N = size of all of the unique genes in the collection of genesets (example the HALLMARK collection); (is the total number of genes in the background distribution (universe) pvalue: Statistical test p-value p.adjust: adjusted p-value qvalue: q-value geneID: List of genes enriched to this term Count: Total number of genes enriched to this term #=============================================== # RABC*_phenotype : RABC*_PhenotypeFile.csv #=============================================== Sample groups of the dataset RABC* Column headers: sample: Sample ID status: Grouping status/type: 1=group1(RA) 0=group2 (other status)