Pathway Analysis
Introduction
The pathway analysis feature in OmicsBox allows identifying pathways from multiple pathway databases for any set of sequences. In combination with differential expression data, the tool allows calculating pathway enrichment.
In order to generate the pathway example dataset, there was the need to run a full RNA-seq analysis as well as retrieve the Mus musculus annotated genome. The RNA-seq analysis steps performed were FastQC, Preprocessing, RNA-Seq Alignment, Gene Level Quantification and Pairwise Differential Expression Analysis with OmicsBox using the Transcriptomics Module and the Mus musculus annotated genome has been retrieved using the BioMart feature available in OmicsBox.
Dataset description
Transcriptome analysis of Mus musculus luminal and basal cell subpopulations in the lactating versus pregnant mammary gland.
Organism: Mus musculus
Type of experiment: RNA-Seq
Experimental design: 12 samples
2 replicates for each stage (Virgin, Pregnant, Lactating)
6 for each cell type (Luminal and Basal)
Instrument: Illumina HiSeq 2000
Layout: Single-end
Publication
Original Data
Link to the public repositories where the original data can be found:
BioProject: PRJNA258286
SRA: SRP045534
Reference: NCBI
Bioinformatic Analysis
Combined Pathway Analysis
Application
Input
Mus musculus Genome Annotation project
Pairwise differential expression project
Parameters
Input
Sequences (.box, .fasta, .annot): biomart_data_mmusculus_gene_ensembl.box
Add Pairwise Expression Data: true
Differential Expression Data (.box): lactate_vs_pregnant.box
Gene Set Enrichment Analysis: true
Fisher's Exact Test: true
Configuration Reactome
Run Reactome Pathway Analysis: true
Run Blast to link via Protein IDs: true
Link with GeneOntology Terms: true
Keep Most Specific Pathways: true
Give Priority to Taxon: true
Top Priority Taxon: Mus musculus
Blast Expectation Value: 1.0E-3
Include Categories: All
Configuration KEGG
Run KEGG Pathway Analysis: true
Link KEGG Orthologs via EggNog: true
Link via Enzyme Codes: true
Include Categories: All
Execution Time
55 mins
Output
A table containing all the linked pathways to the sequences as well as the enriched pathways with differential expression genes.