Utah State University Bioinformatics Facility

How RNA-Seq Analysis Works

RNA-Seq (RNA sequencing) is a technique that can examine the quantity and sequences of RNA in a sample using next generation sequencing (NGS). It analyzes the transcriptome of gene expression patterns encoded within the RNA.

The following are the steps we follow in a usual RNA-seq data analysis; both reference-based and de novo assembly-based protocols are mentioned below:

A. When a reference genome is available

Step 1 - Quality Control/Trimming

Adapter removal, trimming, pooling of samples

Step 2 - Read Alignment

Reads mapping with TopHat2 (or another aligner at request)

Step 3 - Expression Quantification:

Generation of counts table of raw reads mapped based on chosen annotation feature (genes, exons, intergenic regions, etc.)
Conversion of read counts into RPKM values (FPKM for paired-end data)

Step 4 - Sample Tree (Correlation Analysis)

Step 5 - Differential Expression (DE) Testing

Statistical identification of differentially expressed genes (DEGs) with edgeR or DESeq2
PCA plots, Venn diagram
Gene clustering and heatmaps

Step 6 - Functional Interpretation:

Gene Ontology (GO) term enrichment analysis
KEGG pathway analysis

Note: alignment files (*.bam) can be provided at request for visualizing read mappings, analysis results and annotation data, e.g. on the IGV genome browser

B. When a Reference Genome is Not Available

Step 1 - Quality Control/Trimming

Adapter removal, trimming, pooling of samples

Step 2 - Assemble Reads de novo to Construct Reference Genome

Reads mapping with trinity, bowtie, another aligner if requested.

Step 3 - Assessing assembly quality

Alignment summary metrics

Step 4 - Expression Quantification

Generation of counts table of raw reads mapped based on chosen annotation feature (genes, exons, etc.)
Conversion of read counts into RPKM values (FPKM for paired-end data)

Step 5 - Sample Tree (Correlation Analysis)

Step 6 - Differential Expression (DE) Testing

Statistical identification of differentially expressed genes (DEGs) with edgeR or DESeq2
PCA plots, Venn diagram
Gene clustering and heatmaps

Step 7 - Functional Interpretation

Assign Unigene to assembled transcriptome
Annotation of Unigene by BLAST
Gene ontology (GO) term enrichment analysis
KEGG pathway analysis

RNA-Seq Analysis

How RNA-Seq Analysis Works

A. When a reference genome is available

Step 1 - Quality Control/Trimming

Step 2 - Read Alignment

Step 3 - Expression Quantification:

Step 4 - Sample Tree (Correlation Analysis)

Step 5 - Differential Expression (DE) Testing

Step 6 - Functional Interpretation:

B. When a Reference Genome is Not Available

Step 1 - Quality Control/Trimming

Step 2 - Assemble Reads de novo to Construct Reference Genome

Step 3 - Assessing assembly quality

Step 4 - Expression Quantification

Step 5 - Sample Tree (Correlation Analysis)

Step 6 - Differential Expression (DE) Testing

Step 7 - Functional Interpretation

Other Services

Request a quote

Hiring Researchers