1. Introduction
OmicsAnalyst 2.0 is a comprehensive web-based platform designed to support end-to-end analysis of multi-omics data.
The platform bridges the gap between exploratory statistics and mechanistic interpretation, enabling researchers to move
seamlessly from pattern discovery to functional insight and causal hypothesis testing.
What's New in Version 2.0: Enhanced statistical methods, knowledge-based network integration capabilities and function exploration,
and new causal analysis tools including mediation analysis and interaction modeling.
OmicsAnalyst 2.0 operates through three complementary analytical components that are integrated within a unified workflow:
- Comprehensive Statistical Integration - Explore patterns and covariance through individual, pairwise, and multi-omics joint analysis
- Interpretable Network Integration - Visualize omics signatures in biological context across omics layers
- Causal Mechanistic Analysis - Elucidate regulatory mechanisms and driver features
2. System Requirements
OmicsAnalyst 2.0 is a web-based application that runs in your browser. For optimal performance, we recommend:
| Component |
Requirement |
| Browser |
Chrome, Firefox, Safari, or Edge (latest versions) |
| Internet Connection |
Stable broadband connection recommended |
| Screen Resolution |
1280x800 or higher |
| JavaScript |
Must be enabled |
3. Supported Omics Types
OmicsAnalyst 2.0 supports the following five omics data types for multi-omics integration analysis. A maximum of 5 omics data types can be uploaded per analysis:
Genes/mRNAs
Gene expression data from RNA-seq or microarray experiments. Supported ID types: Entrez ID, Ensembl Gene ID, Ensembl Transcript ID, Official Gene Symbol.
Proteins
Protein abundance data from mass spectrometry-based proteomics. Supported ID types: UniProt Protein ID, Entrez ID, Ensembl Gene ID, Official Gene Symbol.
miRNAs
MicroRNA expression data. Supported ID types: miRBase mature ID, miRBase accession, miRBase ID (e.g., hsa-miR-21).
Metabolites
Metabolite concentration data from metabolomics studies. Supported ID types: KEGG ID, PubChem ID, HMDB ID, Common Name.
Microbiome
Taxonomic abundance data from 16S rRNA or metagenomics. Supported ID types: Taxonomy label, OTU ID. Taxon levels: Phylum, Class, Order, Family, Genus, Species, Strain.
Note: At least two omics data types are required for multi-omics integration analysis.
4. Input Data Formats
OmicsAnalyst 2.0 provides two upload modes for different analysis workflows:
Upload Mode 1: Data Tables (Statistical Integration)
Upload expression/abundance data tables with a metadata file for comprehensive statistical analysis.
- Upload a single metadata file and at least two omics data tables
- The metadata table should describe the same sample IDs shared across all omics data
- A small percentage of missing values are acceptable
Upload Mode 2: Feature Lists (Network Integration)
Upload pre-identified feature lists from external analysis tools ( limma, DESeq2, edgeR, etc.) for network-based analysis.
- Upload one or more feature lists (genes, proteins, metabolites, etc.)
- Specify the model organism (Human, Mouse, or Other)
- For microbiome data, specify its host (currently Human or Mouse only)
Supported File Formats
| Format |
Extension |
Max Size |
| CSV |
.csv |
50 MB (Genes/mRNAs), 25 MB (others) |
| TXT |
.txt |
50 MB (Genes/mRNAs), 25 MB (others) |
| TSV |
.tsv |
50 MB (Genes/mRNAs), 25 MB (others) |
Data Table Structure
Each omics data table should follow this structure:
- First column: Feature identifiers (gene symbols, protein IDs, metabolite names, etc.)
- Subsequent columns: Sample measurements (one column per sample)
- First row: Header with sample names matching the metadata file
Metadata File Structure
A metadata file describing sample information is required for Statistical Integration:
- First column: Sample names (must match column headers in data files)
- Second column: Primary study factor (group/condition) - no missing values allowed
- Additional columns: Other sample attributes (batch, clinical variables, etc.)
Tips:
- Ensure sample names are consistent across all data files and the metadata file
- Avoid special characters in sample names and feature IDs
- No missing values allowed for the primary study factor (first column after sample names)
5. Key Features
OmicsAnalyst 2.0 bridges the gap between exploratory statistics and mechanistic interpretation through three key analytical components:
Statistical Integration
Comprehensive multi-omics statistical analysis including limma-based differential analysis,
correlation analysis, and advanced integration methods (MCIA, MOFA, DIABLO) to identify
significant features and patterns across data types.
Network Integration
Project significant features onto comprehensive molecular networks, including protein-protein interaction,
metabolic pathways, and regulatory networks for biological context and functional interpretation.
Causal Analysis
Test mechanistic hypotheses through mediation analysis and interaction modeling (IntLIM) to identify
regulatory relationships and driver features across omics layers.
Statistical Integration Methods
The Statistical Integration component provides a progressive analytical framework from individual omics characterization to full multi-omics integration:
Single-Omics Characterization - Analyze each omics layer individually:
- Significant Features: Identify features significantly associated with experimental factors using linear models (limma)
- Overall Patterns: Explore sample clustering, grouping patterns, and major sources of variation
- Variance Partitioning: Decompose global and feature-level variance to identify features driven by specific factors
Pairwise Omics Analysis - Discover relationships between two omics layers:
- Clustering Heatmap: Hierarchical clustering to reveal global correlation patterns between two omics layers
- Correlation Network: Network-based visualization of significant feature-to-feature correlations
- Differential Chord Diagram: Compare correlation structures between conditions to identify network rewiring
- Sparse CCA: Maximize correlation between two blocks and select key features driving the link
Multi-Omics Integration - Integrate all omics layers simultaneously:
- Global Exploration: Consensus PCA and Multiple Co-inertia Analysis (MCIA) for visualizing shared trends
- Latent Factor Discovery: Fast NMF, Semi-NMF, and MOFA for discovering biologically interpretable patterns
- Feature Selection: DIABLO for finding discriminative and correlated features across omics
Network Integration Methods
For feature list inputs, build and explore comprehensive molecular networks:
- Database Selection: Choose from protein-protein interaction, metabolic, regulatory, and other network databases
- Network Builder: Construct multi-omics networks connecting genes, proteins, metabolites, and microbiome
- Network Viewer: Interactive visualization with functional enrichment analysis
Causal Analysis Methods
Test mechanistic hypotheses and identify regulatory relationships:
- Pairwise Linear Model (IntLIM): Test for significant linear relationships between paired features across conditions
- Mediation Analysis: Identify mediating relationships where one omics layer influences another through an intermediate
6. Analytical Workflows
OmicsAnalyst 2.0 provides two entry points depending on your input data type:
Data Tables → Statistical Integration
Upload Data Tables
Metadata + expression/abundance tables for 2-5 omics types
→
Quality Control
Data filtering, normalization, and harmonization
→
Analysis Hub
Single-omics, pairwise, and multi-omics methods
→
Visualization
Interactive plots, heatmaps, and networks
Feature Lists → Network Integration
Upload Feature Lists
Pre-identified features from external tools (DESeq2, limma, etc.)
→
Database Selection
Choose network databases (PPI, metabolic, regulatory)
→
Network Building
Construct integrated molecular networks
→
Functional Analysis
Pathway enrichment and module identification
Example Analytical Pipelines
Based on these workflows, users can design their own analytical strategies to address specific biological questions.
Below are two example pipelines demonstrating how the three key components work together:
Discovery Workflow: Statistics → Networks → Function
Statistical Analysis
Upload multi-omics data and perform differential analysis
→
Feature Selection
Identify significant features and cross-omics correlations
→
Network Projection
Map features onto biological interaction networks
→
Functional Analysis
Perform pathway enrichment and functional annotation
Validation Workflow: Networks → Statistics → Causality
Network Analysis
Build networks from feature lists and identify hub genes
→
Hypothesis Generation
Form mechanistic questions based on network topology
→
Statistical Testing
Test candidate features with interaction modeling
→
Causal Validation
Quantify direct and mediated effects via mediation analysis
Note: These workflows are flexible. You can choose any combination of methods
based on your data characteristics to address your specific research questions.
7. Getting Started
To start using OmicsAnalyst 2.0, visit the Upload page
and choose your data input type. Before uploading, ensure your data files are properly formatted as described in
Section 4: Input Data Formats.
Which pathway should I choose?
- Data Tables: Choose this if you have raw expression/abundance data and want to perform comprehensive statistical analysis from scratch
- Feature Lists: Choose this if you have already identified significant features using external tools and want to focus on network-based interpretation
Next Steps: Continue with the detailed tutorials for each analytical component: