Welcome to PharmGEO

PharmGEO is a comprehensive, manually curated, and quality-controlled platform that systematically integrates pharmaco-transcriptomic data from the GEO database.

7,931

Pharmaco-transcriptomics Datasets

1,334

Unique Drugs with Metadata

17,776

Average Genes per Dataset

115,264

Drug-Drug Interactions

RNA-seq Datasets

Access 7,931 curated datasets
Standardized metadata annotations
Differential expression analysis
Pathway enrichment results

Drug Information

1,334 drugs with detailed profiles
High-consistency gene signatures
GMCS quality metrics
PubChem integration
Literature evidence support

Gene Information

Gene-centric drug associations
Cross-dataset validation
GeneCards integration

Drug-Drug Interactions

119,298 interaction records
Transcriptomic-level evidence
Multi-source integration
Network visualization

Introduction

This section presents a curated collection of pharmacotranscriptomic datasets systematically retrieved from the GEO database. You can efficiently browse and select datasets for in-depth analysis, including metadata inspection, differential gene expression profiling, and advanced functional enrichment analyses such as GO, KEGG, and GSEA.

Dataset Selection

Please select a dataset from the table below, then click Analyze to proceed.

Introduction

The Drug Information section provides details on specific drugs, including molecular properties, descriptions, and related genes. It also lists genes consistently found across datasets to identify key gene-drug interactions.

Select a Drug:

Drug Information

Highly Consistent Genes

Need help understanding the columns?

Introduction

The Drug-Drug Interaction section analyzes interactions between drugs. Selecting two drugs displays their details, intersecting genes, and interaction data from multiple databases(DDInter, MecDDI, RxNav). A network visualization further illustrates the drug-gene interaction relationships.

Select a Gene:

Hot Genes:

All Genes:

Gene Information

Associated Drugs

Need help understanding the columns?

Introduction

The Drug-Drug Interaction section enables users to explore and analyze interactions between different pharmaceutical compounds. Users can select two drugs to view their detailed information, intersecting genes, and interaction details from multiple databases. Additionally, an interaction network visualization provides insights into the relationships between the selected drugs and associated genes.

Select Drugs

Select Drug A:

Drug Information

Top N Genes per Category:

Available Downloads

Annotated Drug Metadata

Download curated drug annotation metadata (Excel format).

Download Metadata

Differential Gene Lists

Download experiment-specific lists of significantly differentially expressed genes (Rdata format).

Download Gene Lists

PharmaGEO Help Guide

PharmaGEO Help Guide

Module 1: RNA-seq Datasets

Purpose

The RNA-seq Datasets module provides access to drug-related bulk RNA-seq and microarray transcriptomic datasets systematically curated from the GEO database. This section serves as the primary entry point for dataset-specific analyses of pharmaco-transcriptomic studies.

$RNA-seq Datasets Interface$

Key Functionalities

1. Dataset Selection and Exploration

Interactive Table: Browse through 7,780 curated pharmaco-transcriptomic datasets
Metadata Filtering: Filter datasets by drug name, organism, cell type, dose, duration, and experimental conditions
GEO Integration: Direct links to original GEO accession numbers (GSE) for data provenance
Comprehensive Metadata: View dataset-specific information including drug name, dose, duration, cell type, organism, and platform details

Dataset Selection

2. Metadata Analysis

Standardized Annotations: Each dataset includes the following metadata fields:
- standard_name: Standardized drug name using PubChem database nomenclature
- drug_name: Original drug name as recorded in the GEO database
- organism: Species of the sequenced samples
- cell_type: Cell line or tissue type used in the sequencing experiment
- dose: Drug dosage concentration administered
- duration: Duration of drug treatment
- gse_id: Unique identifier for each dataset in the GEO database (clickable link to GEO page)
- GPL: Sequencing platform identifier providing technical parameter information
- ctrl_ids: Sample identifiers used as control group in differential expression analysis
- pert_ids: Sample identifiers used as treatment group in differential expression analysis
- type: Perturbation type - PharmGEO database exclusively contains drug-type perturbations
- Exp_type: Experimental platform type: includes 4,438 microarray datasets (Expression profiling by array) and 3,342 high-throughput sequencing datasets (Expression profiling by high throughput sequencing)

Dataset Metadata

3. Differential Expression Analysis

Volcano Plot Visualization: Interactive plots showing log fold change vs. statistical significance
Gene Expression Tables: Comprehensive lists of differentially expressed genes with:
- genes: Gene symbols with direct links to GeneCards
- logFC: Log fold change values
- p.value: Filtered p.value < 0.05

Differential Expression Analysis

4. Enrichment Analysis

Three complementary enrichment analysis approaches with interactive dot plot visualizations:

Kyoto Encyclopedia of Genes and Genomes (KEGG) Pathway Analysis

Interactive dot plots showing enriched pathways
X-axis: Gene count, Y-axis: Pathway descriptions
Color coding by statistical significance (p-value)
Point size representing gene count

Gene Ontology (GO) Analysis

Interactive dot plots for biological processes, molecular functions, and cellular components
X-axis: Gene count, Y-axis: GO term descriptions
Color coding by statistical significance (p-value)
Point size representing gene count

Gene Set Enrichment Analysis (GSEA)

Interactive dot plots showing gene set enrichment
X-axis: Normalized enrichment score (RichFactor), Y-axis: Gene set descriptions
Color coding by statistical significance (p-value)
Point size representing gene count

Enrichment Analysis

Module 2: Drug Information

Purpose

The Drug Information module enables detailed exploration of individual drugs and their associated gene signatures. This section focuses on drug-centric analysis with emphasis on consistency and variability metrics.

Drug Information Interface

Key Functionalities

1. Drug Selection and Properties

Comprehensive Drug Database: Access to 1,311 unique drugs.

Drug Selection

Molecular Information: Detailed drug properties including:
- Chemical Structure Visualization: 2D molecular structure display
- Drug Name: Standardized drug name
- CID: Canonical chemical identifier
- Formula: Molecular formula
- Weight: Molecular weight in g/mol
- Description: Comprehensive pharmacological description.
- IUPAC Name: International Union of Pure and Applied Chemistry nomenclature
- InChI: International Chemical Identifier string
- InChIKey: Standardized InChI key for database searching
- SMILES: Simplified Molecular Input Line Entry System representation

2. Highly Consistent Gene Analysis

Gene Mean Consistency Score (GMCS)

Formula: GMCS = (∑ᵢ₌₁ⁿ countᵢ) / n
Purpose: Measures gene expression consistency across datasets
Interpretation: Higher values indicate more consistent drug-gene associations

3. Gene Association Tables

High-Confidence Genes: Filtered for Top 25% consistency and Bottom 25% variability
Literature Support: PubMed co-occurrence counts for drug-gene pairs
ATC Classification: Anatomical Therapeutic Chemical classification system integration
Cross-References: Direct links to external databases (GeneCards, PubChem, WHO-ATC)

Gene Association

Module 3: Gene Information

Purpose

The Gene Information module provides gene-centric analysis, allowing users to explore which drugs affect specific genes and understand gene-drug relationship patterns.

Gene Information Interface

Key Functionalities

1. Gene Selection Options

Hot Genes: Quick access to 20 frequently studied genes (TP53, EGFR, KRAS, etc.)
Comprehensive Search: Search through complete gene database with live filtering

Gene Selection

2. Gene Annotation

Gene Cards Integration: Comprehensive gene information including:
- Official gene names and symbols
- Chromosomal location
- Entrez and MIM identifiers
- Gene aliases and alternative names
- Quick link to GeneCards

Gene Annotation

3. Drug Association Analysis

Associated Drugs Table: Comprehensive list of drugs affecting the selected gene
Quality Metrics: GMCS values for each drug-gene pair
Therapeutic Classification: ATC Level 3 codes and descriptions
Literature Evidence: PubMed co-occurrence statistics

Drug Association

Module 4: Drug-Drug Interaction

Purpose

The Drug-Drug Interaction module represents the first systematic transcriptomic-level analysis of drug interactions, providing mechanistic insights through shared gene targets.

Drug-Drug Interaction Interface

Key Functionalities

1. Drug Pair Selection

Interactive Selection: Choose two drugs from the comprehensive database
Validation: Automatic filtering for available interaction data
Real-time Updates: Dynamic drug B options based on drug A selection

DDI Selection

2. Molecular Information Display

Dual Drug Profiles: Side-by-side comparison of drug properties
Chemical Structures: Visual representation of molecular structures
Pharmacological Properties: Detailed molecular and pharmacokinetic information

Drugs Information

3. Gene Intersection Analysis

Shared Target Identification: Genes affected by both drugs
Directional Analysis: Expression change patterns for each drug
Interaction Categories (where A = Drug A, B = Drug B):
- A_up_B_down: Gene upregulated by Drug A but downregulated by Drug B
- A_down_B_up: Gene downregulated by Drug A but upregulated by Drug B
- A_up_B_up: Gene upregulated by both Drug A and Drug B
- A_down_B_down: Gene downregulated by both Drug A and Drug B

DDI Intersecting Genes

4. Multi-Source DDI Evidence

Integration of three major drug interaction databases:

DDInter: Comprehensive drug-drug interaction database
MecDDI: Mechanism-based drug-drug interactions
RxNav: Clinical drug interaction information

Multi-Source Support

5. Network Visualization

Interactive Network Graph: ECharts-based visualization showing:
- Drug nodes (distinct colors)
- Gene nodes (categorized by interaction type)
- Connection patterns between drugs and genes
Customizable Display: Adjustable top N genes per category
Dynamic Updates: Real-time network modification

DDI-Genes Network