About Desert Plant Endophyte Database

A comprehensive platform for desert plant-associated prokaryotic genomes.The Desert Plant Endophyte Genome Database is one of the largest and most diverse collections of endophytic bacteria from desert plants.It captures microbial diversity from inland deserts, mountain wadis, coastal mangroves, and date palm oases across Saudi Arabia, Jordan, and Pakistan.The resource includes whole-genome sequences for more than 530 bacterial isolates, representing a wide range of taxa and plant hosts.

By integrating genomic data with rich ecological metadata, Desert Plant Endophyte enables advanced exploration of microbial ecology, stress-adaptation strategies, and plant growth promoting gene discovery. This platform supports translational research to develop genome-informed microbial inoculants that can improve crop resilience, nutrient use efficiency, and productivity in saline and drought-prone environments. The information of database will serves as a bridge between fundamental science and sustainable agriculture in arid regions.

Using Desert Plant Endophyte Database

Genome

It includes multiple meta-data tables that give an overview of a genome contents including  Country/Host/Isolation source/Sequencing Platform/Assembly method/Genome Features/Complteness/Taxonomy and NCBI repository. Along with it there is an option to down whole genome assembled ,nucleotide and protein sequences.

Annotation

Gene level

The table presents a comprehensive genome annotation at the gene level, providing both structural and functional information for each predicted gene. Each column serves a specific role in describing gene identity, size, function, and sequence composition, as outlined below.

Gene ID

A unique identifier assigned to each predicted gene or open reading frame (ORF).

Length (bp)

The length of the gene in base pairs, calculated from the annotated start and end positions on the genome.

Gene Name

A standardized or putative gene symbol assigned based on sequence similarity to known genes in reference databases. When functional evidence is limited, this field may contain a hypothetical or predicted gene name.

COG (Clusters of Orthologous Groups)

The COG identifier links each gene to an evolutionarily conserved orthologous group. This classification provides functional context by grouping genes with shared ancestry and similar biological roles, facilitating functional categorization and comparative analysis across species.

Sequences

A descriptive annotation of the predicted protein product encoded by the gene. This is typically derived from homology-based searches against curated protein databases and reflects the inferred biochemical function or cellular role of the gene.

Nucleotide Sequence

The DNA sequence of the gene, representing the coding region extracted from the genome.

Protein Sequence

The translated amino acid sequence encoded by the gene.

Plant Growth Promoting Trait (PGPT)

The PGPT table hierarchically links individual genes to specific plant-beneficial mechanisms, enabling systematic genome-scale interpretation of plant–microbe interactions.

PGPT: Locus Tag

The locus tag corresponds to the unique gene identifier in the annotated genome that is associated with a plant growth–promoting function.

PGPT: Effect

This column describes the overall biological effect of the gene on plant growth or health.

PGPT: L1 (Level 1 – Major Trait Category)

PGPT: L2 (Level 2 – Functional Group)

PGPT: L3 (Level 3 – Biochemical Pathway or Mechanism)

PGPT: L4 (Level 4 – Gene Family or Enzyme Class)

PGPT: L5 (Level 5 – Specific Gene or Protein Function)

The overall select mode enables users to query and filter selections based on specific criteria along with download options.