| Protein Information |
| Information Type | Description |
|---|---|
| Protein name | EXP03972 |
| NCBI Accession ID | |
| Organism | Roseburia,Clostridium,Blautia |
| Left | |
| Right | |
| Strand | |
| Nucleotide Sequence | ATGGATATGAAGAAAACGACAGTAAAAGGAGAAGGATTGCTGGCGAGAGCCCTGCTCCATGAAATGGACCATCTGGATGGTGTGTTGTATGTGGACAAAGTAGAGGGATCTTTGCAGGATGTAACGGCGAATGAGGAGGAACAGTAA |
| Sequence | MDMKKTTVKGEGLLARALLHEMDHLDGVLYVDKVEGSLQDVTANEEEQ |
| Source of smORF | Metagenomic Ribo-seq |
| Function | |
| Pubmed ID | 32601270 |
| Domain | |
| Functional Category | Function not yet assigned |
| Uniprot ID | |
| ORF Length (Amino Acid) | 48 |
| Conservation Analysis |
| Sr.No. | Left Position | Right Position | Strand | NCBI Accession id | Species Name |
|---|---|---|---|---|---|
| 1 | 3059161 | 3059307 | - | NC_010001.1 | Lachnoclostridium phytofermentans ISDg |
| 2 | 2284290 | 2284442 | - | NC_014376.1 | [Clostridium] saccharolyticum WM1 |
| 3 | 2133884 | 2134024 | - | NZ_LT635479.1 | Lachnoclostridium phocaeense |
| 4 | 719101 | 719238 | + | NC_012778.1 | [Eubacterium] eligens ATCC 27750 |
| 5 | 3533973 | 3534095 | + | NZ_CP048000.1 | Anaerocolumna sedimenticola |
| 6 | 2850957 | 2851091 | + | NZ_CP070062.1 | Coprococcus comes |
| 7 | 3559013 | 3559159 | - | NZ_CP022464.2 | Enterocloster bolteae |
| 8 | 1980707 | 1980829 | - | NC_014614.1 | Acetoanaerobium sticklandii |
| 9 | 866946 | 867092 | + | NZ_LN879430.1 | Herbinix luporum |
| 10 | 3455778 | 3455912 | - | NZ_AP023367.1 | Anaerocolumna cellulosilytica |
| 11 | 6135769 | 6135921 | - | NC_013595.1 | Streptosporangium roseum DSM 43021 |
| Neighborhood Conservation Analysis |
| Sr.No. | Domain | Co-occurrence Frequency | No. of species in which domain occurs with smORF | Median distance b/w smORF and domain bearing ORFs | Orientation relative to smORF | PFAM Information |
|---|---|---|---|---|---|---|
| 1 | PF13672.8 | 0.64 | 7 | 4289 | same-strand | Protein phosphatase 2C |
| 2 | PF04055.23 | 0.82 | 9 | 3227 | same-strand | Radical SAM superfamily |
| 3 | PF01189.19 | 0.82 | 9 | 1797 | same-strand | 16S rRNA methyltransferase RsmB/F |
| 4 | PF01029.20 | 0.91 | 10 | 1854.5 | same-strand | NusB family |
| 5 | PF17125.7 | 0.64 | 7 | 1912 | same-strand | N-terminal domain of 16S rRNA methyltransferase RsmF |
| 6 | PF04298.14 | 0.91 | 10 | 1160.5 | same-strand | Putative neutral zinc metallopeptidase |
| 7 | PF00551.21 | 0.91 | 10 | 16.5 | same-strand | Formyl transferase |
| 8 | PF02911.20 | 0.91 | 10 | 16.5 | same-strand | Formyl transferase, C-terminal domain |
| 9 | PF04851.17 | 0.91 | 10 | 382.5 | same-strand | Type III restriction enzyme, res subunit |
| 10 | PF17764.3 | 0.91 | 10 | 382.5 | same-strand | 3'DNA-binding domain (3'BD) |
| 11 | PF00270.31 | 0.91 | 10 | 382.5 | same-strand | DEAD/DEAH box helicase |
| 12 | PF18319.3 | 0.91 | 10 | 382.5 | same-strand | PriA DNA helicase Cys-rich region (CRR) domain |
| 13 | PF08245.14 | 0.73 | 8 | 2772 | same-strand | Mur ligase middle domain |
| 14 | PF01225.27 | 0.73 | 8 | 2734 | same-strand | Mur ligase family, catalytic domain |
| 15 | PF02875.23 | 0.73 | 8 | 2773.0 | same-strand | Mur ligase family, glutamate ligase domain |
| 16 | PF18074.3 | 0.64 | 7 | 383 | same-strand | Primosomal protein N C-terminal domain |