| Protein Information |
| Information Type | Description |
|---|---|
| Protein name | EXP00642 |
| NCBI Accession ID | CP001509.3 |
| Organism | Escherichia coli BL21(DE3) |
| Left | 1195375 |
| Right | 1195626 |
| Strand | + |
| Nucleotide Sequence | ATGAGCAAGAATTTGTTTTGCCTGAGTGGTCAGGCGATTACGAGAAAACATCAGTTGCAGACGTCCGCCGCCGACCTGGTGCCAGAGCACGGCGGCGCGAATGCCTGCCAGCAGGGTTGCGCGAACTTTCGCCTGCACTTGTGGGCTTTGCAGTACAGCAGGGGAACCGGTGACCTGAATGCGCGGGCCAAGCGGGCTAATCACATCAACATAGATAGCAGCCATCGCGCTCATCAGCGTTTCGGACTGTAA |
| Sequence | MSKNLFCLSGQAITRKHQLQTSAADLVPEHGGANACQQGCANFRLHLWALQYSRGTGDLNARAKRANHINIDSSHRAHQRFGL |
| Source of smORF | Ribo-seq |
| Function | |
| Pubmed ID | 30904393 |
| Domain | |
| Functional Category | Function not yet assigned |
| Uniprot ID | |
| ORF Length (Amino Acid) | 83 |
| Conservation Analysis |
| Sr.No. | Left Position | Right Position | Strand | NCBI Accession id | Species Name |
|---|---|---|---|---|---|
| 1 | 1771761 | 1772012 | + | NZ_CP061527.1 | Shigella dysenteriae |
| 2 | 1192008 | 1192259 | + | NC_000913.3 | Escherichia coli str. K-12 substr. MG1655 |
| 3 | 1613871 | 1614122 | + | NC_002695.2 | Escherichia coli O157:H7 str. Sakai |
| 4 | 1196870 | 1197121 | + | NZ_AP014857.1 | Escherichia albertii |
| 5 | 2729260 | 2729526 | - | NZ_CP054058.1 | Scandinavium goeteborgense |
| 6 | 3172253 | 3172498 | + | NZ_CP020388.1 | Pluralibacter gergoviae |
| 7 | 2728380 | 2728625 | - | NZ_CP045845.1 | Kluyvera intermedia |
| 8 | 354909 | 355154 | - | NZ_CP044098.1 | Citrobacter portucalensis |
| 9 | 1940826 | 1941071 | + | NZ_CP012871.1 | [Enterobacter] lignolyticus |
| 10 | 570022 | 570267 | + | NZ_CP038469.1 | Citrobacter tructae |
| 11 | 4712853 | 4713098 | + | NZ_CP033744.1 | Citrobacter freundii |
| 12 | 2301339 | 2301617 | - | NZ_CP012268.1 | Cronobacter muytjensii ATCC 51329 |
| 13 | 1666940 | 1667218 | - | NZ_CP027107.1 | Cronobacter sakazakii |
| 14 | 3843453 | 3843698 | + | NZ_CP026047.1 | Raoultella planticola |
| 15 | 1731447 | 1731683 | + | NZ_CP012257.1 | Cronobacter universalis NCTC 9529 |
| 16 | 1080591 | 1080869 | - | NZ_CP045300.1 | Kosakonia arachidis |
| 17 | 3397787 | 3398020 | + | NZ_CP040428.1 | Jejubacter calystegiae |
| 18 | 2218204 | 2218422 | - | NC_012779.2 | Edwardsiella ictaluri 93-146 |
| Neighborhood Conservation Analysis |
| Sr.No. | Domain | Co-occurrence Frequency | No. of species in which domain occurs with smORF | Median distance b/w smORF and domain bearing ORFs | Orientation relative to smORF | PFAM Information |
|---|---|---|---|---|---|---|
| 1 | PF08007.14 | 0.71 | 12 | 3776.5 | opposite-strand | Cupin superfamily protein |
| 2 | PF08918.12 | 0.88 | 15 | 2232.0 | opposite-strand | PhoQ Sensor |
| 3 | PF02518.28 | 0.88 | 15 | 2232.0 | opposite-strand | Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase |
| 4 | PF14501.8 | 0.88 | 15 | 2232.0 | opposite-strand | GHKL domain |
| 5 | PF00072.26 | 0.94 | 16 | 1561 | opposite-strand | Response regulator receiver domain |
| 6 | PF00486.30 | 0.94 | 16 | 1561 | opposite-strand | Transcriptional regulatory protein, C terminal |
| 7 | PF00206.22 | 1.0 | 17 | 22.0 | opposite-strand | Lyase |
| 8 | PF08328.13 | 1.0 | 17 | 22.0 | opposite-strand | Adenylosuccinate lyase C-terminal |
| 9 | PF04356.14 | 1.0 | 17 | -245.0 | opposite-strand | Protein of unknown function (DUF489) |
| 10 | PF03054.18 | 1.0 | 17 | 410.5 | opposite-strand | tRNA methyl transferase |
| 11 | PF00293.30 | 1.0 | 17 | 1573.0 | opposite-strand | NUDIX domain |
| 12 | PF00849.24 | 0.94 | 16 | 2365 | opposite-strand | RNA pseudouridylate synthase |
| 13 | PF00180.22 | 0.65 | 11 | 2954.0 | same-strand | Isocitrate/isopropylmalate dehydrogenase |