Protein Information |
Information Type | Description |
---|---|
Protein name | Carboxysome shell vertex protein CsoS4B |
NCBI Accession ID | CP000109.2 |
Organism | Hydrogenovibrio crunogenus (strain XCL-2) (Thiomicrospira crunogena) |
Left | 922663 |
Right | 922917 |
Strand | + |
Nucleotide Sequence | ATGCAGATACTACAAGTTAAAAAACAGTTGGTTCTGACTAGCCGCTTAAAGGATTTGGGACACCTCCCTTTAAAGGCACTGAGTTCAGAAACGGGTGAAATATTTGTAGCGATGGATCCGGTAGGAACAAAAGATGGTGACTGGGTTTTTACCATCGCTAATTCTGCGGCAAGAGATGCGGCAGGAGACAAAAGATTATTAACAGATTTAACGGTTGGCGGCATCATTGATGACTGGCAGCCAAAACAGAAGTAA |
Sequence | MQILQVKKQLVLTSRLKDLGHLPLKALSSETGEIFVAMDPVGTKDGDWVFTIANSAARDAAGDKRLLTDLTVGGIIDDWQPKQK |
Source of smORF | Swiss-Prot |
Function | Probably forms vertices in the carboxysome, a polyhedral inclusion where RuBisCO (ribulose bisphosphate carboxylase, cbbL-cbbS) is sequestered. Has been modeled to induce curvature upon insertion into an otherwise flat hexagonal layer of major carboxysome subunits. {ECO:0000250|UniProtKB:O85043}. |
Pubmed ID | 17105352 28115547 |
Domain | CDD:413110 |
Functional Category | Others |
Uniprot ID | Q31HD4 |
ORF Length (Amino Acid) | 84 |
Conservation Analysis |
Sr.No. | Left Position | Right Position | Strand | NCBI Accession id | Species Name |
---|---|---|---|---|---|
1 | 1938755 | 1939009 | - | NZ_CP035033.1 | Hydrogenovibrio thermophilus |
2 | 2302364 | 2302618 | - | NZ_CP054020.1 | Thiomicrorhabdus xiamenensis |
3 | 316520 | 316765 | + | NZ_CP040602.1 | Thiomicrorhabdus sediminis |
4 | 339630 | 339896 | + | NZ_AP021889.1 | Thiosulfatimonas sediminis |
5 | 2655526 | 2655792 | - | NZ_AP021889.1 | Thiosulfatimonas sediminis |
6 | 2067830 | 2068102 | - | NZ_AP021888.1 | Thiosulfativibrio zosterae |
7 | 1801516 | 1801776 | - | NC_015581.1 | Thiomicrospira cyclica ALM1 |
8 | 354515 | 354784 | + | NZ_CP033040.1 | Thiomicrorhabdus indica |
9 | 1999144 | 1999404 | - | NZ_CP007030.1 | Thiomicrospira aerophila AL3 |
10 | 2475207 | 2475452 | - | NZ_CP045571.1 | Acidithiobacillus thiooxidans ATCC 19377 |
11 | 2652203 | 2652472 | - | NZ_CP007029.1 | Thioalkalivibrio paradoxus ARh 1 |
12 | 530609 | 530860 | + | NC_005042.1 | Prochlorococcus marinus subsp. marinus str. CCMP1375 |
13 | 8454 | 8705 | - | NC_019675.1 | Cyanobium gracile PCC 6307 |
14 | 3075597 | 3075866 | - | NC_019902.2 | Thioalkalivibrio nitratireducens DSM 14787 |
15 | 2156521 | 2156772 | - | NZ_AP018721.1 | Sulfuritortus calidifontis |
16 | 519495 | 519761 | + | NZ_AP018725.1 | Sulfuriflexus mobilis |
17 | 3241308 | 3241601 | + | NC_019940.1 | Thioflavicoccus mobilis 8321 |
18 | 2389875 | 2390141 | - | NZ_AP018795.1 | Acidithiobacillus ferridurans |
19 | 1457685 | 1457951 | - | NC_011761.1 | Acidithiobacillus ferrooxidans ATCC 23270 |
20 | 315382 | 315648 | + | NZ_CP017415.1 | Acidihalobacter yilgarnenesis |
21 | 3165223 | 3165474 | - | NZ_AP021881.1 | Sulfuriferula nivalis |
22 | 3164964 | 3165221 | - | NZ_AP021881.1 | Sulfuriferula nivalis |
23 | 3297716 | 3297967 | - | NZ_AP021884.1 | Sulfuriferula plumbiphila |
24 | 3105849 | 3106112 | - | NC_013851.1 | Allochromatium vinosum DSM 180 |
Neighborhood Conservation Analysis |
Sr.No. | Domain | Co-occurrence Frequency | No. of species in which domain occurs with smORF | Median distance b/w smORF and domain bearing ORFs | Orientation relative to smORF | PFAM Information |
---|---|---|---|---|---|---|
1 | PF00210.26 | 0.95 | 21 | 1078 | same-strand | Ferritin-like domain |
2 | PF02915.19 | 0.68 | 15 | 1096 | same-strand | Rubrerythrin |
3 | PF00936.21 | 1.0 | 22 | 380 | same-strand | BMC domain |
4 | PF03319.15 | 1.0 | 22 | 2.0 | same-strand | Ethanolamine utilisation protein EutN/carboxysome |
5 | PF08936.12 | 0.91 | 20 | 263.5 | same-strand | Carboxysome Shell Carbonic Anhydrase |
6 | PF12288.10 | 0.91 | 20 | 1816.0 | same-strand | Carboxysome shell peptide mid-region |
7 | PF00101.22 | 0.95 | 21 | 4232 | same-strand | Ribulose bisphosphate carboxylase, small chain |
8 | PF00016.22 | 0.91 | 20 | 4624.0 | same-strand | Ribulose bisphosphate carboxylase large chain, catalytic domain |
9 | PF02788.18 | 0.91 | 20 | 4624.0 | same-strand | Ribulose bisphosphate carboxylase large chain, N-terminal domain |