ProsmORF-pred
Result : P64629
Protein Information
Information Type Description
Protein name Uncharacterized protein YhfL
NCBI Accession ID AE005174.2
Organism Escherichia coli O157:H7
Left 4286584
Right 4286751
Strand +
Nucleotide Sequence ATGAACAAATTTATTAAAGTTGCACTGGTAGGTGCAGTACTGGCTACGTTAACTGCATGTACTGGTCATATTGAAAACCGTGATAAGAACTGCTCTTACGACTACCTGCTGCACCCAGCAATTTCTATTTCTAAAATCATTGGCGGTTGCGGTCCTACTGCACAGTAA
Sequence MNKFIKVALVGAVLATLTACTGHIENRDKNCSYDYLLHPAISISKIIGGCGPTAQ
Source of smORF Swiss-Prot
Function The ORF matches to the profile of pfam13978. Profile Description: Protein of unknown function (DUF4223). This family of proteins is functionally uncharacterized. This family of proteins is found in bacteria. Proteins in this family are approximately 60 amino acids in length. These proteins are likely to be lipoproteins (attachment site currently included in alignment).
Pubmed ID 11206551 11258796
Domain CDD:404801
Functional Category Others
Uniprot ID P64629
ORF Length (Amino Acid) 55
++ More..
Conservation Analysis
Conservation Analysis
No. of Species: 47
Sr.No. Left Position Right Position Strand NCBI Accession id Species Name
1 4219490 4219657 + NC_002695.2 Escherichia coli O157:H7 str. Sakai
2 3499448 3499615 + NC_000913.3 Escherichia coli str. K-12 substr. MG1655
3 3468666 3468833 + NC_004337.2 Shigella flexneri 2a str. 301
4 3987587 3987754 + NZ_LR134340.1 Escherichia marmotae
5 2404324 2404491 + NZ_CP057657.1 Escherichia fergusonii
6 3460407 3460574 + NZ_AP014857.1 Escherichia albertii
7 4694667 4694834 - NC_013716.1 Citrobacter rodentium ICC168
8 3635940 3636110 + NC_003197.2 Salmonella enterica subsp. enterica serovar Typhimurium str. LT2
9 1485064 1485234 + NZ_CP053416.1 Salmonella bongori
10 2779508 2779675 - NZ_CP028271.1 Mixta intestinalis
11 2778244 2778414 - NZ_CP044098.1 Citrobacter portucalensis
12 33792 33962 - NZ_CP012268.1 Cronobacter muytjensii ATCC 51329
13 4007979 4008149 + NZ_CP012264.1 Cronobacter condimenti 1330
14 3931332 3931502 + NZ_CP012257.1 Cronobacter universalis NCTC 9529
15 2290201 2290371 + NZ_CP033744.1 Citrobacter freundii
16 3753282 3753452 - NZ_CP027107.1 Cronobacter sakazakii
17 183188 183358 - NZ_CP013940.1 Cronobacter malonaticus LMG 23826
18 4351933 4352103 + NC_015968.1 Enterobacter soli
19 4098538 4098708 + NZ_CP012266.1 Cronobacter dublinensis subsp. dublinensis LMG 23823
20 3791050 3791220 - NZ_CP038469.1 Citrobacter tructae
21 4397386 4397556 + NC_009792.1 Citrobacter koseri ATCC BAA-895
22 404302 404472 - NZ_CP013990.1 Leclercia adecarboxylata
23 4308443 4308616 + NZ_CP050150.1 Hafnia alvei
24 2075867 2076037 + NZ_AP019007.1 Enterobacter oligotrophicus
25 5067093 5067260 - NZ_CP020388.1 Pluralibacter gergoviae
26 4440465 4440635 + NZ_CP009756.1 Enterobacter cloacae
27 2899140 2899310 + NZ_CP017279.1 Enterobacter ludwigii
28 4336484 4336654 + NZ_CP017184.1 Enterobacter roggenkampii
29 2963044 2963214 - NZ_CP025034.2 Enterobacter sp. SGAir0187
30 4006578 4006748 - NZ_CP045769.1 Enterobacter cancerogenus
31 4234050 4234220 + NZ_AP022508.1 Enterobacter bugandensis
32 3039012 3039182 + NZ_CP023529.1 Lelliottia amnigena
33 3406896 3407063 - NZ_CP061511.1 Mixta calida
34 4324210 4324380 + NZ_CP027986.1 Enterobacter sichuanensis
35 358149 358322 - NZ_AP023184.1 Buttiauxella agrestis
36 1621332 1621499 - NZ_LT556085.1 Citrobacter amalonaticus
37 2172761 2172928 - NZ_CP040428.1 Jejubacter calystegiae
38 340860 341030 - NZ_CP054058.1 Scandinavium goeteborgense
39 315106 315273 - NZ_CP047349.1 Proteus terrae subsp. cibarius
40 1893154 1893321 - NZ_CP026364.1 Proteus hauseri
41 418288 418461 - NZ_CP045845.1 Kluyvera intermedia
42 3786163 3786330 - NC_010554.1 Proteus mirabilis HI4320
43 1557884 1558051 + NZ_CP045205.1 Citrobacter telavivensis
44 97737 97904 - NZ_CP067059.1 Rahnella aceris
45 2175982 2176152 + NZ_CP029822.1 Entomomonas moraniae
46 3989968 3990135 - NZ_CP049044.1 Pseudomonas psychrophila
47 2081437 2081598 - NZ_LS483250.1 Moritella yayanosii
48 1787145 1787297 - NZ_AP019651.1 Vibrio taketomensis
++ More..
Neighborhood Conservation Analysis
* Arrows marked in Genome Diagram shows ORFs; Multiple PFAMs can be mapped to a single ORF.
* 'Small ORF' represents the entry/query analyzed.
* Image generated using 'gggenes'(R-Package).
Neighborhood Conservation Analysis
Neighborhood Representative Chosen(Species): NC_002695.2
Sr.No. Domain Co-occurrence Frequency No. of species in which domain occurs with smORF Median distance b/w smORF and domain bearing ORFs Orientation relative to smORF PFAM Information
1 PF00590.22 0.62 29 249.0 same-strand Tetrapyrrole (Corrin/Porphyrin) Methylases
2 PF13241.8 0.62 29 249.0 same-strand Putative NAD(P)-binding
3 PF10414.11 0.62 29 249.0 same-strand Sirohaem synthase dimerisation region
4 PF14824.8 0.62 29 249.0 same-strand Sirohaem biosynthesis protein central
++ More..