FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE0340, 507 aa
1>>>pF1KE0340 507 - 507 aa - 507 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.3670+/-0.00036; mu= 18.7936+/- 0.022
mean_var=81.5746+/-16.704, 0's: 0 Z-trim(114.5): 14 B-trim: 1420 in 2/55
Lambda= 0.142003
statistics sampled from 24374 (24388) to 24374 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.658), E-opt: 0.2 (0.286), width: 16
Scan time: 9.220
The best scores are: opt bits E(85289)
NP_071381 (OMIM: 613159,613553) probable Xaa-Pro a ( 507) 3450 716.7 3.9e-206
NP_000276 (OMIM: 170100,613230) xaa-Pro dipeptidas ( 493) 464 105.0 5.4e-22
NP_001159529 (OMIM: 170100,613230) xaa-Pro dipepti ( 429) 376 86.9 1.3e-16
NP_001159528 (OMIM: 170100,613230) xaa-Pro dipepti ( 452) 316 74.6 6.8e-13
NP_001191756 (OMIM: 613159,613553) probable Xaa-Pr ( 55) 158 41.5 0.00075
>>NP_071381 (OMIM: 613159,613553) probable Xaa-Pro amino (507 aa)
initn: 3450 init1: 3450 opt: 3450 Z-score: 3821.8 bits: 716.7 E(85289): 3.9e-206
Smith-Waterman score: 3450; 100.0% identity (100.0% similar) in 507 aa overlap (1-507:1-507)
10 20 30 40 50 60
pF1KE0 MPWLLSAPKLVPAVANVRGLSGCMLCSQRRYSLQPVPERRIPNRYLGQPSPFTHPHLLRP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_071 MPWLLSAPKLVPAVANVRGLSGCMLCSQRRYSLQPVPERRIPNRYLGQPSPFTHPHLLRP
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 GEVTPGLSQVEYALRRHKLMSLIQKEAQGQSGTDQTVVVLSNPTYYMSNDIPYTFHQDNN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_071 GEVTPGLSQVEYALRRHKLMSLIQKEAQGQSGTDQTVVVLSNPTYYMSNDIPYTFHQDNN
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 FLYLCGFQEPDSILVLQSLPGKQLPSHKAILFVPRRDPSRELWDGPRSGTDGAIALTGVD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_071 FLYLCGFQEPDSILVLQSLPGKQLPSHKAILFVPRRDPSRELWDGPRSGTDGAIALTGVD
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE0 EAYTLEEFQHLLPKMKAETNMVWYDWMRPSHAQLHSDYMQPLTEAKAKSKNKVRGVQQLI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_071 EAYTLEEFQHLLPKMKAETNMVWYDWMRPSHAQLHSDYMQPLTEAKAKSKNKVRGVQQLI
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE0 QRLRLIKSPAEIERMQIAGKLTSQAFIETMFTSKAPVEEAFLYAKFEFECRARGADILAY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_071 QRLRLIKSPAEIERMQIAGKLTSQAFIETMFTSKAPVEEAFLYAKFEFECRARGADILAY
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE0 PPVVAGGNRSNTLHYVKNNQLIKDGEMVLLDGGCESSCYVSDITRTWPVNGRFTAPQAEL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_071 PPVVAGGNRSNTLHYVKNNQLIKDGEMVLLDGGCESSCYVSDITRTWPVNGRFTAPQAEL
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE0 YEAVLEIQRDCLALCFPGTSLENIYSMMLTLIGQKLKDLGIMKNIKENNAFKAARKYCPH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_071 YEAVLEIQRDCLALCFPGTSLENIYSMMLTLIGQKLKDLGIMKNIKENNAFKAARKYCPH
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE0 HVGHYLGMDVHDTPDMPRSLPLQPGMVITIEPGIYIPEDDKDAPEKFRGLGVRIEDDVVV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_071 HVGHYLGMDVHDTPDMPRSLPLQPGMVITIEPGIYIPEDDKDAPEKFRGLGVRIEDDVVV
430 440 450 460 470 480
490 500
pF1KE0 TQDSPLILSADCPKEMNDIEQICSQAS
:::::::::::::::::::::::::::
NP_071 TQDSPLILSADCPKEMNDIEQICSQAS
490 500
>>NP_000276 (OMIM: 170100,613230) xaa-Pro dipeptidase is (493 aa)
initn: 383 init1: 130 opt: 464 Z-score: 515.8 bits: 105.0 E(85289): 5.4e-22
Smith-Waterman score: 572; 29.3% identity (57.1% similar) in 468 aa overlap (72-500:23-476)
50 60 70 80 90 100
pF1KE0 PNRYLGQPSPFTHPHLLRPGEVTPGLSQVEYALRRHKLMSLIQKEAQGQSGTDQTVVVLS
.:: :..: ..:. :.:. .:::.
NP_000 MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGS---IVVLQ
10 20 30 40
110 120 130 140 150
pF1KE0 N--PTYYMSNDIPYTFHQDNNFLYLCGFQEPDSILVLQSLPGKQLPSHKAILFVPRRDPS
. : . .: :.:.. : . : :: :.. :: . ::::: :
NP_000 GGEETQRYCTDTGVLFRQESFFHWAFGVTEPGCYGVIDVDTGK------STLFVPRLPAS
50 60 70 80 90 100
160 170 180 190 200 210
pF1KE0 RELWDGPRSGTDGAIALTGVDEAYTLEEFQHLLPKMKAETNMVWYDWMRPSHAQLHSDYM
. : : . . .::.. ..:. .: ..: . .. .: ... :
NP_000 HATWMGKIHSKEHFKEKYAVDDVQYVDEIASVLTSQKPSVLLT----LRGVNTDSGSVCR
110 120 130 140 150
220 230 240 250 260 270
pF1KE0 QPLTEAKAKSKNKVRGVQQLIQRLRLIKSPAEIERMQIAGKLTSQAFIETMFTSKAPVEE
. .. .: . . .. : . :..:. :.: .. ..:..:.: :.: . :. ..:
NP_000 EASFDGISKFEVNNTILHPEIVECRVFKTDMELEVLRYTNKISSEAHREVMKAVKVGMKE
160 170 180 190 200 210
280 290 300 310 320 330
pF1KE0 AFLYAKFEFECRARGA-DILAYPPVVAGGNRSNTLHY----VKNNQLIKDGEMVLLDGGC
: . :: : .::. .: . ..:. : .::: . :.. :..:.: :.: :
NP_000 YELESLFEHYCYSRGGMRHSSYTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGG
220 230 240 250 260 270
340 350 360 370 380 390
pF1KE0 ESSCYVSDITRTWPVNGRFTAPQAELYEAVLEIQRDCLALCFPGTSLENIYSMMLTLIGQ
: :..:::: ..:.::.::: : .:::::. .: .. ::. ... . . .
NP_000 EYYCFASDITCSFPANGKFTADQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLE
280 290 300 310 320 330
400 410 420 430 440
pF1KE0 KLKDLGIMK-NIKENNAFKAARKYCPHHVGHYLGMDVHDTPDMP-----------RSL--
.: .::.. .. . . . :: .::.::.::::. .: :::
NP_000 ELAHMGILSGSVDAMVQAHLGAVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRT
340 350 360 370 380 390
450 460 470 480
pF1KE0 --PLQPGMVITIEPGIYIPED---------------DKDAPEKFRGLG-VRIEDDVVVTQ
::::::.:.:::::. . .... ..:::.: ::::.:::::
NP_000 ARHLQPGMVLTVEPGIYFIDHLLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVT-
400 410 420 430 440 450
490 500
pF1KE0 DSPLILSADCPKEMNDIEQICSQAS
:: . : . :. ...::
NP_000 DSGIELLTCVPRTVEEIEACMAGCDKAFTPFSGPK
460 470 480 490
>>NP_001159529 (OMIM: 170100,613230) xaa-Pro dipeptidase (429 aa)
initn: 339 init1: 130 opt: 376 Z-score: 419.2 bits: 86.9 E(85289): 1.3e-16
Smith-Waterman score: 484; 33.7% identity (61.6% similar) in 294 aa overlap (244-500:120-412)
220 230 240 250 260 270
pF1KE0 LHSDYMQPLTEAKAKSKNKVRGVQQLIQRLRLIKSPAEIERMQIAGKLTSQAFIETMFTS
:..:. :.: .. ..:..:.: :.: .
NP_001 SGSVCREASFDGISKFEVNNTILHPEIVECRVFKTDMELEVLRYTNKISSEAHREVMKAV
90 100 110 120 130 140
280 290 300 310 320
pF1KE0 KAPVEEAFLYAKFEFECRARGA-DILAYPPVVAGGNRSNTLHY----VKNNQLIKDGEMV
:. ..: : . :: : .::. .: . ..:. : .::: . :.. :..:.:
NP_001 KVGMKEYELESLFEHYCYSRGGMRHSSYTCICGSGENSAVLHYGHAGAPNDRTIQNGDMC
150 160 170 180 190 200
330 340 350 360 370 380
pF1KE0 LLDGGCESSCYVSDITRTWPVNGRFTAPQAELYEAVLEIQRDCLALCFPGTSLENIYSMM
:.: : : :..:::: ..:.::.::: : .:::::. .: .. ::. ... .
NP_001 LFDMGGEYYCFASDITCSFPANGKFTADQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLA
210 220 230 240 250 260
390 400 410 420 430
pF1KE0 LTLIGQKLKDLGIMK-NIKENNAFKAARKYCPHHVGHYLGMDVHDTPDMP----------
. ..: .::.. .. . . . :: .::.::.::::. .:
NP_001 DRIHLEELAHMGILSGSVDAMVQAHLGAVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPG
270 280 290 300 310 320
440 450 460 470
pF1KE0 -RSL----PLQPGMVITIEPGIYIPED---------------DKDAPEKFRGLG-VRIED
::: ::::::.:.:::::. . .... ..:::.: ::::.
NP_001 LRSLRTARHLQPGMVLTVEPGIYFIDHLLDEALADPARASFLNREVLQRFRGFGGVRIEE
330 340 350 360 370 380
480 490 500
pF1KE0 DVVVTQDSPLILSADCPKEMNDIEQICSQAS
::::: :: . : . :. ...::
NP_001 DVVVT-DSGIELLTCVPRTVEEIEACMAGCDKAFTPFSGPK
390 400 410 420
>>NP_001159528 (OMIM: 170100,613230) xaa-Pro dipeptidase (452 aa)
initn: 383 init1: 130 opt: 316 Z-score: 352.5 bits: 74.6 E(85289): 6.8e-13
Smith-Waterman score: 453; 27.6% identity (51.5% similar) in 468 aa overlap (72-500:23-435)
50 60 70 80 90 100
pF1KE0 PNRYLGQPSPFTHPHLLRPGEVTPGLSQVEYALRRHKLMSLIQKEAQGQSGTDQTVVVLS
.:: :..: ..:. :.:. .:::.
NP_001 MAAATGPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGS---IVVLQ
10 20 30 40
110 120 130 140 150
pF1KE0 N--PTYYMSNDIPYTFHQDNNFLYLCGFQEPDSILVLQSLPGKQLPSHKAILFVPRRDPS
. : . .: :.:.. : . : :: :.. :: . ::::: :
NP_001 GGEETQRYCTDTGVLFRQESFFHWAFGVTEPGCYGVIDVDTGK------STLFVPRLPAS
50 60 70 80 90 100
160 170 180 190 200 210
pF1KE0 RELWDGPRSGTDGAIALTGVDEAYTLEEFQHLLPKMKAETNMVWYDWMRPSHAQLHSDYM
. : : . . .::.. ..:. .: ..: :: .
NP_001 HATWMGKIHSKEHFKEKYAVDDVQYVDEIASVLTSQK------------PS-------VL
110 120 130 140
220 230 240 250 260 270
pF1KE0 QPLTEAKAKSKNKVRGVQQLIQRLRLIKSPAEIERMQIAGKLTSQAFIETMFTSKAPVEE
: ... : . : . : : .... . . ..: .
NP_001 LTLRGVNTDSGSVCREA-----------SFDGISKFEVNNTILHPEIVECL---------
150 160 170 180
280 290 300 310 320 330
pF1KE0 AFLYAKFEFECRARGA-DILAYPPVVAGGNRSNTLHY----VKNNQLIKDGEMVLLDGGC
:: : .::. .: . ..:. : .::: . :.. :..:.: :.: :
NP_001 ------FEHYCYSRGGMRHSSYTCICGSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGG
190 200 210 220 230
340 350 360 370 380 390
pF1KE0 ESSCYVSDITRTWPVNGRFTAPQAELYEAVLEIQRDCLALCFPGTSLENIYSMMLTLIGQ
: :..:::: ..:.::.::: : .:::::. .: .. ::. ... . . .
NP_001 EYYCFASDITCSFPANGKFTADQKAVYEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLE
240 250 260 270 280 290
400 410 420 430 440
pF1KE0 KLKDLGIMK-NIKENNAFKAARKYCPHHVGHYLGMDVHDTPDMP-----------RSL--
.: .::.. .. . . . :: .::.::.::::. .: :::
NP_001 ELAHMGILSGSVDAMVQAHLGAVFMPHGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRT
300 310 320 330 340 350
450 460 470 480
pF1KE0 --PLQPGMVITIEPGIYIPED---------------DKDAPEKFRGLG-VRIEDDVVVTQ
::::::.:.:::::. . .... ..:::.: ::::.:::::
NP_001 ARHLQPGMVLTVEPGIYFIDHLLDEALADPARASFLNREVLQRFRGFGGVRIEEDVVVT-
360 370 380 390 400 410
490 500
pF1KE0 DSPLILSADCPKEMNDIEQICSQAS
:: . : . :. ...::
NP_001 DSGIELLTCVPRTVEEIEACMAGCDKAFTPFSGPK
420 430 440 450
>>NP_001191756 (OMIM: 613159,613553) probable Xaa-Pro am (55 aa)
initn: 170 init1: 156 opt: 158 Z-score: 190.1 bits: 41.5 E(85289): 0.00075
Smith-Waterman score: 158; 60.0% identity (71.1% similar) in 45 aa overlap (1-39:1-45)
10 20 30 40 50
pF1KE0 MPWLLSAPKLVPAVANVRGLSGCMLCSQRRY------SLQPVPERRIPNRYLGQPSPFTH
:::::::::::::::::::::: .. .: .:::. :
NP_001 MPWLLSAPKLVPAVANVRGLSGSYFVTQAGVQWRDPITLQPMSSRLKQSFYQLPE
10 20 30 40 50
60 70 80 90 100 110
pF1KE0 PHLLRPGEVTPGLSQVEYALRRHKLMSLIQKEAQGQSGTDQTVVVLSNPTYYMSNDIPYT
507 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 15:13:11 2016 done: Thu Nov 3 15:13:13 2016
Total Scan time: 9.220 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]