FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE0329, 508 aa
1>>>pF1KE0329 508 - 508 aa - 508 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.6201+/-0.000417; mu= 5.4288+/- 0.026
mean_var=317.4879+/-64.226, 0's: 0 Z-trim(119.8): 15 B-trim: 0 in 0/59
Lambda= 0.071980
statistics sampled from 34190 (34205) to 34190 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.728), E-opt: 0.2 (0.401), width: 16
Scan time: 11.790
The best scores are: opt bits E(85289)
NP_055422 (OMIM: 611216) UBX domain-containing pro ( 508) 3263 352.8 1.3e-96
XP_016873363 (OMIM: 616378) PREDICTED: UBX domain- ( 297) 361 51.2 5e-06
NP_001273006 (OMIM: 616378) UBX domain-containing ( 297) 361 51.2 5e-06
NP_001273007 (OMIM: 616378) UBX domain-containing ( 238) 338 48.7 2.3e-05
XP_005274090 (OMIM: 616378) PREDICTED: UBX domain- ( 312) 340 49.0 2.4e-05
NP_056937 (OMIM: 616378) UBX domain-containing pro ( 312) 340 49.0 2.4e-05
>>NP_055422 (OMIM: 611216) UBX domain-containing protein (508 aa)
initn: 3263 init1: 3263 opt: 3263 Z-score: 1855.2 bits: 352.8 E(85289): 1.3e-96
Smith-Waterman score: 3263; 100.0% identity (100.0% similar) in 508 aa overlap (1-508:1-508)
10 20 30 40 50 60
pF1KE0 MLWFQGAIPAAIATAKRSGAVFVVFVAGDDEQSTQMAASWEDDKVTEASSNSFVAIKIDT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_055 MLWFQGAIPAAIATAKRSGAVFVVFVAGDDEQSTQMAASWEDDKVTEASSNSFVAIKIDT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 KSEACLQFSQIYPVVCVPSSFFIGDSGIPLEVIAGSVSADELVTRIHKVRQMHLLKSETS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_055 KSEACLQFSQIYPVVCVPSSFFIGDSGIPLEVIAGSVSADELVTRIHKVRQMHLLKSETS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 VANGSQSESSVSTPSASFEPNNTCENSQSRNAELCEIPPTSDTKSDTATGGESAGHATSS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_055 VANGSQSESSVSTPSASFEPNNTCENSQSRNAELCEIPPTSDTKSDTATGGESAGHATSS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE0 QEPSGCSDQRPAEDLNIRVERLTKKLEERREEKRKEEEQREIKKEIERRKTGKEMLDYKR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_055 QEPSGCSDQRPAEDLNIRVERLTKKLEERREEKRKEEEQREIKKEIERRKTGKEMLDYKR
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE0 KQEEELTKRMLEERNREKAEDRAARERIKQQIALDRAERAARFAKTKEEVEAAKAAALLA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_055 KQEEELTKRMLEERNREKAEDRAARERIKQQIALDRAERAARFAKTKEEVEAAKAAALLA
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE0 KQAEMEVKRESYARERSTVARIQFRLPDGSSFTNQFPSDAPLEEARQFAAQTVGNTYGNF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_055 KQAEMEVKRESYARERSTVARIQFRLPDGSSFTNQFPSDAPLEEARQFAAQTVGNTYGNF
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE0 SLATMFPRREFTKEDYKKKLLDLELAPSASVVLLPAGRPTASIVHSSSGDIWTLLGTVLY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_055 SLATMFPRREFTKEDYKKKLLDLELAPSASVVLLPAGRPTASIVHSSSGDIWTLLGTVLY
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE0 PFLAIWRLISNFLFSNPPPTQTSVRVTSSEPPNPASSSKSEKREPVRKRVLEKRGDDFKK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_055 PFLAIWRLISNFLFSNPPPTQTSVRVTSSEPPNPASSSKSEKREPVRKRVLEKRGDDFKK
430 440 450 460 470 480
490 500
pF1KE0 EGKIYRLRTQDDGEDENNTWNGNSTQQM
::::::::::::::::::::::::::::
NP_055 EGKIYRLRTQDDGEDENNTWNGNSTQQM
490 500
>>XP_016873363 (OMIM: 616378) PREDICTED: UBX domain-cont (297 aa)
initn: 354 init1: 218 opt: 361 Z-score: 229.1 bits: 51.2 E(85289): 5e-06
Smith-Waterman score: 363; 31.8% identity (59.2% similar) in 255 aa overlap (156-393:43-291)
130 140 150 160 170 180
pF1KE0 QSESSVSTPSASFEPNNTCENSQSRNAELCEIPPTSDTKSDTATGGESAGHATSSQ----
: : : .: : . . :::.
XP_016 MGFPRGRAEKALALTGNQGIEAAMDWLMEHEDDPDVDEPLETPLGHILGREPTSSEQGGL
20 30 40 50 60 70
190 200 210 220 230
pF1KE0 EPSGCS--DQRPAEDLNIRVERLTKKLE--ERREEKRKEEEQRE-IKKEIERRKTGKEML
: :: . . .:: . . : :. . :: .....:.:.:.:: ...: .::. :.:.
XP_016 EGSGSAAGEGKPALSEEERQEQTKRMLELVAQKQREREEREEREALERERQRRRQGQELS
80 90 100 110 120 130
240 250 260 270 280 290
pF1KE0 DYKRKQEEELTKRMLEERNREKAEDRAARERIKQQIALDRAERAARFAKT----KEEVEA
... .:. .: ::: :::::. :::.:....: :.:::: ... . :
XP_016 AARQRLQEDEMRRAAEERRREKAEELAARQRVREKIERDKAERAKKYGGSVGSQPPPVAP
140 150 160 170 180 190
300 310 320 330 340 350
pF1KE0 AKAAALLAKQAEMEVKRESYARERSTVARIQFRLPDGSSFTNQFPSDAPLEEARQFAAQT
. . . . : .::: : . ::: :::::.:.:. : . : .: ..
XP_016 EPGPVPSSPSQEPPTKRE-YDQ-----CRIQVRLPDGTSLTQTFRAREQLAAVRLYVELH
200 210 220 230 240
360 370 380 390 400
pF1KE0 VGNTYGN----FSLATMFPRREFTKEDYKKKLLDLELAPSASVVLLPAGRPTASIVHSSS
:. :. .: . :::: :.. :... : .: :.::: ...
XP_016 RGEELGGGQDPVQLLSGFPRRAFSEADMERPLQELGLVPSAVLIVAKKCPS
250 260 270 280 290
410 420 430 440 450 460
pF1KE0 GDIWTLLGTVLYPFLAIWRLISNFLFSNPPPTQTSVRVTSSEPPNPASSSKSEKREPVRK
>>NP_001273006 (OMIM: 616378) UBX domain-containing prot (297 aa)
initn: 354 init1: 218 opt: 361 Z-score: 229.1 bits: 51.2 E(85289): 5e-06
Smith-Waterman score: 363; 31.8% identity (59.2% similar) in 255 aa overlap (156-393:43-291)
130 140 150 160 170 180
pF1KE0 QSESSVSTPSASFEPNNTCENSQSRNAELCEIPPTSDTKSDTATGGESAGHATSSQ----
: : : .: : . . :::.
NP_001 MGFPRGRAEKALALTGNQGIEAAMDWLMEHEDDPDVDEPLETPLGHILGREPTSSEQGGL
20 30 40 50 60 70
190 200 210 220 230
pF1KE0 EPSGCS--DQRPAEDLNIRVERLTKKLE--ERREEKRKEEEQRE-IKKEIERRKTGKEML
: :: . . .:: . . : :. . :: .....:.:.:.:: ...: .::. :.:.
NP_001 EGSGSAAGEGKPALSEEERQEQTKRMLELVAQKQREREEREEREALERERQRRRQGQELS
80 90 100 110 120 130
240 250 260 270 280 290
pF1KE0 DYKRKQEEELTKRMLEERNREKAEDRAARERIKQQIALDRAERAARFAKT----KEEVEA
... .:. .: ::: :::::. :::.:....: :.:::: ... . :
NP_001 AARQRLQEDEMRRAAEERRREKAEELAARQRVREKIERDKAERAKKYGGSVGSQPPPVAP
140 150 160 170 180 190
300 310 320 330 340 350
pF1KE0 AKAAALLAKQAEMEVKRESYARERSTVARIQFRLPDGSSFTNQFPSDAPLEEARQFAAQT
. . . . : .::: : . ::: :::::.:.:. : . : .: ..
NP_001 EPGPVPSSPSQEPPTKRE-YDQ-----CRIQVRLPDGTSLTQTFRAREQLAAVRLYVELH
200 210 220 230 240
360 370 380 390 400
pF1KE0 VGNTYGN----FSLATMFPRREFTKEDYKKKLLDLELAPSASVVLLPAGRPTASIVHSSS
:. :. .: . :::: :.. :... : .: :.::: ...
NP_001 RGEELGGGQDPVQLLSGFPRRAFSEADMERPLQELGLVPSAVLIVAKKCPS
250 260 270 280 290
410 420 430 440 450 460
pF1KE0 GDIWTLLGTVLYPFLAIWRLISNFLFSNPPPTQTSVRVTSSEPPNPASSSKSEKREPVRK
>>NP_001273007 (OMIM: 616378) UBX domain-containing prot (238 aa)
initn: 345 init1: 209 opt: 338 Z-score: 217.3 bits: 48.7 E(85289): 2.3e-05
Smith-Waterman score: 338; 32.8% identity (63.1% similar) in 198 aa overlap (205-393:41-232)
180 190 200 210 220 230
pF1KE0 GHATSSQEPSGCSDQRPAEDLNIRVERLTKKLEERREEKRKEEEQRE-IKKEIERRKTGK
.: .....:.:.:.:: ...: .::. :.
NP_001 IEMGFPRGRAEKALALTGNQGIEAAMDWMLELVAQKQREREEREEREALERERQRRRQGQ
20 30 40 50 60 70
240 250 260 270 280
pF1KE0 EMLDYKRKQEEELTKRMLEERNREKAEDRAARERIKQQIALDRAERAARFAKT----KEE
:. ... .:. .: ::: :::::. :::.:....: :.:::: ... .
NP_001 ELSAARQRLQEDEMRRAAEERRREKAEELAARQRVREKIERDKAERAKKYGGSVGSQPPP
80 90 100 110 120 130
290 300 310 320 330 340
pF1KE0 VEAAKAAALLAKQAEMEVKRESYARERSTVARIQFRLPDGSSFTNQFPSDAPLEEARQFA
: . . . . : .::: : . ::: :::::.:.:. : . : .: ..
NP_001 VAPEPGPVPSSPSQEPPTKRE-YDQ-----CRIQVRLPDGTSLTQTFRAREQLAAVRLYV
140 150 160 170 180
350 360 370 380 390 400
pF1KE0 AQTVGNTYGN----FSLATMFPRREFTKEDYKKKLLDLELAPSASVVLLPAGRPTASIVH
:. :. .: . :::: :.. :... : .: :.::: ...
NP_001 ELHRGEELGGGQDPVQLLSGFPRRAFSEADMERPLQELGLVPSAVLIVAKKCPS
190 200 210 220 230
410 420 430 440 450 460
pF1KE0 SSSGDIWTLLGTVLYPFLAIWRLISNFLFSNPPPTQTSVRVTSSEPPNPASSSKSEKREP
>>XP_005274090 (OMIM: 616378) PREDICTED: UBX domain-cont (312 aa)
initn: 280 init1: 218 opt: 340 Z-score: 217.1 bits: 49.0 E(85289): 2.4e-05
Smith-Waterman score: 342; 31.5% identity (58.5% similar) in 248 aa overlap (156-386:43-284)
130 140 150 160 170 180
pF1KE0 QSESSVSTPSASFEPNNTCENSQSRNAELCEIPPTSDTKSDTATGGESAGHATSSQ----
: : : .: : . . :::.
XP_005 MGFPRGRAEKALALTGNQGIEAAMDWLMEHEDDPDVDEPLETPLGHILGREPTSSEQGGL
20 30 40 50 60 70
190 200 210 220 230
pF1KE0 EPSGCS--DQRPAEDLNIRVERLTKKLE--ERREEKRKEEEQRE-IKKEIERRKTGKEML
: :: . . .:: . . : :. . :: .....:.:.:.:: ...: .::. :.:.
XP_005 EGSGSAAGEGKPALSEEERQEQTKRMLELVAQKQREREEREEREALERERQRRRQGQELS
80 90 100 110 120 130
240 250 260 270 280 290
pF1KE0 DYKRKQEEELTKRMLEERNREKAEDRAARERIKQQIALDRAERAARFAKT----KEEVEA
... .:. .: ::: :::::. :::.:....: :.:::: ... . :
XP_005 AARQRLQEDEMRRAAEERRREKAEELAARQRVREKIERDKAERAKKYGGSVGSQPPPVAP
140 150 160 170 180 190
300 310 320 330 340 350
pF1KE0 AKAAALLAKQAEMEVKRESYARERSTVARIQFRLPDGSSFTNQFPSDAPLEEARQFAAQT
. . . . : .::: : . ::: :::::.:.:. : . : .: ..
XP_005 EPGPVPSSPSQEPPTKRE-YDQ-----CRIQVRLPDGTSLTQTFRAREQLAAVRLYVELH
200 210 220 230 240
360 370 380 390 400
pF1KE0 VGNTYGN----FSLATMFPRREFTKEDYKKKLLDLELAPSASVVLLPAGRPTASIVHSSS
:. :. .: . :::: :.. :... : .: .:
XP_005 RGEELGGGQDPVQLLSGFPRRAFSEADMERPLQELGMAARLETRTRNWGSREACLGKGGM
250 260 270 280 290 300
410 420 430 440 450 460
pF1KE0 GDIWTLLGTVLYPFLAIWRLISNFLFSNPPPTQTSVRVTSSEPPNPASSSKSEKREPVRK
XP_005 QREGAL
310
>>NP_056937 (OMIM: 616378) UBX domain-containing protein (312 aa)
initn: 280 init1: 218 opt: 340 Z-score: 217.1 bits: 49.0 E(85289): 2.4e-05
Smith-Waterman score: 342; 31.5% identity (58.5% similar) in 248 aa overlap (156-386:43-284)
130 140 150 160 170 180
pF1KE0 QSESSVSTPSASFEPNNTCENSQSRNAELCEIPPTSDTKSDTATGGESAGHATSSQ----
: : : .: : . . :::.
NP_056 MGFPRGRAEKALALTGNQGIEAAMDWLMEHEDDPDVDEPLETPLGHILGREPTSSEQGGL
20 30 40 50 60 70
190 200 210 220 230
pF1KE0 EPSGCS--DQRPAEDLNIRVERLTKKLE--ERREEKRKEEEQRE-IKKEIERRKTGKEML
: :: . . .:: . . : :. . :: .....:.:.:.:: ...: .::. :.:.
NP_056 EGSGSAAGEGKPALSEEERQEQTKRMLELVAQKQREREEREEREALERERQRRRQGQELS
80 90 100 110 120 130
240 250 260 270 280 290
pF1KE0 DYKRKQEEELTKRMLEERNREKAEDRAARERIKQQIALDRAERAARFAKT----KEEVEA
... .:. .: ::: :::::. :::.:....: :.:::: ... . :
NP_056 AARQRLQEDEMRRAAEERRREKAEELAARQRVREKIERDKAERAKKYGGSVGSQPPPVAP
140 150 160 170 180 190
300 310 320 330 340 350
pF1KE0 AKAAALLAKQAEMEVKRESYARERSTVARIQFRLPDGSSFTNQFPSDAPLEEARQFAAQT
. . . . : .::: : . ::: :::::.:.:. : . : .: ..
NP_056 EPGPVPSSPSQEPPTKRE-YDQ-----CRIQVRLPDGTSLTQTFRAREQLAAVRLYVELH
200 210 220 230 240
360 370 380 390 400
pF1KE0 VGNTYGN----FSLATMFPRREFTKEDYKKKLLDLELAPSASVVLLPAGRPTASIVHSSS
:. :. .: . :::: :.. :... : .: .:
NP_056 RGEELGGGQDPVQLLSGFPRRAFSEADMERPLQELGMAARLETRTRNWGSREACLGKGGM
250 260 270 280 290 300
410 420 430 440 450 460
pF1KE0 GDIWTLLGTVLYPFLAIWRLISNFLFSNPPPTQTSVRVTSSEPPNPASSSKSEKREPVRK
NP_056 QREGAL
310
508 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 15:45:26 2016 done: Thu Nov 3 15:45:28 2016
Total Scan time: 11.790 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]