FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB8214, 257 aa
1>>>pF1KB8214 257 - 257 aa - 257 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.4992+/-0.000308; mu= 11.4996+/- 0.019
mean_var=148.8485+/-30.511, 0's: 0 Z-trim(121.0): 277 B-trim: 43 in 1/50
Lambda= 0.105124
statistics sampled from 36794 (37108) to 36794 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.775), E-opt: 0.2 (0.435), width: 16
Scan time: 6.890
The best scores are: opt bits E(85289)
NP_004088 (OMIM: 600034) homeobox protein EMX1 [Ho ( 290) 1751 276.4 3.8e-74
XP_005264260 (OMIM: 600034) PREDICTED: homeobox pr ( 280) 1385 220.9 1.9e-57
NP_004089 (OMIM: 269160,600035) homeobox protein E ( 252) 897 146.9 3.4e-35
XP_011530999 (OMIM: 600034) PREDICTED: homeobox pr ( 119) 774 127.9 8.4e-30
NP_005506 (OMIM: 142994,176450) motor neuron and p ( 401) 350 64.1 4.4e-10
NP_057254 (OMIM: 604240) T-cell leukemia homeobox ( 284) 298 56.1 8.2e-08
NP_002720 (OMIM: 604420) hematopoietically-express ( 270) 296 55.7 9.8e-08
NP_002439 (OMIM: 106600,142983,189500,608874) home ( 303) 296 55.8 1.1e-07
XP_011538046 (OMIM: 186770) PREDICTED: T-cell leuk ( 342) 293 55.4 1.6e-07
NP_001418 (OMIM: 131310) homeobox protein engraile ( 333) 285 54.2 3.6e-07
NP_006159 (OMIM: 602563) homeobox protein Nkx-6.1 ( 367) 283 53.9 4.7e-07
NP_066305 (OMIM: 604640) T-cell leukemia homeobox ( 291) 276 52.7 8.4e-07
NP_005512 (OMIM: 186770) T-cell leukemia homeobox ( 330) 276 52.8 9.2e-07
NP_002136 (OMIM: 142967) homeobox protein Hox-B2 [ ( 356) 261 50.6 4.7e-06
XP_011538047 (OMIM: 186770) PREDICTED: T-cell leuk ( 269) 258 50.0 5.3e-06
NP_001091639 (OMIM: 611074) brain-specific homeobo ( 233) 254 49.3 7.3e-06
NP_001106175 (OMIM: 604294,614402) ventral anterio ( 334) 256 49.8 7.6e-06
NP_036608 (OMIM: 604295) ventral anterior homeobox ( 290) 255 49.6 7.7e-06
NP_796374 (OMIM: 605955) homeobox protein Nkx-6.2 ( 277) 252 49.1 1e-05
XP_016872278 (OMIM: 605955) PREDICTED: homeobox pr ( 277) 250 48.8 1.3e-05
XP_005269800 (OMIM: 600647) PREDICTED: homeobox pr ( 273) 247 48.3 1.7e-05
NP_005510 (OMIM: 600647) homeobox protein HMX2 [Ho ( 273) 247 48.3 1.7e-05
NP_067545 (OMIM: 603260) homeobox protein BarH-lik ( 254) 245 48.0 2e-05
XP_005249787 (OMIM: 142954) PREDICTED: homeobox pr ( 443) 248 48.7 2.1e-05
XP_005249789 (OMIM: 142954) PREDICTED: homeobox pr ( 443) 248 48.7 2.1e-05
NP_705895 (OMIM: 142954) homeobox protein Hox-A3 i ( 443) 248 48.7 2.1e-05
NP_109377 (OMIM: 142954) homeobox protein Hox-A3 i ( 443) 248 48.7 2.1e-05
XP_011513645 (OMIM: 142954) PREDICTED: homeobox pr ( 443) 248 48.7 2.1e-05
XP_005249788 (OMIM: 142954) PREDICTED: homeobox pr ( 443) 248 48.7 2.1e-05
XP_006715778 (OMIM: 142954) PREDICTED: homeobox pr ( 443) 248 48.7 2.1e-05
NP_005915 (OMIM: 600535) homeobox protein MOX-2 [H ( 304) 243 47.8 2.8e-05
NP_001417 (OMIM: 131290) homeobox protein engraile ( 392) 244 48.0 3e-05
NP_001182446 (OMIM: 186770) T-cell leukemia homeob ( 257) 241 47.4 3.1e-05
NP_001159396 (OMIM: 269160,600035) homeobox protei ( 169) 237 46.6 3.5e-05
NP_064448 (OMIM: 605211) barH-like 1 homeobox prot ( 327) 240 47.3 4e-05
NP_064447 (OMIM: 605212) barH-like 2 homeobox prot ( 387) 240 47.4 4.5e-05
NP_005513 (OMIM: 142955,601536) homeobox protein H ( 335) 238 47.0 5e-05
NP_003308 (OMIM: 118700,188550,600635,610978) home ( 371) 238 47.1 5.4e-05
NP_001073136 (OMIM: 118700,188550,600635,610978) h ( 401) 238 47.1 5.7e-05
NP_001158727 (OMIM: 142994,176450) motor neuron an ( 189) 233 46.0 5.8e-05
NP_001180 (OMIM: 602183,613330) homeobox protein N ( 333) 236 46.7 6.2e-05
NP_061975 (OMIM: 142952) homeobox protein Hox-A5 [ ( 270) 234 46.3 6.6e-05
NP_004378 (OMIM: 108900,187500,217095,225250,60058 ( 324) 235 46.6 6.8e-05
NP_000200 (OMIM: 125853,260370,600733,606176,60639 ( 283) 232 46.1 8.5e-05
XP_005257343 (OMIM: 142963) PREDICTED: homeobox pr ( 242) 230 45.7 9.4e-05
XP_005257332 (OMIM: 142967) PREDICTED: homeobox pr ( 247) 230 45.7 9.5e-05
NP_001243268 (OMIM: 602041) homeobox protein Nkx-3 ( 159) 226 44.9 0.00011
NP_055436 (OMIM: 142981) homeobox protein Hox-D4 [ ( 255) 229 45.6 0.00011
NP_671725 (OMIM: 607410) diencephalon/mesencephalo ( 382) 231 46.0 0.00012
NP_061815 (OMIM: 142992,612109) homeobox protein H ( 348) 230 45.9 0.00012
>>NP_004088 (OMIM: 600034) homeobox protein EMX1 [Homo s (290 aa)
initn: 1751 init1: 1751 opt: 1751 Z-score: 1452.1 bits: 276.4 E(85289): 3.8e-74
Smith-Waterman score: 1751; 100.0% identity (100.0% similar) in 257 aa overlap (1-257:34-290)
10 20 30
pF1KB8 MFQPAAKRGFTIESLVAKDGGTGGGTGGGG
::::::::::::::::::::::::::::::
NP_004 AGCTPRKAAAPGRGALPRARLPRTAPAAATMFQPAAKRGFTIESLVAKDGGTGGGTGGGG
10 20 30 40 50 60
40 50 60 70 80 90
pF1KB8 AGSHLLAAAASEEPLRPTALNYPHPSAAEAAFVSGFPAAAAAGAGRSLYGGPELVFPEAM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_004 AGSHLLAAAASEEPLRPTALNYPHPSAAEAAFVSGFPAAAAAGAGRSLYGGPELVFPEAM
70 80 90 100 110 120
100 110 120 130 140 150
pF1KB8 NHPALTVHPAHQLGASPLQPPHSFFGAQHRDPLHFYPWVLRNRFFGHRFQASDVPQDGLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_004 NHPALTVHPAHQLGASPLQPPHSFFGAQHRDPLHFYPWVLRNRFFGHRFQASDVPQDGLL
130 140 150 160 170 180
160 170 180 190 200 210
pF1KB8 LHGPFARKPKRIRTAFSPSQLLRLERAFEKNHYVVGAERKQLAGSLSLSETQVKVWFQNR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_004 LHGPFARKPKRIRTAFSPSQLLRLERAFEKNHYVVGAERKQLAGSLSLSETQVKVWFQNR
190 200 210 220 230 240
220 230 240 250
pF1KB8 RTKYKRQKLEEEGPESEQKKKGSHHINRWRIATKQANGEDIDVTSND
:::::::::::::::::::::::::::::::::::::::::::::::
NP_004 RTKYKRQKLEEEGPESEQKKKGSHHINRWRIATKQANGEDIDVTSND
250 260 270 280 290
>>XP_005264260 (OMIM: 600034) PREDICTED: homeobox protei (280 aa)
initn: 1431 init1: 1385 opt: 1385 Z-score: 1152.3 bits: 220.9 E(85289): 1.9e-57
Smith-Waterman score: 1385; 99.5% identity (100.0% similar) in 204 aa overlap (1-204:34-237)
10 20 30
pF1KB8 MFQPAAKRGFTIESLVAKDGGTGGGTGGGG
::::::::::::::::::::::::::::::
XP_005 AGCTPRKAAAPGRGALPRARLPRTAPAAATMFQPAAKRGFTIESLVAKDGGTGGGTGGGG
10 20 30 40 50 60
40 50 60 70 80 90
pF1KB8 AGSHLLAAAASEEPLRPTALNYPHPSAAEAAFVSGFPAAAAAGAGRSLYGGPELVFPEAM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_005 AGSHLLAAAASEEPLRPTALNYPHPSAAEAAFVSGFPAAAAAGAGRSLYGGPELVFPEAM
70 80 90 100 110 120
100 110 120 130 140 150
pF1KB8 NHPALTVHPAHQLGASPLQPPHSFFGAQHRDPLHFYPWVLRNRFFGHRFQASDVPQDGLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_005 NHPALTVHPAHQLGASPLQPPHSFFGAQHRDPLHFYPWVLRNRFFGHRFQASDVPQDGLL
130 140 150 160 170 180
160 170 180 190 200 210
pF1KB8 LHGPFARKPKRIRTAFSPSQLLRLERAFEKNHYVVGAERKQLAGSLSLSETQVKVWFQNR
:::::::::::::::::::::::::::::::::::::::::::::::::::::.
XP_005 LHGPFARKPKRIRTAFSPSQLLRLERAFEKNHYVVGAERKQLAGSLSLSETQVRRVGVGL
190 200 210 220 230 240
220 230 240 250
pF1KB8 RTKYKRQKLEEEGPESEQKKKGSHHINRWRIATKQANGEDIDVTSND
XP_005 SDPQPSCCPGGGFQSLRAGRLPELREAGSAVSQLPPG
250 260 270 280
>>NP_004089 (OMIM: 269160,600035) homeobox protein EMX2 (252 aa)
initn: 940 init1: 664 opt: 897 Z-score: 752.8 bits: 146.9 E(85289): 3.4e-35
Smith-Waterman score: 1043; 62.9% identity (80.9% similar) in 267 aa overlap (1-257:1-252)
10 20 30 40 50 60
pF1KB8 MFQPAAKRGFTIESLVAKDGGTGGGTGGGGAGSHLLAAAASEEPLRPTALNYPHPSAAEA
::::: :: ::::::::::. : :. ::.:.::.::.: . : .
NP_004 MFQPAPKRCFTIESLVAKDSP--------------LPASRSEDPIRPAALSYANSSPINP
10 20 30 40
70 80 90 100 110
pF1KB8 AFVSGFPAAAAAGAGRSLYGGPELVFPEAMNHP---ALTVHPA---HQLGASPL---QPP
:..:: .::::.:::..:..:.::: ::..:: :. :::. : :.: :: . :
NP_004 -FLNGFHSAAAAAAGRGVYSNPDLVFAEAVSHPPNPAVPVHPVPPPHALAAHPLPSSHSP
50 60 70 80 90 100
120 130 140 150 160 170
pF1KB8 HSFFGAQHRDPLHFYPWVL-RNRFFGHRFQASDVPQDGLLLHGPFARKPKRIRTAFSPSQ
: .:..:.::: ::::.. : :..:::::..:. ...:::. .:::::::::::::::
NP_004 HPLFASQQRDPSTFYPWLIHRYRYLGHRFQGNDTSPESFLLHNALARKPKRIRTAFSPSQ
110 120 130 140 150 160
180 190 200 210 220 230
pF1KB8 LLRLERAFEKNHYVVGAERKQLAGSLSLSETQVKVWFQNRRTKYKRQKLEEEGPESEQKK
:::::.::::::::::::::::: ::::.::::::::::::::.::::::::: .:.:::
NP_004 LLRLEHAFEKNHYVVGAERKQLAHSLSLTETQVKVWFQNRRTKFKRQKLEEEGSDSQQKK
170 180 190 200 210 220
240 250
pF1KB8 KGSHHINRWRIATKQANGEDIDVTSND
::.:::::::::::::. :.:::::.:
NP_004 KGTHHINRWRIATKQASPEEIDVTSDD
230 240 250
>>XP_011530999 (OMIM: 600034) PREDICTED: homeobox protei (119 aa)
initn: 774 init1: 774 opt: 774 Z-score: 656.0 bits: 127.9 E(85289): 8.4e-30
Smith-Waterman score: 774; 100.0% identity (100.0% similar) in 117 aa overlap (141-257:3-119)
120 130 140 150 160 170
pF1KB8 PHSFFGAQHRDPLHFYPWVLRNRFFGHRFQASDVPQDGLLLHGPFARKPKRIRTAFSPSQ
::::::::::::::::::::::::::::::
XP_011 MVASDVPQDGLLLHGPFARKPKRIRTAFSPSQ
10 20 30
180 190 200 210 220 230
pF1KB8 LLRLERAFEKNHYVVGAERKQLAGSLSLSETQVKVWFQNRRTKYKRQKLEEEGPESEQKK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 LLRLERAFEKNHYVVGAERKQLAGSLSLSETQVKVWFQNRRTKYKRQKLEEEGPESEQKK
40 50 60 70 80 90
240 250
pF1KB8 KGSHHINRWRIATKQANGEDIDVTSND
:::::::::::::::::::::::::::
XP_011 KGSHHINRWRIATKQANGEDIDVTSND
100 110
>>NP_005506 (OMIM: 142994,176450) motor neuron and pancr (401 aa)
initn: 392 init1: 217 opt: 350 Z-score: 302.0 bits: 64.1 E(85289): 4.4e-10
Smith-Waterman score: 350; 36.3% identity (58.8% similar) in 226 aa overlap (20-232:100-315)
10 20 30 40
pF1KB8 MFQPAAKRGFTIESLVAKDGGTGGGTGGGGAGSH---------LLAAAA
:: ::::::: .: : ::::
NP_005 ADRLRAESPSPPRLLAAHCALLPKPGFLGAGGGGGGTGGGHGGPHHHAHPGAAAAAAAAA
70 80 90 100 110 120
50 60 70 80 90
pF1KB8 SEEPLRPTALNYPHPSAAEAAFVSGFPAAAAAGAGRSLYGGPELVFPEAM--NHPALTVH
. ::. ::..:... .:.:: :: :. .:: . :. .::::.
NP_005 AAAAAGGLALGL-HPGGAQGG--AGLPAQAAL-YGHPVYGYSAAAAAAALAGQHPALSYS
130 140 150 160 170 180
100 110 120 130 140 150
pF1KB8 PAHQLGASPLQPPHSF-FGAQHRDPLHFYPWVLRNRFFGHRFQASDVPQDGLLLHGPFAR
. :: : .: . .:: ... : :: : . .:. . .. .
NP_005 YPQVQGAHPAHPADPIKLGA---GTFQLDQW-LRASTAGMILPK--MPDFNSQAQSNLLG
190 200 210 220 230
160 170 180 190 200 210
pF1KB8 KPKRIRTAFSPSQLLRLERAFEKNHYVVGAERKQLAGSLSLSETQVKVWFQNRRTKYKR-
: .: ::::. .:::.::. :. :.:. .: ..: :: :.:::::.:::::: :.::
NP_005 KCRRPRTAFTSQQLLELEHQFKLNKYLSRPKRFEVATSLMLTETQVKIWFQNRRMKWKRS
240 250 260 270 280 290
220 230 240 250
pF1KB8 QKLEEEGPESEQKKKGSHHINRWRIATKQANGEDIDVTSND
.: .:.. . .:.::
NP_005 KKAKEQAAQEAEKQKGGGGGAGKGGAEEPGAEELLGPPAPGDKGSGRRLRDLRDSDPEED
300 310 320 330 340 350
>>NP_057254 (OMIM: 604240) T-cell leukemia homeobox prot (284 aa)
initn: 258 init1: 220 opt: 298 Z-score: 261.2 bits: 56.1 E(85289): 8.2e-08
Smith-Waterman score: 301; 38.4% identity (61.6% similar) in 185 aa overlap (61-232:49-228)
40 50 60 70 80
pF1KB8 AGSHLLAAAASEEPLRPTALNYPHPSAAEAAFVSGFPAAAAAGAGRSL--YGGPELVFPE
:: .:. .:.. : . :: : : :
NP_057 FGIDQILSGPETPGGGLGLGRGGQGHGENGAFSGGYHGASGYGPAGSLAPLPGSSGVGPG
20 30 40 50 60 70
90 100 110 120 130 140
pF1KB8 AM----NHPALTVHPAHQLGASPLQP-PHSFFGAQHRDPLHFYPWVLRNRFFGH-RFQAS
.. : : : : :..: : : .. :: : : ::. .: :.. :. :.
NP_057 GVIRVPAHRPLPVPPPA--GGAPAVPGPSGLGGAGGLAGLTF-PWMDSGRRFAKDRLTAA
80 90 100 110 120 130
150 160 170 180 190
pF1KB8 DVPQDGLLLHG-PFARK--PKRI--RTAFSPSQLLRLERAFEKNHYVVGAERKQLAGSLS
: .: : :. . ::: ::.:: ::.:.::: : ...:...::: :: .:
NP_057 LSPFSGTRRIGHPYQNRTPPKRKKPRTSFSRSQVLELERRFLRQKYLASAERAALAKALR
140 150 160 170 180 190
200 210 220 230 240 250
pF1KB8 LSETQVKVWFQNRRTKYKRQKLEEEGPESEQKKKGSHHINRWRIATKQANGEDIDVTSND
....:::.::::::::..:: ::. :.:... :
NP_057 MTDAQVKTWFQNRRTKWRRQTAEER--EAERHRAGRLLLHLQQDALPRPLRPPLPPDPLC
200 210 220 230 240 250
NP_057 LHNSSLFALQNLQPWAEDNKVASVSGLASVV
260 270 280
>>NP_002720 (OMIM: 604420) hematopoietically-expressed h (270 aa)
initn: 259 init1: 190 opt: 296 Z-score: 259.9 bits: 55.7 E(85289): 9.8e-08
Smith-Waterman score: 299; 31.5% identity (56.8% similar) in 241 aa overlap (25-252:7-232)
10 20 30 40 50
pF1KB8 MFQPAAKRGFTIESLVAKDGGTGGGTGGGGAGSHLLAAAASEEPLRPTALNYPH-----P
: ..:..: : : . .: .:: . :
NP_002 MQYPHPGPAAGAVGVPLYAPTPLLQPAHPTPFYIEDILGRGP
10 20 30 40
60 70 80 90 100 110
pF1KB8 SAAEAAFVSGFPAAAAAGAGRSLYGGPELVFPEAMNHPALTVHPAHQLGASPLQPPHSFF
.: : . : .. .. : : : :. . :::.. : : :.:. : .:
NP_002 AAPTPAPTLPSPNSSFTSL-VSPYRTP--VYEPTPIHPAFSHHSAAALAAA--YGPGGFG
50 60 70 80 90
120 130 140 150 160 170
pF1KB8 GAQHRDPLHFYPWVLRNRFFGHRFQASDVPQDGLLLHGPFARKPKRIRTA----FSPSQL
: ::. .: .. . . : . : : :: .:: ..: . : . :: .:
NP_002 G-----PLYPFPRTVND--YTHALLRHD-PLGKPLLWSPFLQRPLHKRKGGQVRFSNDQT
100 110 120 130 140
180 190 200 210 220 230
pF1KB8 LRLERAFEKNHYVVGAERKQLAGSLSLSETQVKVWFQNRRTKYKRQKLEEEGPESEQKKK
..::. :: ..:. :::.:: :.::: :::.::::::.:..: :..:.:.:..:..
NP_002 IELEKKFETQKYLSPPERKRLAKMLQLSERQVKTWFQNRRAKWRR--LKQENPQSNKKEE
150 160 170 180 190 200
240 250
pF1KB8 ----GSHHINRWRIATKQANGEDIDVTSND
: .: . ..: .: ..:
NP_002 LESLDSSCDQRQDLPSEQNKGASLDSSQCSPSPASQEDLESEISEDSDQEVDIEGDKSYF
210 220 230 240 250 260
>>NP_002439 (OMIM: 106600,142983,189500,608874) homeobox (303 aa)
initn: 311 init1: 213 opt: 296 Z-score: 259.2 bits: 55.8 E(85289): 1.1e-07
Smith-Waterman score: 298; 33.9% identity (55.7% similar) in 230 aa overlap (9-222:13-236)
10 20 30 40
pF1KB8 MFQPAAKRGFTIE-SLVAKDGGTGGGTGGGGAGSHLLAAAASEEPLRPT-------
: .: : .: .: :.: . ..:.. : .:.:: .:
NP_002 MAPAADMTSLPLGVKVEDSAFGKPAGGGAGQAPSAAAATAAAMGADEEGAKPKVSPSLLP
10 20 30 40 50 60
50 60 70 80 90 100
pF1KB8 ----ALNYPH--PSAAEAAFVSGFPAAAAAGAGRSLYGGP-ELVFPEAMNHPALTVHPAH
:: : :.: :.:.. . . ::.:... : : : :.: . : :
NP_002 FSVEALMADHRKPGAKESALAPSEGVQAAGGSAQPLGVPPGSLGAPDAPSSPRPLGH--F
70 80 90 100 110
110 120 130 140 150 160
pF1KB8 QLGASPLQPPHSFFGAQHRDPLHFYPWVLRNRFFGHRFQASDVPQDGLLLHGPFARKPKR
..:. : .. :. . . ::. :: . . : : : :::
NP_002 SVGGLLKLPEDALVKAESPEKPERTPWMQSPRFSPPPARRLSPPACTLRKHKT-NRKP--
120 130 140 150 160 170
170 180 190 200 210 220
pF1KB8 IRTAFSPSQLLRLERAFEKNHYVVGAERKQLAGSLSLSETQVKVWFQNRRTKYKR-QKLE
:: :. .::: ::: :....:. ::: ....::::.:::::.::::::.: :: :. :
NP_002 -RTPFTTAQLLALERKFRQKQYLSIAERAEFSSSLSLTETQVKIWFQNRRAKAKRLQEAE
180 190 200 210 220 230
230 240 250
pF1KB8 EEGPESEQKKKGSHHINRWRIATKQANGEDIDVTSND
:
NP_002 LEKLKMAAKPMLPPAAFGLSFPLGGPAAVAAAAGASLYGASGPFQRAALPVAPVGLYTAH
240 250 260 270 280 290
>>XP_011538046 (OMIM: 186770) PREDICTED: T-cell leukemia (342 aa)
initn: 317 init1: 224 opt: 293 Z-score: 256.1 bits: 55.4 E(85289): 1.6e-07
Smith-Waterman score: 293; 32.1% identity (58.3% similar) in 218 aa overlap (20-231:80-285)
10 20 30 40
pF1KB8 MFQPAAKRGFTIESLVAKDGGTGGGTGGGGAGSHLLAAAASEEPLRPTA
:: :: .::::: : .. .. . .
XP_011 LVGGAYTYGGGGSAAATGAGGAGAYGTGGPGGPGGPAGGGGACS--MGPLTGSYNVNMAL
50 60 70 80 90 100
50 60 70 80 90 100
pF1KB8 LNYPHPSAAEAAFVSGFPAAAAAGAGRSLYGGPELVFPEAMNHPALTVHPAHQLGASPLQ
. : :... .. .: : .:::. : : . :. :: . . . : .
XP_011 AGGPGPGGGGGS-SGGAGALSAAGVIRVPAHRP---LAGAVAHPQPLATGLPTVPSVPAM
110 120 130 140 150 160
110 120 130 140 150 160
pF1KB8 PPHSFFGAQHRDPLHFYPWVLRNRFFGH-RFQASDVPQDGLLLHG-PFA-RKP---KRIR
: :... : : ::. :: . . :: .. : : :. : : :. :
XP_011 P-----GVNNLTGLTF-PWMESNRRYTKDRFTVALSPFTVTRRIGHPYQNRTPPKKKKPR
170 180 190 200 210
170 180 190 200 210 220
pF1KB8 TAFSPSQLLRLERAFEKNHYVVGAERKQLAGSLSLSETQVKVWFQNRRTKYKRQKLEEEG
:.:. :. .::. :....:...::: :: .:.....:::.::::::::..:: ::.
XP_011 TSFTRLQICELEKRFHRQKYLASAERAALAKALKMTDAQVKTWFQNRRTKWRRQTAEERE
220 230 240 250 260 270
230 240 250
pF1KB8 PESEQKKKGSHHINRWRIATKQANGEDIDVTSND
: .: ..
XP_011 AERQQANRILLQLQQEAFQKSLAQPLPADPLCVHNSSLFALQNLQPWSDDSTKITSVTSV
280 290 300 310 320 330
>>NP_001418 (OMIM: 131310) homeobox protein engrailed-2 (333 aa)
initn: 328 init1: 201 opt: 285 Z-score: 249.7 bits: 54.2 E(85289): 3.6e-07
Smith-Waterman score: 285; 34.0% identity (57.9% similar) in 209 aa overlap (19-216:103-301)
10 20 30 40
pF1KB8 MFQPAAKRGFTIESLVAKDGGTGGGTGGGGAG-SHLLAAAASEEPLRP
.::..:. :::::: :. : ...:.:: :
NP_001 NILRPEFGRRKDAGTCCAGAGGGRGGGAGGEGGASGAEGGGGAGGSEQLLGSGSREP-RQ
80 90 100 110 120 130
50 60 70 80 90 100
pF1KB8 TALNYPHPSAAEAAFVSGFPAAAAAGAGR-SLYGGPELVFPEAMNHPALTVHPAHQLGAS
. : .. : : :. . .:. ::.:: . . : :. ::..
NP_001 NPPCAPGAGGPLPAAGSDSPGDGEGGSKTLSLHGGAKKGGDP--GGPLDGSLKARGLGGG
140 150 160 170 180
110 120 130 140 150
pF1KB8 PLQ----PPHSFFGAQ-HRDPLHFYPWVLRNRFFGHRFQASDVPQDGLLLHGPFARKP--
:. : ::. .:. . :: .:. :: :..: . : ..:
NP_001 DLSVSSDSDSSQAGANLGAQPMLWPAWVYCTRY-------SDRPSSGPRSRKPKKKNPNK
190 200 210 220 230 240
160 170 180 190 200 210
pF1KB8 --KRIRTAFSPSQLLRLERAFEKNHYVVGAERKQLAGSLSLSETQVKVWFQNRRTKYKRQ
:: ::::. :: ::. :. :.:.. .:..:: :::.:.:.:.::::.:.: :.
NP_001 EDKRPRTAFTAEQLQRLKAEFQTNRYLTEQRRQSLAQELSLNESQIKIWFQNKRAKIKKA
250 260 270 280 290 300
220 230 240 250
pF1KB8 KLEEEGPESEQKKKGSHHINRWRIATKQANGEDIDVTSND
NP_001 TGNKNTLAVHLMAQGLYNHSTTAKEGKSDSE
310 320 330
257 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 10:38:17 2016 done: Fri Nov 4 10:38:18 2016
Total Scan time: 6.890 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]