FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3815, 485 aa 1>>>pF1KE3815 485 - 485 aa - 485 aa Library: /omim/omim.rfq.tfa 64704883 residues in 91410 sequences Statistics: Expectation_n fit: rho(ln(x))= 11.3942+/-0.000424; mu= -5.9047+/- 0.026 mean_var=559.4730+/-113.338, 0's: 0 Z-trim(125.4): 44 B-trim: 862 in 1/60 Lambda= 0.054223 statistics sampled from 50730 (50802) to 50730 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.824), E-opt: 0.2 (0.556), width: 16 Scan time: 5.570 The best scores are: opt bits E(91410) NP_001292997 (OMIM: 606009) double homeobox protei ( 424) 3014 250.2 9.2e-66 NP_001280727 (OMIM: 606009) double homeobox protei ( 424) 3014 250.2 9.2e-66 XP_024308351 (OMIM: 606009) double homeobox protei ( 424) 3001 249.2 1.9e-65 XP_024308352 (OMIM: 606009) double homeobox protei ( 424) 3001 249.2 1.9e-65 NP_001350749 (OMIM: 606009) double homeobox protei ( 160) 1103 100.1 5.1e-21 NP_036281 (OMIM: 611444) double homeobox protein 5 ( 197) 1080 98.5 2e-20 NP_036278 (OMIM: 611441) double homeobox protein 1 ( 170) 926 86.3 7.7e-17 >>NP_001292997 (OMIM: 606009) double homeobox protein 4 (424 aa) initn: 3014 init1: 3014 opt: 3014 Z-score: 1302.2 bits: 250.2 E(91410): 9.2e-66 Smith-Waterman score: 3014; 100.0% identity (100.0% similar) in 424 aa overlap (62-485:1-424) 40 50 60 70 80 90 pF1KE3 AVHSPAEVHGSPPASLCPRPSVKFRPGLTAMALPTPSDSTLPAEARGRGRRRRLVWTPSQ :::::::::::::::::::::::::::::: NP_001 MALPTPSDSTLPAEARGRGRRRRLVWTPSQ 10 20 30 100 110 120 130 140 150 pF1KE3 SEALRACFERNPYPGIATRERLAQAIGIPEPRVQIWFQNERSRQLRQHRRESRPWPGRRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SEALRACFERNPYPGIATRERLAQAIGIPEPRVQIWFQNERSRQLRQHRRESRPWPGRRG 40 50 60 70 80 90 160 170 180 190 200 210 pF1KE3 PPEGRRKRTAVTGSQTALLLRAFEKDRFPGIAAREELARETGLPESRIQIWFQNRRARHP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PPEGRRKRTAVTGSQTALLLRAFEKDRFPGIAAREELARETGLPESRIQIWFQNRRARHP 100 110 120 130 140 150 220 230 240 250 260 270 pF1KE3 GQGGRAPAQAGGLCSAAPGGGHPAPSWVAFAHTGAWGTGLPAPHVPCAPGALPQGAFVSQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 GQGGRAPAQAGGLCSAAPGGGHPAPSWVAFAHTGAWGTGLPAPHVPCAPGALPQGAFVSQ 160 170 180 190 200 210 280 290 300 310 320 330 pF1KE3 AARAAPALQPSQAAPAEGISQPAPARGDFAYAAPAPPDGALSHPQAPRWPPHPGKSREDR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 AARAAPALQPSQAAPAEGISQPAPARGDFAYAAPAPPDGALSHPQAPRWPPHPGKSREDR 220 230 240 250 260 270 340 350 360 370 380 390 pF1KE3 DPQRDGLPGPCAVAQPGPAQAGPQGQGVLAPPTSQGSPWWGWGRGPQVAGAAWEPQAGAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 DPQRDGLPGPCAVAQPGPAQAGPQGQGVLAPPTSQGSPWWGWGRGPQVAGAAWEPQAGAA 280 290 300 310 320 330 400 410 420 430 440 450 pF1KE3 PPPQPAPPDASASARQGQMQGIPAPSQALQEPAPWSALPCGLLLDELLASPEFLQQAQPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PPPQPAPPDASASARQGQMQGIPAPSQALQEPAPWSALPCGLLLDELLASPEFLQQAQPL 340 350 360 370 380 390 460 470 480 pF1KE3 LETEAPGELEASEEAASLEAPLSEEEYRALLEEL :::::::::::::::::::::::::::::::::: NP_001 LETEAPGELEASEEAASLEAPLSEEEYRALLEEL 400 410 420 >>NP_001280727 (OMIM: 606009) double homeobox protein 4 (424 aa) initn: 3014 init1: 3014 opt: 3014 Z-score: 1302.2 bits: 250.2 E(91410): 9.2e-66 Smith-Waterman score: 3014; 100.0% identity (100.0% similar) in 424 aa overlap (62-485:1-424) 40 50 60 70 80 90 pF1KE3 AVHSPAEVHGSPPASLCPRPSVKFRPGLTAMALPTPSDSTLPAEARGRGRRRRLVWTPSQ :::::::::::::::::::::::::::::: NP_001 MALPTPSDSTLPAEARGRGRRRRLVWTPSQ 10 20 30 100 110 120 130 140 150 pF1KE3 SEALRACFERNPYPGIATRERLAQAIGIPEPRVQIWFQNERSRQLRQHRRESRPWPGRRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SEALRACFERNPYPGIATRERLAQAIGIPEPRVQIWFQNERSRQLRQHRRESRPWPGRRG 40 50 60 70 80 90 160 170 180 190 200 210 pF1KE3 PPEGRRKRTAVTGSQTALLLRAFEKDRFPGIAAREELARETGLPESRIQIWFQNRRARHP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PPEGRRKRTAVTGSQTALLLRAFEKDRFPGIAAREELARETGLPESRIQIWFQNRRARHP 100 110 120 130 140 150 220 230 240 250 260 270 pF1KE3 GQGGRAPAQAGGLCSAAPGGGHPAPSWVAFAHTGAWGTGLPAPHVPCAPGALPQGAFVSQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 GQGGRAPAQAGGLCSAAPGGGHPAPSWVAFAHTGAWGTGLPAPHVPCAPGALPQGAFVSQ 160 170 180 190 200 210 280 290 300 310 320 330 pF1KE3 AARAAPALQPSQAAPAEGISQPAPARGDFAYAAPAPPDGALSHPQAPRWPPHPGKSREDR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 AARAAPALQPSQAAPAEGISQPAPARGDFAYAAPAPPDGALSHPQAPRWPPHPGKSREDR 220 230 240 250 260 270 340 350 360 370 380 390 pF1KE3 DPQRDGLPGPCAVAQPGPAQAGPQGQGVLAPPTSQGSPWWGWGRGPQVAGAAWEPQAGAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 DPQRDGLPGPCAVAQPGPAQAGPQGQGVLAPPTSQGSPWWGWGRGPQVAGAAWEPQAGAA 280 290 300 310 320 330 400 410 420 430 440 450 pF1KE3 PPPQPAPPDASASARQGQMQGIPAPSQALQEPAPWSALPCGLLLDELLASPEFLQQAQPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PPPQPAPPDASASARQGQMQGIPAPSQALQEPAPWSALPCGLLLDELLASPEFLQQAQPL 340 350 360 370 380 390 460 470 480 pF1KE3 LETEAPGELEASEEAASLEAPLSEEEYRALLEEL :::::::::::::::::::::::::::::::::: NP_001 LETEAPGELEASEEAASLEAPLSEEEYRALLEEL 400 410 420 >>XP_024308351 (OMIM: 606009) double homeobox protein 4- (424 aa) initn: 3001 init1: 3001 opt: 3001 Z-score: 1296.7 bits: 249.2 E(91410): 1.9e-65 Smith-Waterman score: 3001; 99.5% identity (99.5% similar) in 424 aa overlap (62-485:1-424) 40 50 60 70 80 90 pF1KE3 AVHSPAEVHGSPPASLCPRPSVKFRPGLTAMALPTPSDSTLPAEARGRGRRRRLVWTPSQ :::::::::::::::::::::::::::::: XP_024 MALPTPSDSTLPAEARGRGRRRRLVWTPSQ 10 20 30 100 110 120 130 140 150 pF1KE3 SEALRACFERNPYPGIATRERLAQAIGIPEPRVQIWFQNERSRQLRQHRRESRPWPGRRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_024 SEALRACFERNPYPGIATRERLAQAIGIPEPRVQIWFQNERSRQLRQHRRESRPWPGRRG 40 50 60 70 80 90 160 170 180 190 200 210 pF1KE3 PPEGRRKRTAVTGSQTALLLRAFEKDRFPGIAAREELARETGLPESRIQIWFQNRRARHP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_024 PPEGRRKRTAVTGSQTALLLRAFEKDRFPGIAAREELARETGLPESRIQIWFQNRRARHP 100 110 120 130 140 150 220 230 240 250 260 270 pF1KE3 GQGGRAPAQAGGLCSAAPGGGHPAPSWVAFAHTGAWGTGLPAPHVPCAPGALPQGAFVSQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_024 GQGGRAPAQAGGLCSAAPGGGHPAPSWVAFAHTGAWGTGLPAPHVPCAPGALPQGAFVSQ 160 170 180 190 200 210 280 290 300 310 320 330 pF1KE3 AARAAPALQPSQAAPAEGISQPAPARGDFAYAAPAPPDGALSHPQAPRWPPHPGKSREDR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_024 AARAAPALQPSQAAPAEGISQPAPARGDFAYAAPAPPDGALSHPQAPRWPPHPGKSREDR 220 230 240 250 260 270 340 350 360 370 380 390 pF1KE3 DPQRDGLPGPCAVAQPGPAQAGPQGQGVLAPPTSQGSPWWGWGRGPQVAGAAWEPQAGAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_024 DPQRDGLPGPCAVAQPGPAQAGPQGQGVLAPPTSQGSPWWGWGRGPQVAGAAWEPQAGAA 280 290 300 310 320 330 400 410 420 430 440 450 pF1KE3 PPPQPAPPDASASARQGQMQGIPAPSQALQEPAPWSALPCGLLLDELLASPEFLQQAQPL :::::::::::::::::::::::::::::: :::::: :::::::::::::::::::::: XP_024 PPPQPAPPDASASARQGQMQGIPAPSQALQXPAPWSAXPCGLLLDELLASPEFLQQAQPL 340 350 360 370 380 390 460 470 480 pF1KE3 LETEAPGELEASEEAASLEAPLSEEEYRALLEEL :::::::::::::::::::::::::::::::::: XP_024 LETEAPGELEASEEAASLEAPLSEEEYRALLEEL 400 410 420 >>XP_024308352 (OMIM: 606009) double homeobox protein 4- (424 aa) initn: 3001 init1: 3001 opt: 3001 Z-score: 1296.7 bits: 249.2 E(91410): 1.9e-65 Smith-Waterman score: 3001; 99.5% identity (99.5% similar) in 424 aa overlap (62-485:1-424) 40 50 60 70 80 90 pF1KE3 AVHSPAEVHGSPPASLCPRPSVKFRPGLTAMALPTPSDSTLPAEARGRGRRRRLVWTPSQ :::::::::::::::::::::::::::::: XP_024 MALPTPSDSTLPAEARGRGRRRRLVWTPSQ 10 20 30 100 110 120 130 140 150 pF1KE3 SEALRACFERNPYPGIATRERLAQAIGIPEPRVQIWFQNERSRQLRQHRRESRPWPGRRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_024 SEALRACFERNPYPGIATRERLAQAIGIPEPRVQIWFQNERSRQLRQHRRESRPWPGRRG 40 50 60 70 80 90 160 170 180 190 200 210 pF1KE3 PPEGRRKRTAVTGSQTALLLRAFEKDRFPGIAAREELARETGLPESRIQIWFQNRRARHP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_024 PPEGRRKRTAVTGSQTALLLRAFEKDRFPGIAAREELARETGLPESRIQIWFQNRRARHP 100 110 120 130 140 150 220 230 240 250 260 270 pF1KE3 GQGGRAPAQAGGLCSAAPGGGHPAPSWVAFAHTGAWGTGLPAPHVPCAPGALPQGAFVSQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_024 GQGGRAPAQAGGLCSAAPGGGHPAPSWVAFAHTGAWGTGLPAPHVPCAPGALPQGAFVSQ 160 170 180 190 200 210 280 290 300 310 320 330 pF1KE3 AARAAPALQPSQAAPAEGISQPAPARGDFAYAAPAPPDGALSHPQAPRWPPHPGKSREDR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_024 AARAAPALQPSQAAPAEGISQPAPARGDFAYAAPAPPDGALSHPQAPRWPPHPGKSREDR 220 230 240 250 260 270 340 350 360 370 380 390 pF1KE3 DPQRDGLPGPCAVAQPGPAQAGPQGQGVLAPPTSQGSPWWGWGRGPQVAGAAWEPQAGAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_024 DPQRDGLPGPCAVAQPGPAQAGPQGQGVLAPPTSQGSPWWGWGRGPQVAGAAWEPQAGAA 280 290 300 310 320 330 400 410 420 430 440 450 pF1KE3 PPPQPAPPDASASARQGQMQGIPAPSQALQEPAPWSALPCGLLLDELLASPEFLQQAQPL :::::::::::::::::::::::::::::: :::::: :::::::::::::::::::::: XP_024 PPPQPAPPDASASARQGQMQGIPAPSQALQXPAPWSAXPCGLLLDELLASPEFLQQAQPL 340 350 360 370 380 390 460 470 480 pF1KE3 LETEAPGELEASEEAASLEAPLSEEEYRALLEEL :::::::::::::::::::::::::::::::::: XP_024 LETEAPGELEASEEAASLEAPLSEEEYRALLEEL 400 410 420 >>NP_001350749 (OMIM: 606009) double homeobox protein 4 (160 aa) initn: 1128 init1: 1103 opt: 1103 Z-score: 499.0 bits: 100.1 E(91410): 5.1e-21 Smith-Waterman score: 1103; 100.0% identity (100.0% similar) in 159 aa overlap (62-220:1-159) 40 50 60 70 80 90 pF1KE3 AVHSPAEVHGSPPASLCPRPSVKFRPGLTAMALPTPSDSTLPAEARGRGRRRRLVWTPSQ :::::::::::::::::::::::::::::: NP_001 MALPTPSDSTLPAEARGRGRRRRLVWTPSQ 10 20 30 100 110 120 130 140 150 pF1KE3 SEALRACFERNPYPGIATRERLAQAIGIPEPRVQIWFQNERSRQLRQHRRESRPWPGRRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SEALRACFERNPYPGIATRERLAQAIGIPEPRVQIWFQNERSRQLRQHRRESRPWPGRRG 40 50 60 70 80 90 160 170 180 190 200 210 pF1KE3 PPEGRRKRTAVTGSQTALLLRAFEKDRFPGIAAREELARETGLPESRIQIWFQNRRARHP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PPEGRRKRTAVTGSQTALLLRAFEKDRFPGIAAREELARETGLPESRIQIWFQNRRARHP 100 110 120 130 140 150 220 230 240 250 260 270 pF1KE3 GQGGRAPAQAGGLCSAAPGGGHPAPSWVAFAHTGAWGTGLPAPHVPCAPGALPQGAFVSQ ::::::::: NP_001 GQGGRAPAQV 160 >>NP_036281 (OMIM: 611444) double homeobox protein 5 [Ho (197 aa) initn: 1341 init1: 1080 opt: 1080 Z-score: 488.2 bits: 98.5 E(91410): 2e-20 Smith-Waterman score: 1080; 82.1% identity (89.3% similar) in 196 aa overlap (36-231:2-197) 10 20 30 40 50 60 pF1KE3 LPACGPLQGRLAGWLAVRAGLLAAPAAVHSPAEVHGSPPASLCPRPSVKFRPGLTAMALP :::::::::::::: :::::::: ::: NP_036 MPAEVHGSPPASLCPCQSVKFRPGLPEMALL 10 20 30 70 80 90 100 110 120 pF1KE3 TPSDSTLPAEARGRGRRRRLVWTPSQSEALRACFERNPYPGIATRERLAQAIGIPEPRVQ : :.::: ::.: ::: :. :::::.::::::::: ::::::.:.:::.: ::::::: NP_036 TALDDTLPEEAQGPGRRMILLSTPSQSDALRACFERNLYPGIATKEELAQGIDIPEPRVQ 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE3 IWFQNERSRQLRQHRRESRPWPGRRGPPEGRRKRTAVTGSQTALLLRAFEKDRFPGIAAR :::::::: :::::::.:::::::: : .:::::::.::::::::::::::::::::::: NP_036 IWFQNERSCQLRQHRRQSRPWPGRRDPQKGRRKRTAITGSQTALLLRAFEKDRFPGIAAR 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE3 EELARETGLPESRIQIWFQNRRARHPGQGGRAPAQAGGLCSAAPGGGHPAPSWVAFAHTG ::::::::::::::::::::::::: ::.::::.::. :.::: : NP_036 EELARETGLPESRIQIWFQNRRARHRGQSGRAPTQASIRCNAAPIG 160 170 180 190 250 260 270 280 290 300 pF1KE3 AWGTGLPAPHVPCAPGALPQGAFVSQAARAAPALQPSQAAPAEGISQPAPARGDFAYAAP >>NP_036278 (OMIM: 611441) double homeobox protein 1 [Ho (170 aa) initn: 1152 init1: 926 opt: 926 Z-score: 423.8 bits: 86.3 E(91410): 7.7e-17 Smith-Waterman score: 926; 81.8% identity (90.0% similar) in 170 aa overlap (62-231:1-170) 40 50 60 70 80 90 pF1KE3 AVHSPAEVHGSPPASLCPRPSVKFRPGLTAMALPTPSDSTLPAEARGRGRRRRLVWTPSQ ::: : :.::: ::.: ::: :. :::: NP_036 MALLTALDDTLPEEAQGPGRRMILLSTPSQ 10 20 30 100 110 120 130 140 150 pF1KE3 SEALRACFERNPYPGIATRERLAQAIGIPEPRVQIWFQNERSRQLRQHRRESRPWPGRRG :.::::::::: ::::::.:.:::.: ::::::::::::::: :::::::.:::::::: NP_036 SDALRACFERNLYPGIATKEELAQGIDIPEPRVQIWFQNERSCQLRQHRRQSRPWPGRRD 40 50 60 70 80 90 160 170 180 190 200 210 pF1KE3 PPEGRRKRTAVTGSQTALLLRAFEKDRFPGIAAREELARETGLPESRIQIWFQNRRARHP : .:::::::.:::::::::::::::::::::::::::::::::::::::::::::::: NP_036 PQKGRRKRTAITGSQTALLLRAFEKDRFPGIAAREELARETGLPESRIQIWFQNRRARHR 100 110 120 130 140 150 220 230 240 250 260 270 pF1KE3 GQGGRAPAQAGGLCSAAPGGGHPAPSWVAFAHTGAWGTGLPAPHVPCAPGALPQGAFVSQ ::.::::.::. :.::: : NP_036 GQSGRAPTQASIRCNAAPIG 160 170 485 residues in 1 query sequences 64704883 residues in 91410 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Jul 24 16:39:25 2018 done: Tue Jul 24 16:39:26 2018 Total Scan time: 5.570 Total Display time: 0.050 Function used was FASTA [36.3.4 Apr, 2011]