FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB0989, 135 aa 1>>>pF1KB0989 135 - 135 aa - 135 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6950+/-0.000234; mu= 12.6870+/- 0.015 mean_var=115.0654+/-23.013, 0's: 0 Z-trim(125.4): 22 B-trim: 425 in 1/54 Lambda= 0.119564 statistics sampled from 49138 (49165) to 49138 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.865), E-opt: 0.2 (0.576), width: 16 Scan time: 5.950 The best scores are: opt bits E(85289) NP_689781 (OMIM: 610772) homeobox protein Nkx-6.3 ( 135) 944 171.6 3.4e-43 XP_016868632 (OMIM: 610772) PREDICTED: homeobox pr ( 265) 551 104.1 1.4e-22 NP_006159 (OMIM: 602563) homeobox protein Nkx-6.1 ( 367) 246 51.7 1.2e-06 XP_016872278 (OMIM: 605955) PREDICTED: homeobox pr ( 277) 244 51.2 1.2e-06 NP_796374 (OMIM: 605955) homeobox protein Nkx-6.2 ( 277) 241 50.7 1.8e-06 >>NP_689781 (OMIM: 610772) homeobox protein Nkx-6.3 [Hom (135 aa) initn: 944 init1: 944 opt: 944 Z-score: 896.5 bits: 171.6 E(85289): 3.4e-43 Smith-Waterman score: 944; 100.0% identity (100.0% similar) in 135 aa overlap (1-135:1-135) 10 20 30 40 50 60 pF1KB0 MQQGQLAPGSRLCSGPWGLPELQPAAPSSSAAQLPWGESWGEEADTPACLSASGVWFQNR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_689 MQQGQLAPGSRLCSGPWGLPELQPAAPSSSAAQLPWGESWGEEADTPACLSASGVWFQNR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB0 RTKWRKKSALEPSSSTPRAPGGAGAGAGGDRAPSENEDDEYNKPLDPDSDDEKIRLLLRK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_689 RTKWRKKSALEPSSSTPRAPGGAGAGAGGDRAPSENEDDEYNKPLDPDSDDEKIRLLLRK 70 80 90 100 110 120 130 pF1KB0 HRAAFSVLSLGAHSV ::::::::::::::: NP_689 HRAAFSVLSLGAHSV 130 >>XP_016868632 (OMIM: 610772) PREDICTED: homeobox protei (265 aa) initn: 573 init1: 551 opt: 551 Z-score: 526.5 bits: 104.1 E(85289): 1.4e-22 Smith-Waterman score: 551; 100.0% identity (100.0% similar) in 81 aa overlap (55-135:185-265) 30 40 50 60 70 80 pF1KB0 AAPSSSAAQLPWGESWGEEADTPACLSASGVWFQNRRTKWRKKSALEPSSSTPRAPGGAG :::::::::::::::::::::::::::::: XP_016 EKTFEQTKYLAGPERARLAYSLGMTESQVKVWFQNRRTKWRKKSALEPSSSTPRAPGGAG 160 170 180 190 200 210 90 100 110 120 130 pF1KB0 AGAGGDRAPSENEDDEYNKPLDPDSDDEKIRLLLRKHRAAFSVLSLGAHSV ::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 AGAGGDRAPSENEDDEYNKPLDPDSDDEKIRLLLRKHRAAFSVLSLGAHSV 220 230 240 250 260 >>NP_006159 (OMIM: 602563) homeobox protein Nkx-6.1 [Hom (367 aa) initn: 294 init1: 149 opt: 246 Z-score: 240.4 bits: 51.7 E(85289): 1.2e-06 Smith-Waterman score: 246; 52.9% identity (77.1% similar) in 70 aa overlap (55-124:282-349) 30 40 50 60 70 80 pF1KB0 AAPSSSAAQLPWGESWGEEADTPACLSASGVWFQNRRTKWRKKSALEPSSSTPRAPGGAG ::::::::::::: : : ... . . . NP_006 EKTFEQTKYLAGPERARLAYSLGMTESQVKVWFQNRRTKWRKKHAAEMATAKKKQDSETE 260 270 280 290 300 310 90 100 110 120 130 pF1KB0 AGAGGDRAPSENEDDEYNKPLDPDSDDEKIRLLLRKHRAAFSVLSLGAHSV :... .:.:::.:::::::.:::::: ::.::... NP_006 RLKGASE--NEEEDDDYNKPLDPNSDDEKITQLLKKHKSSSGGGGGLLLHASEPESSS 320 330 340 350 360 >>XP_016872278 (OMIM: 605955) PREDICTED: homeobox protei (277 aa) initn: 297 init1: 141 opt: 244 Z-score: 240.1 bits: 51.2 E(85289): 1.2e-06 Smith-Waterman score: 244; 55.1% identity (75.4% similar) in 69 aa overlap (55-122:194-259) 30 40 50 60 70 80 pF1KB0 AAPSSSAAQLPWGESWGEEADTPACLSASGVWFQNRRTKWRKKSALEPSSSTPRAPGGAG ::::::::::::. :.: .:. . . : XP_016 EKTFEQTKYLAGPERARLAYSLGMTESQVKVWFQNRRTKWRKRHAVEMASAKKKQDSDAE 170 180 190 200 210 220 90 100 110 120 130 pF1KB0 A-GAGGDRAPSENEDDEYNKPLDPDSDDEKIRLLLRKHRAAFSVLSLGAHSV .::. : ..:::::.::::.:::::: ::.::. XP_016 KLKVGGSDA---EDDDEYNRPLDPNSDDEKITRLLKKHKPSNLALVSPCGGGAGDAL 230 240 250 260 270 >>NP_796374 (OMIM: 605955) homeobox protein Nkx-6.2 [Hom (277 aa) initn: 294 init1: 141 opt: 241 Z-score: 237.3 bits: 50.7 E(85289): 1.8e-06 Smith-Waterman score: 241; 55.1% identity (73.9% similar) in 69 aa overlap (55-122:194-259) 30 40 50 60 70 80 pF1KB0 AAPSSSAAQLPWGESWGEEADTPACLSASGVWFQNRRTKWRKKSALEPSSSTPRAPGGAG ::::::::::::. : : .:. . . : NP_796 EKTFEQTKYLAGPERARLAYSLGMTESQVKVWFQNRRTKWRKRHAAEMASAKKKQDSDAE 170 180 190 200 210 220 90 100 110 120 130 pF1KB0 A-GAGGDRAPSENEDDEYNKPLDPDSDDEKIRLLLRKHRAAFSVLSLGAHSV .::. : ..:::::.::::.:::::: ::.::. NP_796 KLKVGGSDA---EDDDEYNRPLDPNSDDEKITRLLKKHKPSNLALVSPCGGGAGDAL 230 240 250 260 270 135 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 21:27:46 2016 done: Sat Nov 5 21:27:47 2016 Total Scan time: 5.950 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]