FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0567, 255 aa 1>>>pF1KE0567 255 - 255 aa - 255 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.1520+/-0.000496; mu= 8.4717+/- 0.031 mean_var=263.1550+/-51.399, 0's: 0 Z-trim(118.0): 201 B-trim: 484 in 1/54 Lambda= 0.079062 statistics sampled from 30252 (30488) to 30252 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.657), E-opt: 0.2 (0.357), width: 16 Scan time: 6.510 The best scores are: opt bits E(85289) NP_005544 (OMIM: 148021) keratin-associated protei ( 169) 556 76.0 4.9e-14 NP_112229 (OMIM: 608819) keratin-associated protei ( 177) 462 65.3 8.6e-11 NP_114163 (OMIM: 608822) keratin-associated protei ( 174) 449 63.8 2.4e-10 NP_112228 (OMIM: 608820) keratin-associated protei ( 167) 434 62.1 7.6e-10 NP_001005922 (OMIM: 148022) keratin-associated pro ( 278) 423 61.1 2.4e-09 NP_001244234 (OMIM: 608821) keratin-associated pro ( 121) 321 49.0 4.8e-06 NP_667345 (OMIM: 603255) transcriptional repressor ( 833) 301 47.9 6.9e-05 NP_002495 (OMIM: 603255) transcriptional repressor (1120) 301 48.1 8.2e-05 XP_011507814 (OMIM: 610278) PREDICTED: platelet en ( 868) 267 44.0 0.001 XP_016856731 (OMIM: 610278) PREDICTED: platelet en ( 909) 267 44.1 0.0011 XP_016856729 (OMIM: 610278) PREDICTED: platelet en ( 909) 267 44.1 0.0011 XP_016856730 (OMIM: 610278) PREDICTED: platelet en ( 909) 267 44.1 0.0011 XP_016878162 (OMIM: 612454) PREDICTED: multiple ep (1021) 263 43.7 0.0016 XP_016878161 (OMIM: 612454) PREDICTED: multiple ep (1044) 263 43.7 0.0016 NP_115821 (OMIM: 612454) multiple epidermal growth (1044) 263 43.7 0.0016 XP_016878160 (OMIM: 612454) PREDICTED: multiple ep (1092) 263 43.7 0.0016 XP_016878159 (OMIM: 612454) PREDICTED: multiple ep (1097) 263 43.7 0.0016 XP_016878164 (OMIM: 612454) PREDICTED: multiple ep ( 854) 251 42.2 0.0037 XP_016878163 (OMIM: 612454) PREDICTED: multiple ep ( 854) 251 42.2 0.0037 NP_787054 (OMIM: 600064) keratin-associated protei ( 163) 238 39.7 0.004 >>NP_005544 (OMIM: 148021) keratin-associated protein 5- (169 aa) initn: 356 init1: 356 opt: 556 Z-score: 372.9 bits: 76.0 E(85289): 4.9e-14 Smith-Waterman score: 558; 45.7% identity (57.7% similar) in 175 aa overlap (57-202:3-164) 30 40 50 60 70 80 pF1KE0 CELPCGTPSCCAPAPCLTLVCTPVSCVSSPCCQAACEPSACQSGCTSSCTPSCCQQS--S :: .:..:: ::: . :..: : NP_005 MGCC-------GCSGGCGSSC--GGCDSSCGS 10 20 90 100 110 120 pF1KE0 CQPACCTSSPCQQACCVPV-CCKPVCC-VPVC-------------------CGASSCCQQ : .: : .::.:: ::::::: ::.: ::. .: : NP_005 CGSGC---RGCGPSCCAPVYCCKPVCCCVPACSCSSCGKRGCGSCGGSKGGCGSCGCSQC 30 40 50 60 70 80 130 140 150 160 170 pF1KE0 SSCQPACCAS----SSCQQSCRVPVCCKAVCCVPTCSESS--SSCCQQSSCQPACCTSSP : :.: ::.: : :: :: : : . :: : :: :. :::::.: :.: ::.:: NP_005 SCCKPCCCSSGCGSSCCQCSCCKPYCSQCSCCKPCCSSSGRGSSCCQSSCCKP-CCSSSG 90 100 110 120 130 180 190 200 210 220 230 pF1KE0 CQQSCCVSVCCKPVCCKSICCVPVCSGASSPCCQQSSCQPACCTSSCCRPSSSVSLLCRP : .::: : :::: : .: :::::: NP_005 CGSSCCQSSCCKPCCSQSRCCVPVCYQCKI 140 150 160 >>NP_112229 (OMIM: 608819) keratin-associated protein 1- (177 aa) initn: 751 init1: 270 opt: 462 Z-score: 314.8 bits: 65.3 E(85289): 8.6e-11 Smith-Waterman score: 513; 37.0% identity (52.6% similar) in 192 aa overlap (25-202:2-177) 10 20 30 40 50 pF1KE0 MAASTMSICSSACTNSWQVDDCPESCCELP-CGTPSCCAPAPCLTLVCTPVSCVSSPCCQ .::. :: ::: . . : . : : :: : NP_112 MACCQTSFCGFPSCSTSGTCGSSCCQP-SC----CET 10 20 30 60 70 80 90 100 110 pF1KE0 AACEPSACQSGCTSSCTPSCCQQSSCQ-PACCTSSPCQQACCVPVCCKPVCCVPVCCGAS ..:.: :...: : ::::: : : :. :.. :...:: : ::. :: : : .: NP_112 SSCQPRCCETSC---CQPSCCQTSFCGFPSFSTGGTCDSSCCQPSCCETSCCQPSCYQTS 40 50 60 70 80 120 130 140 150 160 pF1KE0 SC---C---------QQSSCQPACCASSSCQQSCRVPVCCKAVCCVPTCSESSSSCCQQS :: : :..: . :. .::: : ::: .:. : ::: NP_112 SCGTGCGIGGGIGYGQEGSSGAVSTRIRWCRPDCRVEGTCLPPCCVVSCTPPS--CCQLH 90 100 110 120 130 140 170 180 190 200 210 220 pF1KE0 SCQPACCTSSPCQQSCCVSVCCKPVCCKSICCVPVCSGASSPCCQQSSCQPACCTSSCCR . .:: : : :::: .:::: : :.: NP_112 HAEASCCRPSYCGQSCC-----RPVCC-CYCSEPTC 150 160 170 230 240 250 pF1KE0 PSSSVSLLCRPVCSRPASCSFSSGQKSSC >>NP_114163 (OMIM: 608822) keratin-associated protein 1- (174 aa) initn: 680 init1: 274 opt: 449 Z-score: 306.8 bits: 63.8 E(85289): 2.4e-10 Smith-Waterman score: 479; 38.2% identity (52.8% similar) in 178 aa overlap (25-192:2-174) 10 20 30 40 50 pF1KE0 MAASTMSICSSACTNSWQVDDCPESCCELP-CGTPSCCAPAPCLTLVCTPVSCVSSPCCQ .::. :: :: . : . : : :: . ::: NP_114 MTCCQTSFCGYPSFSISGTCGSSCCQP-SCCETSCCQ 10 20 30 60 70 80 90 100 110 pF1KE0 A-ACEPSACQSGCTSSCTPSCCQQSSCQPACCTSSPCQQACCVPVCCKPVCC-VPVC--- .:. : : : : : . :..: :::.:: .: :: .:: ::.: :: . : NP_114 PRSCQTSFC--GFPSFSTSGTCSSSCCQPSCCETSCCQPSCCETSCCQPSCCQISSCGTG 40 50 60 70 80 90 120 130 140 150 160 170 pF1KE0 CGAS---SCCQQSSCQPACCASSSCQQSCRVPVCCKAVCCVPTCSESSSSCCQQSSCQPA :: . : :..: . :. . :: ::: .:. :::: : . NP_114 CGIGGGISYGQEGSSGAVSTRIRWCRPDSRVEGTYLPPCCVVSCTPP--SCCQLHHAQAS 100 110 120 130 140 150 180 190 200 210 220 230 pF1KE0 CCTSSPCQQSCCVSVCC-KPVCCKSICCVPVCSGASSPCCQQSSCQPACCTSSCCRPSSS :: : : :::: ::: .:.: NP_114 CCRPSYCGQSCCRPVCCCEPTC 160 170 >>NP_112228 (OMIM: 608820) keratin-associated protein 1- (167 aa) initn: 280 init1: 280 opt: 434 Z-score: 297.8 bits: 62.1 E(85289): 7.6e-10 Smith-Waterman score: 497; 37.5% identity (51.0% similar) in 192 aa overlap (25-202:2-167) 10 20 30 40 50 pF1KE0 MAASTMSICSSACTNSWQVDDCPESCCELP-CGTPSCCAPAPCLTLVCTPVSCVSSPCCQ .::. :: ::: . . : .: ::: NP_112 MTCCQTSFCGYPSCSTSGTC-----------GSSCCQ 10 20 60 70 80 90 100 110 pF1KE0 AACEPSACQSGCTSSCTPSCCQQSSCQ-PACCTSSPCQQACCVPVCCKPVCCVPVCCGAS :: :...: : ::::: : : :. ::. :...:: : ::. :: : :: .: NP_112 ----PSCCETSC---CQPSCCQTSFCGFPSFSTSGTCSSSCCQPSCCETSCCQPSCCQTS 30 40 50 60 70 120 130 140 150 160 pF1KE0 SC---C---------QQSSCQPACCASSSCQQSCRVPVCCKAVCCVPTCSESSSSCCQQS :: : :..: . :. .::: : ::: .:. . ::: NP_112 SCGTGCGIGGGIGYGQEGSSGAVSTRIRWCRPDCRVEGTCLPPCCVVSCTPPT--CCQLH 80 90 100 110 120 130 170 180 190 200 210 220 pF1KE0 SCQPACCTSSPCQQSCCVSVCCKPVCCKSICCVPVCSGASSPCCQQSSCQPACCTSSCCR . .:: : : :::: ::: : : : :.: NP_112 HAEASCCRPSYCGQSCCRPVCC----CYS--CEPTC 140 150 160 230 240 250 pF1KE0 PSSSVSLLCRPVCSRPASCSFSSGQKSSC >>NP_001005922 (OMIM: 148022) keratin-associated protein (278 aa) initn: 368 init1: 227 opt: 423 Z-score: 288.7 bits: 61.1 E(85289): 2.4e-09 Smith-Waterman score: 505; 35.7% identity (51.5% similar) in 241 aa overlap (25-239:41-273) 10 20 30 40 50 pF1KE0 MAASTMSICSSACTNSWQVDDCPESCCELP--CGTPSCCAPAPCLTLVCTPVSC ::: .: : : :: : : .: NP_001 GSSCGGCGSGCGGCGSGCGGCGSGCGGSGSSCC-VPVCCCKPVCCRVPTCSCSSCGKGGC 20 30 40 50 60 60 70 80 90 100 pF1KE0 VSSPCCQAACEP-SACQSGCTSSCTPSCCQQSSC--QPACCTSSPCQQACCVPVC--CKP :: ...: ..:..:: .:: : .:: . . : : ... : : : NP_001 GSSGGSKGGCGSCGGCKGGC-GSCGGSKGGCGSCGGSKGGCGSCGGSKGGCGSGCGGCGS 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE0 VCCVPVCCGASSCCQQSSCQPACCASSSCQQSCRVPVCCKAVCCVPTCSESSSSCCQQSS ::::::: :: : ::: :: . .: : :..: .:. :...: . .. NP_001 SCCVPVCCCKPMCC----CVPACSCSSCGKGGCGSCGCSKGAC--GSCGGSKGGCGSCGG 130 140 150 160 170 180 170 180 190 200 pF1KE0 CQPAC--CTSSP--CQQSC--CVSVCCKPVCCKSI--C---------CVPVCSGAS--SP :. .: : .: : ..: : : : :::: : : : :: : .: NP_001 CKGGCGSCGGSKGGCGSGCGGCGSGCGVPVCCCSCSSCGSCAGSKGGCGSSCSQCSCCKP 190 200 210 220 230 240 210 220 230 240 250 pF1KE0 CCQQSSCQPACCTSSCCRPSSSVSLLCRPVCSRPASCSFSSGQKSSC :: .:.: .:: ::::.: : : : ::: NP_001 CCCSSGCGSSCCQSSCCKPCCSQSSCCVPVCCQCKI 250 260 270 >>NP_001244234 (OMIM: 608821) keratin-associated protein (121 aa) initn: 415 init1: 217 opt: 321 Z-score: 229.5 bits: 49.0 E(85289): 4.8e-06 Smith-Waterman score: 331; 35.4% identity (51.2% similar) in 127 aa overlap (88-202:3-121) 60 70 80 90 100 110 pF1KE0 CQAACEPSACQSGCTSSCTPSCCQQSSCQPACCTSSPCQQACCVPVCCKPVCCVPVCCGA .: ::. : ..:: : ::. :: : :: . NP_001 MASCSTSGTCGSSCCQPSCCETSCCQPSCCQT 10 20 30 120 130 140 150 160 pF1KE0 SSC---C---------QQSSCQPACCASSSCQQSCRVPVCCKAVCCVPTCSESSSSCCQQ ::: : :..: . :. .:.: : : . .:. : ::: NP_001 SSCGTGCGIGGGIGYGQEGSGGSVSTRIRWCHPDCHVEGTCLPPCYLVSCTPPS--CCQL 40 50 60 70 80 90 170 180 190 200 210 220 pF1KE0 SSCQPACCTSSPCQQSCCVSVCCKPVCCKSICCVPVCSGASSPCCQQSSCQPACCTSSCC . .:: : : :::: .:: :. :: :.: NP_001 HHAEASCCRPSYCGQSCCRPACC----CH--CCEPTC 100 110 120 230 240 250 pF1KE0 RPSSSVSLLCRPVCSRPASCSFSSGQKSSC >>NP_667345 (OMIM: 603255) transcriptional repressor NF- (833 aa) initn: 95 init1: 95 opt: 301 Z-score: 208.7 bits: 47.9 E(85289): 6.9e-05 Smith-Waterman score: 307; 25.6% identity (45.8% similar) in 262 aa overlap (18-255:529-778) 10 20 30 40 pF1KE0 MAASTMSICSSACTNSWQVDDCPESCCELPCGTPSCCAPA----PCL :: : . .. ::: . . :: NP_667 PCENILNCGQHQCAELCHGGQCQPCQIILNQVCYCGSTSRDVLCGTDVGKSDGFGDFSCL 500 510 520 530 540 550 50 60 70 80 90 pF1KE0 TLVCTPVSCVSSPCCQAACEPSACQSGCTS-----SCTPSCCQQSSCQPACCTSSPCQQA . ..: . : :. :.:. ::. : : : : :. . .: ... NP_667 KICGKDLKCGNHTCSQV-CHPQPCQQ-CPRLPQLVRCCP--CGQTPLSQLLELGSSSRKT 560 570 580 590 600 610 100 110 120 130 140 150 pF1KE0 CCVPV-CCKPVCCVPVCCGA----SSC---CQQSSCQPACCASSSCQQSCRVPVCCKAVC : :: : :: :. ::. .: :....: : :. .: ::: : . NP_667 CMDPVPSCGKVCGKPLPCGSLDFIHTCEKLCHEGDCGP--CSRTSVI-SCRCSFRTKELP 620 630 640 650 660 670 160 170 180 190 200 pF1KE0 CVPTCSESSSSCCQQSSCQPACCTSSPCQQSCCVSVC--CKPVCCKSI-CCVPVCSGASS :. ::... :.. . : :.. :::. : .: ... : . : NP_667 CTSLKSEDATFMCDKRCNKKRLCGRHKCNEICCVDKEHKCPLICGRKLRCGLHRCE---E 680 690 700 710 720 210 220 230 240 250 pF1KE0 PC----CQQSSCQPACCTSSCCRPSSSVSLLCRPVCSRPASCSFSSGQKSSC :: :: .: : :. ..:: : .:: :. . .. : NP_667 PCHRGNCQ--TCWQASFDELTCHCGASVIYPPVPCGTRPPECTQTCARVHECDHPVYHSC 730 740 750 760 770 780 NP_667 HSEEKCPPCTFLTQKWCMGKHEQSHYWASTQKKRSHYMKKIPAHACL 790 800 810 820 830 >>NP_002495 (OMIM: 603255) transcriptional repressor NF- (1120 aa) initn: 148 init1: 95 opt: 301 Z-score: 207.4 bits: 48.1 E(85289): 8.2e-05 Smith-Waterman score: 307; 25.6% identity (45.8% similar) in 262 aa overlap (18-255:529-778) 10 20 30 40 pF1KE0 MAASTMSICSSACTNSWQVDDCPESCCELPCGTPSCCAPA----PCL :: : . .. ::: . . :: NP_002 PCENILNCGQHQCAELCHGGQCQPCQIILNQVCYCGSTSRDVLCGTDVGKSDGFGDFSCL 500 510 520 530 540 550 50 60 70 80 90 pF1KE0 TLVCTPVSCVSSPCCQAACEPSACQSGCTS-----SCTPSCCQQSSCQPACCTSSPCQQA . ..: . : :. :.:. ::. : : : : :. . .: ... NP_002 KICGKDLKCGNHTCSQV-CHPQPCQQ-CPRLPQLVRCCP--CGQTPLSQLLELGSSSRKT 560 570 580 590 600 610 100 110 120 130 140 150 pF1KE0 CCVPV-CCKPVCCVPVCCGA----SSC---CQQSSCQPACCASSSCQQSCRVPVCCKAVC : :: : :: :. ::. .: :....: : :. .: ::: : . NP_002 CMDPVPSCGKVCGKPLPCGSLDFIHTCEKLCHEGDCGP--CSRTSVI-SCRCSFRTKELP 620 630 640 650 660 670 160 170 180 190 200 pF1KE0 CVPTCSESSSSCCQQSSCQPACCTSSPCQQSCCVSVC--CKPVCCKSI-CCVPVCSGASS :. ::... :.. . : :.. :::. : .: ... : . : NP_002 CTSLKSEDATFMCDKRCNKKRLCGRHKCNEICCVDKEHKCPLICGRKLRCGLHRCE---E 680 690 700 710 720 210 220 230 240 250 pF1KE0 PC----CQQSSCQPACCTSSCCRPSSSVSLLCRPVCSRPASCSFSSGQKSSC :: :: .: : :. ..:: : .:: :. . .. : NP_002 PCHRGNCQ--TCWQASFDELTCHCGASVIYPPVPCGTRPPECTQTCARVHECDHPVYHSC 730 740 750 760 770 780 NP_002 HSEEKCPPCTFLTQKWCMGKHEFRSNIPCHLVDISCGLPCSATLPCGMHKCQRLCHKGEC 790 800 810 820 830 840 >>XP_011507814 (OMIM: 610278) PREDICTED: platelet endoth (868 aa) initn: 124 init1: 62 opt: 267 Z-score: 187.6 bits: 44.0 E(85289): 0.001 Smith-Waterman score: 289; 28.0% identity (45.5% similar) in 286 aa overlap (13-245:299-565) 10 20 30 40 pF1KE0 MAASTMSICSSACTNSWQVDDCPESCCELPCGTP--SCCAPA : ..:: .: : : :: :: : XP_011 CPPDTYGVNCSARCSCENAIACSPIDGECVCKEGWQRGNCSVPC---PPGTWGFSCNASC 270 280 290 300 310 320 50 60 70 80 pF1KE0 PCL-TLVCTPVS--CVSSPC-----CQAACEPSACQSGCTSSCTPSCCQQSSCQPA---C : ::.: . :. .: :: : . ::.: : .: ....:.:. : XP_011 QCAHEAVCSPQTGACTCTPGWHGAHCQLPCPKGQFGEGCASRC--DCDHSDGCDPVHGRC 330 340 350 360 370 380 90 100 110 120 130 pF1KE0 -CTS----SPCQQACCVPVCCKPVCCVPVCCGASSCCQQSSCQPA---C-CASS----SC : . . :. .: : : : .: .: . ..: : : :: . :: XP_011 QCQAGWMGARCHLSC--PEGLWGVNCSNTC----TCKNGGTCLPENGNCVCAPGFRGPSC 390 400 410 420 430 140 150 160 170 180 pF1KE0 QQSCRVPVCCKAVCCVPTCSESSSSCCQQSSCQPACC---TSSPCQQSC--------CVS :.::. : ::: :. .. : :. :. : :. :.: : :.. XP_011 QRSCQPGRYGKR--CVP-CKCANHSFCHPSNGTCYCLAGWTGPDCSQPCPPGHWGENCAQ 440 450 460 470 480 490 190 200 210 220 pF1KE0 VC-------CKPVCCKSIC--------CVPVCS-GASSPCCQQSSCQPACCTSSCCRPSS .: :.: . :: :. : :. . :.: :: : . :.: . XP_011 TCQCHHGGTCHPQDGSCICPLGWTGHHCLEGCPLGTFGANCSQP-CQ--CGPGEKCHPET 500 510 520 530 540 550 230 240 250 pF1KE0 SVSLLCRPVCSRPASCSFSSGQKSSC .. . : : : : : XP_011 GACV-CPPGHS-GAPCRIGIQEPFTVMPTTPVAYNSLGAVIGIAVLGSLVVALVALFIGY 560 570 580 590 600 >>XP_016856731 (OMIM: 610278) PREDICTED: platelet endoth (909 aa) initn: 92 init1: 62 opt: 267 Z-score: 187.4 bits: 44.1 E(85289): 0.0011 Smith-Waterman score: 289; 28.0% identity (45.5% similar) in 286 aa overlap (13-245:340-606) 10 20 30 40 pF1KE0 MAASTMSICSSACTNSWQVDDCPESCCELPCGTP--SCCAPA : ..:: .: : : :: :: : XP_016 CPPDTYGVNCSARCSCENAIACSPIDGECVCKEGWQRGNCSVPC---PPGTWGFSCNASC 310 320 330 340 350 360 50 60 70 80 pF1KE0 PCL-TLVCTPVS--CVSSPC-----CQAACEPSACQSGCTSSCTPSCCQQSSCQPA---C : ::.: . :. .: :: : . ::.: : .: ....:.:. : XP_016 QCAHEAVCSPQTGACTCTPGWHGAHCQLPCPKGQFGEGCASRC--DCDHSDGCDPVHGRC 370 380 390 400 410 420 90 100 110 120 130 pF1KE0 -CTS----SPCQQACCVPVCCKPVCCVPVCCGASSCCQQSSCQPA---C-CASS----SC : . . :. .: : : : .: .: . ..: : : :: . :: XP_016 QCQAGWMGARCHLSC--PEGLWGVNCSNTC----TCKNGGTCLPENGNCVCAPGFRGPSC 430 440 450 460 470 140 150 160 170 180 pF1KE0 QQSCRVPVCCKAVCCVPTCSESSSSCCQQSSCQPACC---TSSPCQQSC--------CVS :.::. : ::: :. .. : :. :. : :. :.: : :.. XP_016 QRSCQPGRYGKR--CVP-CKCANHSFCHPSNGTCYCLAGWTGPDCSQPCPPGHWGENCAQ 480 490 500 510 520 530 190 200 210 220 pF1KE0 VC-------CKPVCCKSIC--------CVPVCS-GASSPCCQQSSCQPACCTSSCCRPSS .: :.: . :: :. : :. . :.: :: : . :.: . XP_016 TCQCHHGGTCHPQDGSCICPLGWTGHHCLEGCPLGTFGANCSQP-CQ--CGPGEKCHPET 540 550 560 570 580 590 230 240 250 pF1KE0 SVSLLCRPVCSRPASCSFSSGQKSSC .. . : : : : : XP_016 GACV-CPPGHS-GAPCRIGIQEPFTVMPTTPVAYNSLGAVIGIAVLGSLVVALVALFIGY 600 610 620 630 640 650 255 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Wed Nov 2 23:01:00 2016 done: Wed Nov 2 23:01:01 2016 Total Scan time: 6.510 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]