FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE0586, 165 aa
1>>>pF1KE0586 165 - 165 aa - 165 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.7069+/-0.000542; mu= 11.2269+/- 0.034
mean_var=327.1059+/-63.918, 0's: 0 Z-trim(118.9): 152 B-trim: 71 in 1/49
Lambda= 0.070914
statistics sampled from 32257 (32431) to 32257 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.62), E-opt: 0.2 (0.38), width: 16
Scan time: 3.910
The best scores are: opt bits E(85289)
NP_005544 (OMIM: 148021) keratin-associated protei ( 169) 1041 118.8 4e-27
NP_001005922 (OMIM: 148022) keratin-associated pro ( 278) 910 105.8 5.5e-23
NP_112229 (OMIM: 608819) keratin-associated protei ( 177) 368 50.0 2.2e-06
NP_114163 (OMIM: 608822) keratin-associated protei ( 174) 294 42.4 0.00041
NP_112228 (OMIM: 608820) keratin-associated protei ( 167) 268 39.7 0.0026
>>NP_005544 (OMIM: 148021) keratin-associated protein 5- (169 aa)
initn: 1038 init1: 652 opt: 1041 Z-score: 607.9 bits: 118.8 E(85289): 4e-27
Smith-Waterman score: 1106; 72.8% identity (75.5% similar) in 184 aa overlap (1-165:1-169)
10 20 30 40 50 60
pF1KE0 MGCCGCSEGCGSGCGGCGSGCGGCGSGCGGCGSSCCVPVCCCKPVCCCVPACSCSSCGSC
::::::: ::::.:::: :.::.::::: ::: :::.:: ::::::::::::::::::
NP_005 MGCCGCSGGCGSSCGGCDSSCGSCGSGCRGCGPSCCAPVYCCKPVCCCVPACSCSSCG--
10 20 30 40 50
70 80 90 100 110
pF1KE0 GGSKGGCGSCGGSKGGCGSCGGSKGGCGSCGCSQCSCYKPCCCSSGCGSSCCQSSCCKP-
: :::::::::::::::::::::: ::::::::::::::: :::::
NP_005 -------------KRGCGSCGGSKGGCGSCGCSQCSCCKPCCCSSGCGSSCCQCSCCKPY
60 70 80 90 100
120 130 140 150 160
pF1KE0 C------------------CCQSSCCKPCCCSSGCGSSCCQSSCCNPCCSQSSCCVPVCC
: ::::::::::: ::::::::::::::.:::::: ::::::
NP_005 CSQCSCCKPCCSSSGRGSSCCQSSCCKPCCSSSGCGSSCCQSSCCKPCCSQSRCCVPVCY
110 120 130 140 150 160
pF1KE0 QCKI
::::
NP_005 QCKI
>>NP_001005922 (OMIM: 148022) keratin-associated protein (278 aa)
initn: 1294 init1: 600 opt: 910 Z-score: 533.6 bits: 105.8 E(85289): 5.5e-23
Smith-Waterman score: 1080; 70.9% identity (77.2% similar) in 189 aa overlap (2-165:90-278)
10 20
pF1KE0 MGCCGCSEG-CGS------GCGGCGSGCGGC
: :: :.: ::: :::.::.. :::
NP_001 SCSSCGKGGCGSSGGSKGGCGSCGGCKGGCGSCGGSKGGCGSCGGSKGGCGSCGGSKGGC
60 70 80 90 100 110
30 40 50 60 70
pF1KE0 GSGCGGCGSSCCVPVCCCKPVCCCVPACSCSSCG-----SCGGSKGGCGSCGGSKGGCGS
::::::::::::::::::::.::::::::::::: ::: :::.:::::::::::::
NP_001 GSGCGGCGSSCCVPVCCCKPMCCCVPACSCSSCGKGGCGSCGCSKGACGSCGGSKGGCGS
120 130 140 150 160 170
80 90 100 110 120
pF1KE0 CGGSKGGCGSCGCSQCSCYKPCC-CSSGCGSS--CCQSSCCKPC----------CCQSSC
::: :::::::: :. .: . : :.:::: ::. : : : : : ::
NP_001 CGGCKGGCGSCGGSKGGCGSGCGGCGSGCGVPVCCCSCSSCGSCAGSKGGCGSSCSQCSC
180 190 200 210 220 230
130 140 150 160
pF1KE0 CKPCCCSSGCGSSCCQSSCCNPCCSQSSCCVPVCCQCKI
::::::::::::::::::::.::::::::::::::::::
NP_001 CKPCCCSSGCGSSCCQSSCCKPCCSQSSCCVPVCCQCKI
240 250 260 270
>--
initn: 735 init1: 442 opt: 624 Z-score: 375.5 bits: 76.5 E(85289): 3.6e-14
Smith-Waterman score: 624; 80.9% identity (83.0% similar) in 94 aa overlap (1-87:1-89)
10 20 30 40 50
pF1KE0 MGCCGCSEGCGSGCGGCGSGCGGCGSGCGGCGS-------SCCVPVCCCKPVCCCVPACS
::::::: ::::.:::::::::::::::::::: :::::::::::::: ::.::
NP_001 MGCCGCSGGCGSSCGGCGSGCGGCGSGCGGCGSGCGGSGSSCCVPVCCCKPVCCRVPTCS
10 20 30 40 50 60
60 70 80 90 100 110
pF1KE0 CSSCGSCGGSKGGCGSCGGSKGGCGSCGGSKGGCGSCGCSQCSCYKPCCCSSGCGSSCCQ
::::: :::::: :::::::::::: ::::
NP_001 CSSCG-----KGGCGSSGGSKGGCGSCGGCKGGCGSCGGSKGGCGSCGGSKGGCGSCGGS
70 80 90 100 110
120 130 140 150 160
pF1KE0 SSCCKPCCCQSSCCKPCCCSSGCGSSCCQSSCCNPCCSQSSCCVPVCCQCKI
NP_001 KGGCGSGCGGCGSSCCVPVCCCKPMCCCVPACSCSSCGKGGCGSCGCSKGACGSCGGSKG
120 130 140 150 160 170
>>NP_112229 (OMIM: 608819) keratin-associated protein 1- (177 aa)
initn: 559 init1: 214 opt: 368 Z-score: 235.6 bits: 50.0 E(85289): 2.2e-06
Smith-Waterman score: 403; 39.2% identity (50.8% similar) in 189 aa overlap (1-163:1-170)
10 20 30 40 50
pF1KE0 MGCCGCSEGCG-SGC---GGCGSGCGGCGSGCGGCGSSCCVPVCCCKPVCCCVPACSCSS
:.:: : :: .: : :::.: : .: : .: : : :: . :: :.: .:
NP_112 MACCQTSF-CGFPSCSTSGTCGSSC--CQPSC--CETSSCQPRCC--ETSCCQPSCCQTS
10 20 30 40 50
60 70 80 90 100
pF1KE0 -CGSCGGSKGGCGSCGGSKGGCGSCGGSKGGCGSCGCSQCSCYKPCCCSSGCG-------
:: . : : :.: .: : : : : .: : :::. :..:::
NP_112 FCGFPSFSTG--GTCDSS---C--CQPS---CCETSCCQPSCYQTSSCGTGCGIGGGIGY
60 70 80 90 100
110 120 130 140 150
pF1KE0 ----SSCCQSSC---CKPCC-CQSSCCKPCCCSSGCGSSCCQ-----SSCCNPC-CSQSS
:: :. :.: : ...: ::: : :::: .::: : :.::
NP_112 GQEGSSGAVSTRIRWCRPDCRVEGTCLPPCCVVSCTPPSCCQLHHAEASCCRPSYCGQS-
110 120 130 140 150 160
160
pF1KE0 CCVPVCCQCKI
:: :::: :
NP_112 CCRPVCC-CYCSEPTC
170
>>NP_114163 (OMIM: 608822) keratin-associated protein 1- (174 aa)
initn: 219 init1: 219 opt: 294 Z-score: 194.8 bits: 42.4 E(85289): 0.00041
Smith-Waterman score: 318; 48.2% identity (56.5% similar) in 85 aa overlap (92-162:3-87)
70 80 90 100 110
pF1KE0 GSKGGCGSCGGSKGGCGSCGGSKGGCGSCGCSQCS-CYKPCCCSSG-CGSSCCQSSCCKP
: : : : : :: ::::::: :::.
NP_114 MTCCQTSFCGYPSFSISGTCGSSCCQPSCCET
10 20 30
120 130 140 150 160
pF1KE0 CCCQSSCCKP--C----------CCSSGCGSSCCQSSCCNPCCSQSSCCVPVCCQCKI
::: :. : : :: : :::..:::.: : ..::: : :::
NP_114 SCCQPRSCQTSFCGFPSFSTSGTCSSSCCQPSCCETSCCQPSCCETSCCQPSCCQISSCG
40 50 60 70 80 90
NP_114 TGCGIGGGISYGQEGSSGAVSTRIRWCRPDSRVEGTYLPPCCVVSCTPPSCCQLHHAQAS
100 110 120 130 140 150
>>NP_112228 (OMIM: 608820) keratin-associated protein 1- (167 aa)
initn: 214 init1: 214 opt: 268 Z-score: 180.5 bits: 39.7 E(85289): 0.0026
Smith-Waterman score: 341; 54.7% identity (66.7% similar) in 75 aa overlap (92-162:3-77)
70 80 90 100 110
pF1KE0 GSKGGCGSCGGSKGGCGSCGGSKGGCGSCGCSQCS-CYKPCCCSSG-CGSSCCQSSCCKP
: : : : : : .:: ::::::: :::.
NP_112 MTCCQTSFCGYPSCSTSGTCGSSCCQPSCCET
10 20 30
120 130 140 150 160
pF1KE0 CCCQSSCCKPCCCS--SGCGSSCCQSSCCNPCCSQSSCCVPVCCQCKI
::: :::. :. : :. :.::::.: : ..::: : :::
NP_112 SCCQPSCCQTSFCGFPSFSTSGTCSSSCCQPSCCETSCCQPSCCQTSSCGTGCGIGGGIG
40 50 60 70 80 90
NP_112 YGQEGSSGAVSTRIRWCRPDCRVEGTCLPPCCVVSCTPPTCCQLHHAEASCCRPSYCGQS
100 110 120 130 140 150
165 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Wed Nov 2 21:12:02 2016 done: Wed Nov 2 21:12:03 2016
Total Scan time: 3.910 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]