FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB7967, 283 aa
1>>>pF1KB7967 283 - 283 aa - 283 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 9.6052+/-0.000948; mu= -4.0140+/- 0.056
mean_var=310.3237+/-70.107, 0's: 0 Z-trim(114.6): 698 B-trim: 0 in 0/51
Lambda= 0.072806
statistics sampled from 14302 (15159) to 14302 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.801), E-opt: 0.2 (0.466), width: 16
Scan time: 2.210
The best scores are: opt bits E(32554)
CCDS7060.1 KLF6 gene_id:1316|Hs108|chr10 ( 283) 1966 219.5 2.3e-57
CCDS53490.1 KLF6 gene_id:1316|Hs108|chr10 ( 237) 1537 174.3 7.3e-44
CCDS2373.1 KLF7 gene_id:8609|Hs108|chr2 ( 302) 703 86.8 2e-17
CCDS59440.1 KLF7 gene_id:8609|Hs108|chr2 ( 269) 698 86.2 2.7e-17
CCDS59438.1 KLF7 gene_id:8609|Hs108|chr2 ( 274) 698 86.3 2.7e-17
CCDS9449.1 KLF12 gene_id:11278|Hs108|chr13 ( 402) 586 74.7 1.2e-13
CCDS3444.1 KLF3 gene_id:51274|Hs108|chr4 ( 345) 576 73.5 2.3e-13
CCDS66562.1 KLF5 gene_id:688|Hs108|chr13 ( 366) 564 72.3 5.8e-13
CCDS9448.1 KLF5 gene_id:688|Hs108|chr13 ( 457) 564 72.4 6.8e-13
CCDS14373.1 KLF8 gene_id:11279|Hs108|chrX ( 359) 546 70.4 2.1e-12
CCDS6770.2 KLF4 gene_id:9314|Hs108|chr9 ( 479) 540 69.9 4e-12
CCDS12285.1 KLF1 gene_id:10661|Hs108|chr19 ( 362) 531 68.8 6.3e-12
CCDS12343.1 KLF2 gene_id:10365|Hs108|chr19 ( 355) 524 68.1 1e-11
>>CCDS7060.1 KLF6 gene_id:1316|Hs108|chr10 (283 aa)
initn: 1966 init1: 1966 opt: 1966 Z-score: 1143.5 bits: 219.5 E(32554): 2.3e-57
Smith-Waterman score: 1966; 100.0% identity (100.0% similar) in 283 aa overlap (1-283:1-283)
10 20 30 40 50 60
pF1KB7 MDVLPMCSIFQELQIVHETGYFSALPSLEEYWQQTCLELERYLQSEPCYVSASEIKFDSQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS70 MDVLPMCSIFQELQIVHETGYFSALPSLEEYWQQTCLELERYLQSEPCYVSASEIKFDSQ
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 EDLWTKIILAREKKEESELKISSSPPEDTLISPSFCYNLETNSLNSDVSSESSDSSEELS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS70 EDLWTKIILAREKKEESELKISSSPPEDTLISPSFCYNLETNSLNSDVSSESSDSSEELS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 PTAKFTSDPIGEVLVSSGKLSSSVTSTPPSSPELSREPSQLWGCVPGELPSPGKVRSGTS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS70 PTAKFTSDPIGEVLVSSGKLSSSVTSTPPSSPELSREPSQLWGCVPGELPSPGKVRSGTS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 GKPGDKGNGDASPDGRRRVHRCHFNGCRKVYTKSSHLKAHQRTHTGEKPYRCSWEGCEWR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS70 GKPGDKGNGDASPDGRRRVHRCHFNGCRKVYTKSSHLKAHQRTHTGEKPYRCSWEGCEWR
190 200 210 220 230 240
250 260 270 280
pF1KB7 FARSDELTRHFRKHTGAKPFKCSHCDRCFSRSDHLALHMKRHL
:::::::::::::::::::::::::::::::::::::::::::
CCDS70 FARSDELTRHFRKHTGAKPFKCSHCDRCFSRSDHLALHMKRHL
250 260 270 280
>>CCDS53490.1 KLF6 gene_id:1316|Hs108|chr10 (237 aa)
initn: 1600 init1: 1524 opt: 1537 Z-score: 900.9 bits: 174.3 E(32554): 7.3e-44
Smith-Waterman score: 1537; 97.4% identity (97.9% similar) in 234 aa overlap (1-234:1-234)
10 20 30 40 50 60
pF1KB7 MDVLPMCSIFQELQIVHETGYFSALPSLEEYWQQTCLELERYLQSEPCYVSASEIKFDSQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 MDVLPMCSIFQELQIVHETGYFSALPSLEEYWQQTCLELERYLQSEPCYVSASEIKFDSQ
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 EDLWTKIILAREKKEESELKISSSPPEDTLISPSFCYNLETNSLNSDVSSESSDSSEELS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 EDLWTKIILAREKKEESELKISSSPPEDTLISPSFCYNLETNSLNSDVSSESSDSSEELS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 PTAKFTSDPIGEVLVSSGKLSSSVTSTPPSSPELSREPSQLWGCVPGELPSPGKVRSGTS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 PTAKFTSDPIGEVLVSSGKLSSSVTSTPPSSPELSREPSQLWGCVPGELPSPGKVRSGTS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 GKPGDKGNGDASPDGRRRVHRCHFNGCRKVYTKSSHLKAHQRTHTGEKPYRCSWEGCEWR
:::::::::::::::::::::::::::::::::::::::::::::: : .:
CCDS53 GKPGDKGNGDASPDGRRRVHRCHFNGCRKVYTKSSHLKAHQRTHTGVFPGLTTWPCT
190 200 210 220 230
250 260 270 280
pF1KB7 FARSDELTRHFRKHTGAKPFKCSHCDRCFSRSDHLALHMKRHL
>>CCDS2373.1 KLF7 gene_id:8609|Hs108|chr2 (302 aa)
initn: 1099 init1: 661 opt: 703 Z-score: 426.2 bits: 86.8 E(32554): 2e-17
Smith-Waterman score: 930; 51.9% identity (66.8% similar) in 316 aa overlap (1-283:1-302)
10 20 30 40 50 60
pF1KB7 MDVLPMCSIFQELQIVHETGYFSALPSLEEYWQQTCLELERYLQSEPCYVSASEIKFDSQ
:::: :::::::.::.:::::::::::: :::::::::::::.:: .: . :
CCDS23 MDVLASYSIFQELQLVHDTGYFSALPSLEETWQQTCLELERYLQTEPRRISET---FG--
10 20 30 40 50
70 80 90 100 110
pF1KB7 EDLWTKIILAREKK--EESELKISSSPPEDTLISPSFCYNLETNSLNSDVSSESSDSSEE
::: .. : ::: .. : :. : : .: . . :... ::
CCDS23 EDL-DCFLHASPPPCIEESFRRL------DPLLLPVEAAICEKSSAVDILLSRDKLLSET
60 70 80 90 100
120 130 140 150 160 170
pF1KB7 LSPTAKFTSDPIGEVLVSSGKLSSSVTSTPPSSPELSR---EPSQLWGCVPGELP-----
.:. . . :....:.. .. :::::::::: . :: . : : .
CCDS23 CLSLQPASSSLDSYTAVNQAQLNAVTSLTPPSSPELSRHLVKTSQTLSAVDGTVTLKLVA
110 120 130 140 150 160
180 190 200
pF1KB7 ---------------------SPGKVRSGTSGKPGDKGN--GDASPDGRRRVHRCHFNGC
. : :.:: : . :.:. ..: :....:::::.::::
CCDS23 KKAALSSVKVGGVATAAAAVTAAGAVKSGQSDS--DQGGLGAEACPENKKRVHRCQFNGC
170 180 190 200 210 220
210 220 230 240 250 260
pF1KB7 RKVYTKSSHLKAHQRTHTGEKPYRCSWEGCEWRFARSDELTRHFRKHTGAKPFKCSHCDR
:::::::::::::::::::::::.:::::::::::::::::::.:::::::::::.::::
CCDS23 RKVYTKSSHLKAHQRTHTGEKPYKCSWEGCEWRFARSDELTRHYRKHTGAKPFKCNHCDR
230 240 250 260 270 280
270 280
pF1KB7 CFSRSDHLALHMKRHL
:::::::::::::::.
CCDS23 CFSRSDHLALHMKRHI
290 300
>>CCDS59440.1 KLF7 gene_id:8609|Hs108|chr2 (269 aa)
initn: 915 init1: 661 opt: 698 Z-score: 424.0 bits: 86.2 E(32554): 2.7e-17
Smith-Waterman score: 746; 48.2% identity (64.2% similar) in 282 aa overlap (35-283:2-269)
10 20 30 40 50 60
pF1KB7 PMCSIFQELQIVHETGYFSALPSLEEYWQQTCLELERYLQSEPCYVSASEIKFDSQEDLW
::::::::::.:: .: . : :::
CCDS59 MTCLELERYLQTEPRRISET---FG--EDL-
10 20
70 80 90 100 110 120
pF1KB7 TKIILAREKK--EESELKISSSPPEDTLISPSFCYNLETNSLNSDVSSESSDSSEELSPT
.. : ::: .. : :. : : .: . . :... ::
CCDS59 DCFLHASPPPCIEESFRRL------DPLLLPVEAAICEKSSAVDILLSRDKLLSETCLSL
30 40 50 60 70
130 140 150 160 170
pF1KB7 AKFTSDPIGEVLVSSGKLSSSVTSTPPSSPELSR---EPSQLWGCVPGELP---------
.:. . . :....:.. .. :::::::::: . :: . : : .
CCDS59 QPASSSLDSYTAVNQAQLNAVTSLTPPSSPELSRHLVKTSQTLSAVDGTVTLKLVAKKAA
80 90 100 110 120 130
180 190 200 210
pF1KB7 -----------------SPGKVRSGTSGKPGDKGN--GDASPDGRRRVHRCHFNGCRKVY
. : :.:: : .:.:. ..: :....:::::.::::::::
CCDS59 LSSVKVGGVATAAAAVTAAGAVKSGQSD--SDQGGLGAEACPENKKRVHRCQFNGCRKVY
140 150 160 170 180 190
220 230 240 250 260 270
pF1KB7 TKSSHLKAHQRTHTGEKPYRCSWEGCEWRFARSDELTRHFRKHTGAKPFKCSHCDRCFSR
:::::::::::::::::::.:::::::::::::::::::.:::::::::::.::::::::
CCDS59 TKSSHLKAHQRTHTGEKPYKCSWEGCEWRFARSDELTRHYRKHTGAKPFKCNHCDRCFSR
200 210 220 230 240 250
280
pF1KB7 SDHLALHMKRHL
:::::::::::.
CCDS59 SDHLALHMKRHI
260
>>CCDS59438.1 KLF7 gene_id:8609|Hs108|chr2 (274 aa)
initn: 915 init1: 661 opt: 698 Z-score: 423.9 bits: 86.3 E(32554): 2.7e-17
Smith-Waterman score: 748; 48.1% identity (63.9% similar) in 285 aa overlap (32-283:5-274)
10 20 30 40 50 60
pF1KB7 DVLPMCSIFQELQIVHETGYFSALPSLEEYWQQTCLELERYLQSEPCYVSASEIKFDSQE
: ::::::::::.:: .: . : :
CCDS59 MFPSWP-TCLELERYLQTEPRRISET---FG--E
10 20
70 80 90 100 110
pF1KB7 DLWTKIILAREKK--EESELKISSSPPEDTLISPSFCYNLETNSLNSDVSSESSDSSEEL
:: .. : ::: .. : :. : : .: . . :... ::
CCDS59 DL-DCFLHASPPPCIEESFRRL------DPLLLPVEAAICEKSSAVDILLSRDKLLSETC
30 40 50 60 70 80
120 130 140 150 160 170
pF1KB7 SPTAKFTSDPIGEVLVSSGKLSSSVTSTPPSSPELSR---EPSQLWGCVPGELP------
.:. . . :....:.. .. :::::::::: . :: . : : .
CCDS59 LSLQPASSSLDSYTAVNQAQLNAVTSLTPPSSPELSRHLVKTSQTLSAVDGTVTLKLVAK
90 100 110 120 130 140
180 190 200
pF1KB7 --------------------SPGKVRSGTSGKPGDKGN--GDASPDGRRRVHRCHFNGCR
. : :.:: : . :.:. ..: :....:::::.:::::
CCDS59 KAALSSVKVGGVATAAAAVTAAGAVKSGQSDS--DQGGLGAEACPENKKRVHRCQFNGCR
150 160 170 180 190
210 220 230 240 250 260
pF1KB7 KVYTKSSHLKAHQRTHTGEKPYRCSWEGCEWRFARSDELTRHFRKHTGAKPFKCSHCDRC
::::::::::::::::::::::.:::::::::::::::::::.:::::::::::.:::::
CCDS59 KVYTKSSHLKAHQRTHTGEKPYKCSWEGCEWRFARSDELTRHYRKHTGAKPFKCNHCDRC
200 210 220 230 240 250
270 280
pF1KB7 FSRSDHLALHMKRHL
::::::::::::::.
CCDS59 FSRSDHLALHMKRHI
260 270
>>CCDS9449.1 KLF12 gene_id:11278|Hs108|chr13 (402 aa)
initn: 783 init1: 559 opt: 586 Z-score: 358.2 bits: 74.7 E(32554): 1.2e-13
Smith-Waterman score: 600; 52.0% identity (72.3% similar) in 173 aa overlap (113-283:238-400)
90 100 110 120 130 140
pF1KB7 SSPPEDTLISPSFCYNLETNSLNSDVSSESSDSSEELSPTAKFTS-DPIGEVLVSSGKLS
:::... :.. . : . : . .: ..
CCDS94 NTIVVPLLEDGRGHGKAQMDPRGLSPRQSKSDSDDDDLPNVTLDSVNETGSTALSIARAV
210 220 230 240 250 260
150 160 170 180 190 200
pF1KB7 SSVTSTPPSSPELSREPSQLWGCVPGELPSPGKVRSGTSGKPGDKGNGDASPDGR-RRVH
. : .: : . .: .: . : :: ...: . .. :::.: ::.:
CCDS94 QEVHPSPVSRVRGNRMNNQKFPCS----ISPFSIESTRRQRRSE------SPDSRKRRIH
270 280 290 300 310
210 220 230 240 250 260
pF1KB7 RCHFNGCRKVYTKSSHLKAHQRTHTGEKPYRCSWEGCEWRFARSDELTRHFRKHTGAKPF
:: :.:: ::::::::::::.:::::::::.:.:::: :.::::::::::.:::::.:::
CCDS94 RCDFEGCNKVYTKSSHLKAHRRTHTGEKPYKCTWEGCTWKFARSDELTRHYRKHTGVKPF
320 330 340 350 360 370
270 280
pF1KB7 KCSHCDRCFSRSDHLALHMKRHL
::. ::: :::::::::: .::.
CCDS94 KCADCDRSFSRSDHLALHRRRHMLV
380 390 400
>>CCDS3444.1 KLF3 gene_id:51274|Hs108|chr4 (345 aa)
initn: 747 init1: 557 opt: 576 Z-score: 353.4 bits: 73.5 E(32554): 2.3e-13
Smith-Waterman score: 576; 72.1% identity (83.7% similar) in 104 aa overlap (182-283:240-343)
160 170 180 190 200
pF1KB7 PELSREPSQLWGCVPGELPSPGKVRSGTSGKPGDKGNGDASPDG--RRRVHRCHFNGCRK
.:: . ::: .::.::: ..:: :
CCDS34 YYPEEMSPPLMNSVSPPQALLQENHPSVIVQPGKRPLPVESPDTQRKRRIHRCDYDGCNK
210 220 230 240 250 260
210 220 230 240 250 260
pF1KB7 VYTKSSHLKAHQRTHTGEKPYRCSWEGCEWRFARSDELTRHFRKHTGAKPFKCSHCDRCF
:::::::::::.:::::::::.:.:::: :.:::::::::::::::: :::.: ::: :
CCDS34 VYTKSSHLKAHRRTHTGEKPYKCTWEGCTWKFARSDELTRHFRKHTGIKPFQCPDCDRSF
270 280 290 300 310 320
270 280
pF1KB7 SRSDHLALHMKRHL
::::::::: :::.
CCDS34 SRSDHLALHRKRHMLV
330 340
>>CCDS66562.1 KLF5 gene_id:688|Hs108|chr13 (366 aa)
initn: 572 init1: 552 opt: 564 Z-score: 346.2 bits: 72.3 E(32554): 5.8e-13
Smith-Waterman score: 564; 62.0% identity (75.9% similar) in 137 aa overlap (147-282:232-364)
120 130 140 150 160 170
pF1KB7 EELSPTAKFTSDPIGEVLVSSGKLSSSVTSTPPSSPELSREPSQLWGCVPGELPSPGKVR
::: : . :.: :. ::. :
CCDS66 LPQQATYFPPSPPSSEPGSPDRQAEMLQNLTPPPS-YAATIASKLAIHNPN-LPTTLPVN
210 220 230 240 250
180 190 200 210 220 230
pF1KB7 SGTSGKPGDKGNGDASPD-GRRRVHRCHFNGCRKVYTKSSHLKAHQRTHTGEKPYRCSWE
: . .: . : ..:: .::.: : . :: :::::::::::: :::::::::.:.::
CCDS66 S-QNIQPV-RYNRRSNPDLEKRRIHYCDYPGCTKVYTKSSHLKAHLRTHTGEKPYKCTWE
260 270 280 290 300 310
240 250 260 270 280
pF1KB7 GCEWRFARSDELTRHFRKHTGAKPFKCSHCDRCFSRSDHLALHMKRHL
::.::::::::::::.:::::::::.:. :.: ::::::::::::::
CCDS66 GCDWRFARSDELTRHYRKHTGAKPFQCGVCNRSFSRSDHLALHMKRHQN
320 330 340 350 360
>>CCDS9448.1 KLF5 gene_id:688|Hs108|chr13 (457 aa)
initn: 592 init1: 552 opt: 564 Z-score: 345.0 bits: 72.4 E(32554): 6.8e-13
Smith-Waterman score: 564; 62.0% identity (75.9% similar) in 137 aa overlap (147-282:323-455)
120 130 140 150 160 170
pF1KB7 EELSPTAKFTSDPIGEVLVSSGKLSSSVTSTPPSSPELSREPSQLWGCVPGELPSPGKVR
::: : . :.: :. ::. :
CCDS94 LPQQATYFPPSPPSSEPGSPDRQAEMLQNLTPPPS-YAATIASKLAIHNPN-LPTTLPVN
300 310 320 330 340 350
180 190 200 210 220 230
pF1KB7 SGTSGKPGDKGNGDASPD-GRRRVHRCHFNGCRKVYTKSSHLKAHQRTHTGEKPYRCSWE
: . .: . : ..:: .::.: : . :: :::::::::::: :::::::::.:.::
CCDS94 S-QNIQPV-RYNRRSNPDLEKRRIHYCDYPGCTKVYTKSSHLKAHLRTHTGEKPYKCTWE
360 370 380 390 400
240 250 260 270 280
pF1KB7 GCEWRFARSDELTRHFRKHTGAKPFKCSHCDRCFSRSDHLALHMKRHL
::.::::::::::::.:::::::::.:. :.: ::::::::::::::
CCDS94 GCDWRFARSDELTRHYRKHTGAKPFQCGVCNRSFSRSDHLALHMKRHQN
410 420 430 440 450
>>CCDS14373.1 KLF8 gene_id:11279|Hs108|chrX (359 aa)
initn: 559 init1: 537 opt: 546 Z-score: 336.1 bits: 70.4 E(32554): 2.1e-12
Smith-Waterman score: 548; 58.3% identity (75.8% similar) in 132 aa overlap (166-282:225-356)
140 150 160 170 180
pF1KB7 SSGKLSSSVTSTPPSSPELSREPSQLWGCVPGELPSPGK---VRSGTSGKPGDKG-----
: :.:: .. ..::.:. . .:
CCDS14 DGGPAAITVPLIGGDGKNAGSVKVDPTSMSPLEIPSDSEESTIESGSSALQSLQGLQQEP
200 210 220 230 240 250
190 200 210 220 230 240
pF1KB7 ------NGDASPD-GRRRVHRCHFNGCRKVYTKSSHLKAHQRTHTGEKPYRCSWEGCEWR
.:. : : :::.:.: : :: ::::::::::::.: :::::::.:.:.:: :.
CCDS14 AAMAQMQGEESLDLKRRRIHQCDFAGCSKVYTKSSHLKAHRRIHTGEKPYKCTWDGCSWK
260 270 280 290 300 310
250 260 270 280
pF1KB7 FARSDELTRHFRKHTGAKPFKCSHCDRCFSRSDHLALHMKRHL
:::::::::::::::: :::.:. :.: :::::::.:: .::
CCDS14 FARSDELTRHFRKHTGIKPFRCTDCNRSFSRSDHLSLHRRRHDTM
320 330 340 350
283 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 10:18:42 2016 done: Sat Nov 5 10:18:43 2016
Total Scan time: 2.210 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]