FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB3465, 345 aa
1>>>pF1KB3465 345 - 345 aa - 345 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 12.6023+/-0.00107; mu= -17.0451+/- 0.063
mean_var=424.8386+/-95.198, 0's: 0 Z-trim(115.5): 619 B-trim: 911 in 1/50
Lambda= 0.062225
statistics sampled from 15286 (16060) to 15286 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.801), E-opt: 0.2 (0.493), width: 16
Scan time: 3.310
The best scores are: opt bits E(32554)
CCDS3444.1 KLF3 gene_id:51274|Hs108|chr4 ( 345) 2404 229.4 3.3e-60
CCDS9449.1 KLF12 gene_id:11278|Hs108|chr13 ( 402) 668 73.6 3.1e-13
CCDS14373.1 KLF8 gene_id:11279|Hs108|chrX ( 359) 665 73.3 3.4e-13
CCDS66562.1 KLF5 gene_id:688|Hs108|chr13 ( 366) 624 69.7 4.4e-12
CCDS9448.1 KLF5 gene_id:688|Hs108|chr13 ( 457) 624 69.7 5.2e-12
CCDS7060.1 KLF6 gene_id:1316|Hs108|chr10 ( 283) 576 65.3 7.1e-11
CCDS6770.2 KLF4 gene_id:9314|Hs108|chr9 ( 479) 582 66.0 7.4e-11
>>CCDS3444.1 KLF3 gene_id:51274|Hs108|chr4 (345 aa)
initn: 2404 init1: 2404 opt: 2404 Z-score: 1194.4 bits: 229.4 E(32554): 3.3e-60
Smith-Waterman score: 2404; 100.0% identity (100.0% similar) in 345 aa overlap (1-345:1-345)
10 20 30 40 50 60
pF1KB3 MLMFDPVPVKQEAMDPVSVSYPSNYMESMKPNKYGVIYSTPLPEKFFQTPEGLSHGIQME
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 MLMFDPVPVKQEAMDPVSVSYPSNYMESMKPNKYGVIYSTPLPEKFFQTPEGLSHGIQME
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB3 PVDLTVNKRSSPPSAGNSPSSLKFPSSHRRASPGLSMPSSSPPIKKYSPPSPGVQPFGVP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 PVDLTVNKRSSPPSAGNSPSSLKFPSSHRRASPGLSMPSSSPPIKKYSPPSPGVQPFGVP
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB3 LSMPPVMAAALSRHGIRSPGILPVIQPVVVQPVPFMYTSHLQQPLMVSLSEEMENSSSSM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 LSMPPVMAAALSRHGIRSPGILPVIQPVVVQPVPFMYTSHLQQPLMVSLSEEMENSSSSM
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB3 QVPVIESYEKPISQKKIKIEPGIEPQRTDYYPEEMSPPLMNSVSPPQALLQENHPSVIVQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 QVPVIESYEKPISQKKIKIEPGIEPQRTDYYPEEMSPPLMNSVSPPQALLQENHPSVIVQ
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB3 PGKRPLPVESPDTQRKRRIHRCDYDGCNKVYTKSSHLKAHRRTHTGEKPYKCTWEGCTWK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 PGKRPLPVESPDTQRKRRIHRCDYDGCNKVYTKSSHLKAHRRTHTGEKPYKCTWEGCTWK
250 260 270 280 290 300
310 320 330 340
pF1KB3 FARSDELTRHFRKHTGIKPFQCPDCDRSFSRSDHLALHRKRHMLV
:::::::::::::::::::::::::::::::::::::::::::::
CCDS34 FARSDELTRHFRKHTGIKPFQCPDCDRSFSRSDHLALHRKRHMLV
310 320 330 340
>>CCDS9449.1 KLF12 gene_id:11278|Hs108|chr13 (402 aa)
initn: 786 init1: 645 opt: 668 Z-score: 351.2 bits: 73.6 E(32554): 3.1e-13
Smith-Waterman score: 779; 40.8% identity (63.6% similar) in 385 aa overlap (9-345:36-402)
10 20 30
pF1KB3 MLMFDPVPVKQEAMDPVSVSYPSNYMESMKPNKYGVIY
...: .: .::. ::.. : . .
CCDS94 KRKTIKNINTFENRMLMLDGMPAVRVKTELLESEQGSPNVHNYPD--MEAV-PLLLNNVK
10 20 30 40 50 60
40 50 60 70 80 90
pF1KB3 STPLPEKFFQTPEGLSHGIQMEPVDLTVNK-RSSPPSAGNSPSSL----KFPSSHRRASP
. : :: ... . : :::::..:: :.:: ....:: :. . ::: .:
CCDS94 GEP-PEDSLSVDH---FQTQTEPVDLSINKARTSPTAVSSSPVSMTASASSPSSTSTSSS
70 80 90 100 110
100 110 120 130 140 150
pF1KB3 GLSMPSSSPP-IKKYSPPSPGVQPFGVPLSMPPVMAAALSRHGIRSPGILPVIQPVV-VQ
. : .::: : . : : . .. :. :..:.: :. . .: .:.:: .
CCDS94 SSSRLASSPTVITSVSSASSS----STVLTPGPLVASA---SGVGGQQFLHIIHPVPPSS
120 130 140 150 160 170
160 170 180 190 200
pF1KB3 PVPFMYT--SHLQQ--------PLMVSLSEEMENSSSSMQVPVIESYEKPISQKKIKIEP
:. .. . ::... :.. . . : .... ::..:. . .. : ...:
CCDS94 PMNLQSNKLSHVHRIPVVVQSVPVVYTAVRSPGNVNNTIVVPLLEDGR---GHGKAQMDP
180 190 200 210 220
210 220 230 240
pF1KB3 -GIEPQ--RTDYYPEEMSPPLMNSVSPP--QAL-----LQENHPSVIVQP-GKR------
:. :. ..: ... ..::. :: .:: ::: . . :.:
CCDS94 RGLSPRQSKSDSDDDDLPNVTLDSVNETGSTALSIARAVQEVHPSPVSRVRGNRMNNQKF
230 240 250 260 270 280
250 260 270 280 290
pF1KB3 -----PLPVES---------PDTQRKRRIHRCDYDGCNKVYTKSSHLKAHRRTHTGEKPY
:. .:: ::. :::::::::..:::::::::::::::::::::::::
CCDS94 PCSISPFSIESTRRQRRSESPDS-RKRRIHRCDFEGCNKVYTKSSHLKAHRRTHTGEKPY
290 300 310 320 330 340
300 310 320 330 340
pF1KB3 KCTWEGCTWKFARSDELTRHFRKHTGIKPFQCPDCDRSFSRSDHLALHRKRHMLV
::::::::::::::::::::.:::::.:::.: ::::::::::::::::.:::::
CCDS94 KCTWEGCTWKFARSDELTRHYRKHTGVKPFKCADCDRSFSRSDHLALHRRRHMLV
350 360 370 380 390 400
>>CCDS14373.1 KLF8 gene_id:11279|Hs108|chrX (359 aa)
initn: 692 init1: 586 opt: 665 Z-score: 350.4 bits: 73.3 E(32554): 3.4e-13
Smith-Waterman score: 696; 38.9% identity (59.5% similar) in 368 aa overlap (1-342:22-356)
10 20 30
pF1KB3 MLMFDPVPVKQEAMDPVSVSYPSNYMESM--------KP
: .: : .. . :: . : ::. .:
CCDS14 MVDMDKLINNLEVQLNSEGGSMQVFKQVTASVRNRDPPEIEYRSNMTSPTLLDANPMENP
10 20 30 40 50 60
40 50 60 70
pF1KB3 NKYGVIYSTPLPEKFFQTPEGLSHGIQMEPVDLTVNKRSSP-------------PSAGNS
.. : : ::... . .: :.:::::. .: ..: :. .:
CCDS14 ALFNDIKIEP-PEELLASDFSLP---QVEPVDLSFHKPKAPLQPASMLQAPIRPPKPQSS
70 80 90 100 110
80 90 100 110 120 130
pF1KB3 PSSLKFPSSHRRASPGLSMPSS-SPPIKKYSPPSPGVQP-FGVPLSMPPVMAAALSRHGI
:..: .: : . ..:. .: : : : : . : ..: : .. :
CCDS14 PQTLVVSTSTSDMSTSANIPTVLTPGSVLTSSQSTGSQQILHVIHTIPSVSLP--NKMG-
120 130 140 150 160 170
140 150 160 170 180 190
pF1KB3 RSPGILPVIQPVVVQPVPFMYTSHLQQPLMVSLSEEMENSSSSMQVPVIESYEKPISQKK
:. . ::::: .:..::. : ... ... ::.: . : . .
CCDS14 ---GLKTI--PVVVQSLPMVYTT---LP--------ADGGPAAITVPLIGGDGK--NAGS
180 190 200 210
200 210 220 230 240 250
pF1KB3 IKIEP-GIEPQRTDYYPEEMSPPLMNS-VSPPQALLQENHPSVIVQ-PGKRPLPVESPDT
.:..: .. : . :: . .: .. :.: :: :....: :. :: :
CCDS14 VKVDPTSMSPLEIPSDSEESTIESGSSALQSLQGLQQE--PAAMAQMQGE-----ESLDL
220 230 240 250 260
260 270 280 290 300 310
pF1KB3 QRKRRIHRCDYDGCNKVYTKSSHLKAHRRTHTGEKPYKCTWEGCTWKFARSDELTRHFRK
.: ::::.::. ::.:::::::::::::: :::::::::::.::.:::::::::::::::
CCDS14 KR-RRIHQCDFAGCSKVYTKSSHLKAHRRIHTGEKPYKCTWDGCSWKFARSDELTRHFRK
270 280 290 300 310 320
320 330 340
pF1KB3 HTGIKPFQCPDCDRSFSRSDHLALHRKRHMLV
:::::::.: ::.:::::::::.:::.::
CCDS14 HTGIKPFRCTDCNRSFSRSDHLSLHRRRHDTM
330 340 350
>>CCDS66562.1 KLF5 gene_id:688|Hs108|chr13 (366 aa)
initn: 615 init1: 563 opt: 624 Z-score: 330.4 bits: 69.7 E(32554): 4.4e-12
Smith-Waterman score: 641; 35.3% identity (60.3% similar) in 368 aa overlap (6-342:11-364)
10 20 30 40
pF1KB3 MLMFDPVPV--KQEAMDPVSVSYPSNYMESMKPNKYGVIYSTPLPE------KFF
:::. ... . :.: .... . . :.. ... ::. ..
CCDS66 MEKYLTPQLPPVPIIPEHKKYRRDSASVVDQFFTDTEGLPYSINMNVFLPDITHLRTGLY
10 20 30 40 50 60
50 60 70 80 90
pF1KB3 QTPEGLSHGIQMEPVDLTVNKR--SSPPSAGNS--PSSLKFPSSHRRASPGLS-------
.. . :. ::: . .. ..:: : .. : .. :::. :.: ..
CCDS66 KSQRPCVTHIKTEPVAIFSHQSETTAPPPAPTQALPEFTSIFSSHQTAAPEVNNIFIKQE
70 80 90 100 110 120
100 110 120 130 140 150
pF1KB3 MPSSSPPIKKYSPPSPG--VQPFGVP-LSMPPV--MAAALSRHGIRSPGILPVIQPVVVQ
.:. : .. : . : : ...: :.:: ..::.. .. . . .. . ..
CCDS66 LPT--PDLHLSVPTQQGHLYQLLNTPDLDMPSSTNQTAAMDTLNVSMSAAMAGLN-THTS
130 140 150 160 170
160 170 180 190 200 210
pF1KB3 PVPFMYTSHLQQPLMVSLSEEMENSSSSMQVPVIESYEKPISQKKIKIEPGIEPQRTDYY
:: ....: : . : :. .: .: : .. ::: :.: .
CCDS66 AVPQTAVKQFQG--MPPCTYTMP----SQFLPQQATYFPPSPPSS---EPG-SPDRQAEM
180 190 200 210 220
220 230 240 250 260
pF1KB3 PEEMSPP--LMNSVSPPQALLQENHPSVIVQPGKRPLPVE-----SPDTQRKRRIHRCDY
....:: ... :. . : :... .. ::. .:: . ::::: :::
CCDS66 LQNLTPPPSYAATIASKLAIHNPNLPTTLPVNSQNIQPVRYNRRSNPDLE-KRRIHYCDY
230 240 250 260 270 280
270 280 290 300 310 320
pF1KB3 DGCNKVYTKSSHLKAHRRTHTGEKPYKCTWEGCTWKFARSDELTRHFRKHTGIKPFQCPD
::.:::::::::::: :::::::::::::::: :.::::::::::.::::: :::::
CCDS66 PGCTKVYTKSSHLKAHLRTHTGEKPYKCTWEGCDWRFARSDELTRHYRKHTGAKPFQCGV
290 300 310 320 330 340
330 340
pF1KB3 CDRSFSRSDHLALHRKRHMLV
:.:::::::::::: :::
CCDS66 CNRSFSRSDHLALHMKRHQN
350 360
>>CCDS9448.1 KLF5 gene_id:688|Hs108|chr13 (457 aa)
initn: 615 init1: 563 opt: 624 Z-score: 329.1 bits: 69.7 E(32554): 5.2e-12
Smith-Waterman score: 641; 35.3% identity (60.3% similar) in 368 aa overlap (6-342:102-455)
10 20 30
pF1KB3 MLMFDPVPV--KQEAMDPVSVSYPSNYMESMKPNK
:::. ... . :.: .... . .
CCDS94 QPPATGPRLPPEDLVQTRCEMEKYLTPQLPPVPIIPEHKKYRRDSASVVDQFFTDTEGLP
80 90 100 110 120 130
40 50 60 70 80
pF1KB3 YGVIYSTPLPE------KFFQTPEGLSHGIQMEPVDLTVNKR--SSPPSAGNS--PSSLK
:.. ... ::. .... . :. ::: . .. ..:: : .. : .
CCDS94 YSINMNVFLPDITHLRTGLYKSQRPCVTHIKTEPVAIFSHQSETTAPPPAPTQALPEFTS
140 150 160 170 180 190
90 100 110 120 130
pF1KB3 FPSSHRRASPGLS-------MPSSSPPIKKYSPPSPG--VQPFGVP-LSMPPV--MAAAL
. :::. :.: .. .:. : .. : . : : ...: :.:: ..::.
CCDS94 IFSSHQTAAPEVNNIFIKQELPT--PDLHLSVPTQQGHLYQLLNTPDLDMPSSTNQTAAM
200 210 220 230 240
140 150 160 170 180 190
pF1KB3 SRHGIRSPGILPVIQPVVVQPVPFMYTSHLQQPLMVSLSEEMENSSSSMQVPVIESYEKP
. .. . . .. . .. :: ....: : . : :. .: .: :
CCDS94 DTLNVSMSAAMAGLN-THTSAVPQTAVKQFQG--MPPCTYTMP----SQFLPQQATYFPP
250 260 270 280 290 300
200 210 220 230 240
pF1KB3 ISQKKIKIEPGIEPQRTDYYPEEMSPP--LMNSVSPPQALLQENHPSVIVQPGKRPLPVE
.. ::: :.: . ....:: ... :. . : :... .. ::.
CCDS94 SPPSS---EPG-SPDRQAEMLQNLTPPPSYAATIASKLAIHNPNLPTTLPVNSQNIQPVR
310 320 330 340 350
250 260 270 280 290 300
pF1KB3 -----SPDTQRKRRIHRCDYDGCNKVYTKSSHLKAHRRTHTGEKPYKCTWEGCTWKFARS
.:: . ::::: ::: ::.:::::::::::: :::::::::::::::: :.::::
CCDS94 YNRRSNPDLE-KRRIHYCDYPGCTKVYTKSSHLKAHLRTHTGEKPYKCTWEGCDWRFARS
360 370 380 390 400 410
310 320 330 340
pF1KB3 DELTRHFRKHTGIKPFQCPDCDRSFSRSDHLALHRKRHMLV
::::::.::::: ::::: :.:::::::::::: :::
CCDS94 DELTRHYRKHTGAKPFQCGVCNRSFSRSDHLALHMKRHQN
420 430 440 450
>>CCDS7060.1 KLF6 gene_id:1316|Hs108|chr10 (283 aa)
initn: 747 init1: 557 opt: 576 Z-score: 308.7 bits: 65.3 E(32554): 7.1e-11
Smith-Waterman score: 576; 72.1% identity (83.7% similar) in 104 aa overlap (240-343:182-283)
210 220 230 240 250 260
pF1KB3 YYPEEMSPPLMNSVSPPQALLQENHPSVIVQPGKRPLPVESPDTQRKRRIHRCDYDGCNK
.:: . ::: .::.::: ..:: :
CCDS70 PELSREPSQLWGCVPGELPSPGKVRSGTSGKPGDKGNGDASPDG--RRRVHRCHFNGCRK
160 170 180 190 200
270 280 290 300 310 320
pF1KB3 VYTKSSHLKAHRRTHTGEKPYKCTWEGCTWKFARSDELTRHFRKHTGIKPFQCPDCDRSF
:::::::::::.:::::::::.:.:::: :.:::::::::::::::: :::.: ::: :
CCDS70 VYTKSSHLKAHQRTHTGEKPYRCSWEGCEWRFARSDELTRHFRKHTGAKPFKCSHCDRCF
210 220 230 240 250 260
330 340
pF1KB3 SRSDHLALHRKRHMLV
::::::::: :::.
CCDS70 SRSDHLALHMKRHL
270 280
>>CCDS6770.2 KLF4 gene_id:9314|Hs108|chr9 (479 aa)
initn: 547 init1: 525 opt: 582 Z-score: 308.4 bits: 66.0 E(32554): 7.4e-11
Smith-Waterman score: 591; 39.0% identity (59.0% similar) in 290 aa overlap (58-342:207-478)
30 40 50 60 70 80
pF1KB3 SMKPNKYGVIYSTPLPEKFFQTPEGLSHGIQMEPVDLTVNKRSSPPSAGNSPSSLKFPSS
...:: . .. .::..: :: .
CCDS67 SAPPPTAPFNLADINDVSPSGGFVAELLRPELDPVYIP-PQQPQPPGGGLMG---KFVLK
180 190 200 210 220 230
90 100 110 120 130 140
pF1KB3 HRRASPGLSMPSSSPPIKKYSPPSP-GVQPFGV-PLSM-PPVMAAALSRHGIRSPGILPV
..:: . .:: . . : :: : .: : : . :: ...... : : .
CCDS67 ASLSAPGSEY--GSPSVISVSKGSPDGSHPVVVAPYNGGPPRTCPKIKQEAVSSCTHLGA
240 250 260 270 280 290
150 160 170 180 190 200
pF1KB3 IQPVVVQPVPFMYTSHLQQPLMVSLSEEMENSSSSMQVPVIESYEKPISQKKIKIEPGIE
:. : ..: . :: .: : .. . . : . . . . ::..
CCDS67 GPPLSNGHRP---AAH-DFPLGRQLP-----SRTTPTLGLEEVLSSRDCHPALPLPPGFH
300 310 320 330 340
210 220 230 240 250 260
pF1KB3 PQRTDYYPEEMSPPLMNSVSPPQALLQENHPSVIVQPGKRPLPVESPDTQRKRRI--HRC
:. :: . : :. :: :: : .: ..: : .. . ..: : :
CCDS67 PHPGPNYPSFL-PDQMQPQVPPLHY-QELMPPGSCMP-EEPKPKRGRRSWPRKRTATHTC
350 360 370 380 390
270 280 290 300 310 320
pF1KB3 DYDGCNKVYTKSSHLKAHRRTHTGEKPYKCTWEGCTWKFARSDELTRHFRKHTGIKPFQC
:: ::.:.:::::::::: :::::::::.: :.:: ::::::::::::.::::: .::::
CCDS67 DYAGCGKTYTKSSHLKAHLRTHTGEKPYHCDWDGCGWKFARSDELTRHYRKHTGHRPFQC
400 410 420 430 440 450
330 340
pF1KB3 PDCDRSFSRSDHLALHRKRHMLV
:::.:::::::::: :::
CCDS67 QKCDRAFSRSDHLALHMKRHF
460 470
345 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 13:19:20 2016 done: Thu Nov 3 13:19:21 2016
Total Scan time: 3.310 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]