FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB8909, 244 aa
1>>>pF1KB8909 244 - 244 aa - 244 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.2831+/-0.000901; mu= 1.9598+/- 0.053
mean_var=243.4958+/-53.436, 0's: 0 Z-trim(114.9): 780 B-trim: 834 in 1/51
Lambda= 0.082192
statistics sampled from 14501 (15438) to 14501 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.799), E-opt: 0.2 (0.474), width: 16
Scan time: 2.400
The best scores are: opt bits E(32554)
CCDS6633.1 KLF9 gene_id:687|Hs108|chr9 ( 244) 1700 213.7 8.9e-56
CCDS10025.1 KLF13 gene_id:51621|Hs108|chr15 ( 288) 608 84.3 9.5e-17
CCDS12075.1 KLF16 gene_id:83855|Hs108|chr19 ( 252) 547 77.0 1.3e-14
CCDS5825.1 KLF14 gene_id:136259|Hs108|chr7 ( 323) 545 76.9 1.8e-14
CCDS47905.1 KLF10 gene_id:7071|Hs108|chr8 ( 469) 497 71.4 1.2e-12
CCDS6294.1 KLF10 gene_id:7071|Hs108|chr8 ( 480) 497 71.4 1.2e-12
CCDS5372.1 SP8 gene_id:221833|Hs108|chr7 ( 490) 484 69.9 3.7e-12
CCDS43555.1 SP8 gene_id:221833|Hs108|chr7 ( 508) 484 69.9 3.7e-12
CCDS46453.1 SP9 gene_id:100131390|Hs108|chr2 ( 484) 471 68.3 1.1e-11
CCDS44898.1 SP1 gene_id:6667|Hs108|chr12 ( 778) 473 68.8 1.2e-11
CCDS8857.1 SP1 gene_id:6667|Hs108|chr12 ( 785) 473 68.8 1.2e-11
CCDS54333.1 KLF11 gene_id:8462|Hs108|chr2 ( 495) 468 68.0 1.4e-11
CCDS1668.1 KLF11 gene_id:8462|Hs108|chr2 ( 512) 468 68.0 1.4e-11
CCDS33322.1 SP5 gene_id:389058|Hs108|chr2 ( 398) 456 66.4 3.2e-11
CCDS5373.1 SP4 gene_id:6671|Hs108|chr7 ( 784) 458 67.0 4.3e-11
CCDS3036.1 KLF15 gene_id:28999|Hs108|chr3 ( 416) 452 66.0 4.5e-11
CCDS14373.1 KLF8 gene_id:11279|Hs108|chrX ( 359) 447 65.3 6.2e-11
CCDS73475.1 SP7 gene_id:121340|Hs108|chr12 ( 413) 446 65.3 7.4e-11
CCDS44897.1 SP7 gene_id:121340|Hs108|chr12 ( 431) 446 65.3 7.6e-11
CCDS46452.1 SP3 gene_id:6670|Hs108|chr2 ( 713) 450 66.0 7.7e-11
CCDS2254.1 SP3 gene_id:6670|Hs108|chr2 ( 781) 450 66.0 8.2e-11
>>CCDS6633.1 KLF9 gene_id:687|Hs108|chr9 (244 aa)
initn: 1700 init1: 1700 opt: 1700 Z-score: 1114.8 bits: 213.7 E(32554): 8.9e-56
Smith-Waterman score: 1700; 100.0% identity (100.0% similar) in 244 aa overlap (1-244:1-244)
10 20 30 40 50 60
pF1KB8 MSAAAYMDFVAAQCLVSISNRAAVPEHGVAPDAERLRLPEREVTKEHGDPGDTWKDYCTL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS66 MSAAAYMDFVAAQCLVSISNRAAVPEHGVAPDAERLRLPEREVTKEHGDPGDTWKDYCTL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB8 VTIAKSLLDLNKYRPIQTPSVCSDSLESPDEDMGSDSDVTTESGSSPSHSPEERQDPGSA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS66 VTIAKSLLDLNKYRPIQTPSVCSDSLESPDEDMGSDSDVTTESGSSPSHSPEERQDPGSA
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB8 PSPLSLLHPGVAAKGKHASEKRHKCPYSGCGKVYGKSSHLKAHYRVHTGERPFPCTWPDC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS66 PSPLSLLHPGVAAKGKHASEKRHKCPYSGCGKVYGKSSHLKAHYRVHTGERPFPCTWPDC
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB8 LKKFSRSDELTRHYRTHTGEKQFRCPLCEKRFMRSDHLTKHARRHTEFHPSMIKRSKKAL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS66 LKKFSRSDELTRHYRTHTGEKQFRCPLCEKRFMRSDHLTKHARRHTEFHPSMIKRSKKAL
190 200 210 220 230 240
pF1KB8 ANAL
::::
CCDS66 ANAL
>>CCDS10025.1 KLF13 gene_id:51621|Hs108|chr15 (288 aa)
initn: 717 init1: 574 opt: 608 Z-score: 414.1 bits: 84.3 E(32554): 9.5e-17
Smith-Waterman score: 714; 45.6% identity (66.9% similar) in 263 aa overlap (1-235:1-259)
10 20 30 40 50
pF1KB8 MSAAAYMDFVAAQCLVSISNRAAV--PEHGVA--PD-AERLRLPEREVTKEHGDPGDTWK
:.::::.: ::.::::.:.::.: :..: :. : : ..:. : :
CCDS10 MAAAAYVDHFAAECLVSMSSRAVVHGPREGPESRPEGAAVAATPTLPRVEERRDG----K
10 20 30 40 50
60 70 80 90 100
pF1KB8 DYCTLVTIAKSLLDLNKYRPIQTPS-----VCSDSLESP-------DEDMGSDSDVTTES
: .: ..:. : :::. : .:. . . . ..: : . .. .. .
CCDS10 DSASLFVVARILADLNQQAPAPAPAERREGAAARKARTPCRLPPPAPEPTSPGAEGAAAA
60 70 80 90 100 110
110 120 130 140 150
pF1KB8 GSSPSHS---PEERQDPGSAPSPLSLLHPGV---AAKGK-----HASEKRHKCPYSGCGK
::. : :: .: :.: . .::. . .:. .. ...::: :.:: :
CCDS10 PPSPAWSEPEPEAGLEPEREPGPAGSGEPGLRQRVRRGRSRADLESPQRKHKCHYAGCEK
120 130 140 150 160 170
160 170 180 190 200 210
pF1KB8 VYGKSSHLKAHYRVHTGERPFPCTWPDCLKKFSRSDELTRHYRTHTGEKQFRCPLCEKRF
::::::::::: :.::::::: :.: :: :::.:::::.::::::::::.: ::.:::::
CCDS10 VYGKSSHLKAHLRTHTGERPFACSWQDCNKKFARSDELARHYRTHTGEKKFSCPICEKRF
180 190 200 210 220 230
220 230 240
pF1KB8 MRSDHLTKHARRHTEFHPSMIKRSKKALANAL
:::::::::::::..:::.:..:
CCDS10 MRSDHLTKHARRHANFHPGMLQRRGGGSRTGSLSDYSRSDASSPTISPASSP
240 250 260 270 280
>>CCDS12075.1 KLF16 gene_id:83855|Hs108|chr19 (252 aa)
initn: 734 init1: 495 opt: 547 Z-score: 375.8 bits: 77.0 E(32554): 1.3e-14
Smith-Waterman score: 571; 43.9% identity (61.5% similar) in 244 aa overlap (1-235:1-219)
10 20 30 40 50
pF1KB8 MSAA-AYMDFVAAQCLVSISNRAAV------PEHGVAPDAE-RLRLPEREVTKEHGDPGD
:::: : .:. ::. :..::. :.: :: :..: : .: .::... : ::
CCDS12 MSAAVACVDYFAADVLMAISSGAVVHRGRPGPE-GAGPAAGLDVRAARREAASP-GTPGP
10 20 30 40 50
60 70 80 90 100 110
pF1KB8 TWKDYCTLVTIAKSLLDLNKYRP-IQTPSVCSDSLESPDEDMGSDSDVTTESGSSPSHSP
:.. : . . :. .: .: :. : . :.:: . ::
CCDS12 PPPP-----PAASGPGPGAAAAPHLLAASILADLRGGPGAAPGGASPA---SSSSAASSP
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB8 EERQDPGSAPSPLSLLHPGVAAKGKHASEKRHKCPYSGCGKVYGKSSHLKAHYRVHTGER
. ::.::: :. : :.::. :.:.: ::::::.: :.:::::
CCDS12 SSGRAPGAAPS---------------AAAKSHRCPFPDCAKAYYKSSHLKSHLRTHTGER
120 130 140 150
180 190 200 210 220 230
pF1KB8 PFPCTWPDCLKKFSRSDELTRHYRTHTGEKQFRCPLCEKRFMRSDHLTKHARRHTEFHPS
:: : : : :::.:::::.::.:::::::.: :::: ::: :::::.:::::: :::.
CCDS12 PFACDWQGCDKKFARSDELARHHRTHTGEKRFSCPLCSKRFTRSDHLAKHARRHPGFHPD
160 170 180 190 200 210
240
pF1KB8 MIKRSKKALANAL
...:
CCDS12 LLRRPGARSTSPSDSLPCSLAGSPAPSPAPSPAPAGL
220 230 240 250
>>CCDS5825.1 KLF14 gene_id:136259|Hs108|chr7 (323 aa)
initn: 778 init1: 534 opt: 545 Z-score: 373.2 bits: 76.9 E(32554): 1.8e-14
Smith-Waterman score: 545; 60.7% identity (78.7% similar) in 122 aa overlap (118-238:172-291)
90 100 110 120 130 140
pF1KB8 SPDEDMGSDSDVTTESGSSPSHSPEERQDPGSAPSPLSLLHPGVAAKGKHASEKRHKCPY
:..:.: . : .. . :::.::.
CCDS58 ESSSDAPAVPSAPAAPGAPAASGGFSGGALGAGPAPAADQAP--RRRSVTPAAKRHQCPF
150 160 170 180 190
150 160 170 180 190 200
pF1KB8 SGCGKVYGKSSHLKAHYRVHTGERPFPCTWPDCLKKFSRSDELTRHYRTHTGEKQFRCPL
:: :.: ::::::.: :.::::::: : : :: :::.:::::.::::::::::.: :::
CCDS58 PGCTKAYYKSSHLKSHQRTHTGERPFSCDWLDCDKKFTRSDELARHYRTHTGEKRFSCPL
200 210 220 230 240 250
210 220 230 240
pF1KB8 CEKRFMRSDHLTKHARRHTEFHPSMIK-RSKKALANAL
: :.: :::::::::::: .::.::. :...
CCDS58 CPKQFSRSDHLTKHARRHPTYHPDMIEYRGRRRTPRIDPPLTSEVESSASGSGPGPAPSF
260 270 280 290 300 310
CCDS58 TTCL
320
>>CCDS47905.1 KLF10 gene_id:7071|Hs108|chr8 (469 aa)
initn: 501 init1: 475 opt: 497 Z-score: 340.4 bits: 71.4 E(32554): 1.2e-12
Smith-Waterman score: 497; 47.8% identity (67.3% similar) in 159 aa overlap (75-225:284-440)
50 60 70 80 90
pF1KB8 KEHGDPGDTWKDYCTLVTIAKSLLDLNKYRPIQTPSVCSDSL----ESPDED-MGSDSDV
: : :.:: . . : : .
CCDS47 GGVPPMPVICQMVPLPANNPVVTTVVPSTPPSQPPAVCPPVVFMGTQVPKGAVMFVVPQP
260 270 280 290 300 310
100 110 120 130 140 150
pF1KB8 TTESGSSPSHSPEERQDPGSAPSPLSLLHPGVAAKGKHASEKR---HKCPYSGCGKVYGK
...:.. : ::. . ::.: . :..: . . .: : : . ::::.: :
CCDS47 VVQSSKPPVVSPNGTRLSPIAPAP--GFSPSAAKVTPQIDSSRIRSHICSHPGCGKTYFK
320 330 340 350 360 370
160 170 180 190 200 210
pF1KB8 SSHLKAHYRVHTGERPFPCTWPDCLKKFSRSDELTRHYRTHTGEKQFRCPLCEKRFMRSD
::::::: :.::::.:: :.: : ..:.:::::.:: :::::::.: ::.:..::::::
CCDS47 SSHLKAHTRTHTGEKPFSCSWKGCERRFARSDELSRHRRTHTGEKKFACPMCDRRFMRSD
380 390 400 410 420 430
220 230 240
pF1KB8 HLTKHARRHTEFHPSMIKRSKKALANAL
:::::::::
CCDS47 HLTKHARRHLSAKKLPNWQMEVSKLNDIALPPTPAPTQ
440 450 460
>>CCDS6294.1 KLF10 gene_id:7071|Hs108|chr8 (480 aa)
initn: 501 init1: 475 opt: 497 Z-score: 340.3 bits: 71.4 E(32554): 1.2e-12
Smith-Waterman score: 497; 47.8% identity (67.3% similar) in 159 aa overlap (75-225:295-451)
50 60 70 80 90
pF1KB8 KEHGDPGDTWKDYCTLVTIAKSLLDLNKYRPIQTPSVCSDSL----ESPDED-MGSDSDV
: : :.:: . . : : .
CCDS62 GGVPPMPVICQMVPLPANNPVVTTVVPSTPPSQPPAVCPPVVFMGTQVPKGAVMFVVPQP
270 280 290 300 310 320
100 110 120 130 140 150
pF1KB8 TTESGSSPSHSPEERQDPGSAPSPLSLLHPGVAAKGKHASEKR---HKCPYSGCGKVYGK
...:.. : ::. . ::.: . :..: . . .: : : . ::::.: :
CCDS62 VVQSSKPPVVSPNGTRLSPIAPAP--GFSPSAAKVTPQIDSSRIRSHICSHPGCGKTYFK
330 340 350 360 370 380
160 170 180 190 200 210
pF1KB8 SSHLKAHYRVHTGERPFPCTWPDCLKKFSRSDELTRHYRTHTGEKQFRCPLCEKRFMRSD
::::::: :.::::.:: :.: : ..:.:::::.:: :::::::.: ::.:..::::::
CCDS62 SSHLKAHTRTHTGEKPFSCSWKGCERRFARSDELSRHRRTHTGEKKFACPMCDRRFMRSD
390 400 410 420 430 440
220 230 240
pF1KB8 HLTKHARRHTEFHPSMIKRSKKALANAL
:::::::::
CCDS62 HLTKHARRHLSAKKLPNWQMEVSKLNDIALPPTPAPTQ
450 460 470 480
>>CCDS5372.1 SP8 gene_id:221833|Hs108|chr7 (490 aa)
initn: 576 init1: 438 opt: 484 Z-score: 331.8 bits: 69.9 E(32554): 3.7e-12
Smith-Waterman score: 484; 46.0% identity (65.6% similar) in 163 aa overlap (70-226:280-439)
40 50 60 70 80 90
pF1KB8 EREVTKEHGDPGDTWKDYCTLVTIAKSLLDLNKYRPIQTPSVCSDSLESPDEDMGSD---
.. ..:. :. :: :: :..
CCDS53 GYNSDYSGLSHSAFSSGASSHLLSPAGQHLMDGFKPV-LPGSYPDSAPSPLAGAGGSMLS
250 260 270 280 290 300
100 110 120 130 140 150
pF1KB8 SDVTTESGSSPSHSPEERQDPGSAPSPLSLLHPGVAAKGKHASEKR---HKCPYSGCGKV
. .. :.:: : .. . .. : .. : :: .: :.: :::::
CCDS53 AGPSAPLGGSPRSSARRYSGRATCDCPNCQEAERLGPAG--ASLRRKGLHSCHIPGCGKV
310 320 330 340 350 360
160 170 180 190 200 210
pF1KB8 YGKSSHLKAHYRVHTGERPFPCTWPDCLKKFSRSDELTRHYRTHTGEKQFRCPLCEKRFM
:::.:::::: : ::::::: :.: : :.:.::::: :: :::::::.: ::.:.::::
CCDS53 YGKTSHLKAHLRWHTGERPFVCNWLFCGKRFTRSDELQRHLRTHTGEKRFACPVCNKRFM
370 380 390 400 410 420
220 230 240
pF1KB8 RSDHLTKHARRHTEFHPSMIKRSKKALANAL
:::::.::.. :.
CCDS53 RSDHLSKHVKTHSGGGGGGGSAGSGSGGKKGSDTDSEHSAAGSPPCHSPELLQPPEPGHR
430 440 450 460 470 480
>>CCDS43555.1 SP8 gene_id:221833|Hs108|chr7 (508 aa)
initn: 576 init1: 438 opt: 484 Z-score: 331.7 bits: 69.9 E(32554): 3.7e-12
Smith-Waterman score: 484; 46.0% identity (65.6% similar) in 163 aa overlap (70-226:298-457)
40 50 60 70 80 90
pF1KB8 EREVTKEHGDPGDTWKDYCTLVTIAKSLLDLNKYRPIQTPSVCSDSLESPDEDMGSD---
.. ..:. :. :: :: :..
CCDS43 GYNSDYSGLSHSAFSSGASSHLLSPAGQHLMDGFKPV-LPGSYPDSAPSPLAGAGGSMLS
270 280 290 300 310 320
100 110 120 130 140 150
pF1KB8 SDVTTESGSSPSHSPEERQDPGSAPSPLSLLHPGVAAKGKHASEKR---HKCPYSGCGKV
. .. :.:: : .. . .. : .. : :: .: :.: :::::
CCDS43 AGPSAPLGGSPRSSARRYSGRATCDCPNCQEAERLGPAG--ASLRRKGLHSCHIPGCGKV
330 340 350 360 370 380
160 170 180 190 200 210
pF1KB8 YGKSSHLKAHYRVHTGERPFPCTWPDCLKKFSRSDELTRHYRTHTGEKQFRCPLCEKRFM
:::.:::::: : ::::::: :.: : :.:.::::: :: :::::::.: ::.:.::::
CCDS43 YGKTSHLKAHLRWHTGERPFVCNWLFCGKRFTRSDELQRHLRTHTGEKRFACPVCNKRFM
390 400 410 420 430 440
220 230 240
pF1KB8 RSDHLTKHARRHTEFHPSMIKRSKKALANAL
:::::.::.. :.
CCDS43 RSDHLSKHVKTHSGGGGGGGSAGSGSGGKKGSDTDSEHSAAGSPPCHSPELLQPPEPGHR
450 460 470 480 490 500
>>CCDS46453.1 SP9 gene_id:100131390|Hs108|chr2 (484 aa)
initn: 525 init1: 435 opt: 471 Z-score: 323.6 bits: 68.3 E(32554): 1.1e-11
Smith-Waterman score: 471; 44.8% identity (66.3% similar) in 172 aa overlap (60-225:247-414)
30 40 50 60 70 80
pF1KB8 APDAERLRLPEREVTKEHGDPGDTWKDYCTLVTIAKSLLDLNKYRPIQTPSVCSDSLESP
:.. .. :: . ..:. :: ::: .
CCDS46 LGTYNPDFSSLTHSAFSSTGLGSSAAAASHLLSTSQHLLAQDGFKPV-LPSY-SDSSAAV
220 230 240 250 260 270
90 100 110 120 130 140
pF1KB8 DEDMGS---DSDVTTESGSSPSHSPEERQDPGSAPSPLSLLHPGVAAKGKHASEKR---H
.: .. ... .:.: ..: .. . .. : .. : :: .: :
CCDS46 AAAAASAMISGAAAAAAGGSSARSARRYSGRATCDCPNCQEAERLGPAG--ASLRRKGLH
280 290 300 310 320 330
150 160 170 180 190 200
pF1KB8 KCPYSGCGKVYGKSSHLKAHYRVHTGERPFPCTWPDCLKKFSRSDELTRHYRTHTGEKQF
.: ::::::::.:::::: : ::::::: :.: : :.:.::::: :: :::::::.:
CCDS46 SCHIPGCGKVYGKTSHLKAHLRWHTGERPFVCNWLFCGKRFTRSDELQRHLRTHTGEKRF
340 350 360 370 380 390
210 220 230 240
pF1KB8 RCPLCEKRFMRSDHLTKHARRHTEFHPSMIKRSKKALANAL
::.:.:::::::::.:: . :
CCDS46 ACPVCNKRFMRSDHLSKHIKTHNGGGGGKKGSDSDTDASNLETPRSESPDLILHDSGVSA
400 410 420 430 440 450
>>CCDS44898.1 SP1 gene_id:6667|Hs108|chr12 (778 aa)
initn: 559 init1: 437 opt: 473 Z-score: 322.3 bits: 68.8 E(32554): 1.2e-11
Smith-Waterman score: 473; 50.4% identity (68.6% similar) in 137 aa overlap (91-225:566-701)
70 80 90 100 110
pF1KB8 VTIAKSLLDLNKYRPIQTPSVCSDSLESPDEDMGSDSDVTTESGSSPSHSPE--ERQDPG
. . .:. :. .::. .:. .:
CCDS44 QVHPIQGLPLAIANAPGDHGAQLGLHGAGGDGIHDDTAGGEEGENSPDAQPQAGRRTRRE
540 550 560 570 580 590
120 130 140 150 160 170
pF1KB8 SAPSPLSLLHPGVAAKGKHASEKRHKCPYSGCGKVYGKSSHLKAHYRVHTGERPFPCTWP
. : : ..: ...:.: : .::::::::.:::.:: : ::::::: :::
CCDS44 ACTCPYCKDSEG-RGSGDPGKKKQHICHIQGCGKVYGKTSHLRAHLRWHTGERPFMCTWS
600 610 620 630 640 650
180 190 200 210 220 230
pF1KB8 DCLKKFSRSDELTRHYRTHTGEKQFRCPLCEKRFMRSDHLTKHARRHTEFHPSMIKRSKK
: :.:.::::: :: :::::::.: :: : :::::::::.:: . :
CCDS44 YCGKRFTRSDELQRHKRTHTGEKKFACPECPKRFMRSDHLSKHIKTHQNKKGGPGVALSV
660 670 680 690 700 710
240
pF1KB8 ALANAL
CCDS44 GTLPLDSGAGSEGSGTATPSALITTNMVAMEAICPEGIARLANSGINVMQVADLQSINIS
720 730 740 750 760 770
244 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 16:27:17 2016 done: Fri Nov 4 16:27:17 2016
Total Scan time: 2.400 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]