FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9546, 489 aa
1>>>pF1KB9546 489 - 489 aa - 489 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 10.0488+/-0.000884; mu= -2.3499+/- 0.053
mean_var=256.5767+/-51.796, 0's: 0 Z-trim(114.9): 44 B-trim: 10 in 1/53
Lambda= 0.080069
statistics sampled from 15459 (15499) to 15459 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.786), E-opt: 0.2 (0.476), width: 16
Scan time: 3.890
The best scores are: opt bits E(32554)
CCDS46727.1 PHF21B gene_id:112885|Hs108|chr22 ( 489) 3227 385.6 6.4e-107
CCDS56234.1 PHF21B gene_id:112885|Hs108|chr22 ( 477) 3121 373.4 3.1e-103
CCDS63504.1 PHF21B gene_id:112885|Hs108|chr22 ( 327) 2184 265.0 8.6e-71
CCDS14061.1 PHF21B gene_id:112885|Hs108|chr22 ( 531) 2060 250.8 2.6e-66
CCDS31474.1 PHF21A gene_id:51317|Hs108|chr11 ( 634) 995 127.9 3.3e-29
CCDS44578.1 PHF21A gene_id:51317|Hs108|chr11 ( 680) 615 84.0 5.6e-16
>>CCDS46727.1 PHF21B gene_id:112885|Hs108|chr22 (489 aa)
initn: 3227 init1: 3227 opt: 3227 Z-score: 2033.0 bits: 385.6 E(32554): 6.4e-107
Smith-Waterman score: 3227; 100.0% identity (100.0% similar) in 489 aa overlap (1-489:1-489)
10 20 30 40 50 60
pF1KB9 MELQSRPEALAVELARHQNGDLKKQLHERQPRIAALSDKQALGTITAVPVTGPQVSSLQR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 MELQSRPEALAVELARHQNGDLKKQLHERQPRIAALSDKQALGTITAVPVTGPQVSSLQR
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 LAGQGAAVLPQVRPKTLIPDSLPVAPGRDRPPKQPPTFQKATVVSVKNPSPALPTANNTV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 LAGQGAAVLPQVRPKTLIPDSLPVAPGRDRPPKQPPTFQKATVVSVKNPSPALPTANNTV
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 SHVPAPGSQPQALAEPAALASPLSSAGVAYAIISTSPSNAAAMAPSTAVSVVSDSIKVQP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 SHVPAPGSQPQALAEPAALASPLSSAGVAYAIISTSPSNAAAMAPSTAVSVVSDSIKVQP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB9 LLISADNKVIIIQPQVQTQPESTAESRPPTEEPSQGAQATKKKKEDRPPTQENPEKIAFM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 LLISADNKVIIIQPQVQTQPESTAESRPPTEEPSQGAQATKKKKEDRPPTQENPEKIAFM
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB9 VALGLVTTEHLEEIQSKRQERKRRSTANPAYSGLLETERKRLASNYLNNPLFLTARANED
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 VALGLVTTEHLEEIQSKRQERKRRSTANPAYSGLLETERKRLASNYLNNPLFLTARANED
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB9 PCWKNEITHDEHCAACKRGANLQPCGTCPGAYHLSCLEPPLKTAPKGVWVCPRCQQKALK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 PCWKNEITHDEHCAACKRGANLQPCGTCPGAYHLSCLEPPLKTAPKGVWVCPRCQQKALK
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB9 KDEGVPWTGMLAIVHSYVTHKTVKEEEKQKLLQRGSELQNEHQQLEERDRRLASAVQKCL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 KDEGVPWTGMLAIVHSYVTHKTVKEEEKQKLLQRGSELQNEHQQLEERDRRLASAVQKCL
370 380 390 400 410 420
430 440 450 460 470 480
pF1KB9 ELKTSLLARQRGTQSSLDRLRALLRLIQGEQLLQVTMTTTSPAPLLAGPWTKPSVAATHP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 ELKTSLLARQRGTQSSLDRLRALLRLIQGEQLLQVTMTTTSPAPLLAGPWTKPSVAATHP
430 440 450 460 470 480
pF1KB9 TVQHPQGHN
:::::::::
CCDS46 TVQHPQGHN
>>CCDS56234.1 PHF21B gene_id:112885|Hs108|chr22 (477 aa)
initn: 3121 init1: 3121 opt: 3121 Z-score: 1967.0 bits: 373.4 E(32554): 3.1e-103
Smith-Waterman score: 3121; 100.0% identity (100.0% similar) in 472 aa overlap (18-489:6-477)
10 20 30 40 50 60
pF1KB9 MELQSRPEALAVELARHQNGDLKKQLHERQPRIAALSDKQALGTITAVPVTGPQVSSLQR
:::::::::::::::::::::::::::::::::::::::::::
CCDS56 MRRQPQNGDLKKQLHERQPRIAALSDKQALGTITAVPVTGPQVSSLQR
10 20 30 40
70 80 90 100 110 120
pF1KB9 LAGQGAAVLPQVRPKTLIPDSLPVAPGRDRPPKQPPTFQKATVVSVKNPSPALPTANNTV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 LAGQGAAVLPQVRPKTLIPDSLPVAPGRDRPPKQPPTFQKATVVSVKNPSPALPTANNTV
50 60 70 80 90 100
130 140 150 160 170 180
pF1KB9 SHVPAPGSQPQALAEPAALASPLSSAGVAYAIISTSPSNAAAMAPSTAVSVVSDSIKVQP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 SHVPAPGSQPQALAEPAALASPLSSAGVAYAIISTSPSNAAAMAPSTAVSVVSDSIKVQP
110 120 130 140 150 160
190 200 210 220 230 240
pF1KB9 LLISADNKVIIIQPQVQTQPESTAESRPPTEEPSQGAQATKKKKEDRPPTQENPEKIAFM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 LLISADNKVIIIQPQVQTQPESTAESRPPTEEPSQGAQATKKKKEDRPPTQENPEKIAFM
170 180 190 200 210 220
250 260 270 280 290 300
pF1KB9 VALGLVTTEHLEEIQSKRQERKRRSTANPAYSGLLETERKRLASNYLNNPLFLTARANED
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 VALGLVTTEHLEEIQSKRQERKRRSTANPAYSGLLETERKRLASNYLNNPLFLTARANED
230 240 250 260 270 280
310 320 330 340 350 360
pF1KB9 PCWKNEITHDEHCAACKRGANLQPCGTCPGAYHLSCLEPPLKTAPKGVWVCPRCQQKALK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 PCWKNEITHDEHCAACKRGANLQPCGTCPGAYHLSCLEPPLKTAPKGVWVCPRCQQKALK
290 300 310 320 330 340
370 380 390 400 410 420
pF1KB9 KDEGVPWTGMLAIVHSYVTHKTVKEEEKQKLLQRGSELQNEHQQLEERDRRLASAVQKCL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 KDEGVPWTGMLAIVHSYVTHKTVKEEEKQKLLQRGSELQNEHQQLEERDRRLASAVQKCL
350 360 370 380 390 400
430 440 450 460 470 480
pF1KB9 ELKTSLLARQRGTQSSLDRLRALLRLIQGEQLLQVTMTTTSPAPLLAGPWTKPSVAATHP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 ELKTSLLARQRGTQSSLDRLRALLRLIQGEQLLQVTMTTTSPAPLLAGPWTKPSVAATHP
410 420 430 440 450 460
pF1KB9 TVQHPQGHN
:::::::::
CCDS56 TVQHPQGHN
470
>>CCDS63504.1 PHF21B gene_id:112885|Hs108|chr22 (327 aa)
initn: 2184 init1: 2184 opt: 2184 Z-score: 1384.4 bits: 265.0 E(32554): 8.6e-71
Smith-Waterman score: 2184; 100.0% identity (100.0% similar) in 327 aa overlap (163-489:1-327)
140 150 160 170 180 190
pF1KB9 LAEPAALASPLSSAGVAYAIISTSPSNAAAMAPSTAVSVVSDSIKVQPLLISADNKVIII
::::::::::::::::::::::::::::::
CCDS63 MAPSTAVSVVSDSIKVQPLLISADNKVIII
10 20 30
200 210 220 230 240 250
pF1KB9 QPQVQTQPESTAESRPPTEEPSQGAQATKKKKEDRPPTQENPEKIAFMVALGLVTTEHLE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS63 QPQVQTQPESTAESRPPTEEPSQGAQATKKKKEDRPPTQENPEKIAFMVALGLVTTEHLE
40 50 60 70 80 90
260 270 280 290 300 310
pF1KB9 EIQSKRQERKRRSTANPAYSGLLETERKRLASNYLNNPLFLTARANEDPCWKNEITHDEH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS63 EIQSKRQERKRRSTANPAYSGLLETERKRLASNYLNNPLFLTARANEDPCWKNEITHDEH
100 110 120 130 140 150
320 330 340 350 360 370
pF1KB9 CAACKRGANLQPCGTCPGAYHLSCLEPPLKTAPKGVWVCPRCQQKALKKDEGVPWTGMLA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS63 CAACKRGANLQPCGTCPGAYHLSCLEPPLKTAPKGVWVCPRCQQKALKKDEGVPWTGMLA
160 170 180 190 200 210
380 390 400 410 420 430
pF1KB9 IVHSYVTHKTVKEEEKQKLLQRGSELQNEHQQLEERDRRLASAVQKCLELKTSLLARQRG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS63 IVHSYVTHKTVKEEEKQKLLQRGSELQNEHQQLEERDRRLASAVQKCLELKTSLLARQRG
220 230 240 250 260 270
440 450 460 470 480
pF1KB9 TQSSLDRLRALLRLIQGEQLLQVTMTTTSPAPLLAGPWTKPSVAATHPTVQHPQGHN
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS63 TQSSLDRLRALLRLIQGEQLLQVTMTTTSPAPLLAGPWTKPSVAATHPTVQHPQGHN
280 290 300 310 320
>>CCDS14061.1 PHF21B gene_id:112885|Hs108|chr22 (531 aa)
initn: 2032 init1: 2032 opt: 2060 Z-score: 1303.9 bits: 250.8 E(32554): 2.6e-66
Smith-Waterman score: 3070; 91.9% identity (91.9% similar) in 521 aa overlap (11-489:11-531)
10 20 30 40 50 60
pF1KB9 MELQSRPEALAVELARHQNGDLKKQLHERQPRIAALSDKQALGTITAVPVTGPQVSSLQR
::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 MELQSRPEALAVELARHQNGDLKKQLHERQPRIAALSDKQALGTITAVPVTGPQVSSLQR
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 LAGQGAAVLPQVRPKTLIPDSLPVAPGRDRPPKQPPTFQKATVVSVKNPSPALPTANNTV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 LAGQGAAVLPQVRPKTLIPDSLPVAPGRDRPPKQPPTFQKATVVSVKNPSPALPTANNTV
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 SHVPAPGSQPQALAEPAALASPLSSAGVAYAIISTSPSNAAAMAPSTAVSVVSDSIKVQP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 SHVPAPGSQPQALAEPAALASPLSSAGVAYAIISTSPSNAAAMAPSTAVSVVSDSIKVQP
130 140 150 160 170 180
190
pF1KB9 LLISADNK------------------------------------------VIIIQPQVQT
:::::::: ::::::::::
CCDS14 LLISADNKPPPRLLSSPHPATHHCPLHPSSLPLTPPSPSLSPSPLHGIFQVIIIQPQVQT
190 200 210 220 230 240
200 210 220 230 240 250
pF1KB9 QPESTAESRPPTEEPSQGAQATKKKKEDRPPTQENPEKIAFMVALGLVTTEHLEEIQSKR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 QPESTAESRPPTEEPSQGAQATKKKKEDRPPTQENPEKIAFMVALGLVTTEHLEEIQSKR
250 260 270 280 290 300
260 270 280 290 300 310
pF1KB9 QERKRRSTANPAYSGLLETERKRLASNYLNNPLFLTARANEDPCWKNEITHDEHCAACKR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 QERKRRSTANPAYSGLLETERKRLASNYLNNPLFLTARANEDPCWKNEITHDEHCAACKR
310 320 330 340 350 360
320 330 340 350 360 370
pF1KB9 GANLQPCGTCPGAYHLSCLEPPLKTAPKGVWVCPRCQQKALKKDEGVPWTGMLAIVHSYV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 GANLQPCGTCPGAYHLSCLEPPLKTAPKGVWVCPRCQQKALKKDEGVPWTGMLAIVHSYV
370 380 390 400 410 420
380 390 400 410 420 430
pF1KB9 THKTVKEEEKQKLLQRGSELQNEHQQLEERDRRLASAVQKCLELKTSLLARQRGTQSSLD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 THKTVKEEEKQKLLQRGSELQNEHQQLEERDRRLASAVQKCLELKTSLLARQRGTQSSLD
430 440 450 460 470 480
440 450 460 470 480
pF1KB9 RLRALLRLIQGEQLLQVTMTTTSPAPLLAGPWTKPSVAATHPTVQHPQGHN
:::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS14 RLRALLRLIQGEQLLQVTMTTTSPAPLLAGPWTKPSVAATHPTVQHPQGHN
490 500 510 520 530
>>CCDS31474.1 PHF21A gene_id:51317|Hs108|chr11 (634 aa)
initn: 978 init1: 552 opt: 995 Z-score: 638.0 bits: 127.9 E(32554): 3.3e-29
Smith-Waterman score: 1010; 38.9% identity (68.9% similar) in 447 aa overlap (49-489:213-622)
20 30 40 50 60 70
pF1KB9 NGDLKKQLHERQPRIAALSDKQALGTITAVPVTGPQVSSLQRLAGQGAAVLPQVRPKTLI
:. :: ::. . ::::::: .
CCDS31 SSKVTGPGAEAVQIVAKNTVTLQVQATPPQPIKVPQFIPPPRLTPR-PNFLPQVRPKPVA
190 200 210 220 230 240
80 90 100 110 120 130
pF1KB9 PDSLPVAPGRDRPP--KQPPTFQKATVVSVKNPSPALPTANNTVSHVPAPGSQPQALAEP
...:.::. :: : .:. .... .:. .:::..:.. : . ..: ..:.
CCDS31 QNNIPIAPA--PPPMLAAPQLIQRPVMLTKFTPT-TLPTSQNSIHPVRVVNGQTATIAKT
250 260 270 280 290
140 150 160 170 180 190
pF1KB9 AALASPLSSAGVAYAIISTSPSNAAAMAPSTAVSVVSDSIKVQPLLISADNKVIIIQPQV
.:. :.: :. ..:.. : .:.: :: . .:..
CCDS31 FPMAQ-LTS------IVIATPGTRLA-GPQT----------VQ-----------LSKPSL
300 310 320
200 210 220 230 240 250
pF1KB9 QTQPESTAESRPPTEEPSQGAQATKKKKEDRPPTQENPEKIAFMVALGLVTTEHLEEIQS
. : :..:. :.: . ... .: .:::.:.::::.::::: .:::::::
CCDS31 EKQ---TVKSHTETDEKQTESRTITPPAAPKPKREENPQKLAFMVSLGLVTHDHLEEIQS
330 340 350 360 370 380
260 270 280 290 300 310
pF1KB9 KRQERKRRSTANPAYSG-LLETERKRLASNYLNNPLF--LTARANEDPCWKNEITHDEHC
::::::::.::::.::: ..: :::. : .:::. . ::::. : . :.. :
CCDS31 KRQERKRRTTANPVYSGAVFEPERKKSAVTYLNSTMHPGTRKRANEEH-WPKGDIHEDFC
390 400 410 420 430 440
320 330 340 350 360 370
pF1KB9 AACKRGANLQPCGTCPGAYHLSCLEPPLKTAPKGVWVCPRCQQKALKKDEGVPWTGMLAI
..:.....: : :: .:::.::.::::: :::.:.:::::.. :::.:..:: : :::
CCDS31 SVCRKSGQLLMCDTCSRVYHLDCLDPPLKTIPKGMWICPRCQDQMLKKEEAIPWPGTLAI
450 460 470 480 490 500
380 390 400 410 420 430
pF1KB9 VHSYVTHKTVKEEEKQKLLQRGSELQNEHQQLEERDRRLASAVQKCLELKTSLLARQRGT
::::...:..:::::::::. .:.:..:..:::.. ..:.....::.:.:...::::.
CCDS31 VHSYIAYKAAKEEEKQKLLKWSSDLKQEREQLEQKVKQLSNSISKCMEMKNTILARQKEM
510 520 530 540 550 560
440 450 460 470 480
pF1KB9 QSSLDRLRALLRLIQGEQLLQVTMTTTSPAPLLAGPW-TKPSVAATHPTVQHPQGHN
.:::.... :.:::.: .: . . . .. . . :: : :. ::: . :....
CCDS31 HSSLEKVKQLIRLIHGIDLSKPVDSEATVGAISNGPDCTPPANAATSTPAPSPSSQSCTA
570 580 590 600 610 620
CCDS31 NCNQGEETK
630
>>CCDS44578.1 PHF21A gene_id:51317|Hs108|chr11 (680 aa)
initn: 978 init1: 552 opt: 615 Z-score: 400.3 bits: 84.0 E(32554): 5.6e-16
Smith-Waterman score: 907; 35.3% identity (62.5% similar) in 493 aa overlap (49-489:212-668)
20 30 40 50 60 70
pF1KB9 NGDLKKQLHERQPRIAALSDKQALGTITAVPVTGPQVSSLQRLAGQGAAVLPQVRPKTLI
:. :: ::. . ::::::: .
CCDS44 TSSKVTGPGAEAVQIVAKNTVTLVQATPPQPIKVPQFIPPPRLTPR-PNFLPQVRPKPVA
190 200 210 220 230 240
80 90 100 110 120 130
pF1KB9 PDSLPVAPGRDRPP--KQPPTFQKATVVSVKNPSPALPTANNTVSHVPAPGSQPQALAEP
...:.::. :: : .:. .... .:. .:::..:.. : . ..: ..:.
CCDS44 QNNIPIAPA--PPPMLAAPQLIQRPVMLTKFTPT-TLPTSQNSIHPVRVVNGQTATIAKT
250 260 270 280 290
140 150 160 170 180 190
pF1KB9 AALASPLSSAGVAYAIISTSPSNAAAMAPSTAVSVVSDSIKVQPLLISADNKVIIIQPQV
.:. :.: :. ..:.. : .:.: :: . .:..
CCDS44 FPMAQ-LTS------IVIATPGTRLA-GPQT----------VQ-----------LSKPSL
300 310 320
200 210 220 230 240 250
pF1KB9 QTQPESTAESRPPTEEPSQGAQATKKKKEDRPPTQENPEKIAFMVALGLVTTEHLEEIQS
. : :..:. :.: . ... .: .:::.:.::::.::::: .:::::::
CCDS44 EKQ---TVKSHTETDEKQTESRTITPPAAPKPKREENPQKLAFMVSLGLVTHDHLEEIQS
330 340 350 360 370 380
260 270 280 290 300
pF1KB9 KRQERKRRSTANPAYSG-LLETERKRLASNYLNNPLFLTARANEDP--------------
::::::::.::::.::: ..: :::. : .:::. . .: :
CCDS44 KRQERKRRTTANPVYSGAVFEPERKKSAVTYLNSTMHPGTRKRGRPPKYNAVLGFGALTP
390 400 410 420 430 440
310 320
pF1KB9 ---------CWKNEIT-------------------------HDEHCAACKRGANLQPCGT
.:: : :.. :..:.....: : :
CCDS44 TSPQSSHPDSPENEKTETTFTFPAPVQPVSLPSPTSTDGDIHEDFCSVCRKSGQLLMCDT
450 460 470 480 490 500
330 340 350 360 370 380
pF1KB9 CPGAYHLSCLEPPLKTAPKGVWVCPRCQQKALKKDEGVPWTGMLAIVHSYVTHKTVKEEE
: .:::.::.::::: :::.:.:::::.. :::.:..:: : :::::::...:..::::
CCDS44 CSRVYHLDCLDPPLKTIPKGMWICPRCQDQMLKKEEAIPWPGTLAIVHSYIAYKAAKEEE
510 520 530 540 550 560
390 400 410 420 430 440
pF1KB9 KQKLLQRGSELQNEHQQLEERDRRLASAVQKCLELKTSLLARQRGTQSSLDRLRALLRLI
:::::. .:.:..:..:::.. ..:.....::.:.:...::::. .:::.... :.:::
CCDS44 KQKLLKWSSDLKQEREQLEQKVKQLSNSISKCMEMKNTILARQKEMHSSLEKVKQLIRLI
570 580 590 600 610 620
450 460 470 480
pF1KB9 QGEQLLQVTMTTTSPAPLLAGPW-TKPSVAATHPTVQHPQGHN
.: .: . . . .. . . :: : :. ::: . :....
CCDS44 HGIDLSKPVDSEATVGAISNGPDCTPPANAATSTPAPSPSSQSCTANCNQGEETK
630 640 650 660 670 680
489 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 02:05:06 2016 done: Sat Nov 5 02:05:07 2016
Total Scan time: 3.890 Total Display time: 0.050
Function used was FASTA [36.3.4 Apr, 2011]