FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9546, 489 aa 1>>>pF1KB9546 489 - 489 aa - 489 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 10.0488+/-0.000884; mu= -2.3499+/- 0.053 mean_var=256.5767+/-51.796, 0's: 0 Z-trim(114.9): 44 B-trim: 10 in 1/53 Lambda= 0.080069 statistics sampled from 15459 (15499) to 15459 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.786), E-opt: 0.2 (0.476), width: 16 Scan time: 3.890 The best scores are: opt bits E(32554) CCDS46727.1 PHF21B gene_id:112885|Hs108|chr22 ( 489) 3227 385.6 6.4e-107 CCDS56234.1 PHF21B gene_id:112885|Hs108|chr22 ( 477) 3121 373.4 3.1e-103 CCDS63504.1 PHF21B gene_id:112885|Hs108|chr22 ( 327) 2184 265.0 8.6e-71 CCDS14061.1 PHF21B gene_id:112885|Hs108|chr22 ( 531) 2060 250.8 2.6e-66 CCDS31474.1 PHF21A gene_id:51317|Hs108|chr11 ( 634) 995 127.9 3.3e-29 CCDS44578.1 PHF21A gene_id:51317|Hs108|chr11 ( 680) 615 84.0 5.6e-16 >>CCDS46727.1 PHF21B gene_id:112885|Hs108|chr22 (489 aa) initn: 3227 init1: 3227 opt: 3227 Z-score: 2033.0 bits: 385.6 E(32554): 6.4e-107 Smith-Waterman score: 3227; 100.0% identity (100.0% similar) in 489 aa overlap (1-489:1-489) 10 20 30 40 50 60 pF1KB9 MELQSRPEALAVELARHQNGDLKKQLHERQPRIAALSDKQALGTITAVPVTGPQVSSLQR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 MELQSRPEALAVELARHQNGDLKKQLHERQPRIAALSDKQALGTITAVPVTGPQVSSLQR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 LAGQGAAVLPQVRPKTLIPDSLPVAPGRDRPPKQPPTFQKATVVSVKNPSPALPTANNTV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 LAGQGAAVLPQVRPKTLIPDSLPVAPGRDRPPKQPPTFQKATVVSVKNPSPALPTANNTV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 SHVPAPGSQPQALAEPAALASPLSSAGVAYAIISTSPSNAAAMAPSTAVSVVSDSIKVQP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 SHVPAPGSQPQALAEPAALASPLSSAGVAYAIISTSPSNAAAMAPSTAVSVVSDSIKVQP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 LLISADNKVIIIQPQVQTQPESTAESRPPTEEPSQGAQATKKKKEDRPPTQENPEKIAFM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 LLISADNKVIIIQPQVQTQPESTAESRPPTEEPSQGAQATKKKKEDRPPTQENPEKIAFM 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB9 VALGLVTTEHLEEIQSKRQERKRRSTANPAYSGLLETERKRLASNYLNNPLFLTARANED :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 VALGLVTTEHLEEIQSKRQERKRRSTANPAYSGLLETERKRLASNYLNNPLFLTARANED 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB9 PCWKNEITHDEHCAACKRGANLQPCGTCPGAYHLSCLEPPLKTAPKGVWVCPRCQQKALK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 PCWKNEITHDEHCAACKRGANLQPCGTCPGAYHLSCLEPPLKTAPKGVWVCPRCQQKALK 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB9 KDEGVPWTGMLAIVHSYVTHKTVKEEEKQKLLQRGSELQNEHQQLEERDRRLASAVQKCL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 KDEGVPWTGMLAIVHSYVTHKTVKEEEKQKLLQRGSELQNEHQQLEERDRRLASAVQKCL 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB9 ELKTSLLARQRGTQSSLDRLRALLRLIQGEQLLQVTMTTTSPAPLLAGPWTKPSVAATHP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 ELKTSLLARQRGTQSSLDRLRALLRLIQGEQLLQVTMTTTSPAPLLAGPWTKPSVAATHP 430 440 450 460 470 480 pF1KB9 TVQHPQGHN ::::::::: CCDS46 TVQHPQGHN >>CCDS56234.1 PHF21B gene_id:112885|Hs108|chr22 (477 aa) initn: 3121 init1: 3121 opt: 3121 Z-score: 1967.0 bits: 373.4 E(32554): 3.1e-103 Smith-Waterman score: 3121; 100.0% identity (100.0% similar) in 472 aa overlap (18-489:6-477) 10 20 30 40 50 60 pF1KB9 MELQSRPEALAVELARHQNGDLKKQLHERQPRIAALSDKQALGTITAVPVTGPQVSSLQR ::::::::::::::::::::::::::::::::::::::::::: CCDS56 MRRQPQNGDLKKQLHERQPRIAALSDKQALGTITAVPVTGPQVSSLQR 10 20 30 40 70 80 90 100 110 120 pF1KB9 LAGQGAAVLPQVRPKTLIPDSLPVAPGRDRPPKQPPTFQKATVVSVKNPSPALPTANNTV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 LAGQGAAVLPQVRPKTLIPDSLPVAPGRDRPPKQPPTFQKATVVSVKNPSPALPTANNTV 50 60 70 80 90 100 130 140 150 160 170 180 pF1KB9 SHVPAPGSQPQALAEPAALASPLSSAGVAYAIISTSPSNAAAMAPSTAVSVVSDSIKVQP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 SHVPAPGSQPQALAEPAALASPLSSAGVAYAIISTSPSNAAAMAPSTAVSVVSDSIKVQP 110 120 130 140 150 160 190 200 210 220 230 240 pF1KB9 LLISADNKVIIIQPQVQTQPESTAESRPPTEEPSQGAQATKKKKEDRPPTQENPEKIAFM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 LLISADNKVIIIQPQVQTQPESTAESRPPTEEPSQGAQATKKKKEDRPPTQENPEKIAFM 170 180 190 200 210 220 250 260 270 280 290 300 pF1KB9 VALGLVTTEHLEEIQSKRQERKRRSTANPAYSGLLETERKRLASNYLNNPLFLTARANED :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 VALGLVTTEHLEEIQSKRQERKRRSTANPAYSGLLETERKRLASNYLNNPLFLTARANED 230 240 250 260 270 280 310 320 330 340 350 360 pF1KB9 PCWKNEITHDEHCAACKRGANLQPCGTCPGAYHLSCLEPPLKTAPKGVWVCPRCQQKALK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 PCWKNEITHDEHCAACKRGANLQPCGTCPGAYHLSCLEPPLKTAPKGVWVCPRCQQKALK 290 300 310 320 330 340 370 380 390 400 410 420 pF1KB9 KDEGVPWTGMLAIVHSYVTHKTVKEEEKQKLLQRGSELQNEHQQLEERDRRLASAVQKCL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 KDEGVPWTGMLAIVHSYVTHKTVKEEEKQKLLQRGSELQNEHQQLEERDRRLASAVQKCL 350 360 370 380 390 400 430 440 450 460 470 480 pF1KB9 ELKTSLLARQRGTQSSLDRLRALLRLIQGEQLLQVTMTTTSPAPLLAGPWTKPSVAATHP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 ELKTSLLARQRGTQSSLDRLRALLRLIQGEQLLQVTMTTTSPAPLLAGPWTKPSVAATHP 410 420 430 440 450 460 pF1KB9 TVQHPQGHN ::::::::: CCDS56 TVQHPQGHN 470 >>CCDS63504.1 PHF21B gene_id:112885|Hs108|chr22 (327 aa) initn: 2184 init1: 2184 opt: 2184 Z-score: 1384.4 bits: 265.0 E(32554): 8.6e-71 Smith-Waterman score: 2184; 100.0% identity (100.0% similar) in 327 aa overlap (163-489:1-327) 140 150 160 170 180 190 pF1KB9 LAEPAALASPLSSAGVAYAIISTSPSNAAAMAPSTAVSVVSDSIKVQPLLISADNKVIII :::::::::::::::::::::::::::::: CCDS63 MAPSTAVSVVSDSIKVQPLLISADNKVIII 10 20 30 200 210 220 230 240 250 pF1KB9 QPQVQTQPESTAESRPPTEEPSQGAQATKKKKEDRPPTQENPEKIAFMVALGLVTTEHLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 QPQVQTQPESTAESRPPTEEPSQGAQATKKKKEDRPPTQENPEKIAFMVALGLVTTEHLE 40 50 60 70 80 90 260 270 280 290 300 310 pF1KB9 EIQSKRQERKRRSTANPAYSGLLETERKRLASNYLNNPLFLTARANEDPCWKNEITHDEH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 EIQSKRQERKRRSTANPAYSGLLETERKRLASNYLNNPLFLTARANEDPCWKNEITHDEH 100 110 120 130 140 150 320 330 340 350 360 370 pF1KB9 CAACKRGANLQPCGTCPGAYHLSCLEPPLKTAPKGVWVCPRCQQKALKKDEGVPWTGMLA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 CAACKRGANLQPCGTCPGAYHLSCLEPPLKTAPKGVWVCPRCQQKALKKDEGVPWTGMLA 160 170 180 190 200 210 380 390 400 410 420 430 pF1KB9 IVHSYVTHKTVKEEEKQKLLQRGSELQNEHQQLEERDRRLASAVQKCLELKTSLLARQRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 IVHSYVTHKTVKEEEKQKLLQRGSELQNEHQQLEERDRRLASAVQKCLELKTSLLARQRG 220 230 240 250 260 270 440 450 460 470 480 pF1KB9 TQSSLDRLRALLRLIQGEQLLQVTMTTTSPAPLLAGPWTKPSVAATHPTVQHPQGHN ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 TQSSLDRLRALLRLIQGEQLLQVTMTTTSPAPLLAGPWTKPSVAATHPTVQHPQGHN 280 290 300 310 320 >>CCDS14061.1 PHF21B gene_id:112885|Hs108|chr22 (531 aa) initn: 2032 init1: 2032 opt: 2060 Z-score: 1303.9 bits: 250.8 E(32554): 2.6e-66 Smith-Waterman score: 3070; 91.9% identity (91.9% similar) in 521 aa overlap (11-489:11-531) 10 20 30 40 50 60 pF1KB9 MELQSRPEALAVELARHQNGDLKKQLHERQPRIAALSDKQALGTITAVPVTGPQVSSLQR :::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MELQSRPEALAVELARHQNGDLKKQLHERQPRIAALSDKQALGTITAVPVTGPQVSSLQR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 LAGQGAAVLPQVRPKTLIPDSLPVAPGRDRPPKQPPTFQKATVVSVKNPSPALPTANNTV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 LAGQGAAVLPQVRPKTLIPDSLPVAPGRDRPPKQPPTFQKATVVSVKNPSPALPTANNTV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 SHVPAPGSQPQALAEPAALASPLSSAGVAYAIISTSPSNAAAMAPSTAVSVVSDSIKVQP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 SHVPAPGSQPQALAEPAALASPLSSAGVAYAIISTSPSNAAAMAPSTAVSVVSDSIKVQP 130 140 150 160 170 180 190 pF1KB9 LLISADNK------------------------------------------VIIIQPQVQT :::::::: :::::::::: CCDS14 LLISADNKPPPRLLSSPHPATHHCPLHPSSLPLTPPSPSLSPSPLHGIFQVIIIQPQVQT 190 200 210 220 230 240 200 210 220 230 240 250 pF1KB9 QPESTAESRPPTEEPSQGAQATKKKKEDRPPTQENPEKIAFMVALGLVTTEHLEEIQSKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 QPESTAESRPPTEEPSQGAQATKKKKEDRPPTQENPEKIAFMVALGLVTTEHLEEIQSKR 250 260 270 280 290 300 260 270 280 290 300 310 pF1KB9 QERKRRSTANPAYSGLLETERKRLASNYLNNPLFLTARANEDPCWKNEITHDEHCAACKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 QERKRRSTANPAYSGLLETERKRLASNYLNNPLFLTARANEDPCWKNEITHDEHCAACKR 310 320 330 340 350 360 320 330 340 350 360 370 pF1KB9 GANLQPCGTCPGAYHLSCLEPPLKTAPKGVWVCPRCQQKALKKDEGVPWTGMLAIVHSYV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 GANLQPCGTCPGAYHLSCLEPPLKTAPKGVWVCPRCQQKALKKDEGVPWTGMLAIVHSYV 370 380 390 400 410 420 380 390 400 410 420 430 pF1KB9 THKTVKEEEKQKLLQRGSELQNEHQQLEERDRRLASAVQKCLELKTSLLARQRGTQSSLD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 THKTVKEEEKQKLLQRGSELQNEHQQLEERDRRLASAVQKCLELKTSLLARQRGTQSSLD 430 440 450 460 470 480 440 450 460 470 480 pF1KB9 RLRALLRLIQGEQLLQVTMTTTSPAPLLAGPWTKPSVAATHPTVQHPQGHN ::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 RLRALLRLIQGEQLLQVTMTTTSPAPLLAGPWTKPSVAATHPTVQHPQGHN 490 500 510 520 530 >>CCDS31474.1 PHF21A gene_id:51317|Hs108|chr11 (634 aa) initn: 978 init1: 552 opt: 995 Z-score: 638.0 bits: 127.9 E(32554): 3.3e-29 Smith-Waterman score: 1010; 38.9% identity (68.9% similar) in 447 aa overlap (49-489:213-622) 20 30 40 50 60 70 pF1KB9 NGDLKKQLHERQPRIAALSDKQALGTITAVPVTGPQVSSLQRLAGQGAAVLPQVRPKTLI :. :: ::. . ::::::: . CCDS31 SSKVTGPGAEAVQIVAKNTVTLQVQATPPQPIKVPQFIPPPRLTPR-PNFLPQVRPKPVA 190 200 210 220 230 240 80 90 100 110 120 130 pF1KB9 PDSLPVAPGRDRPP--KQPPTFQKATVVSVKNPSPALPTANNTVSHVPAPGSQPQALAEP ...:.::. :: : .:. .... .:. .:::..:.. : . ..: ..:. CCDS31 QNNIPIAPA--PPPMLAAPQLIQRPVMLTKFTPT-TLPTSQNSIHPVRVVNGQTATIAKT 250 260 270 280 290 140 150 160 170 180 190 pF1KB9 AALASPLSSAGVAYAIISTSPSNAAAMAPSTAVSVVSDSIKVQPLLISADNKVIIIQPQV .:. :.: :. ..:.. : .:.: :: . .:.. CCDS31 FPMAQ-LTS------IVIATPGTRLA-GPQT----------VQ-----------LSKPSL 300 310 320 200 210 220 230 240 250 pF1KB9 QTQPESTAESRPPTEEPSQGAQATKKKKEDRPPTQENPEKIAFMVALGLVTTEHLEEIQS . : :..:. :.: . ... .: .:::.:.::::.::::: .::::::: CCDS31 EKQ---TVKSHTETDEKQTESRTITPPAAPKPKREENPQKLAFMVSLGLVTHDHLEEIQS 330 340 350 360 370 380 260 270 280 290 300 310 pF1KB9 KRQERKRRSTANPAYSG-LLETERKRLASNYLNNPLF--LTARANEDPCWKNEITHDEHC ::::::::.::::.::: ..: :::. : .:::. . ::::. : . :.. : CCDS31 KRQERKRRTTANPVYSGAVFEPERKKSAVTYLNSTMHPGTRKRANEEH-WPKGDIHEDFC 390 400 410 420 430 440 320 330 340 350 360 370 pF1KB9 AACKRGANLQPCGTCPGAYHLSCLEPPLKTAPKGVWVCPRCQQKALKKDEGVPWTGMLAI ..:.....: : :: .:::.::.::::: :::.:.:::::.. :::.:..:: : ::: CCDS31 SVCRKSGQLLMCDTCSRVYHLDCLDPPLKTIPKGMWICPRCQDQMLKKEEAIPWPGTLAI 450 460 470 480 490 500 380 390 400 410 420 430 pF1KB9 VHSYVTHKTVKEEEKQKLLQRGSELQNEHQQLEERDRRLASAVQKCLELKTSLLARQRGT ::::...:..:::::::::. .:.:..:..:::.. ..:.....::.:.:...::::. CCDS31 VHSYIAYKAAKEEEKQKLLKWSSDLKQEREQLEQKVKQLSNSISKCMEMKNTILARQKEM 510 520 530 540 550 560 440 450 460 470 480 pF1KB9 QSSLDRLRALLRLIQGEQLLQVTMTTTSPAPLLAGPW-TKPSVAATHPTVQHPQGHN .:::.... :.:::.: .: . . . .. . . :: : :. ::: . :.... CCDS31 HSSLEKVKQLIRLIHGIDLSKPVDSEATVGAISNGPDCTPPANAATSTPAPSPSSQSCTA 570 580 590 600 610 620 CCDS31 NCNQGEETK 630 >>CCDS44578.1 PHF21A gene_id:51317|Hs108|chr11 (680 aa) initn: 978 init1: 552 opt: 615 Z-score: 400.3 bits: 84.0 E(32554): 5.6e-16 Smith-Waterman score: 907; 35.3% identity (62.5% similar) in 493 aa overlap (49-489:212-668) 20 30 40 50 60 70 pF1KB9 NGDLKKQLHERQPRIAALSDKQALGTITAVPVTGPQVSSLQRLAGQGAAVLPQVRPKTLI :. :: ::. . ::::::: . CCDS44 TSSKVTGPGAEAVQIVAKNTVTLVQATPPQPIKVPQFIPPPRLTPR-PNFLPQVRPKPVA 190 200 210 220 230 240 80 90 100 110 120 130 pF1KB9 PDSLPVAPGRDRPP--KQPPTFQKATVVSVKNPSPALPTANNTVSHVPAPGSQPQALAEP ...:.::. :: : .:. .... .:. .:::..:.. : . ..: ..:. CCDS44 QNNIPIAPA--PPPMLAAPQLIQRPVMLTKFTPT-TLPTSQNSIHPVRVVNGQTATIAKT 250 260 270 280 290 140 150 160 170 180 190 pF1KB9 AALASPLSSAGVAYAIISTSPSNAAAMAPSTAVSVVSDSIKVQPLLISADNKVIIIQPQV .:. :.: :. ..:.. : .:.: :: . .:.. CCDS44 FPMAQ-LTS------IVIATPGTRLA-GPQT----------VQ-----------LSKPSL 300 310 320 200 210 220 230 240 250 pF1KB9 QTQPESTAESRPPTEEPSQGAQATKKKKEDRPPTQENPEKIAFMVALGLVTTEHLEEIQS . : :..:. :.: . ... .: .:::.:.::::.::::: .::::::: CCDS44 EKQ---TVKSHTETDEKQTESRTITPPAAPKPKREENPQKLAFMVSLGLVTHDHLEEIQS 330 340 350 360 370 380 260 270 280 290 300 pF1KB9 KRQERKRRSTANPAYSG-LLETERKRLASNYLNNPLFLTARANEDP-------------- ::::::::.::::.::: ..: :::. : .:::. . .: : CCDS44 KRQERKRRTTANPVYSGAVFEPERKKSAVTYLNSTMHPGTRKRGRPPKYNAVLGFGALTP 390 400 410 420 430 440 310 320 pF1KB9 ---------CWKNEIT-------------------------HDEHCAACKRGANLQPCGT .:: : :.. :..:.....: : : CCDS44 TSPQSSHPDSPENEKTETTFTFPAPVQPVSLPSPTSTDGDIHEDFCSVCRKSGQLLMCDT 450 460 470 480 490 500 330 340 350 360 370 380 pF1KB9 CPGAYHLSCLEPPLKTAPKGVWVCPRCQQKALKKDEGVPWTGMLAIVHSYVTHKTVKEEE : .:::.::.::::: :::.:.:::::.. :::.:..:: : :::::::...:..:::: CCDS44 CSRVYHLDCLDPPLKTIPKGMWICPRCQDQMLKKEEAIPWPGTLAIVHSYIAYKAAKEEE 510 520 530 540 550 560 390 400 410 420 430 440 pF1KB9 KQKLLQRGSELQNEHQQLEERDRRLASAVQKCLELKTSLLARQRGTQSSLDRLRALLRLI :::::. .:.:..:..:::.. ..:.....::.:.:...::::. .:::.... :.::: CCDS44 KQKLLKWSSDLKQEREQLEQKVKQLSNSISKCMEMKNTILARQKEMHSSLEKVKQLIRLI 570 580 590 600 610 620 450 460 470 480 pF1KB9 QGEQLLQVTMTTTSPAPLLAGPW-TKPSVAATHPTVQHPQGHN .: .: . . . .. . . :: : :. ::: . :.... CCDS44 HGIDLSKPVDSEATVGAISNGPDCTPPANAATSTPAPSPSSQSCTANCNQGEETK 630 640 650 660 670 680 489 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 02:05:06 2016 done: Sat Nov 5 02:05:07 2016 Total Scan time: 3.890 Total Display time: 0.050 Function used was FASTA [36.3.4 Apr, 2011]