FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3218, 495 aa 1>>>pF1KE3218 495 - 495 aa - 495 aa Library: human.CCDS.faa 18921897 residues in 33420 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.4612+/-0.000864; mu= 12.9022+/- 0.052 mean_var=100.5383+/-20.929, 0's: 0 Z-trim(108.7): 54 B-trim: 628 in 1/50 Lambda= 0.127911 statistics sampled from 10488 (10530) to 10488 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.685), E-opt: 0.2 (0.315), width: 16 Scan time: 1.610 The best scores are: opt bits E(33420) CCDS9809.1 DCAF4 gene_id:26094|Hs109|chr14 ( 495) 3354 629.6 2.5e-180 CCDS55926.1 DCAF4 gene_id:26094|Hs109|chr14 ( 489) 3278 615.5 4.1e-176 CCDS9810.1 DCAF4 gene_id:26094|Hs109|chr14 ( 395) 2642 498.1 7.4e-141 CCDS41968.2 DCAF4 gene_id:26094|Hs109|chr14 ( 435) 2135 404.6 1.2e-112 CCDS33978.1 DCAF4L1 gene_id:285429|Hs109|chr4 ( 396) 2057 390.2 2.3e-108 CCDS6245.1 DCAF4L2 gene_id:138009|Hs109|chr8 ( 395) 2024 384.1 1.6e-106 >>CCDS9809.1 DCAF4 gene_id:26094|Hs109|chr14 (495 aa) initn: 3354 init1: 3354 opt: 3354 Z-score: 3351.2 bits: 629.6 E(33420): 2.5e-180 Smith-Waterman score: 3354; 99.4% identity (99.6% similar) in 495 aa overlap (1-495:1-495) 10 20 30 40 50 60 pF1KE3 MNKSRWQSRRRHGRRSHQQNPWFRLRDSEDRSDSRAAQPAHDSGHGDDESPSTSSGTAGT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 MNKSRWQSRRRHGRRSHQQNPWFRLRDSEDRSDSRAAQPAHDSGHGDDESPSTSSGTAGT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 SSVPELPGFYFDPEKKRYFRLLPGHNNCNPLTKESIRQKEMESKRLRLLQEEDRRKKIAR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 SSVPELPGFYFDPEKKRYFRLLPGHNNCNPLTKESIRQKEMESKRLRLLQEEDRRKKIAR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 MGFNASSMLRKSQLGFLNVTNYCHLAHELRLSCMERKKVQIRSMDPSALASDRFNLILAD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 MGFNASSMLRKSQLGFLNVTNYCHLAHELRLSCMERKKVQIRSMDPSALASDRFNLILAD 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 TNSDRLFTVNDVTVGGSKYGIINLQSLKTPTLKVFMHENLYFTNRKVNSVCWASLNHLDS :::::::::::: ::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 TNSDRLFTVNDVKVGGSKYGIINLQSLKTPTLKVFMHENLYFTNRKVNSVCWASLNHLDS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 HILLCLMGLAETPGCATLLPASLFVNSHPGIDRPGMLCSFRIPGAWSCAWSLNIQANNCF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 HILLCLMGLAETPGCATLLPASLFVNSHPGIDRPGMLCSFRIPGAWSCAWSLNIQANNCF 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 STGLSRRVLLTNVVTGHRQSFGTNSDVLAQQFAFMAPLLFNGCRSGEIFAIDLRCGNQGK :::::::::::::::::::::::::::::::::.:::::::::::::::::::::::::: CCDS98 STGLSRRVLLTNVVTGHRQSFGTNSDVLAQQFALMAPLLFNGCRSGEIFAIDLRCGNQGK 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE3 GWKATRLFHDSAVTSVRILQDEQYLMASDMAGKIKLWDLRTTKCVRQYEGHVNEYAYLPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 GWKATRLFHDSAVTSVRILQDEQYLMASDMAGKIKLWDLRTTKCVRQYEGHVNEYAYLPL 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE3 HVHEEEGILVAVGQDCYTIIWSLHDARLLRTIPSPYPASKADIPSVAFSSRLGGSRGAPG :::::::::::::::::: ::::::::::::::::::::::::::::::::::::::::: CCDS98 HVHEEEGILVAVGQDCYTRIWSLHDARLLRTIPSPYPASKADIPSVAFSSRLGGSRGAPG 430 440 450 460 470 480 490 pF1KE3 LLMAVGQDLYCYSYS ::::::::::::::: CCDS98 LLMAVGQDLYCYSYS 490 >>CCDS55926.1 DCAF4 gene_id:26094|Hs109|chr14 (489 aa) initn: 2890 init1: 1815 opt: 3278 Z-score: 3275.4 bits: 615.5 E(33420): 4.1e-176 Smith-Waterman score: 3278; 98.0% identity (98.0% similar) in 496 aa overlap (1-495:1-489) 10 20 30 40 50 60 pF1KE3 MNKSRWQSRRRHGRRSHQQNPWFRLRDSEDRSDSRAAQPAHDSGHGDDESPSTSSGTAGT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 MNKSRWQSRRRHGRRSHQQNPWFRLRDSEDRSDSRAAQPAHDSGHGDDESPSTSSGTAGT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 SSVPELPGFYFDPEKKRYFRLLPGHNNCNPLTKESIRQKEMESKRLRLLQEEDRRKKIAR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 SSVPELPGFYFDPEKKRYFRLLPGHNNCNPLTKESIRQKEMESKRLRLLQEEDRRKKIAR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 MGFNASSMLRKSQLGFLNVTNYCHLAHELRLSCMERKKVQIRSMDPSALASDRFNLILAD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 MGFNASSMLRKSQLGFLNVTNYCHLAHELRLSCMERKKVQIRSMDPSALASDRFNLILAD 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 TNSDRLFTVNDVTVGGSKYGIINLQSLKTPTLKVFMHENLYFTNRKVNSVCWASLNHLDS :::::::::::: ::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 TNSDRLFTVNDVKVGGSKYGIINLQSLKTPTLKVFMHENLYFTNRKVNSVCWASLNHLDS 190 200 210 220 230 240 250 260 270 280 290 pF1KE3 HILLCLMGLAETPGCATLLPASLFVNSHP-GIDRPGMLCSFRIPGAWSCAWSLNIQANNC ::::::::::::::::::::::::::::: :::::::::::::::::::::::::::::: CCDS55 HILLCLMGLAETPGCATLLPASLFVNSHPAGIDRPGMLCSFRIPGAWSCAWSLNIQANNC 250 260 270 280 290 300 300 310 320 330 340 350 pF1KE3 FSTGLSRRVLLTNVVTGHRQSFGTNSDVLAQQFAFMAPLLFNGCRSGEIFAIDLRCGNQG :::::::::::::::::::::::::::::: ::::::::::::::::::::::: CCDS55 FSTGLSRRVLLTNVVTGHRQSFGTNSDVLA-------PLLFNGCRSGEIFAIDLRCGNQG 310 320 330 340 350 360 370 380 390 400 410 pF1KE3 KGWKATRLFHDSAVTSVRILQDEQYLMASDMAGKIKLWDLRTTKCVRQYEGHVNEYAYLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 KGWKATRLFHDSAVTSVRILQDEQYLMASDMAGKIKLWDLRTTKCVRQYEGHVNEYAYLP 360 370 380 390 400 410 420 430 440 450 460 470 pF1KE3 LHVHEEEGILVAVGQDCYTIIWSLHDARLLRTIPSPYPASKADIPSVAFSSRLGGSRGAP ::::::::::::::::::: :::::::::::::::::::::::::::::::::::::::: CCDS55 LHVHEEEGILVAVGQDCYTRIWSLHDARLLRTIPSPYPASKADIPSVAFSSRLGGSRGAP 420 430 440 450 460 470 480 490 pF1KE3 GLLMAVGQDLYCYSYS :::::::::::::::: CCDS55 GLLMAVGQDLYCYSYS 480 >>CCDS9810.1 DCAF4 gene_id:26094|Hs109|chr14 (395 aa) initn: 2642 init1: 2642 opt: 2642 Z-score: 2642.5 bits: 498.1 E(33420): 7.4e-141 Smith-Waterman score: 2642; 99.2% identity (99.5% similar) in 395 aa overlap (101-495:1-395) 80 90 100 110 120 130 pF1KE3 FDPEKKRYFRLLPGHNNCNPLTKESIRQKEMESKRLRLLQEEDRRKKIARMGFNASSMLR :::::::::::::::::::::::::::::: CCDS98 MESKRLRLLQEEDRRKKIARMGFNASSMLR 10 20 30 140 150 160 170 180 190 pF1KE3 KSQLGFLNVTNYCHLAHELRLSCMERKKVQIRSMDPSALASDRFNLILADTNSDRLFTVN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 KSQLGFLNVTNYCHLAHELRLSCMERKKVQIRSMDPSALASDRFNLILADTNSDRLFTVN 40 50 60 70 80 90 200 210 220 230 240 250 pF1KE3 DVTVGGSKYGIINLQSLKTPTLKVFMHENLYFTNRKVNSVCWASLNHLDSHILLCLMGLA :: ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 DVKVGGSKYGIINLQSLKTPTLKVFMHENLYFTNRKVNSVCWASLNHLDSHILLCLMGLA 100 110 120 130 140 150 260 270 280 290 300 310 pF1KE3 ETPGCATLLPASLFVNSHPGIDRPGMLCSFRIPGAWSCAWSLNIQANNCFSTGLSRRVLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 ETPGCATLLPASLFVNSHPGIDRPGMLCSFRIPGAWSCAWSLNIQANNCFSTGLSRRVLL 160 170 180 190 200 210 320 330 340 350 360 370 pF1KE3 TNVVTGHRQSFGTNSDVLAQQFAFMAPLLFNGCRSGEIFAIDLRCGNQGKGWKATRLFHD :::::::::::::::::::::::.:::::::::::::::::::::::::::::::::::: CCDS98 TNVVTGHRQSFGTNSDVLAQQFALMAPLLFNGCRSGEIFAIDLRCGNQGKGWKATRLFHD 220 230 240 250 260 270 380 390 400 410 420 430 pF1KE3 SAVTSVRILQDEQYLMASDMAGKIKLWDLRTTKCVRQYEGHVNEYAYLPLHVHEEEGILV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 SAVTSVRILQDEQYLMASDMAGKIKLWDLRTTKCVRQYEGHVNEYAYLPLHVHEEEGILV 280 290 300 310 320 330 440 450 460 470 480 490 pF1KE3 AVGQDCYTIIWSLHDARLLRTIPSPYPASKADIPSVAFSSRLGGSRGAPGLLMAVGQDLY :::::::: ::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 AVGQDCYTRIWSLHDARLLRTIPSPYPASKADIPSVAFSSRLGGSRGAPGLLMAVGQDLY 340 350 360 370 380 390 pF1KE3 CYSYS ::::: CCDS98 CYSYS >>CCDS41968.2 DCAF4 gene_id:26094|Hs109|chr14 (435 aa) initn: 2929 init1: 1539 opt: 2135 Z-score: 2136.3 bits: 404.6 E(33420): 1.2e-112 Smith-Waterman score: 2813; 86.9% identity (87.1% similar) in 496 aa overlap (1-495:1-435) 10 20 30 40 50 60 pF1KE3 MNKSRWQSRRRHGRRSHQQNPWFRLRDSEDRSDSRAAQPAHDSGHGDDESPSTSSGTAGT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 MNKSRWQSRRRHGRRSHQQNPWFRLRDSEDRSDSRAAQPAHDSGHGDDESPSTSSGTAGT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 SSVPELPGFYFDPEKKRYFRLLPGHNNCNPLTKESIRQKEMESKRLRLLQEEDRRKKIAR ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 SSVPELPGFYFDPEKKRYFRLLPGHNNCNPLTKESIRQKEMESKRLRLLQEEDRRKK--- 70 80 90 100 110 130 140 150 160 170 180 pF1KE3 MGFNASSMLRKSQLGFLNVTNYCHLAHELRLSCMERKKVQIRSMDPSALASDRFNLILAD :: CCDS41 ----------------------------------------------------------AD 190 200 210 220 230 240 pF1KE3 TNSDRLFTVNDVTVGGSKYGIINLQSLKTPTLKVFMHENLYFTNRKVNSVCWASLNHLDS :::::::::::: ::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 TNSDRLFTVNDVKVGGSKYGIINLQSLKTPTLKVFMHENLYFTNRKVNSVCWASLNHLDS 120 130 140 150 160 170 250 260 270 280 290 pF1KE3 HILLCLMGLAETPGCATLLPASLFVNSHP-GIDRPGMLCSFRIPGAWSCAWSLNIQANNC ::::::::::::::::::::::::::::: :::::::::::::::::::::::::::::: CCDS41 HILLCLMGLAETPGCATLLPASLFVNSHPAGIDRPGMLCSFRIPGAWSCAWSLNIQANNC 180 190 200 210 220 230 300 310 320 330 340 350 pF1KE3 FSTGLSRRVLLTNVVTGHRQSFGTNSDVLAQQFAFMAPLLFNGCRSGEIFAIDLRCGNQG ::::::::::::::::::::::::::::::::::.::::::::::::::::::::::::: CCDS41 FSTGLSRRVLLTNVVTGHRQSFGTNSDVLAQQFALMAPLLFNGCRSGEIFAIDLRCGNQG 240 250 260 270 280 290 360 370 380 390 400 410 pF1KE3 KGWKATRLFHDSAVTSVRILQDEQYLMASDMAGKIKLWDLRTTKCVRQYEGHVNEYAYLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 KGWKATRLFHDSAVTSVRILQDEQYLMASDMAGKIKLWDLRTTKCVRQYEGHVNEYAYLP 300 310 320 330 340 350 420 430 440 450 460 470 pF1KE3 LHVHEEEGILVAVGQDCYTIIWSLHDARLLRTIPSPYPASKADIPSVAFSSRLGGSRGAP ::::::::::::::::::: :::::::::::::::::::::::::::::::::::::::: CCDS41 LHVHEEEGILVAVGQDCYTRIWSLHDARLLRTIPSPYPASKADIPSVAFSSRLGGSRGAP 360 370 380 390 400 410 480 490 pF1KE3 GLLMAVGQDLYCYSYS :::::::::::::::: CCDS41 GLLMAVGQDLYCYSYS 420 430 >>CCDS33978.1 DCAF4L1 gene_id:285429|Hs109|chr4 (396 aa) initn: 2060 init1: 1974 opt: 2057 Z-score: 2059.1 bits: 390.2 E(33420): 2.3e-108 Smith-Waterman score: 2057; 76.5% identity (92.4% similar) in 396 aa overlap (101-495:1-396) 80 90 100 110 120 130 pF1KE3 FDPEKKRYFRLLPGHNNCNPLTKESIRQKEMESKRLRLLQEEDRRKKIARMGFNASSMLR ::..:::::.:: . ::.:::::::::::: CCDS33 MEAERLRLLEEEAKLKKVARMGFNASSMLR 10 20 30 140 150 160 170 180 190 pF1KE3 KSQLGFLNVTNYCHLAHELRLSCMERKKVQIRSMDPSALASDRFNLILADTNSDRLFTVN ::::::::::.: .::.:::.::::::::::::.:::.:::::::.:::.::::.::.:: CCDS33 KSQLGFLNVTSYSRLANELRVSCMERKKVQIRSLDPSSLASDRFNFILASTNSDQLFVVN 40 50 60 70 80 90 200 210 220 230 240 250 pF1KE3 DVTVGGSKYGIINLQSLKTPTLKVFMHENLYFTNRKVNSVCWASLNHLDSHILLCLMGLA .: : :::::::.:..:: :...:.. .::: ::::.:.::::::.::::.:::. :.. CCDS33 QVEVEGSKYGIISLRTLKIPSFHVYVLRNLYVPNRKVKSLCWASLNQLDSHVLLCFEGIT 100 110 120 130 140 150 260 270 280 290 300 310 pF1KE3 ETPGCATLLPASLFVNSHPGIDRPGMLCSFRIPGAWSCAWSLNIQANNCFSTGLSRRVLL ..:.::.::::: :.. : ...:::::::.:: ::::::::: .: .:::.:::..::: CCDS33 DAPSCAVLLPASRFLSVHTRVNQPGMLCSFQIPEAWSCAWSLNTRAYHCFSAGLSQQVLL 160 170 180 190 200 210 320 330 340 350 360 370 pF1KE3 TNVVTGHRQSFGTNSDVLAQQFAFMAPLLFNGCRSGEIFAIDLRCGNQGKGWKATRLFHD :.:.:::.::: :.::::::::: :::::::::::::::::::: :.::::.::::::: CCDS33 TSVATGHQQSFDTSSDVLAQQFASTAPLLFNGCRSGEIFAIDLRCRNRGKGWRATRLFHD 220 230 240 250 260 270 380 390 400 410 420 430 pF1KE3 SAVTSVRILQDEQYLMASDMAGKIKLWDLRTTKCVRQYEGHVNEYAYLPLHVHEEEGILV ::::::.:::.:: ::::::.:::::::::.::::::::::::: :::::::::::::.: CCDS33 SAVTSVQILQEEQCLMASDMTGKIKLWDLRATKCVRQYEGHVNESAYLPLHVHEEEGIVV 280 290 300 310 320 330 440 450 460 470 480 pF1KE3 AVGQDCYTIIWSLHDARLLRTIPSPYPASKADIPSVAFSSRLGGSRGA-PGLLMAVGQDL :::::::: :::::::.::::::::: ::. :::::::.::::: ::: ::::::: ::: CCDS33 AVGQDCYTRIWSLHDAHLLRTIPSPYSASEDDIPSVAFASRLGGIRGAAPGLLMAVRQDL 340 350 360 370 380 390 490 pF1KE3 YCYSYS ::. .: CCDS33 YCFPFS >>CCDS6245.1 DCAF4L2 gene_id:138009|Hs109|chr8 (395 aa) initn: 2024 init1: 2024 opt: 2024 Z-score: 2026.2 bits: 384.1 E(33420): 1.6e-106 Smith-Waterman score: 2024; 74.4% identity (91.9% similar) in 394 aa overlap (101-494:1-394) 80 90 100 110 120 130 pF1KE3 FDPEKKRYFRLLPGHNNCNPLTKESIRQKEMESKRLRLLQEEDRRKKIARMGFNASSMLR ::::: :::.: :..:: .:.:.:: :::: CCDS62 MESKRPRLLEEADKQKKTVRVGLNAPSMLR 10 20 30 140 150 160 170 180 190 pF1KE3 KSQLGFLNVTNYCHLAHELRLSCMERKKVQIRSMDPSALASDRFNLILADTNSDRLFTVN :.::::: .:::..:.:::.:::.::::::.: :::.::::::: :::.::.:.::::: CCDS62 KNQLGFLRFANYCRIARELRVSCMQRKKVQIHSWDPSSLASDRFNRILANTNTDQLFTVN 40 50 60 70 80 90 200 210 220 230 240 250 pF1KE3 DVTVGGSKYGIINLQSLKTPTLKVFMHENLYFTNRKVNSVCWASLNHLDSHILLCLMGLA .: .::::::::....: :: :.:. :..:: ::::::.:::::::::::.:::..::: CCDS62 QVEAGGSKYGIITMRGLTTPELRVYPHKTLYVPNRKVNSMCWASLNHLDSHLLLCFVGLA 100 110 120 130 140 150 260 270 280 290 300 310 pF1KE3 ETPGCATLLPASLFVNSHPGIDRPGMLCSFRIPGAWSCAWSLNIQANNCFSTGLSRRVLL .::.::.:::::::..: ::. ::::::::.:: ::::::::.:.: . ::::::..::: CCDS62 DTPSCAVLLPASLFIGSFPGMRRPGMLCSFQIPDAWSCAWSLSIHAYHSFSTGLSQQVLL 160 170 180 190 200 210 320 330 340 350 360 370 pF1KE3 TNVVTGHRQSFGTNSDVLAQQFAFMAPLLFNGCRSGEIFAIDLRCGNQGKGWKATRLFHD :::::::.:::::.:::::::::.:.:::::::::::::.:::::::::.:::: : :: CCDS62 TNVVTGHQQSFGTSSDVLAQQFAIMTPLLFNGCRSGEIFGIDLRCGNQGSGWKAICLSHD 220 230 240 250 260 270 380 390 400 410 420 430 pF1KE3 SAVTSVRILQDEQYLMASDMAGKIKLWDLRTTKCVRQYEGHVNEYAYLPLHVHEEEGILV :::::..:::: :.:..:::.: :::::::.:::: :::::::. ::::.::.::::... CCDS62 SAVTSLQILQDGQFLVSSDMTGTIKLWDLRATKCVTQYEGHVNNSAYLPVHVNEEEGVVA 280 290 300 310 320 330 440 450 460 470 480 490 pF1KE3 AVGQDCYTIIWSLHDARLLRTIPSPYPASKADIPSVAFSSRLGGSRGAPGLLMAVGQDLY :::::::: ::::. ..:: :::::::::. ::::::::::::: :::::::::: .::: CCDS62 AVGQDCYTRIWSLRHGHLLTTIPSPYPASENDIPSVAFSSRLGGFRGAPGLLMAVREDLY 340 350 360 370 380 390 pF1KE3 CYSYS :.:: CCDS62 CFSYG 495 residues in 1 query sequences 18921897 residues in 33420 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Apr 23 11:11:42 2019 done: Tue Apr 23 11:11:42 2019 Total Scan time: 1.610 Total Display time: 0.060 Function used was FASTA [36.3.4 Apr, 2011]