FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB5867, 462 aa 1>>>pF1KB5867 462 - 462 aa - 462 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 10.0105+/-0.000867; mu= -3.3651+/- 0.052 mean_var=239.3353+/-49.593, 0's: 0 Z-trim(115.5): 9 B-trim: 291 in 2/52 Lambda= 0.082903 statistics sampled from 16070 (16078) to 16070 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.79), E-opt: 0.2 (0.494), width: 16 Scan time: 2.450 The best scores are: opt bits E(32554) CCDS94.1 ERRFI1 gene_id:54206|Hs108|chr1 ( 462) 3200 395.3 6.8e-110 CCDS33928.1 TNK2 gene_id:10188|Hs108|chr3 (1038) 481 70.3 1e-11 CCDS77875.1 TNK2 gene_id:10188|Hs108|chr3 (1040) 481 70.3 1e-11 CCDS33927.1 TNK2 gene_id:10188|Hs108|chr3 (1086) 481 70.3 1.1e-11 >>CCDS94.1 ERRFI1 gene_id:54206|Hs108|chr1 (462 aa) initn: 3200 init1: 3200 opt: 3200 Z-score: 2086.4 bits: 395.3 E(32554): 6.8e-110 Smith-Waterman score: 3200; 100.0% identity (100.0% similar) in 462 aa overlap (1-462:1-462) 10 20 30 40 50 60 pF1KB5 MSIAGVAAQEIRVPLKTGFLHNGRAMGNMRKTYWSSRSEFKNNFLNIDPITMAYSLNSSA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS94 MSIAGVAAQEIRVPLKTGFLHNGRAMGNMRKTYWSSRSEFKNNFLNIDPITMAYSLNSSA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB5 QERLIPLGHASKSAPMNGHCFAENGPSQKSSLPPLLIPPSENLGPHEEDQVVCGFKKLTV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS94 QERLIPLGHASKSAPMNGHCFAENGPSQKSSLPPLLIPPSENLGPHEEDQVVCGFKKLTV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB5 NGVCASTPPLTPIKNSPSLFPCAPLCERGSRPLPPLPISEALSLDDTDCEVEFLTSSDTD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS94 NGVCASTPPLTPIKNSPSLFPCAPLCERGSRPLPPLPISEALSLDDTDCEVEFLTSSDTD 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB5 FLLEDSTLSDFKYDVPGRRSFRGCGQINYAYFDTPAVSAADLSYVSDQNGGVPDPNPPPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS94 FLLEDSTLSDFKYDVPGRRSFRGCGQINYAYFDTPAVSAADLSYVSDQNGGVPDPNPPPP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB5 QTHRRLRRSHSGPAGSFNKPAIRISNCCIHRASPNSDEDKPEVPPRVPIPPRPVKPDYRR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS94 QTHRRLRRSHSGPAGSFNKPAIRISNCCIHRASPNSDEDKPEVPPRVPIPPRPVKPDYRR 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB5 WSAEVTSSTYSDEDRPPKVPPREPLSPSNSRTPSPKSLPSYLNGVMPPTQSFAPDPKYVS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS94 WSAEVTSSTYSDEDRPPKVPPREPLSPSNSRTPSPKSLPSYLNGVMPPTQSFAPDPKYVS 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB5 SKALQRQNSEGSASKVPCILPIIENGKKVSSTHYYLLPERPPYLDKYEKFFREAEETNGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS94 SKALQRQNSEGSASKVPCILPIIENGKKVSSTHYYLLPERPPYLDKYEKFFREAEETNGG 370 380 390 400 410 420 430 440 450 460 pF1KB5 AQIQPLPADCGISSATEKPDSKTKMDLGGHVKRKHLSYVVSP :::::::::::::::::::::::::::::::::::::::::: CCDS94 AQIQPLPADCGISSATEKPDSKTKMDLGGHVKRKHLSYVVSP 430 440 450 460 >>CCDS33928.1 TNK2 gene_id:10188|Hs108|chr3 (1038 aa) initn: 669 init1: 222 opt: 481 Z-score: 323.7 bits: 70.3 E(32554): 1e-11 Smith-Waterman score: 665; 33.1% identity (56.1% similar) in 483 aa overlap (2-427:445-889) 10 20 30 pF1KB5 MSIAGVAAQEIRVPLKTGFLHNGRAMGNMRK :.::..::.: ::...:.:.:.. .. :. CCDS33 VIEGRAENYWWRGQNTRTLCVGPFPRNVVTSVAGLSAQDISQPLQNSFIHTGHGDSDPRH 420 430 440 450 460 470 40 50 60 70 80 pF1KB5 TYWSSRSEFKNNFLN--IDPITMAYSLNSSAQERLIPLGHASKSA---------PMNGHC :. ... . .:. .:: . :.. :... :: ..: . :... CCDS33 C-WGFPDRIDELYLGNPMDPPDL-LSVELSTSRPPQHLGGVKKPTYDPVSEDQDPLSSD- 480 490 500 510 520 530 90 100 110 pF1KB5 FAENG---PSQKSSL----PPLLIPPSE---------NLGPHEEDQVV-----CG--FKK : . : :. .: : .: .. .: :. :: :. . . CCDS33 FKRLGLRKPGLPRGLWLAKPSARVPGTKASRGSGAEVTLIDFGEEPVVPALRPCAPSLAQ 540 550 560 570 580 590 120 130 140 150 160 170 pF1KB5 LTVNGVCA---STPPLTPIKNSPSLFPCAPLCERGSRPLPPLPISEALSLDDTDCEVEFL :.... :. ::: .: . : . .:. . .::::: : . .. :. : :. . CCDS33 LAMDA-CSLLDETPPQSPTRALPRPLHPTPVVDWDARPLPPPPAYDDVAQDEDDFEICSI 600 610 620 630 640 650 180 190 200 210 220 230 pF1KB5 TSSDTDFLLEDSTLSDFKYDVPGRRSFRGCGQINYAYFDT---PAVSAADLSYVSDQNGG .::: ::. : :: :::. : : .. :.:: CCDS33 ----------NSTLVG--AGVPAGPS---QGQTNYAFVPEQARPPPPLEDNLFLPPQGGG 660 670 680 690 240 250 260 270 280 pF1KB5 VPDPNPPPPQTHRRLR----RSHSGPAGSFNKPAIRISNCCIHRASPNSDEDKPEVPPRV : . . . :. :. ..:::: :: ::..: :::.::::: CCDS33 KPPSSAQTAEIFQALQQECMRQLQAPAGS---PA--------PSPSPGGD-DKPQVPPRV 700 710 720 730 740 290 300 310 320 330 pF1KB5 PIPPRPVKPDYRRWSA---EVTSSTYSDEDRPPKVPPREPLSPSNSRTPSP------KSL ::::::..: . : : .: . ::.:::::::::..:::::: . : CCDS33 PIPPRPTRPHVQLSPAPPGEEETSQWPGPASPPRVPPREPLSPQGSRTPSPLVPPGSSPL 750 760 770 780 790 800 340 350 360 370 380 390 pF1KB5 PSYLNG----VMPPTQSFAPDPKYVSSKALQRQNSEGSASKVPCILPIIENGKKVSSTHY : :.. .:: ::::: ::::.. ...: . ... ::::::...::::::::: CCDS33 PPRLSSSPGKTMPTTQSFASDPKYATPQVIQAPGPRAG----PCILPIVRDGKKVSSTHY 810 820 830 840 850 400 410 420 430 440 450 pF1KB5 YLLPERPPYLDKYEKFFREAEETNGGAQIQPLPADCGISSATEKPDSKTKMDLGGHVKRK ::::::: ::..:..:.:::. . . ::: CCDS33 YLLPERPSYLERYQRFLREAQSPE---EPTPLPVPLLLPPPSTPAPAAPTATVRPMPQAA 860 870 880 890 900 910 460 pF1KB5 HLSYVVSP CCDS33 LDPKANFSTNNSNPGARPPPPRATARLPQRGCPGDGPEAGRPADKIQMAMVHGVTTEECQ 920 930 940 950 960 970 >>CCDS77875.1 TNK2 gene_id:10188|Hs108|chr3 (1040 aa) initn: 669 init1: 222 opt: 481 Z-score: 323.6 bits: 70.3 E(32554): 1e-11 Smith-Waterman score: 665; 33.1% identity (56.1% similar) in 483 aa overlap (2-427:477-921) 10 20 30 pF1KB5 MSIAGVAAQEIRVPLKTGFLHNGRAMGNMRK :.::..::.: ::...:.:.:.. .. :. CCDS77 VIEGRAENYWWRGQNTRTLCVGPFPRNVVTSVAGLSAQDISQPLQNSFIHTGHGDSDPRH 450 460 470 480 490 500 40 50 60 70 80 pF1KB5 TYWSSRSEFKNNFLN--IDPITMAYSLNSSAQERLIPLGHASKSA---------PMNGHC :. ... . .:. .:: . :.. :... :: ..: . :... CCDS77 C-WGFPDRIDELYLGNPMDPPDL-LSVELSTSRPPQHLGGVKKPTYDPVSEDQDPLSSD- 510 520 530 540 550 560 90 100 110 pF1KB5 FAENG---PSQKSSL----PPLLIPPSE---------NLGPHEEDQVV-----CG--FKK : . : :. .: : .: .. .: :. :: :. . . CCDS77 FKRLGLRKPGLPRGLWLAKPSARVPGTKASRGSGAEVTLIDFGEEPVVPALRPCAPSLAQ 570 580 590 600 610 620 120 130 140 150 160 170 pF1KB5 LTVNGVCA---STPPLTPIKNSPSLFPCAPLCERGSRPLPPLPISEALSLDDTDCEVEFL :.... :. ::: .: . : . .:. . .::::: : . .. :. : :. . CCDS77 LAMDA-CSLLDETPPQSPTRALPRPLHPTPVVDWDARPLPPPPAYDDVAQDEDDFEICSI 630 640 650 660 670 680 180 190 200 210 220 230 pF1KB5 TSSDTDFLLEDSTLSDFKYDVPGRRSFRGCGQINYAYFDT---PAVSAADLSYVSDQNGG .::: ::. : :: :::. : : .. :.:: CCDS77 ----------NSTLVG--AGVPAGPS---QGQTNYAFVPEQARPPPPLEDNLFLPPQGGG 690 700 710 720 240 250 260 270 280 pF1KB5 VPDPNPPPPQTHRRLR----RSHSGPAGSFNKPAIRISNCCIHRASPNSDEDKPEVPPRV : . . . :. :. ..:::: :: ::..: :::.::::: CCDS77 KPPSSAQTAEIFQALQQECMRQLQAPAGS---PA--------PSPSPGGD-DKPQVPPRV 730 740 750 760 770 290 300 310 320 330 pF1KB5 PIPPRPVKPDYRRWSA---EVTSSTYSDEDRPPKVPPREPLSPSNSRTPSP------KSL ::::::..: . : : .: . ::.:::::::::..:::::: . : CCDS77 PIPPRPTRPHVQLSPAPPGEEETSQWPGPASPPRVPPREPLSPQGSRTPSPLVPPGSSPL 780 790 800 810 820 830 340 350 360 370 380 390 pF1KB5 PSYLNG----VMPPTQSFAPDPKYVSSKALQRQNSEGSASKVPCILPIIENGKKVSSTHY : :.. .:: ::::: ::::.. ...: . ... ::::::...::::::::: CCDS77 PPRLSSSPGKTMPTTQSFASDPKYATPQVIQAPGPRAG----PCILPIVRDGKKVSSTHY 840 850 860 870 880 890 400 410 420 430 440 450 pF1KB5 YLLPERPPYLDKYEKFFREAEETNGGAQIQPLPADCGISSATEKPDSKTKMDLGGHVKRK ::::::: ::..:..:.:::. . . ::: CCDS77 YLLPERPSYLERYQRFLREAQSPE---EPTPLPVPLLLPPPSTPAPAAPTATVRPMPQAA 900 910 920 930 940 460 pF1KB5 HLSYVVSP CCDS77 LDPKANFSTNNSNPGARPPPPRATARLPQRGCPGDGPEAGRPADKIQMVEQLFGLGLRPR 950 960 970 980 990 1000 >>CCDS33927.1 TNK2 gene_id:10188|Hs108|chr3 (1086 aa) initn: 669 init1: 222 opt: 481 Z-score: 323.4 bits: 70.3 E(32554): 1.1e-11 Smith-Waterman score: 636; 38.9% identity (57.9% similar) in 321 aa overlap (127-427:681-967) 100 110 120 130 140 150 pF1KB5 IPPSENLGPHEEDQVVCGFKKLTVNGVCASTPPLTPIKNSPSLFPCAPLCERGSRPLPPL ::: .: . : . .:. . .::::: CCDS33 FGEEPVVPALRPCAPSLAQLAMDACSLLDETPPQSPTRALPRPLHPTPVVDWDARPLPPP 660 670 680 690 700 710 160 170 180 190 200 210 pF1KB5 PISEALSLDDTDCEVEFLTSSDTDFLLEDSTLSDFKYDVPGRRSFRGCGQINYAYFDT-- : . .. :. : :. . .::: ::. : :: :::. CCDS33 PAYDDVAQDEDDFEICSI----------NSTL--VGAGVPAGPS---QGQTNYAFVPEQA 720 730 740 750 220 230 240 250 260 pF1KB5 -PAVSAADLSYVSDQNGGVPDPNPPPPQTHRRLR----RSHSGPAGSFNKPAIRISNCCI : : .. :.:: : . . . :. :. ..:::: :: CCDS33 RPPPPLEDNLFLPPQGGGKPPSSAQTAEIFQALQQECMRQLQAPAGS---PA-------- 760 770 780 790 800 270 280 290 300 310 320 pF1KB5 HRASPNSDEDKPEVPPRVPIPPRPVKPDYRRWSA---EVTSSTYSDEDRPPKVPPREPLS ::..: :::.:::::::::::..: . : : .: . ::.:::::::: CCDS33 PSPSPGGD-DKPQVPPRVPIPPRPTRPHVQLSPAPPGEEETSQWPGPASPPRVPPREPLS 810 820 830 840 850 860 330 340 350 360 370 pF1KB5 PSNSRTPSP------KSLPSYLNG----VMPPTQSFAPDPKYVSSKALQRQNSEGSASKV :..:::::: . :: :.. .:: ::::: ::::.. ...: . ... CCDS33 PQGSRTPSPLVPPGSSPLPPRLSSSPGKTMPTTQSFASDPKYATPQVIQAPGPRAG---- 870 880 890 900 910 380 390 400 410 420 430 pF1KB5 PCILPIIENGKKVSSTHYYLLPERPPYLDKYEKFFREAEETNGGAQIQPLPADCGISSAT ::::::...:::::::::::::::: ::..:..:.:::. . . ::: CCDS33 PCILPIVRDGKKVSSTHYYLLPERPSYLERYQRFLREAQSPE---EPTPLPVPLLLPPPS 920 930 940 950 960 970 440 450 460 pF1KB5 EKPDSKTKMDLGGHVKRKHLSYVVSP CCDS33 TPAPAAPTATVRPMPQAALDPKANFSTNNSNPGARPPPPRATARLPQRGCPGDGPEAGRP 980 990 1000 1010 1020 1030 462 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 10:34:48 2016 done: Sat Nov 5 10:34:48 2016 Total Scan time: 2.450 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]