FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KSDA1709, 1025 aa 1>>>pF1KSDA1709 1025 - 1025 aa - 1025 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.9270+/-0.000984; mu= 18.7541+/- 0.059 mean_var=85.0197+/-17.195, 0's: 0 Z-trim(105.4): 16 B-trim: 0 in 0/51 Lambda= 0.139096 statistics sampled from 8385 (8394) to 8385 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.618), E-opt: 0.2 (0.258), width: 16 Scan time: 4.250 The best scores are: opt bits E(32554) CCDS7889.1 NAT10 gene_id:55226|Hs108|chr11 (1025) 6669 1349.1 0 CCDS44568.1 NAT10 gene_id:55226|Hs108|chr11 ( 953) 6194 1253.7 0 >>CCDS7889.1 NAT10 gene_id:55226|Hs108|chr11 (1025 aa) initn: 6669 init1: 6669 opt: 6669 Z-score: 7228.4 bits: 1349.1 E(32554): 0 Smith-Waterman score: 6669; 99.9% identity (100.0% similar) in 1025 aa overlap (1-1025:1-1025) 10 20 30 40 50 60 pF1KSD MHRKKVDNRIRILIENGVAERQRSLFVVVGDRGKDQVVILHHMLSKATVKARPSVLWCYK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 MHRKKVDNRIRILIENGVAERQRSLFVVVGDRGKDQVVILHHMLSKATVKARPSVLWCYK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KSD KELGFSSHRKKRMRQLQKKIKNGTLNIKQDDPFELFIAATNIRYCYYNETHKILGNTFGM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 KELGFSSHRKKRMRQLQKKIKNGTLNIKQDDPFELFIAATNIRYCYYNETHKILGNTFGM 70 80 90 100 110 120 130 140 150 160 170 180 pF1KSD CVLQDFEALTPNLLARTVETVEGGGLVVILLRTMNSLKQLYTVTMDVHSRYRTEAHQDVV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 CVLQDFEALTPNLLARTVETVEGGGLVVILLRTMNSLKQLYTVTMDVHSRYRTEAHQDVV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KSD GRFNERFILSLASCKKCLVIDDQLNILPISSHVATMEALPPQTPDESLGPSDLELRELKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 GRFNERFILSLASCKKCLVIDDQLNILPISSHVATMEALPPQTPDESLGPSDLELRELKE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KSD SLQDTQPVGVLVDCCKTLDQAKAVLKFIEGISEKTLRSTVALTAARGRGKSAALGLAIAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 SLQDTQPVGVLVDCCKTLDQAKAVLKFIEGISEKTLRSTVALTAARGRGKSAALGLAIAG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KSD AVAFGYSNIFVTSPSPDNLHTLFEFVFKGFDALQYQEHLDYEIIQSLNPEFNKAVIRVNV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 AVAFGYSNIFVTSPSPDNLHTLFEFVFKGFDALQYQEHLDYEIIQSLNPEFNKAVIRVNV 310 320 330 340 350 360 370 380 390 400 410 420 pF1KSD FREHRQTIQYIHPADAVKLGQAELVVIDEAAAIPLPLVKSLLGPYLVFMASTINGYEGTG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 FREHRQTIQYIHPADAVKLGQAELVVIDEAAAIPLPLVKSLLGPYLVFMASTINGYEGTG 370 380 390 400 410 420 430 440 450 460 470 480 pF1KSD RSLSLKLIQQLRQQSAQSQVSTTAENKTTTTARLASARTLHEVSLQESIRYAPGDAVEKW ::::::::::::::::::::::::::::::::::::::::.::::::::::::::::::: CCDS78 RSLSLKLIQQLRQQSAQSQVSTTAENKTTTTARLASARTLYEVSLQESIRYAPGDAVEKW 430 440 450 460 470 480 490 500 510 520 530 540 pF1KSD LNDLLCLDCLNITRIVSGCPLPEACELYYVNRDTLFCYHKASEVFLQRLMALYVASHYKN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 LNDLLCLDCLNITRIVSGCPLPEACELYYVNRDTLFCYHKASEVFLQRLMALYVASHYKN 490 500 510 520 530 540 550 560 570 580 590 600 pF1KSD SPNDLQMLSDAPAHHLFCLLPPVPPTQNALPEVLAVIQVCLEGEISRQSILNSLSRGKKA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 SPNDLQMLSDAPAHHLFCLLPPVPPTQNALPEVLAVIQVCLEGEISRQSILNSLSRGKKA 550 560 570 580 590 600 610 620 630 640 650 660 pF1KSD SGDLIPWTVSEQFQDPDFGGLSGGRVVRIAVHPDYQGMGYGSRALQLLQMYYEGRFPCLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 SGDLIPWTVSEQFQDPDFGGLSGGRVVRIAVHPDYQGMGYGSRALQLLQMYYEGRFPCLE 610 620 630 640 650 660 670 680 690 700 710 720 pF1KSD EKVLETPQEIHTVSSEAVSLLEEVITPRKDLPPLLLKLNERPAERLDYLGVSYGLTPRLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 EKVLETPQEIHTVSSEAVSLLEEVITPRKDLPPLLLKLNERPAERLDYLGVSYGLTPRLL 670 680 690 700 710 720 730 740 750 760 770 780 pF1KSD KFWKRAGFVPVYLRQTPNDLTGEHSCIMLKTLTDEDEADQGGWLAAFWKDFRRRFLALLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 KFWKRAGFVPVYLRQTPNDLTGEHSCIMLKTLTDEDEADQGGWLAAFWKDFRRRFLALLS 730 740 750 760 770 780 790 800 810 820 830 840 pF1KSD YQFSTFSPSLALNIIQNRNMGKPAQPALSREELEALFLPYDLKRLEMYSRNMVDYHLIMD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 YQFSTFSPSLALNIIQNRNMGKPAQPALSREELEALFLPYDLKRLEMYSRNMVDYHLIMD 790 800 810 820 830 840 850 860 870 880 890 900 pF1KSD MIPAISRIYFLNQLGDLALSAAQSALLLGIGLQHKSVDQLEKEIELPSGQLMGLFNRIIR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 MIPAISRIYFLNQLGDLALSAAQSALLLGIGLQHKSVDQLEKEIELPSGQLMGLFNRIIR 850 860 870 880 890 900 910 920 930 940 950 960 pF1KSD KVVKLFNEVQEKAIEEQMVAAKDVVMEPTMKTLSDDLDEAAKEFQEKHKKEVGKLKSMDL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 KVVKLFNEVQEKAIEEQMVAAKDVVMEPTMKTLSDDLDEAAKEFQEKHKKEVGKLKSMDL 910 920 930 940 950 960 970 980 990 1000 1010 1020 pF1KSD SEYIIRGDDEEWNEVLNKAGPNASIISLKSDKKRKLEAKQEPKQSKKLKNRETKNKKDMK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 SEYIIRGDDEEWNEVLNKAGPNASIISLKSDKKRKLEAKQEPKQSKKLKNRETKNKKDMK 970 980 990 1000 1010 1020 pF1KSD LKRKK ::::: CCDS78 LKRKK >>CCDS44568.1 NAT10 gene_id:55226|Hs108|chr11 (953 aa) initn: 6194 init1: 6194 opt: 6194 Z-score: 6713.7 bits: 1253.7 E(32554): 0 Smith-Waterman score: 6194; 99.9% identity (100.0% similar) in 953 aa overlap (73-1025:1-953) 50 60 70 80 90 100 pF1KSD MLSKATVKARPSVLWCYKKELGFSSHRKKRMRQLQKKIKNGTLNIKQDDPFELFIAATNI :::::::::::::::::::::::::::::: CCDS44 MRQLQKKIKNGTLNIKQDDPFELFIAATNI 10 20 30 110 120 130 140 150 160 pF1KSD RYCYYNETHKILGNTFGMCVLQDFEALTPNLLARTVETVEGGGLVVILLRTMNSLKQLYT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 RYCYYNETHKILGNTFGMCVLQDFEALTPNLLARTVETVEGGGLVVILLRTMNSLKQLYT 40 50 60 70 80 90 170 180 190 200 210 220 pF1KSD VTMDVHSRYRTEAHQDVVGRFNERFILSLASCKKCLVIDDQLNILPISSHVATMEALPPQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 VTMDVHSRYRTEAHQDVVGRFNERFILSLASCKKCLVIDDQLNILPISSHVATMEALPPQ 100 110 120 130 140 150 230 240 250 260 270 280 pF1KSD TPDESLGPSDLELRELKESLQDTQPVGVLVDCCKTLDQAKAVLKFIEGISEKTLRSTVAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 TPDESLGPSDLELRELKESLQDTQPVGVLVDCCKTLDQAKAVLKFIEGISEKTLRSTVAL 160 170 180 190 200 210 290 300 310 320 330 340 pF1KSD TAARGRGKSAALGLAIAGAVAFGYSNIFVTSPSPDNLHTLFEFVFKGFDALQYQEHLDYE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 TAARGRGKSAALGLAIAGAVAFGYSNIFVTSPSPDNLHTLFEFVFKGFDALQYQEHLDYE 220 230 240 250 260 270 350 360 370 380 390 400 pF1KSD IIQSLNPEFNKAVIRVNVFREHRQTIQYIHPADAVKLGQAELVVIDEAAAIPLPLVKSLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 IIQSLNPEFNKAVIRVNVFREHRQTIQYIHPADAVKLGQAELVVIDEAAAIPLPLVKSLL 280 290 300 310 320 330 410 420 430 440 450 460 pF1KSD GPYLVFMASTINGYEGTGRSLSLKLIQQLRQQSAQSQVSTTAENKTTTTARLASARTLHE ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::.: CCDS44 GPYLVFMASTINGYEGTGRSLSLKLIQQLRQQSAQSQVSTTAENKTTTTARLASARTLYE 340 350 360 370 380 390 470 480 490 500 510 520 pF1KSD VSLQESIRYAPGDAVEKWLNDLLCLDCLNITRIVSGCPLPEACELYYVNRDTLFCYHKAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 VSLQESIRYAPGDAVEKWLNDLLCLDCLNITRIVSGCPLPEACELYYVNRDTLFCYHKAS 400 410 420 430 440 450 530 540 550 560 570 580 pF1KSD EVFLQRLMALYVASHYKNSPNDLQMLSDAPAHHLFCLLPPVPPTQNALPEVLAVIQVCLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 EVFLQRLMALYVASHYKNSPNDLQMLSDAPAHHLFCLLPPVPPTQNALPEVLAVIQVCLE 460 470 480 490 500 510 590 600 610 620 630 640 pF1KSD GEISRQSILNSLSRGKKASGDLIPWTVSEQFQDPDFGGLSGGRVVRIAVHPDYQGMGYGS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 GEISRQSILNSLSRGKKASGDLIPWTVSEQFQDPDFGGLSGGRVVRIAVHPDYQGMGYGS 520 530 540 550 560 570 650 660 670 680 690 700 pF1KSD RALQLLQMYYEGRFPCLEEKVLETPQEIHTVSSEAVSLLEEVITPRKDLPPLLLKLNERP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 RALQLLQMYYEGRFPCLEEKVLETPQEIHTVSSEAVSLLEEVITPRKDLPPLLLKLNERP 580 590 600 610 620 630 710 720 730 740 750 760 pF1KSD AERLDYLGVSYGLTPRLLKFWKRAGFVPVYLRQTPNDLTGEHSCIMLKTLTDEDEADQGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 AERLDYLGVSYGLTPRLLKFWKRAGFVPVYLRQTPNDLTGEHSCIMLKTLTDEDEADQGG 640 650 660 670 680 690 770 780 790 800 810 820 pF1KSD WLAAFWKDFRRRFLALLSYQFSTFSPSLALNIIQNRNMGKPAQPALSREELEALFLPYDL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 WLAAFWKDFRRRFLALLSYQFSTFSPSLALNIIQNRNMGKPAQPALSREELEALFLPYDL 700 710 720 730 740 750 830 840 850 860 870 880 pF1KSD KRLEMYSRNMVDYHLIMDMIPAISRIYFLNQLGDLALSAAQSALLLGIGLQHKSVDQLEK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 KRLEMYSRNMVDYHLIMDMIPAISRIYFLNQLGDLALSAAQSALLLGIGLQHKSVDQLEK 760 770 780 790 800 810 890 900 910 920 930 940 pF1KSD EIELPSGQLMGLFNRIIRKVVKLFNEVQEKAIEEQMVAAKDVVMEPTMKTLSDDLDEAAK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 EIELPSGQLMGLFNRIIRKVVKLFNEVQEKAIEEQMVAAKDVVMEPTMKTLSDDLDEAAK 820 830 840 850 860 870 950 960 970 980 990 1000 pF1KSD EFQEKHKKEVGKLKSMDLSEYIIRGDDEEWNEVLNKAGPNASIISLKSDKKRKLEAKQEP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 EFQEKHKKEVGKLKSMDLSEYIIRGDDEEWNEVLNKAGPNASIISLKSDKKRKLEAKQEP 880 890 900 910 920 930 1010 1020 pF1KSD KQSKKLKNRETKNKKDMKLKRKK ::::::::::::::::::::::: CCDS44 KQSKKLKNRETKNKKDMKLKRKK 940 950 1025 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 06:59:52 2016 done: Thu Nov 3 06:59:52 2016 Total Scan time: 4.250 Total Display time: 0.070 Function used was FASTA [36.3.4 Apr, 2011]