FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KSDA1709, 1025 aa
1>>>pF1KSDA1709 1025 - 1025 aa - 1025 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.9270+/-0.000984; mu= 18.7541+/- 0.059
mean_var=85.0197+/-17.195, 0's: 0 Z-trim(105.4): 16 B-trim: 0 in 0/51
Lambda= 0.139096
statistics sampled from 8385 (8394) to 8385 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.618), E-opt: 0.2 (0.258), width: 16
Scan time: 4.250
The best scores are: opt bits E(32554)
CCDS7889.1 NAT10 gene_id:55226|Hs108|chr11 (1025) 6669 1349.1 0
CCDS44568.1 NAT10 gene_id:55226|Hs108|chr11 ( 953) 6194 1253.7 0
>>CCDS7889.1 NAT10 gene_id:55226|Hs108|chr11 (1025 aa)
initn: 6669 init1: 6669 opt: 6669 Z-score: 7228.4 bits: 1349.1 E(32554): 0
Smith-Waterman score: 6669; 99.9% identity (100.0% similar) in 1025 aa overlap (1-1025:1-1025)
10 20 30 40 50 60
pF1KSD MHRKKVDNRIRILIENGVAERQRSLFVVVGDRGKDQVVILHHMLSKATVKARPSVLWCYK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 MHRKKVDNRIRILIENGVAERQRSLFVVVGDRGKDQVVILHHMLSKATVKARPSVLWCYK
10 20 30 40 50 60
70 80 90 100 110 120
pF1KSD KELGFSSHRKKRMRQLQKKIKNGTLNIKQDDPFELFIAATNIRYCYYNETHKILGNTFGM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 KELGFSSHRKKRMRQLQKKIKNGTLNIKQDDPFELFIAATNIRYCYYNETHKILGNTFGM
70 80 90 100 110 120
130 140 150 160 170 180
pF1KSD CVLQDFEALTPNLLARTVETVEGGGLVVILLRTMNSLKQLYTVTMDVHSRYRTEAHQDVV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 CVLQDFEALTPNLLARTVETVEGGGLVVILLRTMNSLKQLYTVTMDVHSRYRTEAHQDVV
130 140 150 160 170 180
190 200 210 220 230 240
pF1KSD GRFNERFILSLASCKKCLVIDDQLNILPISSHVATMEALPPQTPDESLGPSDLELRELKE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 GRFNERFILSLASCKKCLVIDDQLNILPISSHVATMEALPPQTPDESLGPSDLELRELKE
190 200 210 220 230 240
250 260 270 280 290 300
pF1KSD SLQDTQPVGVLVDCCKTLDQAKAVLKFIEGISEKTLRSTVALTAARGRGKSAALGLAIAG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 SLQDTQPVGVLVDCCKTLDQAKAVLKFIEGISEKTLRSTVALTAARGRGKSAALGLAIAG
250 260 270 280 290 300
310 320 330 340 350 360
pF1KSD AVAFGYSNIFVTSPSPDNLHTLFEFVFKGFDALQYQEHLDYEIIQSLNPEFNKAVIRVNV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 AVAFGYSNIFVTSPSPDNLHTLFEFVFKGFDALQYQEHLDYEIIQSLNPEFNKAVIRVNV
310 320 330 340 350 360
370 380 390 400 410 420
pF1KSD FREHRQTIQYIHPADAVKLGQAELVVIDEAAAIPLPLVKSLLGPYLVFMASTINGYEGTG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 FREHRQTIQYIHPADAVKLGQAELVVIDEAAAIPLPLVKSLLGPYLVFMASTINGYEGTG
370 380 390 400 410 420
430 440 450 460 470 480
pF1KSD RSLSLKLIQQLRQQSAQSQVSTTAENKTTTTARLASARTLHEVSLQESIRYAPGDAVEKW
::::::::::::::::::::::::::::::::::::::::.:::::::::::::::::::
CCDS78 RSLSLKLIQQLRQQSAQSQVSTTAENKTTTTARLASARTLYEVSLQESIRYAPGDAVEKW
430 440 450 460 470 480
490 500 510 520 530 540
pF1KSD LNDLLCLDCLNITRIVSGCPLPEACELYYVNRDTLFCYHKASEVFLQRLMALYVASHYKN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 LNDLLCLDCLNITRIVSGCPLPEACELYYVNRDTLFCYHKASEVFLQRLMALYVASHYKN
490 500 510 520 530 540
550 560 570 580 590 600
pF1KSD SPNDLQMLSDAPAHHLFCLLPPVPPTQNALPEVLAVIQVCLEGEISRQSILNSLSRGKKA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 SPNDLQMLSDAPAHHLFCLLPPVPPTQNALPEVLAVIQVCLEGEISRQSILNSLSRGKKA
550 560 570 580 590 600
610 620 630 640 650 660
pF1KSD SGDLIPWTVSEQFQDPDFGGLSGGRVVRIAVHPDYQGMGYGSRALQLLQMYYEGRFPCLE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 SGDLIPWTVSEQFQDPDFGGLSGGRVVRIAVHPDYQGMGYGSRALQLLQMYYEGRFPCLE
610 620 630 640 650 660
670 680 690 700 710 720
pF1KSD EKVLETPQEIHTVSSEAVSLLEEVITPRKDLPPLLLKLNERPAERLDYLGVSYGLTPRLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 EKVLETPQEIHTVSSEAVSLLEEVITPRKDLPPLLLKLNERPAERLDYLGVSYGLTPRLL
670 680 690 700 710 720
730 740 750 760 770 780
pF1KSD KFWKRAGFVPVYLRQTPNDLTGEHSCIMLKTLTDEDEADQGGWLAAFWKDFRRRFLALLS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 KFWKRAGFVPVYLRQTPNDLTGEHSCIMLKTLTDEDEADQGGWLAAFWKDFRRRFLALLS
730 740 750 760 770 780
790 800 810 820 830 840
pF1KSD YQFSTFSPSLALNIIQNRNMGKPAQPALSREELEALFLPYDLKRLEMYSRNMVDYHLIMD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 YQFSTFSPSLALNIIQNRNMGKPAQPALSREELEALFLPYDLKRLEMYSRNMVDYHLIMD
790 800 810 820 830 840
850 860 870 880 890 900
pF1KSD MIPAISRIYFLNQLGDLALSAAQSALLLGIGLQHKSVDQLEKEIELPSGQLMGLFNRIIR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 MIPAISRIYFLNQLGDLALSAAQSALLLGIGLQHKSVDQLEKEIELPSGQLMGLFNRIIR
850 860 870 880 890 900
910 920 930 940 950 960
pF1KSD KVVKLFNEVQEKAIEEQMVAAKDVVMEPTMKTLSDDLDEAAKEFQEKHKKEVGKLKSMDL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 KVVKLFNEVQEKAIEEQMVAAKDVVMEPTMKTLSDDLDEAAKEFQEKHKKEVGKLKSMDL
910 920 930 940 950 960
970 980 990 1000 1010 1020
pF1KSD SEYIIRGDDEEWNEVLNKAGPNASIISLKSDKKRKLEAKQEPKQSKKLKNRETKNKKDMK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 SEYIIRGDDEEWNEVLNKAGPNASIISLKSDKKRKLEAKQEPKQSKKLKNRETKNKKDMK
970 980 990 1000 1010 1020
pF1KSD LKRKK
:::::
CCDS78 LKRKK
>>CCDS44568.1 NAT10 gene_id:55226|Hs108|chr11 (953 aa)
initn: 6194 init1: 6194 opt: 6194 Z-score: 6713.7 bits: 1253.7 E(32554): 0
Smith-Waterman score: 6194; 99.9% identity (100.0% similar) in 953 aa overlap (73-1025:1-953)
50 60 70 80 90 100
pF1KSD MLSKATVKARPSVLWCYKKELGFSSHRKKRMRQLQKKIKNGTLNIKQDDPFELFIAATNI
::::::::::::::::::::::::::::::
CCDS44 MRQLQKKIKNGTLNIKQDDPFELFIAATNI
10 20 30
110 120 130 140 150 160
pF1KSD RYCYYNETHKILGNTFGMCVLQDFEALTPNLLARTVETVEGGGLVVILLRTMNSLKQLYT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 RYCYYNETHKILGNTFGMCVLQDFEALTPNLLARTVETVEGGGLVVILLRTMNSLKQLYT
40 50 60 70 80 90
170 180 190 200 210 220
pF1KSD VTMDVHSRYRTEAHQDVVGRFNERFILSLASCKKCLVIDDQLNILPISSHVATMEALPPQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 VTMDVHSRYRTEAHQDVVGRFNERFILSLASCKKCLVIDDQLNILPISSHVATMEALPPQ
100 110 120 130 140 150
230 240 250 260 270 280
pF1KSD TPDESLGPSDLELRELKESLQDTQPVGVLVDCCKTLDQAKAVLKFIEGISEKTLRSTVAL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 TPDESLGPSDLELRELKESLQDTQPVGVLVDCCKTLDQAKAVLKFIEGISEKTLRSTVAL
160 170 180 190 200 210
290 300 310 320 330 340
pF1KSD TAARGRGKSAALGLAIAGAVAFGYSNIFVTSPSPDNLHTLFEFVFKGFDALQYQEHLDYE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 TAARGRGKSAALGLAIAGAVAFGYSNIFVTSPSPDNLHTLFEFVFKGFDALQYQEHLDYE
220 230 240 250 260 270
350 360 370 380 390 400
pF1KSD IIQSLNPEFNKAVIRVNVFREHRQTIQYIHPADAVKLGQAELVVIDEAAAIPLPLVKSLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 IIQSLNPEFNKAVIRVNVFREHRQTIQYIHPADAVKLGQAELVVIDEAAAIPLPLVKSLL
280 290 300 310 320 330
410 420 430 440 450 460
pF1KSD GPYLVFMASTINGYEGTGRSLSLKLIQQLRQQSAQSQVSTTAENKTTTTARLASARTLHE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::.:
CCDS44 GPYLVFMASTINGYEGTGRSLSLKLIQQLRQQSAQSQVSTTAENKTTTTARLASARTLYE
340 350 360 370 380 390
470 480 490 500 510 520
pF1KSD VSLQESIRYAPGDAVEKWLNDLLCLDCLNITRIVSGCPLPEACELYYVNRDTLFCYHKAS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 VSLQESIRYAPGDAVEKWLNDLLCLDCLNITRIVSGCPLPEACELYYVNRDTLFCYHKAS
400 410 420 430 440 450
530 540 550 560 570 580
pF1KSD EVFLQRLMALYVASHYKNSPNDLQMLSDAPAHHLFCLLPPVPPTQNALPEVLAVIQVCLE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 EVFLQRLMALYVASHYKNSPNDLQMLSDAPAHHLFCLLPPVPPTQNALPEVLAVIQVCLE
460 470 480 490 500 510
590 600 610 620 630 640
pF1KSD GEISRQSILNSLSRGKKASGDLIPWTVSEQFQDPDFGGLSGGRVVRIAVHPDYQGMGYGS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 GEISRQSILNSLSRGKKASGDLIPWTVSEQFQDPDFGGLSGGRVVRIAVHPDYQGMGYGS
520 530 540 550 560 570
650 660 670 680 690 700
pF1KSD RALQLLQMYYEGRFPCLEEKVLETPQEIHTVSSEAVSLLEEVITPRKDLPPLLLKLNERP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 RALQLLQMYYEGRFPCLEEKVLETPQEIHTVSSEAVSLLEEVITPRKDLPPLLLKLNERP
580 590 600 610 620 630
710 720 730 740 750 760
pF1KSD AERLDYLGVSYGLTPRLLKFWKRAGFVPVYLRQTPNDLTGEHSCIMLKTLTDEDEADQGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 AERLDYLGVSYGLTPRLLKFWKRAGFVPVYLRQTPNDLTGEHSCIMLKTLTDEDEADQGG
640 650 660 670 680 690
770 780 790 800 810 820
pF1KSD WLAAFWKDFRRRFLALLSYQFSTFSPSLALNIIQNRNMGKPAQPALSREELEALFLPYDL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 WLAAFWKDFRRRFLALLSYQFSTFSPSLALNIIQNRNMGKPAQPALSREELEALFLPYDL
700 710 720 730 740 750
830 840 850 860 870 880
pF1KSD KRLEMYSRNMVDYHLIMDMIPAISRIYFLNQLGDLALSAAQSALLLGIGLQHKSVDQLEK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 KRLEMYSRNMVDYHLIMDMIPAISRIYFLNQLGDLALSAAQSALLLGIGLQHKSVDQLEK
760 770 780 790 800 810
890 900 910 920 930 940
pF1KSD EIELPSGQLMGLFNRIIRKVVKLFNEVQEKAIEEQMVAAKDVVMEPTMKTLSDDLDEAAK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 EIELPSGQLMGLFNRIIRKVVKLFNEVQEKAIEEQMVAAKDVVMEPTMKTLSDDLDEAAK
820 830 840 850 860 870
950 960 970 980 990 1000
pF1KSD EFQEKHKKEVGKLKSMDLSEYIIRGDDEEWNEVLNKAGPNASIISLKSDKKRKLEAKQEP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 EFQEKHKKEVGKLKSMDLSEYIIRGDDEEWNEVLNKAGPNASIISLKSDKKRKLEAKQEP
880 890 900 910 920 930
1010 1020
pF1KSD KQSKKLKNRETKNKKDMKLKRKK
:::::::::::::::::::::::
CCDS44 KQSKKLKNRETKNKKDMKLKRKK
940 950
1025 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 06:59:52 2016 done: Thu Nov 3 06:59:52 2016
Total Scan time: 4.250 Total Display time: 0.070
Function used was FASTA [36.3.4 Apr, 2011]