FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9851, 380 aa
1>>>pF1KB9851 380 - 380 aa - 380 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.7158+/-0.000847; mu= 16.0503+/- 0.051
mean_var=84.7201+/-16.819, 0's: 0 Z-trim(107.8): 12 B-trim: 0 in 0/52
Lambda= 0.139342
statistics sampled from 9824 (9829) to 9824 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.672), E-opt: 0.2 (0.302), width: 16
Scan time: 2.440
The best scores are: opt bits E(32554)
CCDS8010.1 FEN1 gene_id:2237|Hs108|chr11 ( 380) 2484 509.0 2.8e-144
CCDS44336.1 EXO1 gene_id:9156|Hs108|chr1 ( 803) 371 84.4 3.8e-16
CCDS1620.1 EXO1 gene_id:9156|Hs108|chr1 ( 846) 371 84.5 3.9e-16
>>CCDS8010.1 FEN1 gene_id:2237|Hs108|chr11 (380 aa)
initn: 2484 init1: 2484 opt: 2484 Z-score: 2703.7 bits: 509.0 E(32554): 2.8e-144
Smith-Waterman score: 2484; 100.0% identity (100.0% similar) in 380 aa overlap (1-380:1-380)
10 20 30 40 50 60
pF1KB9 MGIQGLAKLIADVAPSAIRENDIKSYFGRKVAIDASMSIYQFLIAVRQGGDVLQNEEGET
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS80 MGIQGLAKLIADVAPSAIRENDIKSYFGRKVAIDASMSIYQFLIAVRQGGDVLQNEEGET
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 TSHLMGMFYRTIRMMENGIKPVYVFDGKPPQLKSGELAKRSERRAEAEKQLQQAQAAGAE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS80 TSHLMGMFYRTIRMMENGIKPVYVFDGKPPQLKSGELAKRSERRAEAEKQLQQAQAAGAE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 QEVEKFTKRLVKVTKQHNDECKHLLSLMGIPYLDAPSEAEASCAALVKAGKVYAAATEDM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS80 QEVEKFTKRLVKVTKQHNDECKHLLSLMGIPYLDAPSEAEASCAALVKAGKVYAAATEDM
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB9 DCLTFGSPVLMRHLTASEAKKLPIQEFHLSRILQELGLNQEQFVDLCILLGSDYCESIRG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS80 DCLTFGSPVLMRHLTASEAKKLPIQEFHLSRILQELGLNQEQFVDLCILLGSDYCESIRG
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB9 IGPKRAVDLIQKHKSIEEIVRRLDPNKYPVPENWLHKEAHQLFLEPEVLDPESVELKWSE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS80 IGPKRAVDLIQKHKSIEEIVRRLDPNKYPVPENWLHKEAHQLFLEPEVLDPESVELKWSE
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB9 PNEEELIKFMCGEKQFSEERIRSGVKRLSKSRQGSTQGRLDDFFKVTGSLSSAKRKEPEP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS80 PNEEELIKFMCGEKQFSEERIRSGVKRLSKSRQGSTQGRLDDFFKVTGSLSSAKRKEPEP
310 320 330 340 350 360
370 380
pF1KB9 KGSTKKKAKTGAAGKFKRGK
::::::::::::::::::::
CCDS80 KGSTKKKAKTGAAGKFKRGK
370 380
>>CCDS44336.1 EXO1 gene_id:9156|Hs108|chr1 (803 aa)
initn: 255 init1: 221 opt: 371 Z-score: 403.4 bits: 84.4 E(32554): 3.8e-16
Smith-Waterman score: 371; 29.6% identity (59.1% similar) in 301 aa overlap (1-291:1-290)
10 20 30 40 50 60
pF1KB9 MGIQGLAKLIADVAPSAIRENDIKSYFGRKVAIDASMSIYQFLIAVRQGGDVLQNEEGET
:::::: ..: . : . ...: :. ::.:. ... :: . . .::
CCDS44 MGIQGLLQFIKE----ASEPIHVRKYKGQVVAVDTYCWLHKGAIACAE-----KLAKGEP
10 20 30 40 50
70 80 90 100 110
pF1KB9 TSHLMGMFYRTIRMM-ENGIKPVYVFDGKP-PQLKSGELAKRSERRAEAEKQLQQAQAAG
:.. .:. .. . :. .::::. :::: :. : : ..: .:.:. : : . .
CCDS44 TDRYVGFCMKFVNMLLSHGIKPILVFDGCTLPSKKEVERSRRERRQANLLKGKQLLREGK
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB9 AEQEVEKFTKRLVKVTKQHNDECKHLLSLMGIPYLDAPSEAEASCAALVKAGKVYAAATE
. . : :: : ...:. . . .:. : :: ::.:. : : ::: : : ::
CCDS44 VSEARECFT-RSINITHAMAHKVIKAARSQGVDCLVAPYEADAQLAYLNKAGIVQAIITE
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB9 DMDCLTFG-SPVLMRHLTASEAKKLPIQEFHLSRILQELGLNQEQFVDLCILLGSDYCES
: : :.:: . :... ... .. .. . : : .. ...:.: .::: : :: :
CCDS44 DSDLLAFGCKKVILKMDQFGNGLEIDQARLGMCRQLGDV-FTEEKFRYMCILSGCDYLSS
180 190 200 210 220
240 250 260 270 280 290
pF1KB9 IRGIGPKRAVDLIQ--KHKSIEEIVRRLD---PNKYPVPENWLHK--EAHQLFLEPEVLD
.:::: .: ... .. .: ...... . :::.... .:.. :: :.:
CCDS44 LRGIGLAKACKVLRLANNPDIVKVIKKIGHYLKMNITVPEDYINGFIRANNTFLYQLVFD
230 240 250 260 270 280
300 310 320 330 340 350
pF1KB9 PESVELKWSEPNEEELIKFMCGEKQFSEERIRSGVKRLSKSRQGSTQGRLDDFFKVTGSL
:
CCDS44 PIKRKLIPLNAYEDDVDPETLSYAGQYVDDSIALQIALGNKDINTFEQIDDYNPDTAMPA
290 300 310 320 330 340
>>CCDS1620.1 EXO1 gene_id:9156|Hs108|chr1 (846 aa)
initn: 255 init1: 221 opt: 371 Z-score: 403.1 bits: 84.5 E(32554): 3.9e-16
Smith-Waterman score: 371; 29.6% identity (59.1% similar) in 301 aa overlap (1-291:1-290)
10 20 30 40 50 60
pF1KB9 MGIQGLAKLIADVAPSAIRENDIKSYFGRKVAIDASMSIYQFLIAVRQGGDVLQNEEGET
:::::: ..: . : . ...: :. ::.:. ... :: . . .::
CCDS16 MGIQGLLQFIKE----ASEPIHVRKYKGQVVAVDTYCWLHKGAIACAE-----KLAKGEP
10 20 30 40 50
70 80 90 100 110
pF1KB9 TSHLMGMFYRTIRMM-ENGIKPVYVFDGKP-PQLKSGELAKRSERRAEAEKQLQQAQAAG
:.. .:. .. . :. .::::. :::: :. : : ..: .:.:. : : . .
CCDS16 TDRYVGFCMKFVNMLLSHGIKPILVFDGCTLPSKKEVERSRRERRQANLLKGKQLLREGK
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB9 AEQEVEKFTKRLVKVTKQHNDECKHLLSLMGIPYLDAPSEAEASCAALVKAGKVYAAATE
. . : :: : ...:. . . .:. : :: ::.:. : : ::: : : ::
CCDS16 VSEARECFT-RSINITHAMAHKVIKAARSQGVDCLVAPYEADAQLAYLNKAGIVQAIITE
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB9 DMDCLTFG-SPVLMRHLTASEAKKLPIQEFHLSRILQELGLNQEQFVDLCILLGSDYCES
: : :.:: . :... ... .. .. . : : .. ...:.: .::: : :: :
CCDS16 DSDLLAFGCKKVILKMDQFGNGLEIDQARLGMCRQLGDV-FTEEKFRYMCILSGCDYLSS
180 190 200 210 220
240 250 260 270 280 290
pF1KB9 IRGIGPKRAVDLIQ--KHKSIEEIVRRLD---PNKYPVPENWLHK--EAHQLFLEPEVLD
.:::: .: ... .. .: ...... . :::.... .:.. :: :.:
CCDS16 LRGIGLAKACKVLRLANNPDIVKVIKKIGHYLKMNITVPEDYINGFIRANNTFLYQLVFD
230 240 250 260 270 280
300 310 320 330 340 350
pF1KB9 PESVELKWSEPNEEELIKFMCGEKQFSEERIRSGVKRLSKSRQGSTQGRLDDFFKVTGSL
:
CCDS16 PIKRKLIPLNAYEDDVDPETLSYAGQYVDDSIALQIALGNKDINTFEQIDDYNPDTAMPA
290 300 310 320 330 340
380 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 19:46:16 2016 done: Fri Nov 4 19:46:16 2016
Total Scan time: 2.440 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]