FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB4989, 261 aa
1>>>pF1KB4989 261 - 261 aa - 261 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.6304+/-0.00107; mu= 14.3133+/- 0.066
mean_var=99.4695+/-19.630, 0's: 0 Z-trim(107.0): 50 B-trim: 299 in 1/48
Lambda= 0.128597
statistics sampled from 9283 (9314) to 9283 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.649), E-opt: 0.2 (0.286), width: 16
Scan time: 2.120
The best scores are: opt bits E(32554)
CCDS6251.1 CALB1 gene_id:793|Hs108|chr8 ( 261) 1717 328.8 2.4e-90
CCDS10899.1 CALB2 gene_id:794|Hs108|chr16 ( 271) 1034 202.1 3.4e-52
CCDS4561.1 SCGN gene_id:10590|Hs108|chr6 ( 276) 561 114.3 9e-26
>>CCDS6251.1 CALB1 gene_id:793|Hs108|chr8 (261 aa)
initn: 1717 init1: 1717 opt: 1717 Z-score: 1735.6 bits: 328.8 E(32554): 2.4e-90
Smith-Waterman score: 1717; 100.0% identity (100.0% similar) in 261 aa overlap (1-261:1-261)
10 20 30 40 50 60
pF1KB4 MAESHLQSSLITASQFFEIWLHFDADGSGYLEGKELQNLIQELQQARKKAGLELSPEMKT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS62 MAESHLQSSLITASQFFEIWLHFDADGSGYLEGKELQNLIQELQQARKKAGLELSPEMKT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB4 FVDQYGQRDDGKIGIVELAHVLPTEENFLLLFRCQQLKSCEEFMKTWRKYDTDHSGFIET
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS62 FVDQYGQRDDGKIGIVELAHVLPTEENFLLLFRCQQLKSCEEFMKTWRKYDTDHSGFIET
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB4 EELKNFLKDLLEKANKTVDDTKLAEYTDLMLKLFDSNNDGKLELTEMARLLPVQENFLLK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS62 EELKNFLKDLLEKANKTVDDTKLAEYTDLMLKLFDSNNDGKLELTEMARLLPVQENFLLK
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB4 FQGIKMCGKEFNKAFELYDQDGNGYIDENELDALLKDLCEKNKQDLDINNITTYKKNIMA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS62 FQGIKMCGKEFNKAFELYDQDGNGYIDENELDALLKDLCEKNKQDLDINNITTYKKNIMA
190 200 210 220 230 240
250 260
pF1KB4 LSDGGKLYRTDLALILCAGDN
:::::::::::::::::::::
CCDS62 LSDGGKLYRTDLALILCAGDN
250 260
>>CCDS10899.1 CALB2 gene_id:794|Hs108|chr16 (271 aa)
initn: 1227 init1: 702 opt: 1034 Z-score: 1050.5 bits: 202.1 E(32554): 3.4e-52
Smith-Waterman score: 1034; 58.7% identity (86.9% similar) in 259 aa overlap (5-258:10-267)
10 20 30 40 50
pF1KB4 MAESHLQSSLITASQFFEIWLHFDADGSGYLEGKELQNLIQELQQARKKAGL---
.:. . .:::::.::: ::::::.::.:::::.:..:::..::: .:.
CCDS10 MAGPQQQPPYLHLAELTASQFLEIWKHFDADGNGYIEGKELENFFQELEKARKGSGMMSK
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB4 --ELSPEMKTFVDQYGQRDDGKIGIVELAHVLPTEENFLLLFRCQQLKSCEEFMKTWRKY
... .:: :...: . .:::: ..:::..::::::::: :: :.. : :::..::::
CCDS10 SDNFGEKMKEFMQKYDKNSDGKIEMAELAQILPTEENFLLCFR-QHVGSSAEFMEAWRKY
70 80 90 100 110
120 130 140 150 160 170
pF1KB4 DTDHSGFIETEELKNFLKDLLEKANKTVDDTKLAEYTDLMLKLFDSNNDGKLELTEMARL
:::.::.::..:::.::.:::.:::. :. :: :::. .:..:: :.:::: :.::.::
CCDS10 DTDRSGYIEANELKGFLSDLLKKANRPYDEPKLQEYTQTILRMFDLNGDGKLGLSEMSRL
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB4 LPVQENFLLKFQGIKMCGKEFNKAFELYDQDGNGYIDENELDALLKDLCEKNKQDLDINN
:::::::::::::.:. ..::: : .::.: .:::::.::::::::: ::::....:..
CCDS10 LPVQENFLLKFQGMKLTSEEFNAIFTFYDKDRSGYIDEHELDALLKDLYEKNKKEMNIQQ
180 190 200 210 220 230
240 250 260
pF1KB4 ITTYKKNIMALSDGGKLYRTDLALILCAGDN
.:.:.:..:.:...::::: :: ..::.
CCDS10 LTNYRKSVMSLAEAGKLYRKDLEIVLCSEPPM
240 250 260 270
>>CCDS4561.1 SCGN gene_id:10590|Hs108|chr6 (276 aa)
initn: 352 init1: 352 opt: 561 Z-score: 576.2 bits: 114.3 E(32554): 9e-26
Smith-Waterman score: 561; 37.6% identity (66.5% similar) in 263 aa overlap (11-259:12-271)
10 20 30 40 50
pF1KB4 MAESHLQSSLITASQFFEIWLHFDADGSGYLEGKELQ----NLIQELQQARKKAGLELS
. :. :...: .:::: .::.: :::. .....: .:
CCDS45 MDSSREPTLGRLDAAGFWQVWQRFDADEKGYIEEKELDAFFLHMLMKLGTDDTVMKANLH
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB4 PEMKTFVDQYGQRDDGKIGIVELAHVLPTE-ENFLLLFRCQQ-LKSCEEFMKTWRKYDTD
. :. ::.: . ::: .. .: :::::::: .. : : :::. :::::.:
CCDS45 KVKQQFMTTQDASKDGRIRMKELAGMFLSEDENFLLLFRRENPLDSSVEFMQIWRKYDAD
70 80 90 100 110 120
120 130 140 150 160 170
pF1KB4 HSGFIETEELKNFLKDLLEKANKTVDDTKLAEYTDLMLKLFDSNNDGKLELTEMARLLPV
:::: . ::.:::.::. . .:.....:: ::: :.:.:: :.::.:.:...::.: .
CCDS45 SSGFISAAELRNFLRDLFLHHKKAISEAKLEEYTGTMMKIFDRNKDGRLDLNDLARILAL
130 140 150 160 170 180
180 190 200 210 220
pF1KB4 QENFLLKFQGIKMCGKE-----FNKAFELYDQDGNGYIDENELDALLKDLCEKNKQDLDI
::::::.:. . :. : :.: : :: . .: .. :.:...::. : . ...
CCDS45 QENFLLQFK-MDACSTEERKRDFEKIFAYYDVSKTGALEGPEVDGFVKDMMELVQPSISG
190 200 210 220 230
230 240 250 260
pF1KB4 NNITTYKKNIMALSD---GGKLYRTDLALILCAGDN
.. ... .. : ::. ...:: :: :
CCDS45 VDLDKFREILLRHCDVNKDGKIQKSELA--LCLGLKINP
240 250 260 270
261 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 06:11:54 2016 done: Sat Nov 5 06:11:54 2016
Total Scan time: 2.120 Total Display time: -0.030
Function used was FASTA [36.3.4 Apr, 2011]