FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB4604, 303 aa
1>>>pF1KB4604 303 - 303 aa - 303 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.2752+/-0.000846; mu= 16.7912+/- 0.051
mean_var=91.4351+/-17.357, 0's: 0 Z-trim(109.1): 34 B-trim: 9 in 2/50
Lambda= 0.134128
statistics sampled from 10640 (10664) to 10640 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.71), E-opt: 0.2 (0.328), width: 16
Scan time: 2.550
The best scores are: opt bits E(32554)
CCDS4318.1 SPARC gene_id:6678|Hs108|chr5 ( 303) 2131 422.2 2.4e-118
CCDS77939.1 SPARCL1 gene_id:8404|Hs108|chr4 ( 539) 1110 224.9 1.1e-58
CCDS3622.1 SPARCL1 gene_id:8404|Hs108|chr4 ( 664) 1110 225.0 1.2e-58
>>CCDS4318.1 SPARC gene_id:6678|Hs108|chr5 (303 aa)
initn: 2131 init1: 2131 opt: 2131 Z-score: 2238.2 bits: 422.2 E(32554): 2.4e-118
Smith-Waterman score: 2131; 100.0% identity (100.0% similar) in 303 aa overlap (1-303:1-303)
10 20 30 40 50 60
pF1KB4 MRAWIFFLLCLAGRALAAPQQEALPDETEVVEETVAEVTEVSVGANPVQVEVGEFDDGAE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 MRAWIFFLLCLAGRALAAPQQEALPDETEVVEETVAEVTEVSVGANPVQVEVGEFDDGAE
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB4 ETEEEVVAENPCQNHHCKHGKVCELDENNTPMCVCQDPTSCPAPIGEFEKVCSNDNKTFD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 ETEEEVVAENPCQNHHCKHGKVCELDENNTPMCVCQDPTSCPAPIGEFEKVCSNDNKTFD
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB4 SSCHFFATKCTLEGTKKGHKLHLDYIGPCKYIPPCLDSELTEFPLRMRDWLKNVLVTLYE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 SSCHFFATKCTLEGTKKGHKLHLDYIGPCKYIPPCLDSELTEFPLRMRDWLKNVLVTLYE
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB4 RDEDNNLLTEKQKLRVKKIHENEKRLEAGDHPVELLARDFEKNYNMYIFPVHWQFGQLDQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 RDEDNNLLTEKQKLRVKKIHENEKRLEAGDHPVELLARDFEKNYNMYIFPVHWQFGQLDQ
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB4 HPIDGYLSHTELAPLRAPLIPMEHCTTRFFETCDLDNDKYIALDEWAGCFGIKQKDIDKD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 HPIDGYLSHTELAPLRAPLIPMEHCTTRFFETCDLDNDKYIALDEWAGCFGIKQKDIDKD
250 260 270 280 290 300
pF1KB4 LVI
:::
CCDS43 LVI
>>CCDS77939.1 SPARCL1 gene_id:8404|Hs108|chr4 (539 aa)
initn: 971 init1: 612 opt: 1110 Z-score: 1167.3 bits: 224.9 E(32554): 1.1e-58
Smith-Waterman score: 1110; 52.7% identity (78.0% similar) in 296 aa overlap (12-302:246-538)
10 20 30 40
pF1KB4 MRAWIFFLLCLAGRALAAPQQEALPDETEVVEETVAEVTEV
: :: . . . .. : :.:. .. .
CCDS77 GDDDGDDGGTDGPRHSASDDYFIPSQAFLEAERAQSIAYHLKIEEQREKVHEN-ENIGTT
220 230 240 250 260 270
50 60 70 80 90
pF1KB4 SVGANPVQVEVGEFDDGAEETEEE----VVAENPCQNHHCKHGKVCELDENNTPMCVCQD
: . ... .: ... ::: : : : . :.. .::.:..:. :... : :::::
CCDS77 EPGEHQ-EAKKAENSSNEEETSSEGNMRVHAVDSCMSFQCKRGHICKADQQGKPHCVCQD
280 290 300 310 320 330
100 110 120 130 140 150
pF1KB4 PTSCPAPIGEFEKVCSNDNKTFDSSCHFFATKCTLEGTKKGHKLHLDYIGPCKYIPPCLD
:..:: : ...::..::.:. ::::.::::: ::::::::.:.:::.: :: :: : :
CCDS77 PVTCP-PTKPLDQVCGTDNQTYASSCHLFATKCRLEGTKKGHQLQLDYFGACKSIPTCTD
340 350 360 370 380 390
160 170 180 190 200 210
pF1KB4 SELTEFPLRMRDWLKNVLVTLYERD-EDNNLLTEKQKLRVKKIHENEKRLEAGDHPVELL
:. .:::::::::::.:. ::: . : . :.:::. .::::. .:::: :::::..::
CCDS77 FEVIQFPLRMRDWLKNILMQLYEANSEHAGYLNEKQRNKVKKIYLDEKRLLAGDHPIDLL
400 410 420 430 440 450
220 230 240 250 260 270
pF1KB4 ARDFEKNYNMYIFPVHWQFGQLDQHPIDGYLSHTELAPLRAPLIPMEHCTTRFFETCDLD
:::.:::.::..::::::..:::::.: :.:.::::::: :.::::: ::::: :: .
CCDS77 LRDFKKNYHMYVYPVHWQFSELDQHPMDRVLTHSELAPLRASLVPMEHCITRFFEECDPN
460 470 480 490 500 510
280 290 300
pF1KB4 NDKYIALDEWAGCFGIKQKDIDKDLVI
.::.:.: ::. :::::..:::..:.
CCDS77 KDKHITLKEWGHCFGIKEEDIDENLLF
520 530
>>CCDS3622.1 SPARCL1 gene_id:8404|Hs108|chr4 (664 aa)
initn: 1002 init1: 612 opt: 1110 Z-score: 1166.1 bits: 225.0 E(32554): 1.2e-58
Smith-Waterman score: 1110; 52.7% identity (78.0% similar) in 296 aa overlap (12-302:371-663)
10 20 30 40
pF1KB4 MRAWIFFLLCLAGRALAAPQQEALPDETEVVEETVAEVTEV
: :: . . . .. : :.:. .. .
CCDS36 GDDDGDDGGTDGPRHSASDDYFIPSQAFLEAERAQSIAYHLKIEEQREKVHEN-ENIGTT
350 360 370 380 390
50 60 70 80 90
pF1KB4 SVGANPVQVEVGEFDDGAEETEEE----VVAENPCQNHHCKHGKVCELDENNTPMCVCQD
: . ... .: ... ::: : : : . :.. .::.:..:. :... : :::::
CCDS36 EPGEHQ-EAKKAENSSNEEETSSEGNMRVHAVDSCMSFQCKRGHICKADQQGKPHCVCQD
400 410 420 430 440 450
100 110 120 130 140 150
pF1KB4 PTSCPAPIGEFEKVCSNDNKTFDSSCHFFATKCTLEGTKKGHKLHLDYIGPCKYIPPCLD
:..:: : ...::..::.:. ::::.::::: ::::::::.:.:::.: :: :: : :
CCDS36 PVTCP-PTKPLDQVCGTDNQTYASSCHLFATKCRLEGTKKGHQLQLDYFGACKSIPTCTD
460 470 480 490 500 510
160 170 180 190 200 210
pF1KB4 SELTEFPLRMRDWLKNVLVTLYERD-EDNNLLTEKQKLRVKKIHENEKRLEAGDHPVELL
:. .:::::::::::.:. ::: . : . :.:::. .::::. .:::: :::::..::
CCDS36 FEVIQFPLRMRDWLKNILMQLYEANSEHAGYLNEKQRNKVKKIYLDEKRLLAGDHPIDLL
520 530 540 550 560 570
220 230 240 250 260 270
pF1KB4 ARDFEKNYNMYIFPVHWQFGQLDQHPIDGYLSHTELAPLRAPLIPMEHCTTRFFETCDLD
:::.:::.::..::::::..:::::.: :.:.::::::: :.::::: ::::: :: .
CCDS36 LRDFKKNYHMYVYPVHWQFSELDQHPMDRVLTHSELAPLRASLVPMEHCITRFFEECDPN
580 590 600 610 620 630
280 290 300
pF1KB4 NDKYIALDEWAGCFGIKQKDIDKDLVI
.::.:.: ::. :::::..:::..:.
CCDS36 KDKHITLKEWGHCFGIKEEDIDENLLF
640 650 660
303 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 03:04:06 2016 done: Fri Nov 4 03:04:07 2016
Total Scan time: 2.550 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]