FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB4604, 303 aa 1>>>pF1KB4604 303 - 303 aa - 303 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2752+/-0.000846; mu= 16.7912+/- 0.051 mean_var=91.4351+/-17.357, 0's: 0 Z-trim(109.1): 34 B-trim: 9 in 2/50 Lambda= 0.134128 statistics sampled from 10640 (10664) to 10640 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.71), E-opt: 0.2 (0.328), width: 16 Scan time: 2.550 The best scores are: opt bits E(32554) CCDS4318.1 SPARC gene_id:6678|Hs108|chr5 ( 303) 2131 422.2 2.4e-118 CCDS77939.1 SPARCL1 gene_id:8404|Hs108|chr4 ( 539) 1110 224.9 1.1e-58 CCDS3622.1 SPARCL1 gene_id:8404|Hs108|chr4 ( 664) 1110 225.0 1.2e-58 >>CCDS4318.1 SPARC gene_id:6678|Hs108|chr5 (303 aa) initn: 2131 init1: 2131 opt: 2131 Z-score: 2238.2 bits: 422.2 E(32554): 2.4e-118 Smith-Waterman score: 2131; 100.0% identity (100.0% similar) in 303 aa overlap (1-303:1-303) 10 20 30 40 50 60 pF1KB4 MRAWIFFLLCLAGRALAAPQQEALPDETEVVEETVAEVTEVSVGANPVQVEVGEFDDGAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 MRAWIFFLLCLAGRALAAPQQEALPDETEVVEETVAEVTEVSVGANPVQVEVGEFDDGAE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB4 ETEEEVVAENPCQNHHCKHGKVCELDENNTPMCVCQDPTSCPAPIGEFEKVCSNDNKTFD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 ETEEEVVAENPCQNHHCKHGKVCELDENNTPMCVCQDPTSCPAPIGEFEKVCSNDNKTFD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB4 SSCHFFATKCTLEGTKKGHKLHLDYIGPCKYIPPCLDSELTEFPLRMRDWLKNVLVTLYE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 SSCHFFATKCTLEGTKKGHKLHLDYIGPCKYIPPCLDSELTEFPLRMRDWLKNVLVTLYE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB4 RDEDNNLLTEKQKLRVKKIHENEKRLEAGDHPVELLARDFEKNYNMYIFPVHWQFGQLDQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 RDEDNNLLTEKQKLRVKKIHENEKRLEAGDHPVELLARDFEKNYNMYIFPVHWQFGQLDQ 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB4 HPIDGYLSHTELAPLRAPLIPMEHCTTRFFETCDLDNDKYIALDEWAGCFGIKQKDIDKD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 HPIDGYLSHTELAPLRAPLIPMEHCTTRFFETCDLDNDKYIALDEWAGCFGIKQKDIDKD 250 260 270 280 290 300 pF1KB4 LVI ::: CCDS43 LVI >>CCDS77939.1 SPARCL1 gene_id:8404|Hs108|chr4 (539 aa) initn: 971 init1: 612 opt: 1110 Z-score: 1167.3 bits: 224.9 E(32554): 1.1e-58 Smith-Waterman score: 1110; 52.7% identity (78.0% similar) in 296 aa overlap (12-302:246-538) 10 20 30 40 pF1KB4 MRAWIFFLLCLAGRALAAPQQEALPDETEVVEETVAEVTEV : :: . . . .. : :.:. .. . CCDS77 GDDDGDDGGTDGPRHSASDDYFIPSQAFLEAERAQSIAYHLKIEEQREKVHEN-ENIGTT 220 230 240 250 260 270 50 60 70 80 90 pF1KB4 SVGANPVQVEVGEFDDGAEETEEE----VVAENPCQNHHCKHGKVCELDENNTPMCVCQD : . ... .: ... ::: : : : . :.. .::.:..:. :... : ::::: CCDS77 EPGEHQ-EAKKAENSSNEEETSSEGNMRVHAVDSCMSFQCKRGHICKADQQGKPHCVCQD 280 290 300 310 320 330 100 110 120 130 140 150 pF1KB4 PTSCPAPIGEFEKVCSNDNKTFDSSCHFFATKCTLEGTKKGHKLHLDYIGPCKYIPPCLD :..:: : ...::..::.:. ::::.::::: ::::::::.:.:::.: :: :: : : CCDS77 PVTCP-PTKPLDQVCGTDNQTYASSCHLFATKCRLEGTKKGHQLQLDYFGACKSIPTCTD 340 350 360 370 380 390 160 170 180 190 200 210 pF1KB4 SELTEFPLRMRDWLKNVLVTLYERD-EDNNLLTEKQKLRVKKIHENEKRLEAGDHPVELL :. .:::::::::::.:. ::: . : . :.:::. .::::. .:::: :::::..:: CCDS77 FEVIQFPLRMRDWLKNILMQLYEANSEHAGYLNEKQRNKVKKIYLDEKRLLAGDHPIDLL 400 410 420 430 440 450 220 230 240 250 260 270 pF1KB4 ARDFEKNYNMYIFPVHWQFGQLDQHPIDGYLSHTELAPLRAPLIPMEHCTTRFFETCDLD :::.:::.::..::::::..:::::.: :.:.::::::: :.::::: ::::: :: . CCDS77 LRDFKKNYHMYVYPVHWQFSELDQHPMDRVLTHSELAPLRASLVPMEHCITRFFEECDPN 460 470 480 490 500 510 280 290 300 pF1KB4 NDKYIALDEWAGCFGIKQKDIDKDLVI .::.:.: ::. :::::..:::..:. CCDS77 KDKHITLKEWGHCFGIKEEDIDENLLF 520 530 >>CCDS3622.1 SPARCL1 gene_id:8404|Hs108|chr4 (664 aa) initn: 1002 init1: 612 opt: 1110 Z-score: 1166.1 bits: 225.0 E(32554): 1.2e-58 Smith-Waterman score: 1110; 52.7% identity (78.0% similar) in 296 aa overlap (12-302:371-663) 10 20 30 40 pF1KB4 MRAWIFFLLCLAGRALAAPQQEALPDETEVVEETVAEVTEV : :: . . . .. : :.:. .. . CCDS36 GDDDGDDGGTDGPRHSASDDYFIPSQAFLEAERAQSIAYHLKIEEQREKVHEN-ENIGTT 350 360 370 380 390 50 60 70 80 90 pF1KB4 SVGANPVQVEVGEFDDGAEETEEE----VVAENPCQNHHCKHGKVCELDENNTPMCVCQD : . ... .: ... ::: : : : . :.. .::.:..:. :... : ::::: CCDS36 EPGEHQ-EAKKAENSSNEEETSSEGNMRVHAVDSCMSFQCKRGHICKADQQGKPHCVCQD 400 410 420 430 440 450 100 110 120 130 140 150 pF1KB4 PTSCPAPIGEFEKVCSNDNKTFDSSCHFFATKCTLEGTKKGHKLHLDYIGPCKYIPPCLD :..:: : ...::..::.:. ::::.::::: ::::::::.:.:::.: :: :: : : CCDS36 PVTCP-PTKPLDQVCGTDNQTYASSCHLFATKCRLEGTKKGHQLQLDYFGACKSIPTCTD 460 470 480 490 500 510 160 170 180 190 200 210 pF1KB4 SELTEFPLRMRDWLKNVLVTLYERD-EDNNLLTEKQKLRVKKIHENEKRLEAGDHPVELL :. .:::::::::::.:. ::: . : . :.:::. .::::. .:::: :::::..:: CCDS36 FEVIQFPLRMRDWLKNILMQLYEANSEHAGYLNEKQRNKVKKIYLDEKRLLAGDHPIDLL 520 530 540 550 560 570 220 230 240 250 260 270 pF1KB4 ARDFEKNYNMYIFPVHWQFGQLDQHPIDGYLSHTELAPLRAPLIPMEHCTTRFFETCDLD :::.:::.::..::::::..:::::.: :.:.::::::: :.::::: ::::: :: . CCDS36 LRDFKKNYHMYVYPVHWQFSELDQHPMDRVLTHSELAPLRASLVPMEHCITRFFEECDPN 580 590 600 610 620 630 280 290 300 pF1KB4 NDKYIALDEWAGCFGIKQKDIDKDLVI .::.:.: ::. :::::..:::..:. CCDS36 KDKHITLKEWGHCFGIKEEDIDENLLF 640 650 660 303 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 03:04:06 2016 done: Fri Nov 4 03:04:07 2016 Total Scan time: 2.550 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]