FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KA0247, 303 aa 1>>>pF1KA0247 303 - 303 aa - 303 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.7571+/-0.000692; mu= 14.9485+/- 0.042 mean_var=92.9491+/-17.837, 0's: 0 Z-trim(112.8): 31 B-trim: 6 in 1/50 Lambda= 0.133031 statistics sampled from 13465 (13492) to 13465 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.764), E-opt: 0.2 (0.414), width: 16 Scan time: 1.940 The best scores are: opt bits E(32554) CCDS9796.1 SUSD6 gene_id:9766|Hs108|chr14 ( 303) 2044 401.7 3.6e-112 CCDS41471.1 SUSD4 gene_id:55061|Hs108|chr1 ( 490) 448 95.6 8.3e-20 >>CCDS9796.1 SUSD6 gene_id:9766|Hs108|chr14 (303 aa) initn: 2044 init1: 2044 opt: 2044 Z-score: 2127.4 bits: 401.7 E(32554): 3.6e-112 Smith-Waterman score: 2044; 100.0% identity (100.0% similar) in 303 aa overlap (1-303:1-303) 10 20 30 40 50 60 pF1KA0 MCHGRIAPKSTSVFAVASVGHGVFLPLVILCTLLGDGLASVCPLPPEPENGGYICHPRPC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS97 MCHGRIAPKSTSVFAVASVGHGVFLPLVILCTLLGDGLASVCPLPPEPENGGYICHPRPC 10 20 30 40 50 60 70 80 90 100 110 120 pF1KA0 RDPLTAGSVIEYLCAEGYMLKGDYKYLTCKNGEWKPAMEISCRLNEDKDTHTSLGVPTLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS97 RDPLTAGSVIEYLCAEGYMLKGDYKYLTCKNGEWKPAMEISCRLNEDKDTHTSLGVPTLS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KA0 IVASTASSVALILLLVVLFVLLQPKLKSFHHSRRDQGVSGDQVSIMVDGVQVALPSYEEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS97 IVASTASSVALILLLVVLFVLLQPKLKSFHHSRRDQGVSGDQVSIMVDGVQVALPSYEEA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KA0 VYGSSGHCVPPADPRVQIVLSEGSGPSGRSVPREQQLPDQGACSSAGGEDEAPGQSGLCE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS97 VYGSSGHCVPPADPRVQIVLSEGSGPSGRSVPREQQLPDQGACSSAGGEDEAPGQSGLCE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KA0 AWGSRASETVMVHQATTSSWVAGSGNRQLAHKETADSENSDIQSLLSLTSEEYTDDIPLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS97 AWGSRASETVMVHQATTSSWVAGSGNRQLAHKETADSENSDIQSLLSLTSEEYTDDIPLL 250 260 270 280 290 300 pF1KA0 KEA ::: CCDS97 KEA >>CCDS41471.1 SUSD4 gene_id:55061|Hs108|chr1 (490 aa) initn: 316 init1: 137 opt: 448 Z-score: 469.1 bits: 95.6 E(32554): 8.3e-20 Smith-Waterman score: 466; 35.3% identity (59.8% similar) in 266 aa overlap (41-302:242-488) 20 30 40 50 60 70 pF1KA0 TSVFAVASVGHGVFLPLVILCTLLGDGLASVCPLPPEPENGGYICHPRPCRDPLTAGSVI :::::: .: ..:::::: . . :.:. CCDS41 PGFKLDGSAYLECLQNLIWSSSPPRCLALEVCPLPPMVSHGDFVCHPRPC-ERYNHGTVV 220 230 240 250 260 270 80 90 100 110 120 pF1KA0 EYLCAEGYMLKGDYKYLTCKNGEWKPAMEISCRLNEDK--DTHTSLGVPTLSIVASTASS :. : :: : .::::.::. ::: :.... : .:. .:: .: . : .::: ::.: CCDS41 EFYCDPGYSLTSDYKYITCQYGEWFPSYQVYCIKSEQTWPSTHETL-LTTWKIVAFTATS 280 290 300 310 320 130 140 150 160 170 180 pF1KA0 VALILLLVVLFVLLQPKLKSFHHSRRD-QGVSGDQVSIMVDGVQVALPSYEEAVYGSSGH : :.::::.: ..: :.:. : .. :.: ..:::: : ::::.::: :. . CCDS41 VLLVLLLVILARMFQTKFKAHFPPRGPPRSSSSDPDFVVVDGVPVMLPSYDEAVSGGLSA 330 340 350 360 370 380 190 200 210 220 230 240 pF1KA0 CVPPADPRVQIVLSEGSG-PSGRSVPREQQLPDQGACSSAGGEDEAPGQSGLCEAWGSRA : . : :.: : .: ..: : : ..: : .::.: :.. : . CCDS41 LGPG------YMASVGQGCP----LPVDDQSPP--AYPGSGDTDTGPGESETCDSV-SGS 390 400 410 420 430 250 260 270 280 290 300 pF1KA0 SETVMVHQATTSSWVAGSGNRQLAHKETADSENSDIQSLLSLTSEEYTDDIPLLKEA :: .... . : . . . : .. : . . . .:.:::..: CCDS41 SE--LLQSLYSPPRCQESTHPASDNPDIIASTAEEVAS--TSPGIDIADEIPLMEEDP 440 450 460 470 480 490 303 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Wed Nov 2 18:26:36 2016 done: Wed Nov 2 18:26:36 2016 Total Scan time: 1.940 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]