FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7055, 582 aa 1>>>pF1KB7055 582 - 582 aa - 582 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.9520+/-0.00139; mu= 2.0279+/- 0.083 mean_var=192.4974+/-40.872, 0's: 0 Z-trim(105.4): 38 B-trim: 0 in 0/50 Lambda= 0.092440 statistics sampled from 8380 (8394) to 8380 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.595), E-opt: 0.2 (0.258), width: 16 Scan time: 3.400 The best scores are: opt bits E(32554) CCDS13346.1 SEMG2 gene_id:6407|Hs108|chr20 ( 582) 3849 526.6 3.3e-149 CCDS13345.1 SEMG1 gene_id:6406|Hs108|chr20 ( 462) 2134 297.8 1.9e-80 >>CCDS13346.1 SEMG2 gene_id:6407|Hs108|chr20 (582 aa) initn: 3849 init1: 3849 opt: 3849 Z-score: 2792.2 bits: 526.6 E(32554): 3.3e-149 Smith-Waterman score: 3849; 99.7% identity (100.0% similar) in 582 aa overlap (1-582:1-582) 10 20 30 40 50 60 pF1KB7 MKSIILFVLSLLLILEKQAAVMGQKGGSKGQLPSGSSQFPHGKKGQHYFGQKDQQHTKSK ::::::::::::::::::::::::::::::::::::::::::.::::::::::::::::: CCDS13 MKSIILFVLSLLLILEKQAAVMGQKGGSKGQLPSGSSQFPHGQKGQHYFGQKDQQHTKSK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 GSFSIQHTYHVDINDHDWTRKSQQYDLNALHKATKSKQHLGGSQQLLNYKQEGRDHDKSK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 GSFSIQHTYHVDINDHDWTRKSQQYDLNALHKATKSKQHLGGSQQLLNYKQEGRDHDKSK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 GHFHMIVIHHKGGQAHHGTQNPSQDQGNSPSGKGLSSQCSNTEKRLWVHGLSKEQASASG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 GHFHMIVIHHKGGQAHHGTQNPSQDQGNSPSGKGLSSQCSNTEKRLWVHGLSKEQASASG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 AQKGRTQGGSQSSYVLQTEELVVNKQQRETKNSHQNKGHYQNVVDVREEHSSKLQTSLHP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 AQKGRTQGGSQSSYVLQTEELVVNKQQRETKNSHQNKGHYQNVVDVREEHSSKLQTSLHP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 AHQDRLQHGPKDIFTTQDELLVYNKNQHQTKNLNQDQEHGRKAHKISYPSSRTEERQLHH :::::::::::::::::::::::::::::::::.:::::::::::::::::::::::::: CCDS13 AHQDRLQHGPKDIFTTQDELLVYNKNQHQTKNLSQDQEHGRKAHKISYPSSRTEERQLHH 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB7 GEKSVQKDVSKGSISIQTEEKIHGKSQNQVTIHSQDQEHGHKENKISYQSSSTEERHLNC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 GEKSVQKDVSKGSISIQTEEKIHGKSQNQVTIHSQDQEHGHKENKISYQSSSTEERHLNC 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB7 GEKGIQKGVSKGSISIQTEEQIHGKSQNQVRIPSQAQEYGHKENKISYQSSSTEERRLNS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 GEKGIQKGVSKGSISIQTEEQIHGKSQNQVRIPSQAQEYGHKENKISYQSSSTEERRLNS 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB7 GEKDVQKGVSKGSISIQTEEKIHGKSQNQVTIPSQDQEHGHKENKMSYQSSSTEERRLNY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 GEKDVQKGVSKGSISIQTEEKIHGKSQNQVTIPSQDQEHGHKENKMSYQSSSTEERRLNY 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB7 GGKSTQKDVSQSSISFQIEKLVEGKSQIQTPNPNQDQWSGQNAKGKSGQSADSKQDLLSH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 GGKSTQKDVSQSSISFQIEKLVEGKSQIQTPNPNQDQWSGQNAKGKSGQSADSKQDLLSH 490 500 510 520 530 540 550 560 570 580 pF1KB7 EQKGRYKQESSESHNIVITEHEVAQDDHLTQQYNEDRNPIST :::::::::::::::::::::::::::::::::::::::::: CCDS13 EQKGRYKQESSESHNIVITEHEVAQDDHLTQQYNEDRNPIST 550 560 570 580 >>CCDS13345.1 SEMG1 gene_id:6406|Hs108|chr20 (462 aa) initn: 2968 init1: 2116 opt: 2134 Z-score: 1557.6 bits: 297.8 E(32554): 1.9e-80 Smith-Waterman score: 2155; 62.2% identity (73.0% similar) in 582 aa overlap (1-582:1-462) 10 20 30 40 50 60 pF1KB7 MKSIILFVLSLLLILEKQAAVMGQKGGSKGQLPSGSSQFPHGKKGQHYFGQKDQQHTKSK :: :.::::::::::::::::::::::::.::: ::::::.::::: ::: .:.:.:: CCDS13 MKPNIIFVLSLLLILEKQAAVMGQKGGSKGRLPSEFSQFPHGQKGQHYSGQKGKQQTESK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 GSFSIQHTYHVDINDHDWTRKSQQYDLNALHKATKSKQHLGGSQQLLNYKQEGRDHDKSK ::::::.::::: :::: .:::::::::::::.:::..:::::::::. ::::::::::: CCDS13 GSFSIQYTYHVDANDHDQSRKSQQYDLNALHKTTKSQRHLGGSQQLLHNKQEGRDHDKSK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 GHFHMIVIHHKGGQAHHGTQNPSQDQGNSPSGKGLSSQCSNTEKRLWVHGLSKEQASASG :::: .:::::::.::.:::::::::::::::::.::: ::::.:::::::::::.:.:: CCDS13 GHFHRVVIHHKGGKAHRGTQNPSQDQGNSPSGKGISSQYSNTEERLWVHGLSKEQTSVSG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 AQKGRTQGGSQSSYVLQTEELVVNKQQRETKNSHQNKGHYQNVVDVREEHSSKLQTSLHP ::::: ::::::::::::::::.:::::::::::::::::::::.::::::::.:::: : CCDS13 AQKGRKQGGSQSSYVLQTEELVANKQQRETKNSHQNKGHYQNVVEVREEHSSKVQTSLCP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 AHQDRLQHGPKDIFTTQDELLVYNKNQHQTKNLNQDQEHGRKAHKISYPSSRTEERQLHH ::::.:::: ::::.::::::::::::::::::::::.:::::.:::: :: ::::.::. CCDS13 AHQDKLQHGSKDIFSTQDELLVYNKNQHQTKNLNQDQQHGRKANKISYQSSSTEERRLHY 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB7 GEKSVQKDVSKGSISIQTEEKIHGKSQNQVTIHSQDQEHGHKENKISYQSSSTEERHLNC ::..::::::..:: :. CCDS13 GENGVQKDVSQSSI---------------------------------YS----------- 310 370 380 390 400 410 420 pF1KB7 GEKGIQKGVSKGSISIQTEEQIHGKSQNQVRIPSQAQEYGHKENKISYQSSSTEERRLNS CCDS13 ------------------------------------------------------------ 430 440 450 460 470 480 pF1KB7 GEKDVQKGVSKGSISIQTEEKIHGKSQNQVTIPSQDQEHGHKENKMSYQSSSTEERRLNY ::::: .::::.:.:::::.:::..: ::.::::::::::::.: CCDS13 ----------------QTEEKAQGKSQKQITIPSQEQEHSQKANKISYQSSSTEERRLHY 320 330 340 350 360 490 500 510 520 530 540 pF1KB7 GGKSTQKDVSQSSISFQIEKLVEGKSQIQTPNPNQDQWSGQNAKGKSGQSADSKQDLLSH : ...:::::: :: : :::: ::::::.:::.:. : :.::::.::::.. .:::::: CCDS13 GENGVQKDVSQRSIYSQTEKLVAGKSQIQAPNPKQEPWHGENAKGESGQSTNREQDLLSH 370 380 390 400 410 420 550 560 570 580 pF1KB7 EQKGRYKQESSESHNIVITEHEVAQDDHLTQQYNEDRNPIST :::::... : . .::: :.: .: ::.:. :.::::. : CCDS13 EQKGRHQHGSHGGLDIVIIEQEDDSDRHLAQHLNNDRNPLFT 430 440 450 460 582 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 04:18:34 2016 done: Fri Nov 4 04:18:35 2016 Total Scan time: 3.400 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]