FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9682, 149 aa 1>>>pF1KB9682 149 - 149 aa - 149 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3364+/-0.000626; mu= 12.0495+/- 0.038 mean_var=67.7534+/-13.502, 0's: 0 Z-trim(111.4): 29 B-trim: 0 in 0/53 Lambda= 0.155815 statistics sampled from 12349 (12373) to 12349 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.755), E-opt: 0.2 (0.38), width: 16 Scan time: 1.690 The best scores are: opt bits E(32554) CCDS13186.1 ID1 gene_id:3397|Hs108|chr20 ( 149) 950 221.5 1.5e-58 CCDS13185.1 ID1 gene_id:3397|Hs108|chr20 ( 155) 903 211.0 2.4e-55 CCDS1659.1 ID2 gene_id:3398|Hs108|chr2 ( 134) 286 72.2 1.2e-13 CCDS237.1 ID3 gene_id:3399|Hs108|chr1 ( 119) 275 69.7 5.9e-13 CCDS4544.1 ID4 gene_id:3400|Hs108|chr6 ( 161) 257 65.7 1.3e-11 >>CCDS13186.1 ID1 gene_id:3397|Hs108|chr20 (149 aa) initn: 950 init1: 950 opt: 950 Z-score: 1164.6 bits: 221.5 E(32554): 1.5e-58 Smith-Waterman score: 950; 100.0% identity (100.0% similar) in 149 aa overlap (1-149:1-149) 10 20 30 40 50 60 pF1KB9 MKVASGSTATAAAGPSCALKAGKTASGAGEVVRCLSEQSVAISRCAGGAGARLPALLDEQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MKVASGSTATAAAGPSCALKAGKTASGAGEVVRCLSEQSVAISRCAGGAGARLPALLDEQ 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 QVNVLLYDMNGCYSRLKELVPTLPQNRKVSKVEILQHVIDYIRDLQLELNSESEVGTPGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 QVNVLLYDMNGCYSRLKELVPTLPQNRKVSKVEILQHVIDYIRDLQLELNSESEVGTPGG 70 80 90 100 110 120 130 140 pF1KB9 RGLPVRAPLSTLNGEISALTAEVRSRSDH ::::::::::::::::::::::::::::: CCDS13 RGLPVRAPLSTLNGEISALTAEVRSRSDH 130 140 >>CCDS13185.1 ID1 gene_id:3397|Hs108|chr20 (155 aa) initn: 903 init1: 903 opt: 903 Z-score: 1107.2 bits: 211.0 E(32554): 2.4e-55 Smith-Waterman score: 903; 100.0% identity (100.0% similar) in 142 aa overlap (1-142:1-142) 10 20 30 40 50 60 pF1KB9 MKVASGSTATAAAGPSCALKAGKTASGAGEVVRCLSEQSVAISRCAGGAGARLPALLDEQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MKVASGSTATAAAGPSCALKAGKTASGAGEVVRCLSEQSVAISRCAGGAGARLPALLDEQ 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 QVNVLLYDMNGCYSRLKELVPTLPQNRKVSKVEILQHVIDYIRDLQLELNSESEVGTPGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 QVNVLLYDMNGCYSRLKELVPTLPQNRKVSKVEILQHVIDYIRDLQLELNSESEVGTPGG 70 80 90 100 110 120 130 140 pF1KB9 RGLPVRAPLSTLNGEISALTAEVRSRSDH :::::::::::::::::::::: CCDS13 RGLPVRAPLSTLNGEISALTAEAACVPADDRILCR 130 140 150 >>CCDS1659.1 ID2 gene_id:3398|Hs108|chr2 (134 aa) initn: 307 init1: 249 opt: 286 Z-score: 358.6 bits: 72.2 E(32554): 1.2e-13 Smith-Waterman score: 293; 49.5% identity (73.0% similar) in 111 aa overlap (35-140:15-114) 10 20 30 40 50 60 pF1KB9 SGSTATAAAGPSCALKAGKTASGAGEVVRCLSEQSVAISRCAGGAGARLPALLDEQQVNV ::..:..::: .. : .:. . CCDS16 MKAFSPVRSVRKNSLSDHSLGISR------SKTP--VDDPM--S 10 20 30 70 80 90 100 110 pF1KB9 LLYDMNGCYSRLKELVPTLPQNRKVSKVEILQHVIDYIRDLQLELNSESEVGT-----PG :::.:: :::.::::::..:::.::::.:::::::::: :::. :.:. . . :: CCDS16 LLYNMNDCYSKLKELVPSIPQNKKVSKMEILQHVIDYILDLQIALDSHPTIVSLHHQRPG 40 50 60 70 80 90 120 130 140 pF1KB9 GRGLPVRAPLSTLNGEISALTAEVRSRSDH .. :.::.::: .:: :. CCDS16 -QNQASRTPLTTLNTDISILSLQASEFPSELMSNDSKALCG 100 110 120 130 >>CCDS237.1 ID3 gene_id:3399|Hs108|chr1 (119 aa) initn: 248 init1: 192 opt: 275 Z-score: 346.0 bits: 69.7 E(32554): 5.9e-13 Smith-Waterman score: 276; 49.1% identity (69.1% similar) in 110 aa overlap (19-126:1-100) 10 20 30 40 50 pF1KB9 MKVASGSTATAAAGPSCALKAGKTASGAGEVVRCLSEQSVAISRCAG-GAGARLP-ALLD .:: . . : :.: ::::.:.::.: : : .:. : .::: CCDS23 MKALSPVRGCYEAVCCLSERSLAIARGRGKGPAAEEPLSLLD 10 20 30 40 60 70 80 90 100 110 pF1KB9 EQQVNVLLYDMNGCYSRLKELVPTLPQNRKVSKVEILQHVIDYIRDLQLELNSESEVGTP ::: :::::.:::: .:.. ..:.:::::.::::: :::. : .: : : CCDS23 ---------DMNHCYSRLRELVPGVPRGTQLSQVEILQRVIDYILDLQVVL-AEPAPGPP 50 60 70 80 90 120 130 140 pF1KB9 GGRGLPVRAPLSTLNGEISALTAEVRSRSDH : ::.. CCDS23 DGPHLPIQTAELTPELVISNDKRSFCH 100 110 >>CCDS4544.1 ID4 gene_id:3400|Hs108|chr6 (161 aa) initn: 300 init1: 227 opt: 257 Z-score: 322.2 bits: 65.7 E(32554): 1.3e-11 Smith-Waterman score: 266; 42.5% identity (64.9% similar) in 134 aa overlap (20-133:12-144) 10 20 30 40 50 pF1KB9 MKVASGSTATAAAGPSCALKAGKTASGAGEV-VRCLSEQSVAI----SRCAGGAGARLPA . . .. :.::. .:::.:.. .. . :..:.:: : CCDS45 MKAVSPVRPSGRKAPSGCGGGELALRCLAEHGHSLGGSAAAAAAAAAARCKA 10 20 30 40 50 60 70 80 90 100 110 pF1KB9 L---LDEQQVNVLLYDMNGCYSRLKELVPTLPQNRKVSKVEILQHVIDYIRDLQLELNSE :: . : ::: :::::..::::.: :.::::::::::::::: :::: :... CCDS45 AEAAADEPAL-CLQCDMNDCYSRLRRLVPTIPPNKKVSKVEILQHVIDYILDLQLALETH 60 70 80 90 100 110 120 130 140 pF1KB9 SEV----------GTPGGR--GLPVRAPLSTLNGEISALTAEVRSRSDH . :.: . : :.::..:: CCDS45 PALLRQPPPPAPPHHPAGTCPAAPPRTPLTALNTDPAGAVNKQGDSILCR 120 130 140 150 160 149 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 18:17:44 2016 done: Fri Nov 4 18:17:44 2016 Total Scan time: 1.690 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]