FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8218, 185 aa 1>>>pF1KB8218 185 - 185 aa - 185 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4890+/-0.000656; mu= 11.6620+/- 0.040 mean_var=58.8374+/-11.583, 0's: 0 Z-trim(110.1): 9 B-trim: 51 in 1/49 Lambda= 0.167204 statistics sampled from 11359 (11367) to 11359 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.739), E-opt: 0.2 (0.349), width: 16 Scan time: 1.600 The best scores are: opt bits E(32554) CCDS3376.1 NSG1 gene_id:27065|Hs108|chr4 ( 185) 1203 297.9 2.4e-81 CCDS4391.1 HMP19 gene_id:51617|Hs108|chr5 ( 171) 381 99.6 1.1e-21 CCDS7678.1 CALY gene_id:50632|Hs108|chr10 ( 217) 378 98.9 2.2e-21 >>CCDS3376.1 NSG1 gene_id:27065|Hs108|chr4 (185 aa) initn: 1203 init1: 1203 opt: 1203 Z-score: 1574.0 bits: 297.9 E(32554): 2.4e-81 Smith-Waterman score: 1203; 100.0% identity (100.0% similar) in 185 aa overlap (1-185:1-185) 10 20 30 40 50 60 pF1KB8 MVKLGNNFAEKGTKQPLLEDGFDTIPLMTPLDVNQLQFPPPDKVVVKTKTEYEPDRKKGK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 MVKLGNNFAEKGTKQPLLEDGFDTIPLMTPLDVNQLQFPPPDKVVVKTKTEYEPDRKKGK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 ARPPQIAEFTVSITEGVTERFKVSVLVLFALAFLTCVVFLVVYKVYKYDRACPDGFVLKN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 ARPPQIAEFTVSITEGVTERFKVSVLVLFALAFLTCVVFLVVYKVYKYDRACPDGFVLKN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 TQCIPEGLESYYAEQDSSAREKFYTVINHYNLAKQSITRSVSPWMSVLSEEKLSEQETEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 TQCIPEGLESYYAEQDSSAREKFYTVINHYNLAKQSITRSVSPWMSVLSEEKLSEQETEA 130 140 150 160 170 180 pF1KB8 AEKSA ::::: CCDS33 AEKSA >>CCDS4391.1 HMP19 gene_id:51617|Hs108|chr5 (171 aa) initn: 613 init1: 362 opt: 381 Z-score: 502.9 bits: 99.6 E(32554): 1.1e-21 Smith-Waterman score: 656; 56.2% identity (81.2% similar) in 176 aa overlap (1-172:1-164) 10 20 30 40 50 pF1KB8 MVKLGNNFAEKGTKQPLLEDGFDTIPLMTPLDVNQLQFPPPDKVVVKTKTEYEPDRK-KG ::::..: .::::: : .::::.:.::.:::.::.::.: :.::.:::.:::.:..: :: CCDS43 MVKLNSNPSEKGTKPPSVEDGFQTVPLITPLEVNHLQLPAPEKVIVKTRTEYQPEQKNKG 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB8 KARPPQIAEFTVSITEGVTERFKVSVLVLFALAFLTCVVFLVVYKVYKYDRACPDGFVLK : : :.::::::.: :: .:::::.:.:::::::.. ::..::.::: : CCDS43 KFRVPKIAEFTVTI------------LVSLALAFLACIVFLVVYKAFTYDHSCPEGFVYK 70 80 90 100 120 130 140 150 160 170 pF1KB8 NTQCIPEGLESYYAEQDSSAREKFYTVINHYNLAKQSITRSVSPWMS---VLSEEKLSEQ . .::: .:..::. :: ..: .:::::.::..:::: .:...::.: :. : : CCDS43 HKRCIPASLDAYYSSQDPNSRSRFYTVISHYSVAKQSTARAIGPWLSAAAVIHEPKPPKT 110 120 130 140 150 160 180 pF1KB8 ETEAAEKSA CCDS43 QGH 170 >>CCDS7678.1 CALY gene_id:50632|Hs108|chr10 (217 aa) initn: 319 init1: 165 opt: 378 Z-score: 497.3 bits: 98.9 E(32554): 2.2e-21 Smith-Waterman score: 378; 38.8% identity (68.8% similar) in 170 aa overlap (1-164:1-165) 10 20 30 40 50 pF1KB8 MVKLGNNFAEKGTKQPLLEDG--FDTIPLMTPLDVNQLQFPPPDKVVVKTKTEYE---PD ::::: .:. : :.: .:: .:..::..:::..::: : ::.::.::.:::. :: CCDS76 MVKLGCSFSGKPGKDPGDQDGAAMDSVPLISPLDISQLQPPLPDQVVIKTQTEYQLSSPD 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB8 RKKGKARPPQIAEFTVSITEGVTERFKVSVLVLFALAFLTCVVFLVVYKVYKYDR-ACPD ... : . . :: .:. .. .. ::.:.: :: :..::. ::. .::: CCDS76 QQNFPDLEGQRLNCS-HPEEG--RRLPTARMIAFAMALLGCV--LIMYKAIWYDQFTCPD 70 80 90 100 110 120 130 140 150 160 170 pF1KB8 GFVLKNTQCIPEGLESYYAEQDSSAREKFYTVINHYNLAKQSITRSVSPWMSVLSEEKLS ::.:.. : : :: ::.:.: .... ..:. : :... :.. . : CCDS76 GFLLRHKICTPLTLEMYYTEMDPERHRSILAAIGAYPLSRKHGTETPAAWGDGYRAAKEE 120 130 140 150 160 170 180 pF1KB8 EQETEAAEKSA CCDS76 RKGPTQAGAAAAATEPPGKPSAKAEKEAARKAAGSAAPPPAQ 180 190 200 210 185 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 10:39:36 2016 done: Fri Nov 4 10:39:36 2016 Total Scan time: 1.600 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]