FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB3711, 75 aa 1>>>pF1KB3711 75 - 75 aa - 75 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.8183+/-0.000529; mu= 9.2803+/- 0.032 mean_var=47.1681+/- 9.293, 0's: 0 Z-trim(111.6): 19 B-trim: 0 in 0/54 Lambda= 0.186745 statistics sampled from 12524 (12543) to 12524 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.789), E-opt: 0.2 (0.385), width: 16 Scan time: 1.390 The best scores are: opt bits E(32554) CCDS1607.1 GNG4 gene_id:2786|Hs108|chr1 ( 75) 489 138.3 4.3e-34 CCDS32082.1 GNG2 gene_id:54331|Hs108|chr14 ( 71) 381 109.2 2.3e-25 CCDS8032.1 GNG3 gene_id:2785|Hs108|chr11 ( 75) 341 98.4 4.3e-22 CCDS12687.1 GNG8 gene_id:94235|Hs108|chr19 ( 70) 321 93.0 1.7e-20 CCDS12091.1 GNG7 gene_id:2788|Hs108|chr19 ( 68) 283 82.8 2e-17 CCDS30749.1 GNG12 gene_id:55970|Hs108|chr1 ( 72) 280 82.0 3.7e-17 CCDS696.1 GNG5 gene_id:2787|Hs108|chr1 ( 68) 211 63.4 1.4e-11 CCDS35107.1 GNG10 gene_id:2790|Hs108|chr9 ( 68) 204 61.5 5.1e-11 >>CCDS1607.1 GNG4 gene_id:2786|Hs108|chr1 (75 aa) initn: 489 init1: 489 opt: 489 Z-score: 725.6 bits: 138.3 E(32554): 4.3e-34 Smith-Waterman score: 489; 100.0% identity (100.0% similar) in 75 aa overlap (1-75:1-75) 10 20 30 40 50 60 pF1KB3 MKEGMSNNSTTSISQARKAVEQLKMEACMDRVKVSQAAADLLAYCEAHVREDPLIIPVPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS16 MKEGMSNNSTTSISQARKAVEQLKMEACMDRVKVSQAAADLLAYCEAHVREDPLIIPVPA 10 20 30 40 50 60 70 pF1KB3 SENPFREKKFFCTIL ::::::::::::::: CCDS16 SENPFREKKFFCTIL 70 >>CCDS32082.1 GNG2 gene_id:54331|Hs108|chr14 (71 aa) initn: 381 init1: 381 opt: 381 Z-score: 568.8 bits: 109.2 E(32554): 2.3e-25 Smith-Waterman score: 381; 77.5% identity (95.8% similar) in 71 aa overlap (5-75:1-71) 10 20 30 40 50 60 pF1KB3 MKEGMSNNSTTSISQARKAVEQLKMEACMDRVKVSQAAADLLAYCEAHVREDPLIIPVPA :..:.:.::.:::: :::::::: .::.:::.:::::.::::::..::::. :::: CCDS32 MASNNTASIAQARKLVEQLKMEANIDRIKVSKAAADLMAYCEAHAKEDPLLTPVPA 10 20 30 40 50 70 pF1KB3 SENPFREKKFFCTIL ::::::::::::.:: CCDS32 SENPFREKKFFCAIL 60 70 >>CCDS8032.1 GNG3 gene_id:2785|Hs108|chr11 (75 aa) initn: 351 init1: 341 opt: 341 Z-score: 510.1 bits: 98.4 E(32554): 4.3e-22 Smith-Waterman score: 341; 69.3% identity (85.3% similar) in 75 aa overlap (1-75:1-75) 10 20 30 40 50 60 pF1KB3 MKEGMSNNSTTSISQARKAVEQLKMEACMDRVKVSQAAADLLAYCEAHVREDPLIIPVPA :: ::: ::.:::: :::::.:: . :.:::.:::::..::.::. ::::: :::. CCDS80 MKGETPVNSTMSIGQARKMVEQLKIEASLCRIKVSKAAADLMTYCDAHACEDPLITPVPT 10 20 30 40 50 60 70 pF1KB3 SENPFREKKFFCTIL ::::::::::::..: CCDS80 SENPFREKKFFCALL 70 >>CCDS12687.1 GNG8 gene_id:94235|Hs108|chr19 (70 aa) initn: 326 init1: 300 opt: 321 Z-score: 481.5 bits: 93.0 E(32554): 1.7e-20 Smith-Waterman score: 321; 63.4% identity (94.4% similar) in 71 aa overlap (5-75:1-70) 10 20 30 40 50 60 pF1KB3 MKEGMSNNSTTSISQARKAVEQLKMEACMDRVKVSQAAADLLAYCEAHVREDPLIIPVPA :::: ..:..:::.:::::.:. .::.:::::::.:::.::.:...:::. :::: CCDS12 MSNN-MAKIAEARKTVEQLKLEVNIDRMKVSQAAAELLAFCETHAKDDPLVTPVPA 10 20 30 40 50 70 pF1KB3 SENPFREKKFFCTIL .:::::.:..::..: CCDS12 AENPFRDKRLFCVLL 60 70 >>CCDS12091.1 GNG7 gene_id:2788|Hs108|chr19 (68 aa) initn: 277 init1: 277 opt: 283 Z-score: 426.4 bits: 82.8 E(32554): 2e-17 Smith-Waterman score: 283; 60.3% identity (89.7% similar) in 68 aa overlap (8-75:2-68) 10 20 30 40 50 60 pF1KB3 MKEGMSNNSTTSISQARKAVEQLKMEACMDRVKVSQAAADLLAYCEAHVREDPLIIPVPA ..:..:.:::: ::::..:: ..:.:::.::.::..::: :.:.:::.. ::: CCDS12 MSATNNIAQARKLVEQLRIEAGIERIKVSKAASDLMSYCEQHARNDPLLVGVPA 10 20 30 40 50 70 pF1KB3 SENPFREKKFFCTIL :::::..:: : :: CCDS12 SENPFKDKKP-CIIL 60 >>CCDS30749.1 GNG12 gene_id:55970|Hs108|chr1 (72 aa) initn: 272 init1: 272 opt: 280 Z-score: 421.6 bits: 82.0 E(32554): 3.7e-17 Smith-Waterman score: 280; 58.2% identity (89.6% similar) in 67 aa overlap (9-75:7-72) 10 20 30 40 50 60 pF1KB3 MKEGMSNNSTTSISQARKAVEQLKMEACMDRVKVSQAAADLLAYCEAHVREDPLIIPVPA ::..:.:::..:.::..:: ..:.:::.:.:::..::: :.: :::.: .:. CCDS30 MSSKTASTNNIAQARRTVQQLRLEASIERIKVSKASADLMSYCEEHARSDPLLIGIPT 10 20 30 40 50 70 pF1KB3 SENPFREKKFFCTIL :::::..:: : :: CCDS30 SENPFKDKKT-CIIL 60 70 >>CCDS696.1 GNG5 gene_id:2787|Hs108|chr1 (68 aa) initn: 189 init1: 175 opt: 211 Z-score: 321.5 bits: 63.4 E(32554): 1.4e-11 Smith-Waterman score: 211; 45.6% identity (79.4% similar) in 68 aa overlap (8-75:2-68) 10 20 30 40 50 60 pF1KB3 MKEGMSNNSTTSISQARKAVEQLKMEACMDRVKVSQAAADLLAYCEAHVREDPLIIPVPA ....:.. .:.:.::..:: ..::::::::::: .: ....:::. : . CCDS69 MSGSSSVAAMKKVVQQLRLEAGLNRVKVSQAAADLKQFCLQNAQHDPLLTGVSS 10 20 30 40 50 70 pF1KB3 SENPFREKKFFCTIL : :::: .: :..: CCDS69 STNPFRPQKV-CSFL 60 >>CCDS35107.1 GNG10 gene_id:2790|Hs108|chr9 (68 aa) initn: 230 init1: 187 opt: 204 Z-score: 311.4 bits: 61.5 E(32554): 5.1e-11 Smith-Waterman score: 204; 48.5% identity (75.0% similar) in 68 aa overlap (8-75:2-68) 10 20 30 40 50 60 pF1KB3 MKEGMSNNSTTSISQARKAVEQLKMEACMDRVKVSQAAADLLAYCEAHVREDPLIIPVPA .: .: : .. :::::.:: ..:.:::::::.: :: .. .: :.. ::: CCDS35 MSSGASASALQRLVEQLKLEAGVERIKVSQAAAELQQYCMQNACKDALLVGVPA 10 20 30 40 50 70 pF1KB3 SENPFREKKFFCTIL . ::::: . :..: CCDS35 GSNPFREPRS-CALL 60 75 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 02:38:07 2016 done: Fri Nov 4 02:38:07 2016 Total Scan time: 1.390 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]