FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB5643, 72 aa 1>>>pF1KB5643 72 - 72 aa - 72 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.9146+/-0.000592; mu= 8.3300+/- 0.036 mean_var=45.6497+/- 9.066, 0's: 0 Z-trim(109.8): 20 B-trim: 0 in 0/50 Lambda= 0.189826 statistics sampled from 11160 (11179) to 11160 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.765), E-opt: 0.2 (0.343), width: 16 Scan time: 1.330 The best scores are: opt bits E(32554) CCDS30749.1 GNG12 gene_id:55970|Hs108|chr1 ( 72) 451 130.2 1.1e-31 CCDS12091.1 GNG7 gene_id:2788|Hs108|chr19 ( 68) 363 106.1 1.9e-24 CCDS32082.1 GNG2 gene_id:54331|Hs108|chr14 ( 71) 291 86.4 1.7e-18 CCDS1607.1 GNG4 gene_id:2786|Hs108|chr1 ( 75) 280 83.4 1.4e-17 CCDS8032.1 GNG3 gene_id:2785|Hs108|chr11 ( 75) 272 81.2 6.5e-17 CCDS12687.1 GNG8 gene_id:94235|Hs108|chr19 ( 70) 261 78.2 4.9e-16 CCDS696.1 GNG5 gene_id:2787|Hs108|chr1 ( 68) 234 70.8 8e-14 CCDS35107.1 GNG10 gene_id:2790|Hs108|chr9 ( 68) 222 67.5 7.8e-13 >>CCDS30749.1 GNG12 gene_id:55970|Hs108|chr1 (72 aa) initn: 451 init1: 451 opt: 451 Z-score: 682.4 bits: 130.2 E(32554): 1.1e-31 Smith-Waterman score: 451; 100.0% identity (100.0% similar) in 72 aa overlap (1-72:1-72) 10 20 30 40 50 60 pF1KB5 MSSKTASTNNIAQARRTVQQLRLEASIERIKVSKASADLMSYCEEHARSDPLLIGIPTSE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 MSSKTASTNNIAQARRTVQQLRLEASIERIKVSKASADLMSYCEEHARSDPLLIGIPTSE 10 20 30 40 50 60 70 pF1KB5 NPFKDKKTCIIL :::::::::::: CCDS30 NPFKDKKTCIIL 70 >>CCDS12091.1 GNG7 gene_id:2788|Hs108|chr19 (68 aa) initn: 363 init1: 363 opt: 363 Z-score: 552.6 bits: 106.1 E(32554): 1.9e-24 Smith-Waterman score: 363; 77.6% identity (97.0% similar) in 67 aa overlap (6-72:2-68) 10 20 30 40 50 60 pF1KB5 MSSKTASTNNIAQARRTVQQLRLEASIERIKVSKASADLMSYCEEHARSDPLLIGIPTSE ..::::::::. :.:::.::.:::::::::..:::::::.:::.::::.:.:.:: CCDS12 MSATNNIAQARKLVEQLRIEAGIERIKVSKAASDLMSYCEQHARNDPLLVGVPASE 10 20 30 40 50 70 pF1KB5 NPFKDKKTCIIL ::::::: :::: CCDS12 NPFKDKKPCIIL 60 >>CCDS32082.1 GNG2 gene_id:54331|Hs108|chr14 (71 aa) initn: 291 init1: 283 opt: 291 Z-score: 445.7 bits: 86.4 E(32554): 1.7e-18 Smith-Waterman score: 291; 62.3% identity (85.5% similar) in 69 aa overlap (5-72:3-71) 10 20 30 40 50 60 pF1KB5 MSSKTASTNNIAQARRTVQQLRLEASIERIKVSKASADLMSYCEEHARSDPLLIGIPTSE . .: .:::::. :.::..::.:.:::::::.::::.::: ::. :::: .:.:: CCDS32 MASNNTASIAQARKLVEQLKMEANIDRIKVSKAAADLMAYCEAHAKEDPLLTPVPASE 10 20 30 40 50 70 pF1KB5 NPFKDKKT-CIIL :::..:: : :: CCDS32 NPFREKKFFCAIL 60 70 >>CCDS1607.1 GNG4 gene_id:2786|Hs108|chr1 (75 aa) initn: 272 init1: 272 opt: 280 Z-score: 429.0 bits: 83.4 E(32554): 1.4e-17 Smith-Waterman score: 280; 58.2% identity (89.6% similar) in 67 aa overlap (7-72:9-75) 10 20 30 40 50 pF1KB5 MSSKTASTNNIAQARRTVQQLRLEASIERIKVSKASADLMSYCEEHARSDPLLIGIPT ::..:.:::..:.::..:: ..:.:::.:.:::..::: :.: :::.: .:. CCDS16 MKEGMSNNSTTSISQARKAVEQLKMEACMDRVKVSQAAADLLAYCEAHVREDPLIIPVPA 10 20 30 40 50 60 60 70 pF1KB5 SENPFKDKKT-CIIL :::::..:: : :: CCDS16 SENPFREKKFFCTIL 70 >>CCDS8032.1 GNG3 gene_id:2785|Hs108|chr11 (75 aa) initn: 267 init1: 267 opt: 272 Z-score: 417.2 bits: 81.2 E(32554): 6.5e-17 Smith-Waterman score: 272; 61.2% identity (83.6% similar) in 67 aa overlap (7-72:9-75) 10 20 30 40 50 pF1KB5 MSSKTASTNNIAQARRTVQQLRLEASIERIKVSKASADLMSYCEEHARSDPLLIGIPT :: .:.:::. :.::..:::. :::::::.::::.::. :: :::. .:: CCDS80 MKGETPVNSTMSIGQARKMVEQLKIEASLCRIKVSKAAADLMTYCDAHACEDPLITPVPT 10 20 30 40 50 60 60 70 pF1KB5 SENPFKDKKT-CIIL :::::..:: : .: CCDS80 SENPFREKKFFCALL 70 >>CCDS12687.1 GNG8 gene_id:94235|Hs108|chr19 (70 aa) initn: 258 init1: 249 opt: 261 Z-score: 401.4 bits: 78.2 E(32554): 4.9e-16 Smith-Waterman score: 261; 50.7% identity (87.7% similar) in 73 aa overlap (1-72:1-70) 10 20 30 40 50 60 pF1KB5 MSSKTASTNNIAQARRTVQQLRLEASIERIKVSKASADLMSYCEEHARSDPLLIGIPTSE ::.. : .::.::.::.::.::..:.:.:::.:.:.:...:: ::..:::. .:..: CCDS12 MSNNMA---KIAEARKTVEQLKLEVNIDRMKVSQAAAELLAFCETHAKDDPLVTPVPAAE 10 20 30 40 50 70 pF1KB5 NPFKDKKT-CIIL :::.::. :..: CCDS12 NPFRDKRLFCVLL 60 70 >>CCDS696.1 GNG5 gene_id:2787|Hs108|chr1 (68 aa) initn: 224 init1: 216 opt: 234 Z-score: 361.6 bits: 70.8 E(32554): 8e-14 Smith-Waterman score: 234; 46.3% identity (85.1% similar) in 67 aa overlap (6-72:2-68) 10 20 30 40 50 60 pF1KB5 MSSKTASTNNIAQARRTVQQLRLEASIERIKVSKASADLMSYCEEHARSDPLLIGIPTSE ......: ...::::::::...:.:::.:.::: ..: ..:. :::: :. .: CCDS69 MSGSSSVAAMKKVVQQLRLEAGLNRVKVSQAAADLKQFCLQNAQHDPLLTGVSSST 10 20 30 40 50 70 pF1KB5 NPFKDKKTCIIL :::. .:.: .: CCDS69 NPFRPQKVCSFL 60 >>CCDS35107.1 GNG10 gene_id:2790|Hs108|chr9 (68 aa) initn: 231 init1: 207 opt: 222 Z-score: 343.9 bits: 67.5 E(32554): 7.8e-13 Smith-Waterman score: 222; 47.8% identity (85.1% similar) in 67 aa overlap (7-72:2-68) 10 20 30 40 50 pF1KB5 MSSKTASTNNIAQA-RRTVQQLRLEASIERIKVSKASADLMSYCEEHARSDPLLIGIPTS :.. :.: .: :.::.:::..::::::.:.:.:..:: ..: .: ::.:.:.. CCDS35 MSSGASASALQRLVEQLKLEAGVERIKVSQAAAELQQYCMQNACKDALLVGVPAG 10 20 30 40 50 60 70 pF1KB5 ENPFKDKKTCIIL :::.. ..: .: CCDS35 SNPFREPRSCALL 60 72 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 04:00:17 2016 done: Fri Nov 4 04:00:17 2016 Total Scan time: 1.330 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]