FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB6442, 346 aa 1>>>pF1KB6442 346 - 346 aa - 346 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.6704+/-0.000812; mu= 19.8931+/- 0.049 mean_var=65.4782+/-13.073, 0's: 0 Z-trim(107.4): 42 B-trim: 39 in 1/50 Lambda= 0.158499 statistics sampled from 9501 (9538) to 9501 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.664), E-opt: 0.2 (0.293), width: 16 Scan time: 2.710 The best scores are: opt bits E(32554) CCDS4871.1 PRPH2 gene_id:5961|Hs108|chr6 ( 346) 2347 545.4 2.6e-155 CCDS8024.1 ROM1 gene_id:6094|Hs108|chr11 ( 351) 802 192.1 6e-49 >>CCDS4871.1 PRPH2 gene_id:5961|Hs108|chr6 (346 aa) initn: 2347 init1: 2347 opt: 2347 Z-score: 2901.7 bits: 545.4 E(32554): 2.6e-155 Smith-Waterman score: 2347; 99.1% identity (99.7% similar) in 346 aa overlap (1-346:1-346) 10 20 30 40 50 60 pF1KB6 MALLKVKFDQKKRVKLAQGLWLMNWFSVLAGIIIFSLGLFLKIELRKRSDVMNNSESHFV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 MALLKVKFDQKKRVKLAQGLWLMNWFSVLAGIIIFSLGLFLKIELRKRSDVMNNSESHFV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 PNSLIGMGVLSCVFNSLAGKICYDALDPAKYARWKPWLKPYLAICVLFNIILFLVALCCF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 PNSLIGMGVLSCVFNSLAGKICYDALDPAKYARWKPWLKPYLAICVLFNIILFLVALCCF 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB6 LLRGSLENTLGQGLKNGMKYYRDTDTPGRCFMKKTIDMLQIEFKCCGNNGFRDWFEIQWI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 LLRGSLENTLGQGLKNGMKYYRDTDTPGRCFMKKTIDMLQIEFKCCGNNGFRDWFEIQWI 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB6 SNRYLDFSSKEVKDRIKSNVDGRYLVDGVPFSCCNPSSPRPCIQYQITNNSAHYSYDHQT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 SNRYLDFSSKEVKDRIKSNVDGRYLVDGVPFSCCNPSSPRPCIQYQITNNSAHYSYDHQT 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB6 EELNLWVRGCRAALLSYYSSLMNSMGVVTLLIWLFEVTITIGLRYLQTSLDGVSNPEESE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 EELNLWVRGCRAALLSYYSSLMNSMGVVTLLIWLFEVTITIGLRYLQTSLDGVSNPEESE 250 260 270 280 290 300 310 320 330 340 pF1KB6 SESEGWLLEKSVPETWKAFLESVKKLGKGNQVEAEGAGAGQAPEAG :::.:::::.::::::::::::::::::::::::::: :::::::: CCDS48 SESQGWLLERSVPETWKAFLESVKKLGKGNQVEAEGADAGQAPEAG 310 320 330 340 >>CCDS8024.1 ROM1 gene_id:6094|Hs108|chr11 (351 aa) initn: 764 init1: 662 opt: 802 Z-score: 992.3 bits: 192.1 E(32554): 6e-49 Smith-Waterman score: 802; 36.2% identity (70.7% similar) in 345 aa overlap (3-343:4-343) 10 20 30 40 50 pF1KB6 MALLKVKFDQKKRVKLAQGLWLMNWFSVLAGIIIFSLGLFLKIELRKRSDVMNNSESHF .: . . . :..:::::::..:. .::: .:. . : ..::. . . : . CCDS80 MAPVLPLVLPLQPRIRLAQGLWLLSWLLALAGGVILLCSGHLLVQLRHLGTFLAPSCQFP 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB6 V-PNSLIGMGVLSCVFNSLAG-KICYDALDPAKYARWKPWLKPYLAICVLFNIILFLVAL : :.. .. :... . ..:.: .:. : : :. : : :. . . :..:.: CCDS80 VLPQAALAAGAVA-LGTGLVGVGASRASLNAALYPPWRGVLGPLLVAGTAGGGGLLVVGL 70 80 90 100 110 120 130 140 150 160 170 pF1KB6 CCFL-LRGSLENTLGQGLKNGMKYYRDTDTPGRCFMKKTIDMLQIEFKCCGNNGFRDWFE : : :::...: .:: ... .:.::..::.: :. .: ::....::: .:..::: CCDS80 GLALALPGSLDEALEEGLVTALAHYKDTEVPGHCQAKRLVDELQLRYHCCGRHGYKDWFG 120 130 140 150 160 170 180 190 200 210 220 230 pF1KB6 IQWISNRYLDFSSKEVKDRIKSNVDGRYLVDGVPFSCCNPSSPRPCIQYQITNNSAHYSY .::.:.:::: ....: :::.:::.: ::.:::::::::: :::::.: ..... :: . CCDS80 VQWVSSRYLDPGDRDVADRIQSNVEGLYLTDGVPFSCCNPHSPRPCLQNRLSDSYAHPLF 180 190 200 210 220 230 240 250 260 270 280 290 pF1KB6 DHQTEELNLWVRGCRAALLSYYSSLMNSMGVVTLLIWLFEVTITIGLRYLQTSLDGVSNP : . . :::..::. .:: . ..: ...: . . .:... . .:::::::.:.:... CCDS80 DPRQPNQNLWAQGCHEVLLEHLQDLAGTLGSMLAVTFLLQALVLLGLRYLQTALEGLGGV 240 250 260 270 280 290 300 310 320 330 340 pF1KB6 EESESESEGWLLEKSVPETWK-AFLESVKKLGKGNQVEAEGAGAGQAPEAG .. .:..:.:. ... . : :.:.. : . . : : :.:: CCDS80 IDAGGETQGYLFPSGLKDMLKTAWLQG----GVACRPAPEEAPPGEAPPKEDLSEA 300 310 320 330 340 350 346 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 17:35:30 2016 done: Fri Nov 4 17:35:30 2016 Total Scan time: 2.710 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]