FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB6189, 660 aa 1>>>pF1KB6189 660 - 660 aa - 660 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.2624+/-0.00109; mu= 14.2735+/- 0.065 mean_var=73.7572+/-14.812, 0's: 0 Z-trim(102.8): 20 B-trim: 400 in 1/49 Lambda= 0.149339 statistics sampled from 7106 (7113) to 7106 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.582), E-opt: 0.2 (0.218), width: 16 Scan time: 3.460 The best scores are: opt bits E(32554) CCDS4367.1 RARS gene_id:5917|Hs108|chr5 ( 660) 4331 943.0 0 CCDS5011.1 RARS2 gene_id:57038|Hs108|chr6 ( 578) 666 153.4 8.2e-37 >>CCDS4367.1 RARS gene_id:5917|Hs108|chr5 (660 aa) initn: 4331 init1: 4331 opt: 4331 Z-score: 5040.9 bits: 943.0 E(32554): 0 Smith-Waterman score: 4331; 99.8% identity (99.8% similar) in 660 aa overlap (1-660:1-660) 10 20 30 40 50 60 pF1KB6 MDVLVSECSARLLQQEEEIKSLTAEIDRLKNCGCLGASPNLEQLQEENLKLKYRLNILRK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 MDVLVSECSARLLQQEEEIKSLTAEIDRLKNCGCLGASPNLEQLQEENLKLKYRLNILRK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 SLQAERNKPTKNMINIISRLQEVFGHAIKAAYPDLENPPLLVTPSQQAKFGDYQCNSAMG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 SLQAERNKPTKNMINIISRLQEVFGHAIKAAYPDLENPPLLVTPSQQAKFGDYQCNSAMG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB6 ISQMLKTKEQKVNPREIAENITKHLPDNECIEKVEIAGPGFINVHLRKDFVSEQLTSLLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 ISQMLKTKEQKVNPREIAENITKHLPDNECIEKVEIAGPGFINVHLRKDFVSEQLTSLLV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB6 NGVQLPALGENKKVIVDFSSPNIAKEMHVGHLRSTIIGESISRLFEFAGYDVLRLNHVGD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 NGVQLPALGENKKVIVDFSSPNIAKEMHVGHLRSTIIGESISRLFEFAGYDVLRLNHVGD 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB6 WGTQFGMLIAHLQDKFPDYLTVSPPIGDLQVFYKESKKRFDTEEEFKKRAYQCVVLLQGK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 WGTQFGMLIAHLQDKFPDYLTVSPPIGDLQVFYKESKKRFDTEEEFKKRAYQCVVLLQGK 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB6 NPDITKAWKLICDVSRQELNKIYDALDVSLIERGESFYQDRMNDIVKEFEDRGFVQVDDG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 NPDITKAWKLICDVSRQELNKIYDALDVSLIERGESFYQDRMNDIVKEFEDRGFVQVDDG 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB6 RKIVFVPGCSIPLTIVKSDGGYTYDTSDLAAIKQRLFEEKADMIIYVVDNGQSVHFQTIF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 RKIVFVPGCSIPLTIVKSDGGYTYDTSDLAAIKQRLFEEKADMIIYVVDNGQSVHFQTIF 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB6 AAAQMIGWYDPKVTRVFHAGFGVVLGEDKKKFKTRSGETVRLMDLLGEGLKRSMDKLKEK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 AAAQMIGWYDPKVTRVFHAGFGVVLGEDKKKFKTRSGETVRLMDLLGEGLKRSMDKLKEK 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB6 ERDKVLTAEELNAAQTSVAYGYIKYADLSHNRLNDYIFSFDKMLDDRGNTAAYLLYAFTR ::::::::::::::::::::: :::::::::::::::::::::::::::::::::::::: CCDS43 ERDKVLTAEELNAAQTSVAYGCIKYADLSHNRLNDYIFSFDKMLDDRGNTAAYLLYAFTR 490 500 510 520 530 540 550 560 570 580 590 600 pF1KB6 IRSIARLANIDEEMLQKAARETKILLDHEKEWKLGRCILRFPEILQKILDDLFLHTLCDY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 IRSIARLANIDEEMLQKAARETKILLDHEKEWKLGRCILRFPEILQKILDDLFLHTLCDY 550 560 570 580 590 600 610 620 630 640 650 660 pF1KB6 IYELATAFTEFYDSCYCVEKDRQTGKILKVNMWRMLLCEAVAAVMAKGFDILGIKPVQRM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 IYELATAFTEFYDSCYCVEKDRQTGKILKVNMWRMLLCEAVAAVMAKGFDILGIKPVQRM 610 620 630 640 650 660 >>CCDS5011.1 RARS2 gene_id:57038|Hs108|chr6 (578 aa) initn: 628 init1: 297 opt: 666 Z-score: 774.3 bits: 153.4 E(32554): 8.2e-37 Smith-Waterman score: 729; 29.3% identity (60.3% similar) in 580 aa overlap (102-660:31-578) 80 90 100 110 120 130 pF1KB6 NMINIISRLQEVFGHAIKAAYPDLENPPLLVTPSQQAKFGDYQCNSAMGISQMLKTKEQK : ::. . .:.: ......:. ... CCDS50 MACGFRRAIACQLSRVLNLPPENLITSISAVPISQKEEVADFQ----LSVDSLLEKDNDH 10 20 30 40 50 140 150 160 170 180 pF1KB6 VNP--REIAENITKHLPDNECIEKVEIAGPGFINVHLRKDFVSEQ-LTSLLVNGVQLPAL : . :. ....: . . .. .: .: .. ...... : ... .: . CCDS50 SRPDIQVQAKRLAEKLRCDTVVSEIS-TGQRTVNFKINRELLTKTVLQQVIEDGSKYGLK 60 70 80 90 100 110 190 200 210 220 230 240 pF1KB6 GE------NKKVIVDFSSPNIAKEMHVGHLRSTIIGESISRLFEFAGYDVLRLNHVGDWG .: .::..:.:::::.::..:::::::::::. :. : : :..:.:.:..:::: CCDS50 SELFSGLPQKKIVVEFSSPNVAKKFHVGHLRSTIIGNFIANLKEALGHQVIRINYLGDWG 120 130 140 150 160 170 250 260 270 280 290 300 pF1KB6 TQFGMLIAHLQD-KFPDYLTVSPPIGDLQVFYKESKKRFDTEEEFKKRAYQCVVLLQGKN :::.: . .: . . : .: ..:. . .:. : .. : : . :. . CCDS50 MQFGLLGTGFQLFGYEEKLQSNPLQHLFEVYVQVNKEAAD-DKSVAKAAQEFFQRLELGD 180 190 200 210 220 230 310 320 330 340 350 pF1KB6 PDITKAWKLICDVSRQELNKIYDALDVSLIER-GESFYQDRMNDIVKEFEDRGFV-QVDD . . :. . :.: .: ..: : : . : :::::... ....: .:..:.. .. CCDS50 VQALSLWQKFRDLSIEEYIRVYKRLGVYFDEYSGESFYREKSQEVLKLLESKGLLLKTIK 240 250 260 270 280 290 360 370 380 390 400 410 pF1KB6 GRKIVFVPGCSIP---LTIVKSDGGYTYDTSDLAAIKQRLFEEKADMIIYVVDNGQSVHF : .: . : . : :...::: : : :::: .:. . . : .:::.:.::. :: CCDS50 GTAVVDLSGNGDPSSICTVMRSDGTSLYATRDLAAAIDRMDKYNFDTMIYVTDKGQKKHF 300 310 320 330 340 350 420 430 440 450 460 470 pF1KB6 QTIFAAAQMIGWYDPKVTRVFHAGFGVVLGEDKKKFKTRSGETVRLMDLLGEGLKRSMDK : .: ...: :: . : :. :::: : .::: :... : :.:.: : ... CCDS50 QQVFQMLKIMG-YD-WAERCQHVPFGVVQG-----MKTRRGDVTFLEDVLNEIQLRMLQN 360 370 380 390 400 480 490 500 510 520 530 pF1KB6 LKEKERDKVLTAEELNAAQTSVAYGYIKYADLSHNRLNDYIFSFDKMLDDRGNTAAYLLY . . : : . .: ....: :. :.. :.:: ::.:.....::.:...: : CCDS50 MASIKTTKELKNPQETAERVGLAALIIQ--DFKGLLLSDYKFSWDRVFQSRGDTGVFLQY 410 420 430 440 450 460 540 550 560 570 580 590 pF1KB6 AFTRIRSIAR------LANIDEEMLQKAARETKILLDHEKEWKLGRCILRFPEILQKILD . .:..:. . : ... ::. .. .:.: .::: :.: : . CCDS50 THARLHSLEETFGCGYLNDFNTACLQEP--QSVSILQH---------LLRFDEVLYKSSQ 470 480 490 500 510 600 610 620 630 640 650 pF1KB6 DLFLHTLCDYIYELATAFTEFYDSCYCVEKDRQTGKILKVNMWRMLLCEAVAAVMAKGFD :. . . .:. :. . . . :: .: :. : .:: .:.:.:. CCDS50 DFQPRHIVSYLLTLSHLAAVAHKTLQI--KDSPP----EVAGARLHLFKAVRSVLANGMK 520 530 540 550 560 660 pF1KB6 ILGIKPVQRM .::: :: :: CCDS50 LLGITPVCRM 570 660 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 13:45:42 2016 done: Sat Nov 5 13:45:43 2016 Total Scan time: 3.460 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]