FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KA0625, 2677 aa 1>>>pF1KA0625 2677 - 2677 aa - 2677 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.1900+/-0.00121; mu= 13.4272+/- 0.073 mean_var=158.0678+/-31.404, 0's: 0 Z-trim(106.9): 27 B-trim: 40 in 1/49 Lambda= 0.102012 statistics sampled from 9229 (9245) to 9229 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.613), E-opt: 0.2 (0.284), width: 16 Scan time: 6.730 The best scores are: opt bits E(32554) CCDS6947.1 SETX gene_id:23064|Hs108|chr9 (2677) 17830 2638.0 0 CCDS12386.1 UPF1 gene_id:5976|Hs108|chr19 (1118) 450 80.0 8.3e-14 CCDS74315.1 UPF1 gene_id:5976|Hs108|chr19 (1129) 450 80.0 8.4e-14 CCDS8187.1 IGHMBP2 gene_id:3508|Hs108|chr11 ( 993) 385 70.4 5.7e-11 >>CCDS6947.1 SETX gene_id:23064|Hs108|chr9 (2677 aa) initn: 17830 init1: 17830 opt: 17830 Z-score: 14179.5 bits: 2638.0 E(32554): 0 Smith-Waterman score: 17830; 99.9% identity (100.0% similar) in 2677 aa overlap (1-2677:1-2677) 10 20 30 40 50 60 pF1KA0 MSTCCWCTPGGASTIDFLKRYASNTPSGEFQTADEDLCYCLECVAEYHKARDELPFLHEV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 MSTCCWCTPGGASTIDFLKRYASNTPSGEFQTADEDLCYCLECVAEYHKARDELPFLHEV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KA0 LWELETLRLINHFEKSMKAEIGDDDELYIVDNNGEMPLFDITGQDFENKLRVPLLEILKY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 LWELETLRLINHFEKSMKAEIGDDDELYIVDNNGEMPLFDITGQDFENKLRVPLLEILKY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KA0 PYLLLHERVNELCVEALCRMEQANCSFQVFDKHPGIYLFLVHPNEMVRRWAILTARNLGK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 PYLLLHERVNELCVEALCRMEQANCSFQVFDKHPGIYLFLVHPNEMVRRWAILTARNLGK 130 140 150 160 170 180 190 200 210 220 230 240 pF1KA0 VDRDDYYDLQEVLLCLFKVIELGLLESPDIYTSSVLEKGKLILLPSHMYDTTNYKSYWLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 VDRDDYYDLQEVLLCLFKVIELGLLESPDIYTSSVLEKGKLILLPSHMYDTTNYKSYWLG 190 200 210 220 230 240 250 260 270 280 290 300 pF1KA0 ICMLLTILEEQAMDSLLLGSDKQNDFMQSILHTMEREADDDSVDPFWPALHCFMVILDRL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 ICMLLTILEEQAMDSLLLGSDKQNDFMQSILHTMEREADDDSVDPFWPALHCFMVILDRL 250 260 270 280 290 300 310 320 330 340 350 360 pF1KA0 GSKVWGQLMDPIVAFQTIINNASYNREIRHIRNSSVRTKLEPESYLDDMVTCSQIVYNYN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 GSKVWGQLMDPIVAFQTIINNASYNREIRHIRNSSVRTKLEPESYLDDMVTCSQIVYNYN 310 320 330 340 350 360 370 380 390 400 410 420 pF1KA0 PEKTKKDSGWRTAICPDYCPNMYEEMETLASVLQSDIGQDMRVHNSTFLWFIPFVQSLMD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 PEKTKKDSGWRTAICPDYCPNMYEEMETLASVLQSDIGQDMRVHNSTFLWFIPFVQSLMD 370 380 390 400 410 420 430 440 450 460 470 480 pF1KA0 LKDLGVAYIAQVVNHLYSEVKEVLNQTDAVCDKVTEFFLLILVSVIELHRNKKCLHLLWV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 LKDLGVAYIAQVVNHLYSEVKEVLNQTDAVCDKVTEFFLLILVSVIELHRNKKCLHLLWV 430 440 450 460 470 480 490 500 510 520 530 540 pF1KA0 SSQQWVEAVVKCAKLPTTAFTRSSEKSSGNCSKGTAMISSLSLHSMPSNSVQLAYVQLIR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 SSQQWVEAVVKCAKLPTTAFTRSSEKSSGNCSKGTAMISSLSLHSMPSNSVQLAYVQLIR 490 500 510 520 530 540 550 560 570 580 590 600 pF1KA0 SLLKEGYQLGQQSLCKRFWDKLNLFLRGNLSLGWQLTSQETHELQSCLKQIIRNIKFKAP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 SLLKEGYQLGQQSLCKRFWDKLNLFLRGNLSLGWQLTSQETHELQSCLKQIIRNIKFKAP 550 560 570 580 590 600 610 620 630 640 650 660 pF1KA0 PCNTFVDLTSACKISPASYNKEESEQMGKTSRKDMHCLEASSPTFSKEPMKVQDSVLIKA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 PCNTFVDLTSACKISPASYNKEESEQMGKTSRKDMHCLEASSPTFSKEPMKVQDSVLIKA 610 620 630 640 650 660 670 680 690 700 710 720 pF1KA0 DNTIEGDNNEQNYIKDVKLEDHLLAGSCLKQSSKNIFTERAEDQIKISTRKQKSVKEISS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 DNTIEGDNNEQNYIKDVKLEDHLLAGSCLKQSSKNIFTERAEDQIKISTRKQKSVKEISS 670 680 690 700 710 720 730 740 750 760 770 780 pF1KA0 YTPKDCTSRNGPERGCDRGIIVSTRLLTDSSTDALEKVSTSNEDFSLKDDALAKTSKRKT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 YTPKDCTSRNGPERGCDRGIIVSTRLLTDSSTDALEKVSTSNEDFSLKDDALAKTSKRKT 730 740 750 760 770 780 790 800 810 820 830 840 pF1KA0 KVQKDEICAKLSHVIKKQHRKSTLVDNTINLDENLTVSNIESFYSRKDTGVQKGDGFIHN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 KVQKDEICAKLSHVIKKQHRKSTLVDNTINLDENLTVSNIESFYSRKDTGVQKGDGFIHN 790 800 810 820 830 840 850 860 870 880 890 900 pF1KA0 LSLDPSGVLDDKNGEQKSQNNVLPKEKQLKNEELVIFSFHENNCKIQEFHVDGKGLIPFT :::::::::::::::::::::::::::::::::::::::::::::::::::::: ::::: CCDS69 LSLDPSGVLDDKNGEQKSQNNVLPKEKQLKNEELVIFSFHENNCKIQEFHVDGKELIPFT 850 860 870 880 890 900 910 920 930 940 950 960 pF1KA0 EMTNASEKKSSPFKDLMTVPESRDEEMSNSTSVIYSNLTREQAPDISPKSDTLTDSQIDR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 EMTNASEKKSSPFKDLMTVPESRDEEMSNSTSVIYSNLTREQAPDISPKSDTLTDSQIDR 910 920 930 940 950 960 970 980 990 1000 1010 1020 pF1KA0 DLHKLSLLAQASVITFPSDSPQNSSQLQRKVKEDKRCFTANQNNVGDTSRGQVIIISDSD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 DLHKLSLLAQASVITFPSDSPQNSSQLQRKVKEDKRCFTANQNNVGDTSRGQVIIISDSD 970 980 990 1000 1010 1020 1030 1040 1050 1060 1070 1080 pF1KA0 DDDDERILSLEKLTKQDKICLEREHPEQHVSTVNSKEEKNPVKEEKTETLFQFEESDSQC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 DDDDERILSLEKLTKQDKICLEREHPEQHVSTVNSKEEKNPVKEEKTETLFQFEESDSQC 1030 1040 1050 1060 1070 1080 1090 1100 1110 1120 1130 1140 pF1KA0 FEFESSSEVFSVWQDHPDDNNSVQDGEKKCLAPIANTTNGQGCTDYVSEVVKKGAEGIEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 FEFESSSEVFSVWQDHPDDNNSVQDGEKKCLAPIANTTNGQGCTDYVSEVVKKGAEGIEE 1090 1100 1110 1120 1130 1140 1150 1160 1170 1180 1190 1200 pF1KA0 HTRPRSISVEEFCEIEVKKPKRKRSEKPMAEDPVRPSSSVRNEGQSDTNKRDLVGNDFKS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 HTRPRSISVEEFCEIEVKKPKRKRSEKPMAEDPVRPSSSVRNEGQSDTNKRDLVGNDFKS 1150 1160 1170 1180 1190 1200 1210 1220 1230 1240 1250 1260 pF1KA0 IDRRTSTPNSRIQRATTVSQKKSSKLCTCTEPIRKVPVSKTPKKTHSDAKKGQNRSSNYL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 IDRRTSTPNSRIQRATTVSQKKSSKLCTCTEPIRKVPVSKTPKKTHSDAKKGQNRSSNYL 1210 1220 1230 1240 1250 1260 1270 1280 1290 1300 1310 1320 pF1KA0 SCRTTPAIVPPKKFRECPEPTSTAEKLGLKKGPRKAYELSQRSLDYVAQLRDHGKTVGVV :::::::::::::::.:::::::::::::::::::::::::::::::::::::::::::: CCDS69 SCRTTPAIVPPKKFRQCPEPTSTAEKLGLKKGPRKAYELSQRSLDYVAQLRDHGKTVGVV 1270 1280 1290 1300 1310 1320 1330 1340 1350 1360 1370 1380 pF1KA0 DTRKKTKLISPQNLSVRNNKKLLTSQELQMQRQIRPKSQKNRRRLSDCESTDVKRAGSHT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 DTRKKTKLISPQNLSVRNNKKLLTSQELQMQRQIRPKSQKNRRRLSDCESTDVKRAGSHT 1330 1340 1350 1360 1370 1380 1390 1400 1410 1420 1430 1440 pF1KA0 AQNSDIFVPESDRSDYNCTGGTEVLANSNRKQLIKCMPSEPETIKAKHGSPATDDACPLN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 AQNSDIFVPESDRSDYNCTGGTEVLANSNRKQLIKCMPSEPETIKAKHGSPATDDACPLN 1390 1400 1410 1420 1430 1440 1450 1460 1470 1480 1490 1500 pF1KA0 QCDSVVLNGTVPTNEVIVSTSEDPLGGGDPTARHIEMAALKEGEPDSSSDAEEDNLFLTQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 QCDSVVLNGTVPTNEVIVSTSEDPLGGGDPTARHIEMAALKEGEPDSSSDAEEDNLFLTQ 1450 1460 1470 1480 1490 1500 1510 1520 1530 1540 1550 1560 pF1KA0 NDPEDMDLCSQMENDNYKLIELIHGKDTVEVEEDSVSRPQLESLSGTKCKYKDCLETTKN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 NDPEDMDLCSQMENDNYKLIELIHGKDTVEVEEDSVSRPQLESLSGTKCKYKDCLETTKN 1510 1520 1530 1540 1550 1560 1570 1580 1590 1600 1610 1620 pF1KA0 QGEYCPKHSEVKAADEDVFRKPGLPPPASKPLRPTTKIFSSKSTSRIAGLSKSLETSSAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 QGEYCPKHSEVKAADEDVFRKPGLPPPASKPLRPTTKIFSSKSTSRIAGLSKSLETSSAL 1570 1580 1590 1600 1610 1620 1630 1640 1650 1660 1670 1680 pF1KA0 SPSLKNKSKGIQSILKVPQPVPLIAQKPVGEMKNSCNVLHPQSPNNSNRQGCKVPFGESK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 SPSLKNKSKGIQSILKVPQPVPLIAQKPVGEMKNSCNVLHPQSPNNSNRQGCKVPFGESK 1630 1640 1650 1660 1670 1680 1690 1700 1710 1720 1730 1740 pF1KA0 YFPSSSPVNILLSSQSVSDTFVKEVLKWKYEMFLNFGQCGPPASLCQSISRPVPVRFHNY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 YFPSSSPVNILLSSQSVSDTFVKEVLKWKYEMFLNFGQCGPPASLCQSISRPVPVRFHNY 1690 1700 1710 1720 1730 1740 1750 1760 1770 1780 1790 1800 pF1KA0 GDYFNVFFPLMVLNTFETVAQEWLNSPNRENFYQLQVRKFPADYIKYWEFAVYLEECELA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 GDYFNVFFPLMVLNTFETVAQEWLNSPNRENFYQLQVRKFPADYIKYWEFAVYLEECELA 1750 1760 1770 1780 1790 1800 1810 1820 1830 1840 1850 1860 pF1KA0 KQLYPKENDLVFLAPERINEEKKDTERNDIQDLHEYHSGYVHKFRRTSVMRNGKTECYLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 KQLYPKENDLVFLAPERINEEKKDTERNDIQDLHEYHSGYVHKFRRTSVMRNGKTECYLS 1810 1820 1830 1840 1850 1860 1870 1880 1890 1900 1910 1920 pF1KA0 IQTQENFPANLNELVNCIVISSLVTTQRKLKAMSLLGSRNQLARAVLNPNPMDFCTKDLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 IQTQENFPANLNELVNCIVISSLVTTQRKLKAMSLLGSRNQLARAVLNPNPMDFCTKDLL 1870 1880 1890 1900 1910 1920 1930 1940 1950 1960 1970 1980 pF1KA0 TTTSERIIAYLRDFNEDQKKAIETAYAMVKHSPSVAKICLIHGPPGTGKSKTIVGLLYRL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 TTTSERIIAYLRDFNEDQKKAIETAYAMVKHSPSVAKICLIHGPPGTGKSKTIVGLLYRL 1930 1940 1950 1960 1970 1980 1990 2000 2010 2020 2030 2040 pF1KA0 LTENQRKGHSDENSNAKIKQNRVLVCAPSNAAVDELMKKIILEFKEKCKDKKNPLGNCGD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 LTENQRKGHSDENSNAKIKQNRVLVCAPSNAAVDELMKKIILEFKEKCKDKKNPLGNCGD 1990 2000 2010 2020 2030 2040 2050 2060 2070 2080 2090 2100 pF1KA0 INLVRLGPEKSINSEVLKFSLDSQVNHRMKKELPSHVQAMHKRKEFLDYQLDELSRQRAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 INLVRLGPEKSINSEVLKFSLDSQVNHRMKKELPSHVQAMHKRKEFLDYQLDELSRQRAL 2050 2060 2070 2080 2090 2100 2110 2120 2130 2140 2150 2160 pF1KA0 CRGGREIQRQELDENISKVSKERQELASKIKEVQGRPQKTQSIIILESHIICCTLSTSGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 CRGGREIQRQELDENISKVSKERQELASKIKEVQGRPQKTQSIIILESHIICCTLSTSGG 2110 2120 2130 2140 2150 2160 2170 2180 2190 2200 2210 2220 pF1KA0 LLLESAFRGQGGVPFSCVIVDEAGQSCEIETLTPLIHRCNKLILVGDPKQLPPTVISMKA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 LLLESAFRGQGGVPFSCVIVDEAGQSCEIETLTPLIHRCNKLILVGDPKQLPPTVISMKA 2170 2180 2190 2200 2210 2220 2230 2240 2250 2260 2270 2280 pF1KA0 QEYGYDQSMMARFCRLLEENVEHNMISRLPILQLTVQYRMHPDICLFPSNYVYNRNLKTN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 QEYGYDQSMMARFCRLLEENVEHNMISRLPILQLTVQYRMHPDICLFPSNYVYNRNLKTN 2230 2240 2250 2260 2270 2280 2290 2300 2310 2320 2330 2340 pF1KA0 RQTEAIRCSSDWPFQPYLVFDVGDGSERRDNDSYINVQEIKLVMEIIKLIKDKRKDVSFR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 RQTEAIRCSSDWPFQPYLVFDVGDGSERRDNDSYINVQEIKLVMEIIKLIKDKRKDVSFR 2290 2300 2310 2320 2330 2340 2350 2360 2370 2380 2390 2400 pF1KA0 NIGIITHYKAQKTMIQKDLDKEFDRKGPAEVDTVDAFQGRQKDCVIVTCVRANSIQGSIG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 NIGIITHYKAQKTMIQKDLDKEFDRKGPAEVDTVDAFQGRQKDCVIVTCVRANSIQGSIG 2350 2360 2370 2380 2390 2400 2410 2420 2430 2440 2450 2460 pF1KA0 FLASLQRLNVTITRAKYSLFILGHLRTLMENQHWNQLIQDAQKRGAIIKTCDKNYRHDAV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 FLASLQRLNVTITRAKYSLFILGHLRTLMENQHWNQLIQDAQKRGAIIKTCDKNYRHDAV 2410 2420 2430 2440 2450 2460 2470 2480 2490 2500 2510 2520 pF1KA0 KILKLKPVLQRSLTHPPTIAPEGSRPQGGLPSSKLDSGFAKTSVAASLYHTPSDSKEITL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 KILKLKPVLQRSLTHPPTIAPEGSRPQGGLPSSKLDSGFAKTSVAASLYHTPSDSKEITL 2470 2480 2490 2500 2510 2520 2530 2540 2550 2560 2570 2580 pF1KA0 TVTSKDPERPPVHDQLQDPRLLKRMGIEVKGGIFLWDPQPSSPQHPGATPPTGEPGFPVV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 TVTSKDPERPPVHDQLQDPRLLKRMGIEVKGGIFLWDPQPSSPQHPGATPPTGEPGFPVV 2530 2540 2550 2560 2570 2580 2590 2600 2610 2620 2630 2640 pF1KA0 HQDLSHIQQPAAVVAALSSHKPPVRGEPPAASPEASTCQSKCDDPEEELCHRREARAFSE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 HQDLSHIQQPAAVVAALSSHKPPVRGEPPAASPEASTCQSKCDDPEEELCHRREARAFSE 2590 2600 2610 2620 2630 2640 2650 2660 2670 pF1KA0 GEQEKCGSETHHTRRNSRWDKRTLEQEDSSSKKRKLL ::::::::::::::::::::::::::::::::::::: CCDS69 GEQEKCGSETHHTRRNSRWDKRTLEQEDSSSKKRKLL 2650 2660 2670 >>CCDS12386.1 UPF1 gene_id:5976|Hs108|chr19 (1118 aa) initn: 581 init1: 135 opt: 450 Z-score: 361.3 bits: 80.0 E(32554): 8.3e-14 Smith-Waterman score: 629; 26.8% identity (54.1% similar) in 867 aa overlap (1733-2578:292-1011) 1710 1720 1730 1740 1750 1760 pF1KA0 KEVLKWKYEMFLNFGQCGPPASLCQSISRPVPVRFHNYGDYFNVFFPLMVLNTFETVAQE : .:... .: :.: ::. :.. .. CCDS12 INKLEELWKENPSATLEDLEKPGVDEEPQHVLLRYEDAYQYQNIFGPLVKLEA--DYDKK 270 280 290 300 310 1770 1780 1790 1800 1810 1820 pF1KA0 WLNSPNRENFYQLQVRKFPADYIKYWEFAVYLEECELAKQLYPK-ENDLVFLAPERINEE .: ...: . :: :... :.. ..: :: ..:. .. ..: . CCDS12 LKESQTQDN---ITVR---------WDLG--LNKKRIAYFTLPKTDSDMRLMQGDEICLR 320 330 340 350 360 1830 1840 1850 1860 1870 1880 pF1KA0 KKDTERNDIQDLHEYHSGYVHKFRRTSVMRNGKTECYLSIQTQENFPANLNELVNCIVIS : :: .: : .. : : : . .... . :..... . . CCDS12 YKG-------DLAPLWKGIGHVIK---VPDNYGDEIAIELRSSVGAPVEVTHNFQVDFVW 370 380 390 400 410 1890 1900 1910 1920 1930 pF1KA0 SLVTTQRKLKAMSLLGSRNQLARAVLNPNPMDFCTKDLLTTTS--ERIIAY-LRDFNEDQ . .. .: .:.. .. . . . . . . ..:.. . .:. : : :.:..: CCDS12 KSTSFDRMQSALKTFAVDETSVSGYIYHKLLGHEVEDVIIKCQLPKRFTAQGLPDLNHSQ 420 430 440 450 460 470 1940 1950 1960 1970 1980 1990 pF1KA0 KKAIETAYAMVKHSPSVAKICLIHGPPGTGKSKTIVGLLYRLLTENQRKGHSDENSNAKI :..: : . : . ::.:::::::. : . ..:.: :.:.. CCDS12 VYAVKT----VLQRP----LSLIQGPPGTGKTVTSATIVYHL----ARQGNGP------- 480 490 500 510 2000 2010 2020 2030 2040 2050 pF1KA0 KQNRVLVCAPSNAAVDELMKKIILEFKEKCKDKKNPLGNCGDINLVRLGPEKSINSEVLK :::::::: :::.: .:: . : ...::: :: . CCDS12 ----VLVCAPSNIAVDQLTEKI------------HQTG----LKVVRLCA-KS------R 520 530 540 2060 2070 2080 2090 2100 2110 pF1KA0 FSLDSQVNHRMKKELPSHVQAMHKRKEFLDYQLDELSRQRALCRGGREIQRQELDENISK ..:: : :.:.. . .: .. ::.. . : ..: : .:. CCDS12 EAIDS----------PVSFLALHNQIRNMD-SMPELQKLQQL--------KDETGE-LSS 550 560 570 580 2120 2130 2140 2150 2160 2170 pF1KA0 VSKERQELASKIKEVQGRPQKTQSIIILESHIICCTLSTSGGLLLESAFRGQGGVPFSCV ....: . .. : . ..... .:::: .: : . . : . CCDS12 ADEKRYRALKRTAERE---------LLMNADVICCTCVGAGDPRLAK-------MQFRSI 590 600 610 620 630 2180 2190 2200 2210 2220 2230 pF1KA0 IVDEAGQSCEIETLTPLIHRCNKLILVGDPKQLPPTVISMKAQEYGYDQSMMARFCRLLE ..::. :. : : ..:.. ..:::::: :: :.:. :: . : .::. : ::. CCDS12 LIDESTQATEPECMVPVVLGAKQLILVGDHCQLGPVVMCKKAAKAGLSQSL---FERLVV 640 650 660 670 680 690 2240 2250 2260 2270 2280 2290 pF1KA0 ENVEHNMISRLPILQLTVQYRMHPDICLFPSNYVYNRNLKTNRQTEAIRCSS----DWPF ... :: .: ::::::: . :::: :. .:. : : : : .. .:: CCDS12 LGIR-------PI-RLQVQYRMHPALSAFPSNIFYEGSLQ-NGVTAADRVKKGFDFQWP- 700 710 720 730 740 2300 2310 2320 2330 2340 pF1KA0 QPY--LVFDVGDGSER--RDNDSYINVQEIKLVMEII-KLIKDKRKDVSFRNIGIITHYK :: . : : .:.:. .. ::.: : : .: ::.: : .::::: :. CCDS12 QPDKPMFFYVTQGQEEIASSGTSYLNRTEAANVEKITTKLLKAGAKP---DQIGIITPYE 750 760 770 780 790 2350 2360 2370 2380 2390 2400 pF1KA0 AQKT-MIQK-DLDKEFDRKGPAEVD--TVDAFQGRQKDCVIVTCVRANSIQGSIGFLASL .:.. ..: ... . : ::. .:::::::.:: .:..::::: :: :::: . CCDS12 GQRSYLVQYMQFSGSLHTKLYQEVEIASVDAFQGREKDFIILSCVRANEHQG-IGFLNDP 800 810 820 830 840 850 2410 2420 2430 2440 2450 2460 pF1KA0 QRLNVTITRAKYSLFILGHLRTLMENQHWNQLIQDAQKRGAIIKTCDKNYRHDAVKILKL .::::..:::.:...:.:. ..: .. ::.:.. ... .... .: :.. ... CCDS12 RRLNVALTRARYGVIIVGNPKALSKQPLWNHLLNYYKEQKVLVEGPLNNLRESLMQFS-- 860 870 880 890 900 910 2470 2480 2490 2500 2510 2520 pF1KA0 KPVLQRSLTHPPTIAPEGSRPQGGLPSSKLDSGFAKTSVAASLYHTPSDSKEITLTVTSK :: :.:.. :: : :.: . .. :. : . .:.: :... .. . CCDS12 KP---RKLVN--TINP-GAR---FMTTAMYDAREAI--IPGSVYDRSSQGRPSSMYFQT- 920 930 940 950 960 2530 2540 2550 2560 2570 2580 pF1KA0 DPERPPVHDQL----QDPRLLKRMGIEVKGGIFLWDPQPSSPQHPGATPPTGEPGFPVVH :::. : . :.: . .. . :.: :. :.. : : CCDS12 -------HDQIGMISAGPSHVAAMNIPIPFNLVM-PPMPPPGYFGQANGPAAGRGTPKGK 970 980 990 1000 1010 2590 2600 2610 2620 2630 2640 pF1KA0 QDLSHIQQPAAVVAALSSHKPPVRGEPPAASPEASTCQSKCDDPEEELCHRREARAFSEG CCDS12 TGRGGRQKNRFGLPGPSQTNLPNSQASQDVASQPFSQGALTQGYISMSQPSQMSQPGLSQ 1020 1030 1040 1050 1060 1070 >>CCDS74315.1 UPF1 gene_id:5976|Hs108|chr19 (1129 aa) initn: 581 init1: 135 opt: 450 Z-score: 361.3 bits: 80.0 E(32554): 8.4e-14 Smith-Waterman score: 626; 29.8% identity (55.3% similar) in 665 aa overlap (1931-2578:479-1022) 1910 1920 1930 1940 1950 1960 pF1KA0 QLARAVLNPNPMDFCTKDLLTTTSERIIAYLRDFNEDQKKAIETAYAMVKHSPSVAKICL : :.:..: :..: : . : . : CCDS74 SGYIYHKLLGHEVEDVIIKCQLPKRFTAQGLPDLNHSQVYAVKT----VLQRP----LSL 450 460 470 480 490 500 1970 1980 1990 2000 2010 2020 pF1KA0 IHGPPGTGKSKTIVGLLYRLLTENQRKGHSDENSNAKIKQNRVLVCAPSNAAVDELMKKI :.:::::::. : . ..:.: :.:.. :::::::: :::.: .:: CCDS74 IQGPPGTGKTVTSATIVYHL----ARQGNGP-----------VLVCAPSNIAVDQLTEKI 510 520 530 540 2030 2040 2050 2060 2070 2080 pF1KA0 ILEFKEKCKDKKNPLGNCGDINLVRLGPEKSINSEVLKFSLDSQVNHRMKKELPSHVQAM . : ...::: :: . ..:: : :. CCDS74 ------------HQTG----LKVVRLCA-KS------REAIDS----------PVSFLAL 550 560 570 2090 2100 2110 2120 2130 2140 pF1KA0 HKRKEFLDYQLDELSRQRALCRGGREIQRQELDENISKVSKERQELASKIKEVQGRPQKT :.. . .: .. ::.. . : ..: : .:.....: . .. : . CCDS74 HNQIRNMD-SMPELQKLQQL--------KDETGE-LSSADEKRYRALKRTAERE------ 580 590 600 610 2150 2160 2170 2180 2190 2200 pF1KA0 QSIIILESHIICCTLSTSGGLLLESAFRGQGGVPFSCVIVDEAGQSCEIETLTPLIHRCN ..... .:::: .: : . . : ...::. :. : : ..:.. . CCDS74 ---LLMNADVICCTCVGAGDPRLAK-------MQFRSILIDESTQATEPECMVPVVLGAK 620 630 640 650 660 2210 2220 2230 2240 2250 2260 pF1KA0 KLILVGDPKQLPPTVISMKAQEYGYDQSMMARFCRLLEENVEHNMISRLPILQLTVQYRM .:::::: :: :.:. :: . : .::. : ::. ... :: .: ::::: CCDS74 QLILVGDHCQLGPVVMCKKAAKAGLSQSL---FERLVVLGIR-------PI-RLQVQYRM 670 680 690 700 710 2270 2280 2290 2300 2310 pF1KA0 HPDICLFPSNYVYNRNLKTNRQTEAIRCSS----DWPFQPY--LVFDVGDGSER--RDND :: . :::: :. .:. : : : : .. .:: :: . : : .:.:. .. CCDS74 HPALSAFPSNIFYEGSLQ-NGVTAADRVKKGFDFQWP-QPDKPMFFYVTQGQEEIASSGT 720 730 740 750 760 770 2320 2330 2340 2350 2360 pF1KA0 SYINVQEIKLVMEII-KLIKDKRKDVSFRNIGIITHYKAQKT-MIQK-DLDKEFDRKGPA ::.: : : .: ::.: : .::::: :..:.. ..: ... . : CCDS74 SYLNRTEAANVEKITTKLLKAGAKP---DQIGIITPYEGQRSYLVQYMQFSGSLHTKLYQ 780 790 800 810 820 830 2370 2380 2390 2400 2410 2420 pF1KA0 EVD--TVDAFQGRQKDCVIVTCVRANSIQGSIGFLASLQRLNVTITRAKYSLFILGHLRT ::. .:::::::.:: .:..::::: :: :::: . .::::..:::.:...:.:. .. CCDS74 EVEIASVDAFQGREKDFIILSCVRANEHQG-IGFLNDPRRLNVALTRARYGVIIVGNPKA 840 850 860 870 880 2430 2440 2450 2460 2470 2480 pF1KA0 LMENQHWNQLIQDAQKRGAIIKTCDKNYRHDAVKILKLKPVLQRSLTHPPTIAPEGSRPQ : .. ::.:.. ... .... .: :.. ... :: :.:.. :: : :.: CCDS74 LSKQPLWNHLLNYYKEQKVLVEGPLNNLRESLMQFS--KP---RKLVN--TINP-GARF- 890 900 910 920 930 940 2490 2500 2510 2520 2530 2540 pF1KA0 GGLPSSKLDSGFAKTSVAASLYHTPSDSKEITLTVTSKDPERPPVHDQL----QDPRLLK . .. :. .. . .:.: :... .. . :::. : . CCDS74 --MTTAMYDA--REAIIPGSVYDRSSQGRPSSMYFQT--------HDQIGMISAGPSHVA 950 960 970 980 2550 2560 2570 2580 2590 2600 pF1KA0 RMGIEVKGGIFLWDPQPSSPQHPGATPPTGEPGFPVVHQDLSHIQQPAAVVAALSSHKPP :.: . .. . :.: :. :.. : : CCDS74 AMNIPIPFNLVM-PPMPPPGYFGQANGPAAGRGTPKGKTGRGGRQKNRFGLPGPSQTNLP 990 1000 1010 1020 1030 1040 2610 2620 2630 2640 2650 2660 pF1KA0 VRGEPPAASPEASTCQSKCDDPEEELCHRREARAFSEGEQEKCGSETHHTRRNSRWDKRT CCDS74 NSQASQDVASQPFSQGALTQGYISMSQPSQMSQPGLSQPELSQDSYLGDEFKSQIDVALS 1050 1060 1070 1080 1090 1100 >>CCDS8187.1 IGHMBP2 gene_id:3508|Hs108|chr11 (993 aa) initn: 404 init1: 91 opt: 385 Z-score: 310.4 bits: 70.4 E(32554): 5.7e-11 Smith-Waterman score: 539; 26.0% identity (52.1% similar) in 768 aa overlap (1934-2673:192-860) 1910 1920 1930 1940 1950 1960 pF1KA0 RAVLNPNPMDFCTKDLLTTTSERIIAYLRDFNEDQKKAIETAYAMVKHSPSVAKICLIHG .. .::.:. .:. : .. .::: CCDS81 PASSLIEVLFGRSAPSPASEIHPLTFFNTCLDTSQKEAV--LFAL-----SQKELAIIHG 170 180 190 200 210 1970 1980 1990 2000 2010 2020 pF1KA0 PPGTGKSKTIVGLLYRLLTENQRKGHSDENSNAKIKQN-RVLVCAPSNAAVDELMKKIIL ::::::. :.: .. . .::. .:: ::::: :::.:.... : CCDS81 PPGTGKTTTVVEIILQ-----------------AVKQGLKVLCCAPSNIAVDNLVERLAL 220 230 240 250 2030 2040 2050 2060 2070 2080 pF1KA0 EFKEKCKDKKNPLGNCGDINLVRLGPEKSINSEVLKFSLDSQVNHRMKKELPSHVQAMHK ::.. ::. :.. ..:... :: : ..:. ..:.. :.. : CCDS81 -----CKQRILRLGH--PARLLESIQQHSLDA-VLARSDSAQIVADIRKDID---QVFVK 260 270 280 290 300 2090 2100 2110 2120 2130 2140 pF1KA0 RKEFLDYQLDELSRQRALCRGGREIQRQELDENISKVSKERQELASKIKEVQGRPQKTQS :. : .:... :. .. :.:: :::.: :. .. . :.. CCDS81 NKKTQD------KREKSNFRNEIKLLRKEL--------KEREE-AAMLESL------TSA 310 320 330 340 2150 2160 2170 2180 2190 2200 pF1KA0 IIILESHIICCTLSTSGGL-LLESAFRGQGGVPFSCVIVDEAGQSCEIETLTPLIHRCNK ..: .. :..: : :: .. :. :..:: .:. : ::. . : CCDS81 NVVLATNT---GASADGPLKLLPESY-------FDVVVIDECAQALEASCWIPLL-KARK 350 360 370 380 390 2210 2220 2230 2240 2250 2260 pF1KA0 LILVGDPKQLPPTVISMKAQEYGYDQSMMARFCRLLEENVEHNMISRLPILQLTVQYRMH ::.:: ::::::..: :: : . :.: :: :: . .:. . :::::::: CCDS81 CILAGDHKQLPPTTVSHKAALAGLSLSLME---RLAEE-----YGARV-VRTLTVQYRMH 400 410 420 430 440 2270 2280 2290 2300 pF1KA0 PDICLFPSNYVYNRNLKTNRQTEAIRCSSDWPFQ--------PYLVFDV-GDGS---ERR : . :. .: .: : ... : . : : : :. :. : : :.. CCDS81 QAIMRWASDTMYLGQL-TAHSSVARHLLRDLPGVAATEETGVPLLLVDTAGCGLFELEEE 450 460 470 480 490 500 2310 2320 2330 2340 2350 2360 pF1KA0 DNDSYINVQEIKLVMEIIKLIKDKRKDVSFRNIGIITHYKAQKTMIQKDLDKEFDRKGPA :..: : :..:: :. . : : :.:.... :. : .....: :. CCDS81 DEQSKGNPGEVRLVSLHIQALVD--AGVPARDIAVVSPYNLQVDLLRQSL---VHRHPEL 510 520 530 540 550 2370 2380 2390 2400 2410 2420 pF1KA0 EVDTVDAFQGRQKDCVIVTCVRANSIQGSIGFLASLQRLNVTITRAKYSLFILGHLRTLM :. .::.::::.:. ::.. ::.: .: .:::: .:.::..:::. . .. ::. CCDS81 EIKSVDGFQGREKEAVILSFVRSNR-KGEVGFLAEDRRINVAVTRARRHVAVICDSRTVN 560 570 580 590 600 610 2430 2440 2450 2460 2470 2480 pF1KA0 ENQHWNQLIQDAQKRGAIIKTCDKNYRHDAVKILKLKPVLQRSLTHPPTIAPEGSRPQGG .. . :.. ..: . . . : : : . : : .: : .::: CCDS81 NHAFLKTLVEYFTQHGEVRTAFE--YLDDIVPENYSHENSQGS-SHAAT------KPQGP 620 630 640 650 660 2490 2500 2510 2520 2530 2540 pF1KA0 LPSSKLDSGFAKTSVAASLYHTPSDSKEITLTVTSKDPERPPVHDQLQDPRLLKRMGIEV :.. : . . :. . .: ...:. : .: .. .:. :.: CCDS81 ATSTRTGSQRQEGGQEAAAPARQGRKKPAGKSLASEAPSQPSLNG--GSPE-----GVES 670 680 690 700 710 720 2550 2560 2570 2580 2590 pF1KA0 KGGI-----FLWDPQPSSPQHPGATPPTGEPGFPVVHQ-----DLSHIQQPAA----VVA . :. .. . . :. .. : . ::: : : .. . ... CCDS81 QDGVDHFRAMIVEFMASKKMQLEFPPSLNSHDRLRVHQIAEEHGLRHDSSGEGKRRFITV 730 740 750 760 770 780 2600 2610 2620 2630 2640 2650 pF1KA0 ALSSHKPPVRGEPPAASPEASTCQSKCDDPEEELCHRREARAFSEGEQEKCGSETHHTRR . . .: . :::.. . : : . :: :. .. . . : . : CCDS81 SKRAPRPRAALGPPAGTGGPAPLQPVPPTPAQTEQPPREQRGPDQPDLRTLHLERLQRVR 790 800 810 820 830 840 2660 2670 pF1KA0 NSRWDKRTLEQEDSSSKKRKLL ... . . ::. :...: CCDS81 SAQGQPASKEQQASGQQKLPEKKKKKAKGHPATDLPTEEDFEALVSAAVKADNTCGFAKC 850 860 870 880 890 900 2677 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Wed Nov 2 19:29:20 2016 done: Wed Nov 2 19:29:21 2016 Total Scan time: 6.730 Total Display time: 0.380 Function used was FASTA [36.3.4 Apr, 2011]