FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KA0625, 2677 aa
1>>>pF1KA0625 2677 - 2677 aa - 2677 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.1900+/-0.00121; mu= 13.4272+/- 0.073
mean_var=158.0678+/-31.404, 0's: 0 Z-trim(106.9): 27 B-trim: 40 in 1/49
Lambda= 0.102012
statistics sampled from 9229 (9245) to 9229 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.613), E-opt: 0.2 (0.284), width: 16
Scan time: 6.730
The best scores are: opt bits E(32554)
CCDS6947.1 SETX gene_id:23064|Hs108|chr9 (2677) 17830 2638.0 0
CCDS12386.1 UPF1 gene_id:5976|Hs108|chr19 (1118) 450 80.0 8.3e-14
CCDS74315.1 UPF1 gene_id:5976|Hs108|chr19 (1129) 450 80.0 8.4e-14
CCDS8187.1 IGHMBP2 gene_id:3508|Hs108|chr11 ( 993) 385 70.4 5.7e-11
>>CCDS6947.1 SETX gene_id:23064|Hs108|chr9 (2677 aa)
initn: 17830 init1: 17830 opt: 17830 Z-score: 14179.5 bits: 2638.0 E(32554): 0
Smith-Waterman score: 17830; 99.9% identity (100.0% similar) in 2677 aa overlap (1-2677:1-2677)
10 20 30 40 50 60
pF1KA0 MSTCCWCTPGGASTIDFLKRYASNTPSGEFQTADEDLCYCLECVAEYHKARDELPFLHEV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 MSTCCWCTPGGASTIDFLKRYASNTPSGEFQTADEDLCYCLECVAEYHKARDELPFLHEV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KA0 LWELETLRLINHFEKSMKAEIGDDDELYIVDNNGEMPLFDITGQDFENKLRVPLLEILKY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 LWELETLRLINHFEKSMKAEIGDDDELYIVDNNGEMPLFDITGQDFENKLRVPLLEILKY
70 80 90 100 110 120
130 140 150 160 170 180
pF1KA0 PYLLLHERVNELCVEALCRMEQANCSFQVFDKHPGIYLFLVHPNEMVRRWAILTARNLGK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 PYLLLHERVNELCVEALCRMEQANCSFQVFDKHPGIYLFLVHPNEMVRRWAILTARNLGK
130 140 150 160 170 180
190 200 210 220 230 240
pF1KA0 VDRDDYYDLQEVLLCLFKVIELGLLESPDIYTSSVLEKGKLILLPSHMYDTTNYKSYWLG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 VDRDDYYDLQEVLLCLFKVIELGLLESPDIYTSSVLEKGKLILLPSHMYDTTNYKSYWLG
190 200 210 220 230 240
250 260 270 280 290 300
pF1KA0 ICMLLTILEEQAMDSLLLGSDKQNDFMQSILHTMEREADDDSVDPFWPALHCFMVILDRL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 ICMLLTILEEQAMDSLLLGSDKQNDFMQSILHTMEREADDDSVDPFWPALHCFMVILDRL
250 260 270 280 290 300
310 320 330 340 350 360
pF1KA0 GSKVWGQLMDPIVAFQTIINNASYNREIRHIRNSSVRTKLEPESYLDDMVTCSQIVYNYN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 GSKVWGQLMDPIVAFQTIINNASYNREIRHIRNSSVRTKLEPESYLDDMVTCSQIVYNYN
310 320 330 340 350 360
370 380 390 400 410 420
pF1KA0 PEKTKKDSGWRTAICPDYCPNMYEEMETLASVLQSDIGQDMRVHNSTFLWFIPFVQSLMD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 PEKTKKDSGWRTAICPDYCPNMYEEMETLASVLQSDIGQDMRVHNSTFLWFIPFVQSLMD
370 380 390 400 410 420
430 440 450 460 470 480
pF1KA0 LKDLGVAYIAQVVNHLYSEVKEVLNQTDAVCDKVTEFFLLILVSVIELHRNKKCLHLLWV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 LKDLGVAYIAQVVNHLYSEVKEVLNQTDAVCDKVTEFFLLILVSVIELHRNKKCLHLLWV
430 440 450 460 470 480
490 500 510 520 530 540
pF1KA0 SSQQWVEAVVKCAKLPTTAFTRSSEKSSGNCSKGTAMISSLSLHSMPSNSVQLAYVQLIR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 SSQQWVEAVVKCAKLPTTAFTRSSEKSSGNCSKGTAMISSLSLHSMPSNSVQLAYVQLIR
490 500 510 520 530 540
550 560 570 580 590 600
pF1KA0 SLLKEGYQLGQQSLCKRFWDKLNLFLRGNLSLGWQLTSQETHELQSCLKQIIRNIKFKAP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 SLLKEGYQLGQQSLCKRFWDKLNLFLRGNLSLGWQLTSQETHELQSCLKQIIRNIKFKAP
550 560 570 580 590 600
610 620 630 640 650 660
pF1KA0 PCNTFVDLTSACKISPASYNKEESEQMGKTSRKDMHCLEASSPTFSKEPMKVQDSVLIKA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 PCNTFVDLTSACKISPASYNKEESEQMGKTSRKDMHCLEASSPTFSKEPMKVQDSVLIKA
610 620 630 640 650 660
670 680 690 700 710 720
pF1KA0 DNTIEGDNNEQNYIKDVKLEDHLLAGSCLKQSSKNIFTERAEDQIKISTRKQKSVKEISS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 DNTIEGDNNEQNYIKDVKLEDHLLAGSCLKQSSKNIFTERAEDQIKISTRKQKSVKEISS
670 680 690 700 710 720
730 740 750 760 770 780
pF1KA0 YTPKDCTSRNGPERGCDRGIIVSTRLLTDSSTDALEKVSTSNEDFSLKDDALAKTSKRKT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 YTPKDCTSRNGPERGCDRGIIVSTRLLTDSSTDALEKVSTSNEDFSLKDDALAKTSKRKT
730 740 750 760 770 780
790 800 810 820 830 840
pF1KA0 KVQKDEICAKLSHVIKKQHRKSTLVDNTINLDENLTVSNIESFYSRKDTGVQKGDGFIHN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 KVQKDEICAKLSHVIKKQHRKSTLVDNTINLDENLTVSNIESFYSRKDTGVQKGDGFIHN
790 800 810 820 830 840
850 860 870 880 890 900
pF1KA0 LSLDPSGVLDDKNGEQKSQNNVLPKEKQLKNEELVIFSFHENNCKIQEFHVDGKGLIPFT
:::::::::::::::::::::::::::::::::::::::::::::::::::::: :::::
CCDS69 LSLDPSGVLDDKNGEQKSQNNVLPKEKQLKNEELVIFSFHENNCKIQEFHVDGKELIPFT
850 860 870 880 890 900
910 920 930 940 950 960
pF1KA0 EMTNASEKKSSPFKDLMTVPESRDEEMSNSTSVIYSNLTREQAPDISPKSDTLTDSQIDR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 EMTNASEKKSSPFKDLMTVPESRDEEMSNSTSVIYSNLTREQAPDISPKSDTLTDSQIDR
910 920 930 940 950 960
970 980 990 1000 1010 1020
pF1KA0 DLHKLSLLAQASVITFPSDSPQNSSQLQRKVKEDKRCFTANQNNVGDTSRGQVIIISDSD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 DLHKLSLLAQASVITFPSDSPQNSSQLQRKVKEDKRCFTANQNNVGDTSRGQVIIISDSD
970 980 990 1000 1010 1020
1030 1040 1050 1060 1070 1080
pF1KA0 DDDDERILSLEKLTKQDKICLEREHPEQHVSTVNSKEEKNPVKEEKTETLFQFEESDSQC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 DDDDERILSLEKLTKQDKICLEREHPEQHVSTVNSKEEKNPVKEEKTETLFQFEESDSQC
1030 1040 1050 1060 1070 1080
1090 1100 1110 1120 1130 1140
pF1KA0 FEFESSSEVFSVWQDHPDDNNSVQDGEKKCLAPIANTTNGQGCTDYVSEVVKKGAEGIEE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 FEFESSSEVFSVWQDHPDDNNSVQDGEKKCLAPIANTTNGQGCTDYVSEVVKKGAEGIEE
1090 1100 1110 1120 1130 1140
1150 1160 1170 1180 1190 1200
pF1KA0 HTRPRSISVEEFCEIEVKKPKRKRSEKPMAEDPVRPSSSVRNEGQSDTNKRDLVGNDFKS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 HTRPRSISVEEFCEIEVKKPKRKRSEKPMAEDPVRPSSSVRNEGQSDTNKRDLVGNDFKS
1150 1160 1170 1180 1190 1200
1210 1220 1230 1240 1250 1260
pF1KA0 IDRRTSTPNSRIQRATTVSQKKSSKLCTCTEPIRKVPVSKTPKKTHSDAKKGQNRSSNYL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 IDRRTSTPNSRIQRATTVSQKKSSKLCTCTEPIRKVPVSKTPKKTHSDAKKGQNRSSNYL
1210 1220 1230 1240 1250 1260
1270 1280 1290 1300 1310 1320
pF1KA0 SCRTTPAIVPPKKFRECPEPTSTAEKLGLKKGPRKAYELSQRSLDYVAQLRDHGKTVGVV
:::::::::::::::.::::::::::::::::::::::::::::::::::::::::::::
CCDS69 SCRTTPAIVPPKKFRQCPEPTSTAEKLGLKKGPRKAYELSQRSLDYVAQLRDHGKTVGVV
1270 1280 1290 1300 1310 1320
1330 1340 1350 1360 1370 1380
pF1KA0 DTRKKTKLISPQNLSVRNNKKLLTSQELQMQRQIRPKSQKNRRRLSDCESTDVKRAGSHT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 DTRKKTKLISPQNLSVRNNKKLLTSQELQMQRQIRPKSQKNRRRLSDCESTDVKRAGSHT
1330 1340 1350 1360 1370 1380
1390 1400 1410 1420 1430 1440
pF1KA0 AQNSDIFVPESDRSDYNCTGGTEVLANSNRKQLIKCMPSEPETIKAKHGSPATDDACPLN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 AQNSDIFVPESDRSDYNCTGGTEVLANSNRKQLIKCMPSEPETIKAKHGSPATDDACPLN
1390 1400 1410 1420 1430 1440
1450 1460 1470 1480 1490 1500
pF1KA0 QCDSVVLNGTVPTNEVIVSTSEDPLGGGDPTARHIEMAALKEGEPDSSSDAEEDNLFLTQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 QCDSVVLNGTVPTNEVIVSTSEDPLGGGDPTARHIEMAALKEGEPDSSSDAEEDNLFLTQ
1450 1460 1470 1480 1490 1500
1510 1520 1530 1540 1550 1560
pF1KA0 NDPEDMDLCSQMENDNYKLIELIHGKDTVEVEEDSVSRPQLESLSGTKCKYKDCLETTKN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 NDPEDMDLCSQMENDNYKLIELIHGKDTVEVEEDSVSRPQLESLSGTKCKYKDCLETTKN
1510 1520 1530 1540 1550 1560
1570 1580 1590 1600 1610 1620
pF1KA0 QGEYCPKHSEVKAADEDVFRKPGLPPPASKPLRPTTKIFSSKSTSRIAGLSKSLETSSAL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 QGEYCPKHSEVKAADEDVFRKPGLPPPASKPLRPTTKIFSSKSTSRIAGLSKSLETSSAL
1570 1580 1590 1600 1610 1620
1630 1640 1650 1660 1670 1680
pF1KA0 SPSLKNKSKGIQSILKVPQPVPLIAQKPVGEMKNSCNVLHPQSPNNSNRQGCKVPFGESK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 SPSLKNKSKGIQSILKVPQPVPLIAQKPVGEMKNSCNVLHPQSPNNSNRQGCKVPFGESK
1630 1640 1650 1660 1670 1680
1690 1700 1710 1720 1730 1740
pF1KA0 YFPSSSPVNILLSSQSVSDTFVKEVLKWKYEMFLNFGQCGPPASLCQSISRPVPVRFHNY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 YFPSSSPVNILLSSQSVSDTFVKEVLKWKYEMFLNFGQCGPPASLCQSISRPVPVRFHNY
1690 1700 1710 1720 1730 1740
1750 1760 1770 1780 1790 1800
pF1KA0 GDYFNVFFPLMVLNTFETVAQEWLNSPNRENFYQLQVRKFPADYIKYWEFAVYLEECELA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 GDYFNVFFPLMVLNTFETVAQEWLNSPNRENFYQLQVRKFPADYIKYWEFAVYLEECELA
1750 1760 1770 1780 1790 1800
1810 1820 1830 1840 1850 1860
pF1KA0 KQLYPKENDLVFLAPERINEEKKDTERNDIQDLHEYHSGYVHKFRRTSVMRNGKTECYLS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 KQLYPKENDLVFLAPERINEEKKDTERNDIQDLHEYHSGYVHKFRRTSVMRNGKTECYLS
1810 1820 1830 1840 1850 1860
1870 1880 1890 1900 1910 1920
pF1KA0 IQTQENFPANLNELVNCIVISSLVTTQRKLKAMSLLGSRNQLARAVLNPNPMDFCTKDLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 IQTQENFPANLNELVNCIVISSLVTTQRKLKAMSLLGSRNQLARAVLNPNPMDFCTKDLL
1870 1880 1890 1900 1910 1920
1930 1940 1950 1960 1970 1980
pF1KA0 TTTSERIIAYLRDFNEDQKKAIETAYAMVKHSPSVAKICLIHGPPGTGKSKTIVGLLYRL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 TTTSERIIAYLRDFNEDQKKAIETAYAMVKHSPSVAKICLIHGPPGTGKSKTIVGLLYRL
1930 1940 1950 1960 1970 1980
1990 2000 2010 2020 2030 2040
pF1KA0 LTENQRKGHSDENSNAKIKQNRVLVCAPSNAAVDELMKKIILEFKEKCKDKKNPLGNCGD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 LTENQRKGHSDENSNAKIKQNRVLVCAPSNAAVDELMKKIILEFKEKCKDKKNPLGNCGD
1990 2000 2010 2020 2030 2040
2050 2060 2070 2080 2090 2100
pF1KA0 INLVRLGPEKSINSEVLKFSLDSQVNHRMKKELPSHVQAMHKRKEFLDYQLDELSRQRAL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 INLVRLGPEKSINSEVLKFSLDSQVNHRMKKELPSHVQAMHKRKEFLDYQLDELSRQRAL
2050 2060 2070 2080 2090 2100
2110 2120 2130 2140 2150 2160
pF1KA0 CRGGREIQRQELDENISKVSKERQELASKIKEVQGRPQKTQSIIILESHIICCTLSTSGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 CRGGREIQRQELDENISKVSKERQELASKIKEVQGRPQKTQSIIILESHIICCTLSTSGG
2110 2120 2130 2140 2150 2160
2170 2180 2190 2200 2210 2220
pF1KA0 LLLESAFRGQGGVPFSCVIVDEAGQSCEIETLTPLIHRCNKLILVGDPKQLPPTVISMKA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 LLLESAFRGQGGVPFSCVIVDEAGQSCEIETLTPLIHRCNKLILVGDPKQLPPTVISMKA
2170 2180 2190 2200 2210 2220
2230 2240 2250 2260 2270 2280
pF1KA0 QEYGYDQSMMARFCRLLEENVEHNMISRLPILQLTVQYRMHPDICLFPSNYVYNRNLKTN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 QEYGYDQSMMARFCRLLEENVEHNMISRLPILQLTVQYRMHPDICLFPSNYVYNRNLKTN
2230 2240 2250 2260 2270 2280
2290 2300 2310 2320 2330 2340
pF1KA0 RQTEAIRCSSDWPFQPYLVFDVGDGSERRDNDSYINVQEIKLVMEIIKLIKDKRKDVSFR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 RQTEAIRCSSDWPFQPYLVFDVGDGSERRDNDSYINVQEIKLVMEIIKLIKDKRKDVSFR
2290 2300 2310 2320 2330 2340
2350 2360 2370 2380 2390 2400
pF1KA0 NIGIITHYKAQKTMIQKDLDKEFDRKGPAEVDTVDAFQGRQKDCVIVTCVRANSIQGSIG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 NIGIITHYKAQKTMIQKDLDKEFDRKGPAEVDTVDAFQGRQKDCVIVTCVRANSIQGSIG
2350 2360 2370 2380 2390 2400
2410 2420 2430 2440 2450 2460
pF1KA0 FLASLQRLNVTITRAKYSLFILGHLRTLMENQHWNQLIQDAQKRGAIIKTCDKNYRHDAV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 FLASLQRLNVTITRAKYSLFILGHLRTLMENQHWNQLIQDAQKRGAIIKTCDKNYRHDAV
2410 2420 2430 2440 2450 2460
2470 2480 2490 2500 2510 2520
pF1KA0 KILKLKPVLQRSLTHPPTIAPEGSRPQGGLPSSKLDSGFAKTSVAASLYHTPSDSKEITL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 KILKLKPVLQRSLTHPPTIAPEGSRPQGGLPSSKLDSGFAKTSVAASLYHTPSDSKEITL
2470 2480 2490 2500 2510 2520
2530 2540 2550 2560 2570 2580
pF1KA0 TVTSKDPERPPVHDQLQDPRLLKRMGIEVKGGIFLWDPQPSSPQHPGATPPTGEPGFPVV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 TVTSKDPERPPVHDQLQDPRLLKRMGIEVKGGIFLWDPQPSSPQHPGATPPTGEPGFPVV
2530 2540 2550 2560 2570 2580
2590 2600 2610 2620 2630 2640
pF1KA0 HQDLSHIQQPAAVVAALSSHKPPVRGEPPAASPEASTCQSKCDDPEEELCHRREARAFSE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS69 HQDLSHIQQPAAVVAALSSHKPPVRGEPPAASPEASTCQSKCDDPEEELCHRREARAFSE
2590 2600 2610 2620 2630 2640
2650 2660 2670
pF1KA0 GEQEKCGSETHHTRRNSRWDKRTLEQEDSSSKKRKLL
:::::::::::::::::::::::::::::::::::::
CCDS69 GEQEKCGSETHHTRRNSRWDKRTLEQEDSSSKKRKLL
2650 2660 2670
>>CCDS12386.1 UPF1 gene_id:5976|Hs108|chr19 (1118 aa)
initn: 581 init1: 135 opt: 450 Z-score: 361.3 bits: 80.0 E(32554): 8.3e-14
Smith-Waterman score: 629; 26.8% identity (54.1% similar) in 867 aa overlap (1733-2578:292-1011)
1710 1720 1730 1740 1750 1760
pF1KA0 KEVLKWKYEMFLNFGQCGPPASLCQSISRPVPVRFHNYGDYFNVFFPLMVLNTFETVAQE
: .:... .: :.: ::. :.. ..
CCDS12 INKLEELWKENPSATLEDLEKPGVDEEPQHVLLRYEDAYQYQNIFGPLVKLEA--DYDKK
270 280 290 300 310
1770 1780 1790 1800 1810 1820
pF1KA0 WLNSPNRENFYQLQVRKFPADYIKYWEFAVYLEECELAKQLYPK-ENDLVFLAPERINEE
.: ...: . :: :... :.. ..: :: ..:. .. ..: .
CCDS12 LKESQTQDN---ITVR---------WDLG--LNKKRIAYFTLPKTDSDMRLMQGDEICLR
320 330 340 350 360
1830 1840 1850 1860 1870 1880
pF1KA0 KKDTERNDIQDLHEYHSGYVHKFRRTSVMRNGKTECYLSIQTQENFPANLNELVNCIVIS
: :: .: : .. : : : . .... . :..... . .
CCDS12 YKG-------DLAPLWKGIGHVIK---VPDNYGDEIAIELRSSVGAPVEVTHNFQVDFVW
370 380 390 400 410
1890 1900 1910 1920 1930
pF1KA0 SLVTTQRKLKAMSLLGSRNQLARAVLNPNPMDFCTKDLLTTTS--ERIIAY-LRDFNEDQ
. .. .: .:.. .. . . . . . . ..:.. . .:. : : :.:..:
CCDS12 KSTSFDRMQSALKTFAVDETSVSGYIYHKLLGHEVEDVIIKCQLPKRFTAQGLPDLNHSQ
420 430 440 450 460 470
1940 1950 1960 1970 1980 1990
pF1KA0 KKAIETAYAMVKHSPSVAKICLIHGPPGTGKSKTIVGLLYRLLTENQRKGHSDENSNAKI
:..: : . : . ::.:::::::. : . ..:.: :.:..
CCDS12 VYAVKT----VLQRP----LSLIQGPPGTGKTVTSATIVYHL----ARQGNGP-------
480 490 500 510
2000 2010 2020 2030 2040 2050
pF1KA0 KQNRVLVCAPSNAAVDELMKKIILEFKEKCKDKKNPLGNCGDINLVRLGPEKSINSEVLK
:::::::: :::.: .:: . : ...::: :: .
CCDS12 ----VLVCAPSNIAVDQLTEKI------------HQTG----LKVVRLCA-KS------R
520 530 540
2060 2070 2080 2090 2100 2110
pF1KA0 FSLDSQVNHRMKKELPSHVQAMHKRKEFLDYQLDELSRQRALCRGGREIQRQELDENISK
..:: : :.:.. . .: .. ::.. . : ..: : .:.
CCDS12 EAIDS----------PVSFLALHNQIRNMD-SMPELQKLQQL--------KDETGE-LSS
550 560 570 580
2120 2130 2140 2150 2160 2170
pF1KA0 VSKERQELASKIKEVQGRPQKTQSIIILESHIICCTLSTSGGLLLESAFRGQGGVPFSCV
....: . .. : . ..... .:::: .: : . . : .
CCDS12 ADEKRYRALKRTAERE---------LLMNADVICCTCVGAGDPRLAK-------MQFRSI
590 600 610 620 630
2180 2190 2200 2210 2220 2230
pF1KA0 IVDEAGQSCEIETLTPLIHRCNKLILVGDPKQLPPTVISMKAQEYGYDQSMMARFCRLLE
..::. :. : : ..:.. ..:::::: :: :.:. :: . : .::. : ::.
CCDS12 LIDESTQATEPECMVPVVLGAKQLILVGDHCQLGPVVMCKKAAKAGLSQSL---FERLVV
640 650 660 670 680 690
2240 2250 2260 2270 2280 2290
pF1KA0 ENVEHNMISRLPILQLTVQYRMHPDICLFPSNYVYNRNLKTNRQTEAIRCSS----DWPF
... :: .: ::::::: . :::: :. .:. : : : : .. .::
CCDS12 LGIR-------PI-RLQVQYRMHPALSAFPSNIFYEGSLQ-NGVTAADRVKKGFDFQWP-
700 710 720 730 740
2300 2310 2320 2330 2340
pF1KA0 QPY--LVFDVGDGSER--RDNDSYINVQEIKLVMEII-KLIKDKRKDVSFRNIGIITHYK
:: . : : .:.:. .. ::.: : : .: ::.: : .::::: :.
CCDS12 QPDKPMFFYVTQGQEEIASSGTSYLNRTEAANVEKITTKLLKAGAKP---DQIGIITPYE
750 760 770 780 790
2350 2360 2370 2380 2390 2400
pF1KA0 AQKT-MIQK-DLDKEFDRKGPAEVD--TVDAFQGRQKDCVIVTCVRANSIQGSIGFLASL
.:.. ..: ... . : ::. .:::::::.:: .:..::::: :: :::: .
CCDS12 GQRSYLVQYMQFSGSLHTKLYQEVEIASVDAFQGREKDFIILSCVRANEHQG-IGFLNDP
800 810 820 830 840 850
2410 2420 2430 2440 2450 2460
pF1KA0 QRLNVTITRAKYSLFILGHLRTLMENQHWNQLIQDAQKRGAIIKTCDKNYRHDAVKILKL
.::::..:::.:...:.:. ..: .. ::.:.. ... .... .: :.. ...
CCDS12 RRLNVALTRARYGVIIVGNPKALSKQPLWNHLLNYYKEQKVLVEGPLNNLRESLMQFS--
860 870 880 890 900 910
2470 2480 2490 2500 2510 2520
pF1KA0 KPVLQRSLTHPPTIAPEGSRPQGGLPSSKLDSGFAKTSVAASLYHTPSDSKEITLTVTSK
:: :.:.. :: : :.: . .. :. : . .:.: :... .. .
CCDS12 KP---RKLVN--TINP-GAR---FMTTAMYDAREAI--IPGSVYDRSSQGRPSSMYFQT-
920 930 940 950 960
2530 2540 2550 2560 2570 2580
pF1KA0 DPERPPVHDQL----QDPRLLKRMGIEVKGGIFLWDPQPSSPQHPGATPPTGEPGFPVVH
:::. : . :.: . .. . :.: :. :.. : :
CCDS12 -------HDQIGMISAGPSHVAAMNIPIPFNLVM-PPMPPPGYFGQANGPAAGRGTPKGK
970 980 990 1000 1010
2590 2600 2610 2620 2630 2640
pF1KA0 QDLSHIQQPAAVVAALSSHKPPVRGEPPAASPEASTCQSKCDDPEEELCHRREARAFSEG
CCDS12 TGRGGRQKNRFGLPGPSQTNLPNSQASQDVASQPFSQGALTQGYISMSQPSQMSQPGLSQ
1020 1030 1040 1050 1060 1070
>>CCDS74315.1 UPF1 gene_id:5976|Hs108|chr19 (1129 aa)
initn: 581 init1: 135 opt: 450 Z-score: 361.3 bits: 80.0 E(32554): 8.4e-14
Smith-Waterman score: 626; 29.8% identity (55.3% similar) in 665 aa overlap (1931-2578:479-1022)
1910 1920 1930 1940 1950 1960
pF1KA0 QLARAVLNPNPMDFCTKDLLTTTSERIIAYLRDFNEDQKKAIETAYAMVKHSPSVAKICL
: :.:..: :..: : . : . :
CCDS74 SGYIYHKLLGHEVEDVIIKCQLPKRFTAQGLPDLNHSQVYAVKT----VLQRP----LSL
450 460 470 480 490 500
1970 1980 1990 2000 2010 2020
pF1KA0 IHGPPGTGKSKTIVGLLYRLLTENQRKGHSDENSNAKIKQNRVLVCAPSNAAVDELMKKI
:.:::::::. : . ..:.: :.:.. :::::::: :::.: .::
CCDS74 IQGPPGTGKTVTSATIVYHL----ARQGNGP-----------VLVCAPSNIAVDQLTEKI
510 520 530 540
2030 2040 2050 2060 2070 2080
pF1KA0 ILEFKEKCKDKKNPLGNCGDINLVRLGPEKSINSEVLKFSLDSQVNHRMKKELPSHVQAM
. : ...::: :: . ..:: : :.
CCDS74 ------------HQTG----LKVVRLCA-KS------REAIDS----------PVSFLAL
550 560 570
2090 2100 2110 2120 2130 2140
pF1KA0 HKRKEFLDYQLDELSRQRALCRGGREIQRQELDENISKVSKERQELASKIKEVQGRPQKT
:.. . .: .. ::.. . : ..: : .:.....: . .. : .
CCDS74 HNQIRNMD-SMPELQKLQQL--------KDETGE-LSSADEKRYRALKRTAERE------
580 590 600 610
2150 2160 2170 2180 2190 2200
pF1KA0 QSIIILESHIICCTLSTSGGLLLESAFRGQGGVPFSCVIVDEAGQSCEIETLTPLIHRCN
..... .:::: .: : . . : ...::. :. : : ..:.. .
CCDS74 ---LLMNADVICCTCVGAGDPRLAK-------MQFRSILIDESTQATEPECMVPVVLGAK
620 630 640 650 660
2210 2220 2230 2240 2250 2260
pF1KA0 KLILVGDPKQLPPTVISMKAQEYGYDQSMMARFCRLLEENVEHNMISRLPILQLTVQYRM
.:::::: :: :.:. :: . : .::. : ::. ... :: .: :::::
CCDS74 QLILVGDHCQLGPVVMCKKAAKAGLSQSL---FERLVVLGIR-------PI-RLQVQYRM
670 680 690 700 710
2270 2280 2290 2300 2310
pF1KA0 HPDICLFPSNYVYNRNLKTNRQTEAIRCSS----DWPFQPY--LVFDVGDGSER--RDND
:: . :::: :. .:. : : : : .. .:: :: . : : .:.:. ..
CCDS74 HPALSAFPSNIFYEGSLQ-NGVTAADRVKKGFDFQWP-QPDKPMFFYVTQGQEEIASSGT
720 730 740 750 760 770
2320 2330 2340 2350 2360
pF1KA0 SYINVQEIKLVMEII-KLIKDKRKDVSFRNIGIITHYKAQKT-MIQK-DLDKEFDRKGPA
::.: : : .: ::.: : .::::: :..:.. ..: ... . :
CCDS74 SYLNRTEAANVEKITTKLLKAGAKP---DQIGIITPYEGQRSYLVQYMQFSGSLHTKLYQ
780 790 800 810 820 830
2370 2380 2390 2400 2410 2420
pF1KA0 EVD--TVDAFQGRQKDCVIVTCVRANSIQGSIGFLASLQRLNVTITRAKYSLFILGHLRT
::. .:::::::.:: .:..::::: :: :::: . .::::..:::.:...:.:. ..
CCDS74 EVEIASVDAFQGREKDFIILSCVRANEHQG-IGFLNDPRRLNVALTRARYGVIIVGNPKA
840 850 860 870 880
2430 2440 2450 2460 2470 2480
pF1KA0 LMENQHWNQLIQDAQKRGAIIKTCDKNYRHDAVKILKLKPVLQRSLTHPPTIAPEGSRPQ
: .. ::.:.. ... .... .: :.. ... :: :.:.. :: : :.:
CCDS74 LSKQPLWNHLLNYYKEQKVLVEGPLNNLRESLMQFS--KP---RKLVN--TINP-GARF-
890 900 910 920 930 940
2490 2500 2510 2520 2530 2540
pF1KA0 GGLPSSKLDSGFAKTSVAASLYHTPSDSKEITLTVTSKDPERPPVHDQL----QDPRLLK
. .. :. .. . .:.: :... .. . :::. : .
CCDS74 --MTTAMYDA--REAIIPGSVYDRSSQGRPSSMYFQT--------HDQIGMISAGPSHVA
950 960 970 980
2550 2560 2570 2580 2590 2600
pF1KA0 RMGIEVKGGIFLWDPQPSSPQHPGATPPTGEPGFPVVHQDLSHIQQPAAVVAALSSHKPP
:.: . .. . :.: :. :.. : :
CCDS74 AMNIPIPFNLVM-PPMPPPGYFGQANGPAAGRGTPKGKTGRGGRQKNRFGLPGPSQTNLP
990 1000 1010 1020 1030 1040
2610 2620 2630 2640 2650 2660
pF1KA0 VRGEPPAASPEASTCQSKCDDPEEELCHRREARAFSEGEQEKCGSETHHTRRNSRWDKRT
CCDS74 NSQASQDVASQPFSQGALTQGYISMSQPSQMSQPGLSQPELSQDSYLGDEFKSQIDVALS
1050 1060 1070 1080 1090 1100
>>CCDS8187.1 IGHMBP2 gene_id:3508|Hs108|chr11 (993 aa)
initn: 404 init1: 91 opt: 385 Z-score: 310.4 bits: 70.4 E(32554): 5.7e-11
Smith-Waterman score: 539; 26.0% identity (52.1% similar) in 768 aa overlap (1934-2673:192-860)
1910 1920 1930 1940 1950 1960
pF1KA0 RAVLNPNPMDFCTKDLLTTTSERIIAYLRDFNEDQKKAIETAYAMVKHSPSVAKICLIHG
.. .::.:. .:. : .. .:::
CCDS81 PASSLIEVLFGRSAPSPASEIHPLTFFNTCLDTSQKEAV--LFAL-----SQKELAIIHG
170 180 190 200 210
1970 1980 1990 2000 2010 2020
pF1KA0 PPGTGKSKTIVGLLYRLLTENQRKGHSDENSNAKIKQN-RVLVCAPSNAAVDELMKKIIL
::::::. :.: .. . .::. .:: ::::: :::.:.... :
CCDS81 PPGTGKTTTVVEIILQ-----------------AVKQGLKVLCCAPSNIAVDNLVERLAL
220 230 240 250
2030 2040 2050 2060 2070 2080
pF1KA0 EFKEKCKDKKNPLGNCGDINLVRLGPEKSINSEVLKFSLDSQVNHRMKKELPSHVQAMHK
::.. ::. :.. ..:... :: : ..:. ..:.. :.. :
CCDS81 -----CKQRILRLGH--PARLLESIQQHSLDA-VLARSDSAQIVADIRKDID---QVFVK
260 270 280 290 300
2090 2100 2110 2120 2130 2140
pF1KA0 RKEFLDYQLDELSRQRALCRGGREIQRQELDENISKVSKERQELASKIKEVQGRPQKTQS
:. : .:... :. .. :.:: :::.: :. .. . :..
CCDS81 NKKTQD------KREKSNFRNEIKLLRKEL--------KEREE-AAMLESL------TSA
310 320 330 340
2150 2160 2170 2180 2190 2200
pF1KA0 IIILESHIICCTLSTSGGL-LLESAFRGQGGVPFSCVIVDEAGQSCEIETLTPLIHRCNK
..: .. :..: : :: .. :. :..:: .:. : ::. . :
CCDS81 NVVLATNT---GASADGPLKLLPESY-------FDVVVIDECAQALEASCWIPLL-KARK
350 360 370 380 390
2210 2220 2230 2240 2250 2260
pF1KA0 LILVGDPKQLPPTVISMKAQEYGYDQSMMARFCRLLEENVEHNMISRLPILQLTVQYRMH
::.:: ::::::..: :: : . :.: :: :: . .:. . ::::::::
CCDS81 CILAGDHKQLPPTTVSHKAALAGLSLSLME---RLAEE-----YGARV-VRTLTVQYRMH
400 410 420 430 440
2270 2280 2290 2300
pF1KA0 PDICLFPSNYVYNRNLKTNRQTEAIRCSSDWPFQ--------PYLVFDV-GDGS---ERR
: . :. .: .: : ... : . : : : :. :. : : :..
CCDS81 QAIMRWASDTMYLGQL-TAHSSVARHLLRDLPGVAATEETGVPLLLVDTAGCGLFELEEE
450 460 470 480 490 500
2310 2320 2330 2340 2350 2360
pF1KA0 DNDSYINVQEIKLVMEIIKLIKDKRKDVSFRNIGIITHYKAQKTMIQKDLDKEFDRKGPA
:..: : :..:: :. . : : :.:.... :. : .....: :.
CCDS81 DEQSKGNPGEVRLVSLHIQALVD--AGVPARDIAVVSPYNLQVDLLRQSL---VHRHPEL
510 520 530 540 550
2370 2380 2390 2400 2410 2420
pF1KA0 EVDTVDAFQGRQKDCVIVTCVRANSIQGSIGFLASLQRLNVTITRAKYSLFILGHLRTLM
:. .::.::::.:. ::.. ::.: .: .:::: .:.::..:::. . .. ::.
CCDS81 EIKSVDGFQGREKEAVILSFVRSNR-KGEVGFLAEDRRINVAVTRARRHVAVICDSRTVN
560 570 580 590 600 610
2430 2440 2450 2460 2470 2480
pF1KA0 ENQHWNQLIQDAQKRGAIIKTCDKNYRHDAVKILKLKPVLQRSLTHPPTIAPEGSRPQGG
.. . :.. ..: . . . : : : . : : .: : .:::
CCDS81 NHAFLKTLVEYFTQHGEVRTAFE--YLDDIVPENYSHENSQGS-SHAAT------KPQGP
620 630 640 650 660
2490 2500 2510 2520 2530 2540
pF1KA0 LPSSKLDSGFAKTSVAASLYHTPSDSKEITLTVTSKDPERPPVHDQLQDPRLLKRMGIEV
:.. : . . :. . .: ...:. : .: .. .:. :.:
CCDS81 ATSTRTGSQRQEGGQEAAAPARQGRKKPAGKSLASEAPSQPSLNG--GSPE-----GVES
670 680 690 700 710 720
2550 2560 2570 2580 2590
pF1KA0 KGGI-----FLWDPQPSSPQHPGATPPTGEPGFPVVHQ-----DLSHIQQPAA----VVA
. :. .. . . :. .. : . ::: : : .. . ...
CCDS81 QDGVDHFRAMIVEFMASKKMQLEFPPSLNSHDRLRVHQIAEEHGLRHDSSGEGKRRFITV
730 740 750 760 770 780
2600 2610 2620 2630 2640 2650
pF1KA0 ALSSHKPPVRGEPPAASPEASTCQSKCDDPEEELCHRREARAFSEGEQEKCGSETHHTRR
. . .: . :::.. . : : . :: :. .. . . : . :
CCDS81 SKRAPRPRAALGPPAGTGGPAPLQPVPPTPAQTEQPPREQRGPDQPDLRTLHLERLQRVR
790 800 810 820 830 840
2660 2670
pF1KA0 NSRWDKRTLEQEDSSSKKRKLL
... . . ::. :...:
CCDS81 SAQGQPASKEQQASGQQKLPEKKKKKAKGHPATDLPTEEDFEALVSAAVKADNTCGFAKC
850 860 870 880 890 900
2677 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Wed Nov 2 19:29:20 2016 done: Wed Nov 2 19:29:21 2016
Total Scan time: 6.730 Total Display time: 0.380
Function used was FASTA [36.3.4 Apr, 2011]