FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KA0560, 1485 aa
1>>>pF1KA0560 1485 - 1485 aa - 1485 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.8429+/-0.00122; mu= 15.5644+/- 0.073
mean_var=90.9695+/-17.758, 0's: 0 Z-trim(102.9): 41 B-trim: 0 in 0/51
Lambda= 0.134470
statistics sampled from 7136 (7146) to 7136 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.559), E-opt: 0.2 (0.22), width: 16
Scan time: 2.860
The best scores are: opt bits E(32554)
CCDS42013.1 AQR gene_id:9716|Hs108|chr15 (1485) 9917 1935.3 0
CCDS12386.1 UPF1 gene_id:5976|Hs108|chr19 (1118) 324 74.2 2.5e-12
CCDS74315.1 UPF1 gene_id:5976|Hs108|chr19 (1129) 324 74.2 2.5e-12
>>CCDS42013.1 AQR gene_id:9716|Hs108|chr15 (1485 aa)
initn: 9917 init1: 9917 opt: 9917 Z-score: 10390.7 bits: 1935.3 E(32554): 0
Smith-Waterman score: 9917; 100.0% identity (100.0% similar) in 1485 aa overlap (1-1485:1-1485)
10 20 30 40 50 60
pF1KA0 MAAPAQPKKIVAPTVSQINAEFVTQLACKYWAPHIKKKSPFDIKVIEDIYEKEIVKSRFA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 MAAPAQPKKIVAPTVSQINAEFVTQLACKYWAPHIKKKSPFDIKVIEDIYEKEIVKSRFA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KA0 IRKIMLLEFSQYLENYLWMNYSPEVSSKAYLMSICCMVNEKFRENVPAWEIFKKKPDHFP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 IRKIMLLEFSQYLENYLWMNYSPEVSSKAYLMSICCMVNEKFRENVPAWEIFKKKPDHFP
70 80 90 100 110 120
130 140 150 160 170 180
pF1KA0 FFFKHILKAALAETDGEFSLHEQTVLLLFLDHCFNSLEVDLIRSQVQQLISLPMWMGLQL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 FFFKHILKAALAETDGEFSLHEQTVLLLFLDHCFNSLEVDLIRSQVQQLISLPMWMGLQL
130 140 150 160 170 180
190 200 210 220 230 240
pF1KA0 ARLELELKKTPKLRKFWNLIKKNDEKMDPEAREQAYQERRFLSQLIQKFISVLKSVPLSE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 ARLELELKKTPKLRKFWNLIKKNDEKMDPEAREQAYQERRFLSQLIQKFISVLKSVPLSE
190 200 210 220 230 240
250 260 270 280 290 300
pF1KA0 PVTMDKVHYCERFIELMIDLEALLPTRRWFNTILDDSHLLVHCYLSNLVRREEDGHLFSQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 PVTMDKVHYCERFIELMIDLEALLPTRRWFNTILDDSHLLVHCYLSNLVRREEDGHLFSQ
250 260 270 280 290 300
310 320 330 340 350 360
pF1KA0 LLDMLKFYTGFEINDQTGNALTENEMTTIHYDRITSLQRAAFAHFPELYDFALSNVAEVD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 LLDMLKFYTGFEINDQTGNALTENEMTTIHYDRITSLQRAAFAHFPELYDFALSNVAEVD
310 320 330 340 350 360
370 380 390 400 410 420
pF1KA0 TRESLVKFFGPLSSNTLHQVASYLCLLPTLPKNEDTTFDKEFLLELLVSRHERRISQIQQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 TRESLVKFFGPLSSNTLHQVASYLCLLPTLPKNEDTTFDKEFLLELLVSRHERRISQIQQ
370 380 390 400 410 420
430 440 450 460 470 480
pF1KA0 LNQMPLYPTEKIIWDENIVPTEYYSGEGCLALPKLNLQFLTLHDYLLRNFNLFRLESTYE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 LNQMPLYPTEKIIWDENIVPTEYYSGEGCLALPKLNLQFLTLHDYLLRNFNLFRLESTYE
430 440 450 460 470 480
490 500 510 520 530 540
pF1KA0 IRQDIEDSVSRMKPWQSEYGGVVFGGWARMAQPIVAFTVVEVAKPNIGENWPTRVRADVT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 IRQDIEDSVSRMKPWQSEYGGVVFGGWARMAQPIVAFTVVEVAKPNIGENWPTRVRADVT
490 500 510 520 530 540
550 560 570 580 590 600
pF1KA0 INLNVRDHIKDEWEGLRKHDVCFLITVRPTKPYGTKFDRRRPFIEQVGLVYVRGCEIQGM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 INLNVRDHIKDEWEGLRKHDVCFLITVRPTKPYGTKFDRRRPFIEQVGLVYVRGCEIQGM
550 560 570 580 590 600
610 620 630 640 650 660
pF1KA0 LDDKGRVIEDGPEPRPNLRGESRTFRVFLDPNQYQQDMTNTIQNGAEDVYETFNIIMRRK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 LDDKGRVIEDGPEPRPNLRGESRTFRVFLDPNQYQQDMTNTIQNGAEDVYETFNIIMRRK
610 620 630 640 650 660
670 680 690 700 710 720
pF1KA0 PKENNFKAVLETIRNLMNTDCVVPDWLHDIILGYGDPSSAHYSKMPNQIATLDFNDTFLS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 PKENNFKAVLETIRNLMNTDCVVPDWLHDIILGYGDPSSAHYSKMPNQIATLDFNDTFLS
670 680 690 700 710 720
730 740 750 760 770 780
pF1KA0 IEHLKASFPGHNVKVTVEDPALQIPPFRITFPVRSGKGKKRKDADVEDEDTEEAKTLIVE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 IEHLKASFPGHNVKVTVEDPALQIPPFRITFPVRSGKGKKRKDADVEDEDTEEAKTLIVE
730 740 750 760 770 780
790 800 810 820 830 840
pF1KA0 PHVIPNRGPYPYNQPKRNTIQFTHTQIEAIRAGMQPGLTMVVGPPGTGKTDVAVQIISNI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 PHVIPNRGPYPYNQPKRNTIQFTHTQIEAIRAGMQPGLTMVVGPPGTGKTDVAVQIISNI
790 800 810 820 830 840
850 860 870 880 890 900
pF1KA0 YHNFPEQRTLIVTHSNQALNQLFEKIMALDIDERHLLRLGHGEEELETEKDFSRYGRVNY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 YHNFPEQRTLIVTHSNQALNQLFEKIMALDIDERHLLRLGHGEEELETEKDFSRYGRVNY
850 860 870 880 890 900
910 920 930 940 950 960
pF1KA0 VLARRIELLEEVKRLQKSLGVPGDASYTCETAGYFFLYQVMSRWEEYISKVKNKGSTLPD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 VLARRIELLEEVKRLQKSLGVPGDASYTCETAGYFFLYQVMSRWEEYISKVKNKGSTLPD
910 920 930 940 950 960
970 980 990 1000 1010 1020
pF1KA0 VTEVSTFFPFHEYFANAPQPIFKGRSYEEDMEIAEGCFRHIKKIFTQLEEFRASELLRSG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 VTEVSTFFPFHEYFANAPQPIFKGRSYEEDMEIAEGCFRHIKKIFTQLEEFRASELLRSG
970 980 990 1000 1010 1020
1030 1040 1050 1060 1070 1080
pF1KA0 LDRSKYLLVKEAKIIAMTCTHAALKRHDLVKLGFKYDNILMEEAAQILEIETFIPLLLQN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 LDRSKYLLVKEAKIIAMTCTHAALKRHDLVKLGFKYDNILMEEAAQILEIETFIPLLLQN
1030 1040 1050 1060 1070 1080
1090 1100 1110 1120 1130 1140
pF1KA0 PQDGFSRLKRWIMIGDHHQLPPVIKNMAFQKYSNMEQSLFTRFVRVGVPTVDLDAQGRAR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 PQDGFSRLKRWIMIGDHHQLPPVIKNMAFQKYSNMEQSLFTRFVRVGVPTVDLDAQGRAR
1090 1100 1110 1120 1130 1140
1150 1160 1170 1180 1190 1200
pF1KA0 ASLCNLYNWRYKNLGNLPHVQLLPEFSTANAGLLYDFQLINVEDFQGVGESEPNPYFYQN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 ASLCNLYNWRYKNLGNLPHVQLLPEFSTANAGLLYDFQLINVEDFQGVGESEPNPYFYQN
1150 1160 1170 1180 1190 1200
1210 1220 1230 1240 1250 1260
pF1KA0 LGEAEYVVALFMYMCLLGYPADKISILTTYNGQKHLIRDIINRRCGNNPLIGRPNKVTTV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 LGEAEYVVALFMYMCLLGYPADKISILTTYNGQKHLIRDIINRRCGNNPLIGRPNKVTTV
1210 1220 1230 1240 1250 1260
1270 1280 1290 1300 1310 1320
pF1KA0 DRFQGQQNDYILLSLVRTRAVGHLRDVRRLVVAMSRARLGLYIFARVSLFQNCFELTPAF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 DRFQGQQNDYILLSLVRTRAVGHLRDVRRLVVAMSRARLGLYIFARVSLFQNCFELTPAF
1270 1280 1290 1300 1310 1320
1330 1340 1350 1360 1370 1380
pF1KA0 SQLTARPLHLHIIPTEPFPTTRKNGERPSHEVQIIKNMPQMANFVYNMYMHLIQTTHHYH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 SQLTARPLHLHIIPTEPFPTTRKNGERPSHEVQIIKNMPQMANFVYNMYMHLIQTTHHYH
1330 1340 1350 1360 1370 1380
1390 1400 1410 1420 1430 1440
pF1KA0 QTLLQLPPAMVEEGEEVQNQETELETEEEAMTVQADIIPSPTDTSCRQETPAFQTDTTPS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS42 QTLLQLPPAMVEEGEEVQNQETELETEEEAMTVQADIIPSPTDTSCRQETPAFQTDTTPS
1390 1400 1410 1420 1430 1440
1450 1460 1470 1480
pF1KA0 ETGATSTPEAIPALSETTPTVVGAVSAPAEANTPQDATSAPEETK
:::::::::::::::::::::::::::::::::::::::::::::
CCDS42 ETGATSTPEAIPALSETTPTVVGAVSAPAEANTPQDATSAPEETK
1450 1460 1470 1480
>>CCDS12386.1 UPF1 gene_id:5976|Hs108|chr19 (1118 aa)
initn: 269 init1: 133 opt: 324 Z-score: 334.8 bits: 74.2 E(32554): 2.5e-12
Smith-Waterman score: 354; 24.2% identity (52.4% similar) in 492 aa overlap (1013-1481:590-1048)
990 1000 1010 1020 1030 1040
pF1KA0 KGRSYEEDMEIAEGCFRHIKKIFTQLEEFRASELLRSGLDRS-KYLLVKEAKIIAMTCTH
:.: .: :. . :. .: .: ::.
CCDS12 ALHNQIRNMDSMPELQKLQQLKDETGELSSADEKRYRALKRTAERELLMNADVICCTCVG
560 570 580 590 600 610
1050 1060 1070 1080 1090 1100
pF1KA0 AALKRHDLVKLGFKYDNILMEEAAQILEIETFIPLLLQNPQDGFSRLKRWIMIGDHHQLP
:. : :.:. :. .::..:..: : : ..:..: :. :..::: ::
CCDS12 AGDPR--LAKMQFR--SILIDESTQATEPECMVPVVLGA--------KQLILVGDHCQLG
620 630 640 650 660
1110 1120 1130 1140 1150 1160
pF1KA0 PVIKNMAFQKYSNMEQSLFTRFVRVGVPTVDLDAQGRARASLCNL-YNWRYKNLGNLPHV
::. : ... :::: :.: .:. . :..: : . .: . : :. :.: .
CCDS12 PVVMCKKAAK-AGLSQSLFERLVVLGIRPIRLQVQYRMHPALSAFPSNIFYE--GSLQN-
670 680 690 700 710 720
1170 1180 1190 1200 1210
pF1KA0 QLLPEFSTANAGLLYDFQLINVED--F----QGVGESEPNPYFYQNLGEAEYVVALFMYM
. . .. : .::: . . : :: : . : : :: : . .
CCDS12 -GVTAADRVKKG--FDFQWPQPDKPMFFYVTQGQEEIASSGTSYLNRTEAANVEKITTKL
730 740 750 760 770 780
1220 1230 1240 1250 1260 1270
pF1KA0 CLLGYPADKISILTTYNGQKHLIRDIINRRCGNNPLIGRPNKVTTVDRFQGQQNDYILLS
: :.:.:.: :.::. . . .. . . . . ....:: :::...:.:.::
CCDS12 LKAGAKPDQIGIITPYEGQRSYLVQYMQFSGSLHTKLYQEVEIASVDAFQGREKDFIILS
790 800 810 820 830 840
1280 1290 1300 1310 1320 1330
pF1KA0 LVRT---RAVGHLRDVRRLVVAMSRARLGLYIFARVSLFQNCFELTPAFSQLTARPLHLH
::. ...: : : ::: ::..::: :. : . . ... : ...: . .
CCDS12 CVRANEHQGIGFLNDPRRLNVALTRARYGVIIVGNPKALSK----QPLWNHLLNYYKEQK
850 860 870 880 890
1340 1350 1360 1370 1380 1390
pF1KA0 IIPTEPFPTTRKNGERPSHEVQIIKNMPQMANFVYNMYMHLIQTTHHYHQTLLQLPPAMV
.. :. . :.. . :. ...... : :. :: : .: ..
CCDS12 VLVEGPLNNLRESLMQFSKPRKLVNTINPGARFM---------TTAMYDAREAIIPGSVY
900 910 920 930 940
1400 1410 1420 1430 1440
pF1KA0 EEGEEVQNQETELETEEEAMTVQADI-------IPSPTDTSCRQETPAFQTDTTPSETGA
... . . . ..:... ..: :: : . : . . ...
CCDS12 DRSSQGRPSSMYFQTHDQIGMISAGPSHVAAMNIPIPFNLVMPPMPPPGYFGQANGPAAG
950 960 970 980 990 1000
1450 1460 1470 1480
pF1KA0 TSTPEAIPALSETTPTVVGAVSAPAEANTP-----QDATSAPEETK
.::.. . . . : . .:...: : ::..: :
CCDS12 RGTPKGKTGRGGRQKNRFG-LPGPSQTNLPNSQASQDVASQPFSQGALTQGYISMSQPSQ
1010 1020 1030 1040 1050 1060
CCDS12 MSQPGLSQPELSQDSYLGDEFKSQIDVALSQDSTYQGERAYQHGGVTGLSQY
1070 1080 1090 1100 1110
>>CCDS74315.1 UPF1 gene_id:5976|Hs108|chr19 (1129 aa)
initn: 269 init1: 133 opt: 324 Z-score: 334.8 bits: 74.2 E(32554): 2.5e-12
Smith-Waterman score: 354; 24.2% identity (52.4% similar) in 492 aa overlap (1013-1481:601-1059)
990 1000 1010 1020 1030 1040
pF1KA0 KGRSYEEDMEIAEGCFRHIKKIFTQLEEFRASELLRSGLDRS-KYLLVKEAKIIAMTCTH
:.: .: :. . :. .: .: ::.
CCDS74 ALHNQIRNMDSMPELQKLQQLKDETGELSSADEKRYRALKRTAERELLMNADVICCTCVG
580 590 600 610 620 630
1050 1060 1070 1080 1090 1100
pF1KA0 AALKRHDLVKLGFKYDNILMEEAAQILEIETFIPLLLQNPQDGFSRLKRWIMIGDHHQLP
:. : :.:. :. .::..:..: : : ..:..: :. :..::: ::
CCDS74 AGDPR--LAKMQFR--SILIDESTQATEPECMVPVVLGA--------KQLILVGDHCQLG
640 650 660 670
1110 1120 1130 1140 1150 1160
pF1KA0 PVIKNMAFQKYSNMEQSLFTRFVRVGVPTVDLDAQGRARASLCNL-YNWRYKNLGNLPHV
::. : ... :::: :.: .:. . :..: : . .: . : :. :.: .
CCDS74 PVVMCKKAAK-AGLSQSLFERLVVLGIRPIRLQVQYRMHPALSAFPSNIFYE--GSLQN-
680 690 700 710 720 730
1170 1180 1190 1200 1210
pF1KA0 QLLPEFSTANAGLLYDFQLINVED--F----QGVGESEPNPYFYQNLGEAEYVVALFMYM
. . .. : .::: . . : :: : . : : :: : . .
CCDS74 -GVTAADRVKKG--FDFQWPQPDKPMFFYVTQGQEEIASSGTSYLNRTEAANVEKITTKL
740 750 760 770 780 790
1220 1230 1240 1250 1260 1270
pF1KA0 CLLGYPADKISILTTYNGQKHLIRDIINRRCGNNPLIGRPNKVTTVDRFQGQQNDYILLS
: :.:.:.: :.::. . . .. . . . . ....:: :::...:.:.::
CCDS74 LKAGAKPDQIGIITPYEGQRSYLVQYMQFSGSLHTKLYQEVEIASVDAFQGREKDFIILS
800 810 820 830 840 850
1280 1290 1300 1310 1320 1330
pF1KA0 LVRT---RAVGHLRDVRRLVVAMSRARLGLYIFARVSLFQNCFELTPAFSQLTARPLHLH
::. ...: : : ::: ::..::: :. : . . ... : ...: . .
CCDS74 CVRANEHQGIGFLNDPRRLNVALTRARYGVIIVGNPKALSK----QPLWNHLLNYYKEQK
860 870 880 890 900
1340 1350 1360 1370 1380 1390
pF1KA0 IIPTEPFPTTRKNGERPSHEVQIIKNMPQMANFVYNMYMHLIQTTHHYHQTLLQLPPAMV
.. :. . :.. . :. ...... : :. :: : .: ..
CCDS74 VLVEGPLNNLRESLMQFSKPRKLVNTINPGARFM---------TTAMYDAREAIIPGSVY
910 920 930 940 950
1400 1410 1420 1430 1440
pF1KA0 EEGEEVQNQETELETEEEAMTVQADI-------IPSPTDTSCRQETPAFQTDTTPSETGA
... . . . ..:... ..: :: : . : . . ...
CCDS74 DRSSQGRPSSMYFQTHDQIGMISAGPSHVAAMNIPIPFNLVMPPMPPPGYFGQANGPAAG
960 970 980 990 1000 1010
1450 1460 1470 1480
pF1KA0 TSTPEAIPALSETTPTVVGAVSAPAEANTP-----QDATSAPEETK
.::.. . . . : . .:...: : ::..: :
CCDS74 RGTPKGKTGRGGRQKNRFG-LPGPSQTNLPNSQASQDVASQPFSQGALTQGYISMSQPSQ
1020 1030 1040 1050 1060 1070
CCDS74 MSQPGLSQPELSQDSYLGDEFKSQIDVALSQDSTYQGERAYQHGGVTGLSQY
1080 1090 1100 1110 1120
1485 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Wed Nov 2 19:16:02 2016 done: Wed Nov 2 19:16:03 2016
Total Scan time: 2.860 Total Display time: 0.130
Function used was FASTA [36.3.4 Apr, 2011]