FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE0488, 369 aa
1>>>pF1KE0488 369 - 369 aa - 369 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.5900+/-0.000826; mu= 14.7990+/- 0.050
mean_var=69.8883+/-13.500, 0's: 0 Z-trim(107.9): 12 B-trim: 0 in 0/51
Lambda= 0.153416
statistics sampled from 9858 (9863) to 9858 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.686), E-opt: 0.2 (0.303), width: 16
Scan time: 2.570
The best scores are: opt bits E(32554)
CCDS13834.2 ASPHD2 gene_id:57168|Hs108|chr22 ( 369) 2644 594.2 5.8e-170
CCDS10660.1 ASPHD1 gene_id:253982|Hs108|chr16 ( 390) 994 229.0 5.3e-60
CCDS55234.1 ASPH gene_id:444|Hs108|chr8 ( 729) 365 89.9 7.3e-18
CCDS34898.1 ASPH gene_id:444|Hs108|chr8 ( 758) 365 90.0 7.6e-18
>>CCDS13834.2 ASPHD2 gene_id:57168|Hs108|chr22 (369 aa)
initn: 2644 init1: 2644 opt: 2644 Z-score: 3164.8 bits: 594.2 E(32554): 5.8e-170
Smith-Waterman score: 2644; 100.0% identity (100.0% similar) in 369 aa overlap (1-369:1-369)
10 20 30 40 50 60
pF1KE0 MVWAPLGPPRTDCLTLLHTPSKDSPKMSLEWLVAWSWSLDGLRDCIATGIQSVRDCDTTA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 MVWAPLGPPRTDCLTLLHTPSKDSPKMSLEWLVAWSWSLDGLRDCIATGIQSVRDCDTTA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 VITVACLLVLFVWYCYHVGREQPRPYVSVNSLMQAADANGLQNGYVYCQSPECVRCTHNE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 VITVACLLVLFVWYCYHVGREQPRPYVSVNSLMQAADANGLQNGYVYCQSPECVRCTHNE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 GLNQKLYHNLQEYAKRYSWSGMGRIHKGIREQGRYLNSRPSIQKPEVFFLPDLPTTPYFS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 GLNQKLYHNLQEYAKRYSWSGMGRIHKGIREQGRYLNSRPSIQKPEVFFLPDLPTTPYFS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE0 RDAQKHDVEVLERNFQTILCEFETLYKAFSNCSLPQGWKMNSTPSGEWFTFYLVNQGVCV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 RDAQKHDVEVLERNFQTILCEFETLYKAFSNCSLPQGWKMNSTPSGEWFTFYLVNQGVCV
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE0 PRNCRKCPRTYRLLGSLRTCIGNNVFGNACISVLSPGTVITEHYGPTNIRIRCHLGLKTP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 PRNCRKCPRTYRLLGSLRTCIGNNVFGNACISVLSPGTVITEHYGPTNIRIRCHLGLKTP
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE0 NGCELVVGGEPQCWAEGRCLLFDDSFLHAAFHEGSAEDGPRVVFMVDLWHPNVAAAERQA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 NGCELVVGGEPQCWAEGRCLLFDDSFLHAAFHEGSAEDGPRVVFMVDLWHPNVAAAERQA
310 320 330 340 350 360
pF1KE0 LDFIFAPGR
:::::::::
CCDS13 LDFIFAPGR
>>CCDS10660.1 ASPHD1 gene_id:253982|Hs108|chr16 (390 aa)
initn: 938 init1: 683 opt: 994 Z-score: 1190.7 bits: 229.0 E(32554): 5.3e-60
Smith-Waterman score: 994; 49.8% identity (72.4% similar) in 315 aa overlap (67-367:87-388)
40 50 60 70 80 90
pF1KE0 WSLDGLRDCIATGIQSVRDCDTTAVITVACLLVLFVWYCYHVGREQPRPYVSVNSLMQAA
: ::.::::..: .. ...: ..
CCDS10 APGLLARASLIMLPWPLPLASSALTLLFGALTSLFLWYCYRLGSQD------MQALGAGS
60 70 80 90 100 110
100 110 120 130 140
pF1KE0 DANGLQNGYVYCQ-----SP----ECVRCTHNEGLNQKLYHNLQEYAKRYSWSGMGRIHK
:.:...: : :. :: . . ..::: .. :. ::.::::.::::...
CCDS10 RAGGVRGGPVGCSEAGGPSPGGPGDPGEGPRTEGLVSR---RLRAYARRYSWAGMGRVRR
120 130 140 150 160
150 160 170 180 190 200
pF1KE0 GIREQGRYLNSR-PS---IQKPEVFFLPDLPTTPYFSRDAQKHDVEVLERNFQTILCEFE
. :: .: :. ::.: ..::::::..:. ::::.::::.:: .: .:: .:
CCDS10 A--AQGGPGPGRGPGVLGIQRPGLLFLPDLPSAPFVPRDAQRHDVELLESSFPAILRDFG
170 180 190 200 210 220
210 220 230 240 250 260
pF1KE0 TLYKAFSNCSLP-QGWKMNSTPSGEWFTFYLVNQGVCVPRNCRKCPRTYRLLGSLRTCIG
.. ::. . : .::. .:. . . : . : : : :::.:: .:: : .::. ..
CCDS10 AVSWDFSGTTPPPRGWSPPLAPGC--YQLLLYQAGRCQPSNCRRCPGAYRALRGLRSFMS
230 240 250 260 270 280
270 280 290 300 310 320
pF1KE0 NNVFGNACISVLSPGTVITEHYGPTNIRIRCHLGLKTPNGCELVVGGEPQCWAEGRCLLF
:.:::: .::: ::. . . :::: :.::::::: : ::::::::::::::::.:::
CCDS10 ANTFGNAGFSVLLPGARLEGRCGPTNARVRCHLGLKIPPGCELVVGGEPQCWAEGHCLLV
290 300 310 320 330 340
330 340 350 360
pF1KE0 DDSFLHAAFHEGSAEDGPRVVFMVDLWHPNVAAAERQALDFIFAPGR
::::::.. :.:: ::::::::.:::::::::.::::::::.:::
CCDS10 DDSFLHTVAHNGSPEDGPRVVFIVDLWHPNVAGAERQALDFVFAPDP
350 360 370 380 390
>>CCDS55234.1 ASPH gene_id:444|Hs108|chr8 (729 aa)
initn: 281 init1: 116 opt: 365 Z-score: 434.1 bits: 89.9 E(32554): 7.3e-18
Smith-Waterman score: 370; 30.1% identity (56.2% similar) in 249 aa overlap (125-364:497-729)
100 110 120 130 140
pF1KE0 AADANGLQNGYVYCQSPECVRCTHNEGLNQKLYHNLQEYAKR------YSWSGMGRIHKG
..: .: . .: :.: .: ::
CCDS55 FILKAQNKIAESIPYLKEGIESGDPGTDDGRFYFHLGDAMQRVGNKEAYKWYELG--HK-
470 480 490 500 510 520
150 160 170 180 190 200
pF1KE0 IREQGRYLNSRPSIQKPEVFFLPDLPTTPYFSRDAQKHD--VEVLERNFQTILCEFETLY
.:.. :. . .. . : . :... . :. ::::.. : : ...
CCDS55 ---RGHF----ASVWQRSLYNVNGLKAQPWWTPKETGYTELVKSLERNWKLIRDEGLAVM
530 540 550 560 570
210 220 230 240 250 260
pF1KE0 KAFSNCSLPQGWKMNSTPSGEWFTFYLVNQGVCVPRNCRKCPRTYRLLGSLRTCIGNNVF
.. ::. : .:.: : : .:: :. :.: :: .. :
CCDS55 DKAKGLFLPED--ENLREKGDWSQFTLWQQGRRNENACKGAPKTCTLLEKFPETTGCRR-
580 590 600 610 620 630
270 280 290 300 310 320
pF1KE0 GNACISVLSPGTVITEHYGPTNIRIRCHLGLKTPN-GCELVVGGEPQCWAEGRCLLFDDS
:. :.. ::: . : :::: :.: :::: :. ::.. ..: . : ::. :.::::
CCDS55 GQIKYSIMHPGTHVWPHTGPTNCRLRMHLGLVIPKEGCKIRCANETKTWEEGKVLIFDDS
640 650 660 670 680 690
330 340 350 360
pF1KE0 FLHAAFHEGSAEDGPRVVFMVDLWHPNVAAAERQALDFIFAPGR
: : .....:. :..:.::.:::... .:..: :
CCDS55 FEHEVWQDASSF---RLIFIVDVWHPELTPQQRRSLPAI
700 710 720
>>CCDS34898.1 ASPH gene_id:444|Hs108|chr8 (758 aa)
initn: 281 init1: 116 opt: 365 Z-score: 433.9 bits: 90.0 E(32554): 7.6e-18
Smith-Waterman score: 370; 30.1% identity (56.2% similar) in 249 aa overlap (125-364:526-758)
100 110 120 130 140
pF1KE0 AADANGLQNGYVYCQSPECVRCTHNEGLNQKLYHNLQEYAKR------YSWSGMGRIHKG
..: .: . .: :.: .: ::
CCDS34 FILKAQNKIAESIPYLKEGIESGDPGTDDGRFYFHLGDAMQRVGNKEAYKWYELG--HK-
500 510 520 530 540 550
150 160 170 180 190 200
pF1KE0 IREQGRYLNSRPSIQKPEVFFLPDLPTTPYFSRDAQKHD--VEVLERNFQTILCEFETLY
.:.. :. . .. . : . :... . :. ::::.. : : ...
CCDS34 ---RGHF----ASVWQRSLYNVNGLKAQPWWTPKETGYTELVKSLERNWKLIRDEGLAVM
560 570 580 590 600
210 220 230 240 250 260
pF1KE0 KAFSNCSLPQGWKMNSTPSGEWFTFYLVNQGVCVPRNCRKCPRTYRLLGSLRTCIGNNVF
.. ::. : .:.: : : .:: :. :.: :: .. :
CCDS34 DKAKGLFLPED--ENLREKGDWSQFTLWQQGRRNENACKGAPKTCTLLEKFPETTGCRR-
610 620 630 640 650 660
270 280 290 300 310 320
pF1KE0 GNACISVLSPGTVITEHYGPTNIRIRCHLGLKTPN-GCELVVGGEPQCWAEGRCLLFDDS
:. :.. ::: . : :::: :.: :::: :. ::.. ..: . : ::. :.::::
CCDS34 GQIKYSIMHPGTHVWPHTGPTNCRLRMHLGLVIPKEGCKIRCANETKTWEEGKVLIFDDS
670 680 690 700 710 720
330 340 350 360
pF1KE0 FLHAAFHEGSAEDGPRVVFMVDLWHPNVAAAERQALDFIFAPGR
: : .....:. :..:.::.:::... .:..: :
CCDS34 FEHEVWQDASSF---RLIFIVDVWHPELTPQQRRSLPAI
730 740 750
369 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 04:45:02 2016 done: Thu Nov 3 04:45:03 2016
Total Scan time: 2.570 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]