FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB0950, 407 aa
1>>>pF1KB0950 407 - 407 aa - 407 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.4046+/-0.000791; mu= 8.6532+/- 0.047
mean_var=158.6809+/-31.857, 0's: 0 Z-trim(114.1): 8 B-trim: 70 in 1/51
Lambda= 0.101815
statistics sampled from 14674 (14678) to 14674 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.786), E-opt: 0.2 (0.451), width: 16
Scan time: 3.430
The best scores are: opt bits E(32554)
CCDS12567.1 EGLN2 gene_id:112398|Hs108|chr19 ( 407) 2845 429.4 2.9e-120
CCDS1595.1 EGLN1 gene_id:54583|Hs108|chr1 ( 426) 1147 180.0 3.7e-45
CCDS9646.1 EGLN3 gene_id:112399|Hs108|chr14 ( 239) 1002 158.5 6.1e-39
CCDS76671.1 EGLN3 gene_id:112399|Hs108|chr14 ( 145) 662 108.4 4.5e-24
>>CCDS12567.1 EGLN2 gene_id:112398|Hs108|chr19 (407 aa)
initn: 2845 init1: 2845 opt: 2845 Z-score: 2272.5 bits: 429.4 E(32554): 2.9e-120
Smith-Waterman score: 2845; 100.0% identity (100.0% similar) in 407 aa overlap (1-407:1-407)
10 20 30 40 50 60
pF1KB0 MDSPCQPQPLSQALPQLPGSSSEPLEPEPGRARMGVESYLPCPLLPSYHCPGVPSEASAG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 MDSPCQPQPLSQALPQLPGSSSEPLEPEPGRARMGVESYLPCPLLPSYHCPGVPSEASAG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB0 SGTPRATATSTTASPLRDGFGGQDGGELRPLQSEGAAALVTKGCQRLAAQGARPEAPKRK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 SGTPRATATSTTASPLRDGFGGQDGGELRPLQSEGAAALVTKGCQRLAAQGARPEAPKRK
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB0 WAEDGGDAPSPSKRPWARQENQEAEREGGMSCSCSSGSGEASAGLMEEALPSAPERLALD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 WAEDGGDAPSPSKRPWARQENQEAEREGGMSCSCSSGSGEASAGLMEEALPSAPERLALD
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB0 YIVPCMRYYGICVKDSFLGAALGGRVLAEVEALKRGGRLRDGQLVSQRAIPPRSIRGDQI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 YIVPCMRYYGICVKDSFLGAALGGRVLAEVEALKRGGRLRDGQLVSQRAIPPRSIRGDQI
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB0 AWVEGHEPGCRSIGALMAHVDAVIRHCAGRLGSYVINGRTKAMVACYPGNGLGYVRHVDN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 AWVEGHEPGCRSIGALMAHVDAVIRHCAGRLGSYVINGRTKAMVACYPGNGLGYVRHVDN
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB0 PHGDGRCITCIYYLNQNWDVKVHGGLLQIFPEGRPVVANIEPLFDRLLIFWSDRRNPHEV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 PHGDGRCITCIYYLNQNWDVKVHGGLLQIFPEGRPVVANIEPLFDRLLIFWSDRRNPHEV
310 320 330 340 350 360
370 380 390 400
pF1KB0 KPAYATRYAITVWYFDAKERAAAKDKYQLASGQKGVQVPVSQPPTPT
:::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 KPAYATRYAITVWYFDAKERAAAKDKYQLASGQKGVQVPVSQPPTPT
370 380 390 400
>>CCDS1595.1 EGLN1 gene_id:54583|Hs108|chr1 (426 aa)
initn: 1109 init1: 1079 opt: 1147 Z-score: 924.3 bits: 180.0 E(32554): 3.7e-45
Smith-Waterman score: 1158; 50.7% identity (70.4% similar) in 371 aa overlap (35-403:60-417)
10 20 30 40 50 60
pF1KB0 CQPQPLSQALPQLPGSSSEPLEPEPGRARMGVESYLPCPLLPSYHCPGVPSEASAGSGTP
: :. : . : : .: :. :
CCDS15 LLRCSRCRSSFYCCKEHQRQDWKKHKLVCQGSEGALGHGVGPHQHSGPAP---PAAVPPP
30 40 50 60 70 80
70 80 90 100 110 120
pF1KB0 RATATSTT-ASPLRDGFGGQDG-GELRPLQSEGAAALVTKGCQRLAAQGARPEAPKRKWA
:: : :. ::. .:. . :... :: .. :. :: :.. : . :
CCDS15 RAGAREPRKAAARRDNASGDAAKGKVKAKPPADPAAAASP-CR--AAAGGQGSAVAAE-A
90 100 110 120 130 140
130 140 150 160 170 180
pF1KB0 EDGGDAPSPSKRPWARQENQEAEREGGMSCSCSSGSGEASAGLMEEALPSAPERLALDYI
: : . : :.. ... . . . . : :.: : . . ::. .:::.::
CCDS15 EPGKEEP-PARSSLFQEKANLYPPSNTPGDALSPGGGLRPNG-QTKPLPAL--KLALEYI
150 160 170 180 190
190 200 210 220 230 240
pF1KB0 VPCMRYYGICVKDSFLGAALGGRVLAEVEALKRGGRLRDGQLVSQRAIPPRSIRGDQIAW
:::: .:::: :.::: : .. ::.::. :.. :::::::.. ..::::.:.:
CCDS15 VPCMNKHGICVVDDFLGKETGQQIGDEVRALHDTGKFTDGQLVSQKSDSSKDIRGDKITW
200 210 220 230 240 250
250 260 270 280 290 300
pF1KB0 VEGHEPGCRSIGALMAHVDAVIRHCAGRLGSYVINGRTKAMVACYPGNGLGYVRHVDNPH
.::.::::..:: ::. .: .:::: :.:::: :::::::::::::::: :::::::::.
CCDS15 IEGKEPGCETIGLLMSSMDDLIRHCNGKLGSYKINGRTKAMVACYPGNGTGYVRHVDNPN
260 270 280 290 300 310
310 320 330 340 350 360
pF1KB0 GDGRCITCIYYLNQNWDVKVHGGLLQIFPEGRPVVANIEPLFDRLLIFWSDRRNPHEVKP
:::::.:::::::..::.:: ::.:.:::::. :.::: :::::.:::::::::::.:
CCDS15 GDGRCVTCIYYLNKDWDAKVSGGILRIFPEGKAQFADIEPKFDRLLFFWSDRRNPHEVQP
320 330 340 350 360 370
370 380 390 400
pF1KB0 AYATRYAITVWYFDAKERAAAKDKYQLASGQKGVQVPVSQPPTPT
::::::::::::::: ::: :: :: .:.:::.: ...:
CCDS15 AYATRYAITVWYFDADERARAKVKYL--TGEKGVRVELNKPSDSVGKDVF
380 390 400 410 420
>>CCDS9646.1 EGLN3 gene_id:112399|Hs108|chr14 (239 aa)
initn: 1025 init1: 820 opt: 1002 Z-score: 812.6 bits: 158.5 E(32554): 6.1e-39
Smith-Waterman score: 1002; 61.9% identity (86.0% similar) in 215 aa overlap (175-388:12-226)
150 160 170 180 190 200
pF1KB0 EREGGMSCSCSSGSGEASAGLMEEALPSAPERLALDYIVPCMRYYGICVKDSFLGAALGG
:..::.:::::.. :.: :.::: ..:
CCDS96 MPLGHIMRLDLEKIALEYIVPCLHEVGFCYLDNFLGEVVGD
10 20 30 40
210 220 230 240 250 260
pF1KB0 RVLAEVEALKRGGRLRDGQLVSQRA-IPPRSIRGDQIAWVEGHEPGCRSIGALMAHVDAV
:: .:. :. : ::::::.. :: . : .:::::.:. :.: ::..:. :.. .: .
CCDS96 CVLERVKQLHCTGALRDGQLAGPRAGVSKRHLRGDQITWIGGNEEGCEAISFLLSLIDRL
50 60 70 80 90 100
270 280 290 300 310 320
pF1KB0 IRHCAGRLGSYVINGRTKAMVACYPGNGLGYVRHVDNPHGDGRCITCIYYLNQNWDVKVH
. .:..:::.: .. :.::::::::::: :::::::::.:::::::::::::.:::.:.:
CCDS96 VLYCGSRLGKYYVKERSKAMVACYPGNGTGYVRHVDNPNGDGRCITCIYYLNKNWDAKLH
110 120 130 140 150 160
330 340 350 360 370 380
pF1KB0 GGLLQIFPEGRPVVANIEPLFDRLLIFWSDRRNPHEVKPAYATRYAITVWYFDAKERAAA
::.:.:::::. .:..::.:::::.:::::::::::.:.::::::.:::::::.::: :
CCDS96 GGILRIFPEGKSFIADVEPIFDRLLFFWSDRRNPHEVQPSYATRYAMTVWYFDAEERAEA
170 180 190 200 210 220
390 400
pF1KB0 KDKYQLASGQKGVQVPVSQPPTPT
: :..
CCDS96 KKKFRNLTRKTESALTED
230
>>CCDS76671.1 EGLN3 gene_id:112399|Hs108|chr14 (145 aa)
initn: 716 init1: 655 opt: 662 Z-score: 545.6 bits: 108.4 E(32554): 4.5e-24
Smith-Waterman score: 662; 73.9% identity (92.2% similar) in 115 aa overlap (274-388:18-132)
250 260 270 280 290 300
pF1KB0 EGHEPGCRSIGALMAHVDAVIRHCAGRLGSYVINGRTKAMVACYPGNGLGYVRHVDNPHG
:.. .:::::::::: :::::::::.:
CCDS76 MPLGHIMRLDLEKIALEYIVPCLHEAMVACYPGNGTGYVRHVDNPNG
10 20 30 40
310 320 330 340 350 360
pF1KB0 DGRCITCIYYLNQNWDVKVHGGLLQIFPEGRPVVANIEPLFDRLLIFWSDRRNPHEVKPA
::::::::::::.:::.:.:::.:.:::::. .:..::.:::::.:::::::::::.:.
CCDS76 DGRCITCIYYLNKNWDAKLHGGILRIFPEGKSFIADVEPIFDRLLFFWSDRRNPHEVQPS
50 60 70 80 90 100
370 380 390 400
pF1KB0 YATRYAITVWYFDAKERAAAKDKYQLASGQKGVQVPVSQPPTPT
::::::.:::::::.::: :: :..
CCDS76 YATRYAMTVWYFDAEERAEAKKKFRNLTRKTESALTED
110 120 130 140
407 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 17:25:39 2016 done: Sat Nov 5 17:25:39 2016
Total Scan time: 3.430 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]