FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9295, 203 aa
1>>>pF1KB9295 203 - 203 aa - 203 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.2973+/-0.000823; mu= 5.4769+/- 0.050
mean_var=152.9170+/-30.410, 0's: 0 Z-trim(113.4): 24 B-trim: 325 in 1/50
Lambda= 0.103716
statistics sampled from 14037 (14056) to 14037 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.779), E-opt: 0.2 (0.432), width: 16
Scan time: 1.460
The best scores are: opt bits E(32554)
CCDS32314.1 HDGFRP3 gene_id:50810|Hs108|chr15 ( 203) 1376 216.6 8.1e-57
CCDS1156.1 HDGF gene_id:3068|Hs108|chr1 ( 240) 624 104.2 6.9e-23
CCDS83348.1 PSIP1 gene_id:11168|Hs108|chr9 ( 329) 613 102.6 2.7e-22
CCDS6480.1 PSIP1 gene_id:11168|Hs108|chr9 ( 333) 613 102.7 2.8e-22
CCDS6479.1 PSIP1 gene_id:11168|Hs108|chr9 ( 530) 613 102.8 3.9e-22
CCDS59336.1 HDGFRP2 gene_id:84717|Hs108|chr19 ( 670) 606 101.8 9.7e-22
CCDS42472.1 HDGFRP2 gene_id:84717|Hs108|chr19 ( 671) 606 101.8 9.7e-22
CCDS44247.1 HDGF gene_id:3068|Hs108|chr1 ( 256) 470 81.2 6.3e-16
CCDS44248.1 HDGF gene_id:3068|Hs108|chr1 ( 233) 465 80.4 9.8e-16
CCDS34347.1 HDGFL1 gene_id:154150|Hs108|chr6 ( 251) 449 78.0 5.4e-15
>>CCDS32314.1 HDGFRP3 gene_id:50810|Hs108|chr15 (203 aa)
initn: 1376 init1: 1376 opt: 1376 Z-score: 1133.5 bits: 216.6 E(32554): 8.1e-57
Smith-Waterman score: 1376; 100.0% identity (100.0% similar) in 203 aa overlap (1-203:1-203)
10 20 30 40 50 60
pF1KB9 MARPRPREYKAGDLVFAKMKGYPHWPARIDELPEGAVKPPANKYPIFFFGTHETAFLGPK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 MARPRPREYKAGDLVFAKMKGYPHWPARIDELPEGAVKPPANKYPIFFFGTHETAFLGPK
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 DLFPYKEYKDKFGKSNKRKGFNEGLWEIENNPGVKFTGYQAIQQQSSSETEGEGGNTADA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 DLFPYKEYKDKFGKSNKRKGFNEGLWEIENNPGVKFTGYQAIQQQSSSETEGEGGNTADA
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 SSEEEGDRVEEDGKGKRKNEKAGSKRKKSYTSKKSSKQSRKSPGDEDDKDCKEEENKSSS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 SSEEEGDRVEEDGKGKRKNEKAGSKRKKSYTSKKSSKQSRKSPGDEDDKDCKEEENKSSS
130 140 150 160 170 180
190 200
pF1KB9 EGGDAGNDTRNTTSDLQKTSEGT
:::::::::::::::::::::::
CCDS32 EGGDAGNDTRNTTSDLQKTSEGT
190 200
>>CCDS1156.1 HDGF gene_id:3068|Hs108|chr1 (240 aa)
initn: 636 init1: 576 opt: 624 Z-score: 524.4 bits: 104.2 E(32554): 6.9e-23
Smith-Waterman score: 661; 51.5% identity (69.6% similar) in 227 aa overlap (1-203:1-225)
10 20 30 40 50
pF1KB9 MARP-RPREYKAGDLVFAKMKGYPHWPARIDELPEGAVKPPANKYPIFFFGTHETAFLGP
:.: : .::: ::::::::::::::::::::.::.::: :::: .:::::::::::::
CCDS11 MSRSNRQKEYKCGDLVFAKMKGYPHWPARIDEMPEAAVKSTANKYQVFFFGTHETAFLGP
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB9 KDLFPYKEYKDKFGKSNKRKGFNEGLWEIENNPGVKFTGYQAIQQQSSSE--------TE
::::::.: :.:::: ::::::.:::::::::: :: .:::. :..: : .:
CCDS11 KDLFPYEESKEKFGKPNKRKGFSEGLWEIENNPTVKASGYQSSQKKSCVEEPEPEPEAAE
70 80 90 100 110 120
120 130 140 150 160
pF1KB9 GEG---GNTADASSEEEGDRVEEDGKGKRKNEKAGSKRKKS---YTSKKSSKQSRKSPGD
:.: :: :..::.::: : : .:.::::.. ::. . : : :.... :.
CCDS11 GDGDKKGN-AEGSSDEEGKLVI-DEPAKEKNEKGALKRRAGDLLEDSPKRPKEAENPEGE
130 140 150 160 170
170 180 190 200
pF1KB9 EDDKDCKE---------EENKSSSEGGDAGNDTRNTTSDLQKTSEGT
: . : :.:.. :: :.. . .. . .. :.:
CCDS11 EKEAATLEVERPLPMEVEKNSTPSEPGSGRGPPQEEEEEEDEEEEATKEDAEAPGIRDHE
180 190 200 210 220 230
CCDS11 SL
240
>>CCDS83348.1 PSIP1 gene_id:11168|Hs108|chr9 (329 aa)
initn: 604 init1: 572 opt: 613 Z-score: 513.6 bits: 102.6 E(32554): 2.7e-22
Smith-Waterman score: 613; 55.6% identity (78.8% similar) in 160 aa overlap (7-166:3-155)
10 20 30 40 50 60
pF1KB9 MARPRPREYKAGDLVFAKMKGYPHWPARIDELPEGAVKPPANKYPIFFFGTHETAFLGPK
:..: :::.:::::::::::::.::.:.::::::.:: ::::::::::::::::
CCDS83 MTRDFKPGDLIFAKMKGYPHWPARVDEVPDGAVKPPTNKLPIFFFGTHETAFLGPK
10 20 30 40 50
70 80 90 100 110 120
pF1KB9 DLFPYKEYKDKFGKSNKRKGFNEGLWEIENNPGVKFTGYQAIQQQSSSETEGEGGNTADA
:.:::.: :.:.:: :::::::::::::.::: :::.. :: .::.. .. : . .
CCDS83 DIFPYSENKEKYGKPNKRKGFNEGLWEIDNNPKVKFSSQQAATKQSNASSDVEVEEKETS
60 70 80 90 100 110
130 140 150 160 170 180
pF1KB9 SSEEEGDRVEEDGKGKRKNEKAGSKRKKSYTSKKSSKQSRKSPGDEDDKDCKEEENKSSS
:.:. :. : : .:: . . . :. :.....:: ...
CCDS83 VSKEDTDHEE-----KASNEDV--TKAVDITTPKAARRGRKRKAEKQVETEEAGVVTTAT
120 130 140 150 160
190 200
pF1KB9 EGGDAGNDTRNTTSDLQKTSEGT
CCDS83 ASVNLKVSPKRGRPAATEVKIPKPRGRPKMVKQPCPSESDIITEEDKSKKKGQEEKQPKK
170 180 190 200 210 220
>>CCDS6480.1 PSIP1 gene_id:11168|Hs108|chr9 (333 aa)
initn: 604 init1: 572 opt: 613 Z-score: 513.6 bits: 102.7 E(32554): 2.8e-22
Smith-Waterman score: 613; 55.6% identity (78.8% similar) in 160 aa overlap (7-166:3-155)
10 20 30 40 50 60
pF1KB9 MARPRPREYKAGDLVFAKMKGYPHWPARIDELPEGAVKPPANKYPIFFFGTHETAFLGPK
:..: :::.:::::::::::::.::.:.::::::.:: ::::::::::::::::
CCDS64 MTRDFKPGDLIFAKMKGYPHWPARVDEVPDGAVKPPTNKLPIFFFGTHETAFLGPK
10 20 30 40 50
70 80 90 100 110 120
pF1KB9 DLFPYKEYKDKFGKSNKRKGFNEGLWEIENNPGVKFTGYQAIQQQSSSETEGEGGNTADA
:.:::.: :.:.:: :::::::::::::.::: :::.. :: .::.. .. : . .
CCDS64 DIFPYSENKEKYGKPNKRKGFNEGLWEIDNNPKVKFSSQQAATKQSNASSDVEVEEKETS
60 70 80 90 100 110
130 140 150 160 170 180
pF1KB9 SSEEEGDRVEEDGKGKRKNEKAGSKRKKSYTSKKSSKQSRKSPGDEDDKDCKEEENKSSS
:.:. :. : : .:: . . . :. :.....:: ...
CCDS64 VSKEDTDHEE-----KASNEDV--TKAVDITTPKAARRGRKRKAEKQVETEEAGVVTTAT
120 130 140 150 160
190 200
pF1KB9 EGGDAGNDTRNTTSDLQKTSEGT
CCDS64 ASVNLKVSPKRGRPAATEVKIPKPRGRPKMVKQPCPSESDIITEEDKSKKKGQEEKQPKK
170 180 190 200 210 220
>>CCDS6479.1 PSIP1 gene_id:11168|Hs108|chr9 (530 aa)
initn: 626 init1: 572 opt: 613 Z-score: 510.8 bits: 102.8 E(32554): 3.9e-22
Smith-Waterman score: 613; 55.6% identity (78.8% similar) in 160 aa overlap (7-166:3-155)
10 20 30 40 50 60
pF1KB9 MARPRPREYKAGDLVFAKMKGYPHWPARIDELPEGAVKPPANKYPIFFFGTHETAFLGPK
:..: :::.:::::::::::::.::.:.::::::.:: ::::::::::::::::
CCDS64 MTRDFKPGDLIFAKMKGYPHWPARVDEVPDGAVKPPTNKLPIFFFGTHETAFLGPK
10 20 30 40 50
70 80 90 100 110 120
pF1KB9 DLFPYKEYKDKFGKSNKRKGFNEGLWEIENNPGVKFTGYQAIQQQSSSETEGEGGNTADA
:.:::.: :.:.:: :::::::::::::.::: :::.. :: .::.. .. : . .
CCDS64 DIFPYSENKEKYGKPNKRKGFNEGLWEIDNNPKVKFSSQQAATKQSNASSDVEVEEKETS
60 70 80 90 100 110
130 140 150 160 170 180
pF1KB9 SSEEEGDRVEEDGKGKRKNEKAGSKRKKSYTSKKSSKQSRKSPGDEDDKDCKEEENKSSS
:.:. :. : : .:: . . . :. :.....:: ...
CCDS64 VSKEDTDHEE-----KASNEDV--TKAVDITTPKAARRGRKRKAEKQVETEEAGVVTTAT
120 130 140 150 160
190 200
pF1KB9 EGGDAGNDTRNTTSDLQKTSEGT
CCDS64 ASVNLKVSPKRGRPAATEVKIPKPRGRPKMVKQPCPSESDIITEEDKSKKKGQEEKQPKK
170 180 190 200 210 220
>>CCDS59336.1 HDGFRP2 gene_id:84717|Hs108|chr19 (670 aa)
initn: 704 init1: 571 opt: 606 Z-score: 503.8 bits: 101.8 E(32554): 9.7e-22
Smith-Waterman score: 645; 50.0% identity (73.5% similar) in 200 aa overlap (6-188:2-201)
10 20 30 40 50 60
pF1KB9 MARPRPREYKAGDLVFAKMKGYPHWPARIDELPEGAVKPPANKYPIFFFGTHETAFLGPK
:. .: :::::::::::::::::::.. .:::::: :::::::::::::::::::
CCDS59 MPHAFKPGDLVFAKMKGYPHWPARIDDIADGAVKPPPNKYPIFFFGTHETAFLGPK
10 20 30 40 50
70 80 90 100 110
pF1KB9 DLFPYKEYKDKFGKSNKRKGFNEGLWEIENNPGVKFTGYQAIQQQSSSETEGE--GGNTA
::::: . :::.:: :::::::::::::.::: ..... .....: :.. :. :
CCDS59 DLFPYDKCKDKYGKPNKRKGFNEGLWEIQNNPHASYSAPPPVSSSDSEAPEANPADGSDA
60 70 80 90 100 110
120 130 140 150 160
pF1KB9 DASSEEEG-------------DRVEEDGKGKRKNEKAGSKRKKSYTSKKSSKQSRKSPGD
: ..:..: ::.: :. . ......: ::: . . ::..::. .:
CCDS59 DEDDEDRGVMAVTAVTATAASDRMESDSDSDKSSDNSGLKRKTPALKMSVSKRARKASSD
120 130 140 150 160 170
170 180 190 200
pF1KB9 EDDKDCK--EEENKSSSEGGDAGNDTRNTTSDLQKTSEGT
:. . . ::::. :: .. .:
CCDS59 LDQASVSPSEEENSESSSESEKTSDQDFTPEKKAAVRAPRRGPLGGRKKKKAPSASDSDS
180 190 200 210 220 230
>>CCDS42472.1 HDGFRP2 gene_id:84717|Hs108|chr19 (671 aa)
initn: 704 init1: 571 opt: 606 Z-score: 503.8 bits: 101.8 E(32554): 9.7e-22
Smith-Waterman score: 645; 50.0% identity (73.5% similar) in 200 aa overlap (6-188:2-201)
10 20 30 40 50 60
pF1KB9 MARPRPREYKAGDLVFAKMKGYPHWPARIDELPEGAVKPPANKYPIFFFGTHETAFLGPK
:. .: :::::::::::::::::::.. .:::::: :::::::::::::::::::
CCDS42 MPHAFKPGDLVFAKMKGYPHWPARIDDIADGAVKPPPNKYPIFFFGTHETAFLGPK
10 20 30 40 50
70 80 90 100 110
pF1KB9 DLFPYKEYKDKFGKSNKRKGFNEGLWEIENNPGVKFTGYQAIQQQSSSETEGE--GGNTA
::::: . :::.:: :::::::::::::.::: ..... .....: :.. :. :
CCDS42 DLFPYDKCKDKYGKPNKRKGFNEGLWEIQNNPHASYSAPPPVSSSDSEAPEANPADGSDA
60 70 80 90 100 110
120 130 140 150 160
pF1KB9 DASSEEEG-------------DRVEEDGKGKRKNEKAGSKRKKSYTSKKSSKQSRKSPGD
: ..:..: ::.: :. . ......: ::: . . ::..::. .:
CCDS42 DEDDEDRGVMAVTAVTATAASDRMESDSDSDKSSDNSGLKRKTPALKMSVSKRARKASSD
120 130 140 150 160 170
170 180 190 200
pF1KB9 EDDKDCK--EEENKSSSEGGDAGNDTRNTTSDLQKTSEGT
:. . . ::::. :: .. .:
CCDS42 LDQASVSPSEEENSESSSESEKTSDQDFTPEKKAAVRAPRRGPLGGRKKKKAPSASDSDS
180 190 200 210 220 230
>>CCDS44247.1 HDGF gene_id:3068|Hs108|chr1 (256 aa)
initn: 486 init1: 426 opt: 470 Z-score: 399.5 bits: 81.2 E(32554): 6.3e-16
Smith-Waterman score: 507; 47.3% identity (66.7% similar) in 201 aa overlap (26-203:43-241)
10 20 30 40 50
pF1KB9 MARPRPREYKAGDLVFAKMKGYPHWPARIDELPEGAVKPPANKYPIFFFGTHETA
: :::.::.::: :::: .:::::::::
CCDS44 LGHLLATKLKRFLLSKGGRRAQIPDVSRATPHTIDEMPEAAVKSTANKYQVFFFGTHETA
20 30 40 50 60 70
60 70 80 90 100
pF1KB9 FLGPKDLFPYKEYKDKFGKSNKRKGFNEGLWEIENNPGVKFTGYQAIQQQSSSE------
::::::::::.: :.:::: ::::::.:::::::::: :: .:::. :..: :
CCDS44 FLGPKDLFPYEESKEKFGKPNKRKGFSEGLWEIENNPTVKASGYQSSQKKSCVEEPEPEP
80 90 100 110 120 130
110 120 130 140 150 160
pF1KB9 --TEGEG---GNTADASSEEEGDRVEEDGKGKRKNEKAGSKRKKS---YTSKKSSKQSRK
.::.: :: :..::.::: : : .:.::::.. ::. . : : :....
CCDS44 EAAEGDGDKKGN-AEGSSDEEGKLVI-DEPAKEKNEKGALKRRAGDLLEDSPKRPKEAEN
140 150 160 170 180 190
170 180 190 200
pF1KB9 SPGDEDDKDCKE---------EENKSSSEGGDAGNDTRNTTSDLQKTSEGT
:.: . : :.:.. :: :.. . .. . .. :.:
CCDS44 PEGEEKEAATLEVERPLPMEVEKNSTPSEPGSGRGPPQEEEEEEDEEEEATKEDAEAPGI
200 210 220 230 240 250
CCDS44 RDHESL
>>CCDS44248.1 HDGF gene_id:3068|Hs108|chr1 (233 aa)
initn: 488 init1: 428 opt: 465 Z-score: 396.0 bits: 80.4 E(32554): 9.8e-16
Smith-Waterman score: 502; 47.5% identity (67.0% similar) in 200 aa overlap (27-203:21-218)
10 20 30 40 50 60
pF1KB9 MARPRPREYKAGDLVFAKMKGYPHWPARIDELPEGAVKPPANKYPIFFFGTHETAFLGPK
: :::.::.::: :::: .::::::::::::::
CCDS44 MEQRAGGNRVQTSTLNCAGAAVIDEMPEAAVKSTANKYQVFFFGTHETAFLGPK
10 20 30 40 50
70 80 90 100 110
pF1KB9 DLFPYKEYKDKFGKSNKRKGFNEGLWEIENNPGVKFTGYQAIQQQSSSE--------TEG
:::::.: :.:::: ::::::.:::::::::: :: .:::. :..: : .::
CCDS44 DLFPYEESKEKFGKPNKRKGFSEGLWEIENNPTVKASGYQSSQKKSCVEEPEPEPEAAEG
60 70 80 90 100 110
120 130 140 150 160
pF1KB9 EG---GNTADASSEEEGDRVEEDGKGKRKNEKAGSKRKKS---YTSKKSSKQSRKSPGDE
.: :: :..::.::: : : .:.::::.. ::. . : : :.... :.:
CCDS44 DGDKKGN-AEGSSDEEGKLVI-DEPAKEKNEKGALKRRAGDLLEDSPKRPKEAENPEGEE
120 130 140 150 160 170
170 180 190 200
pF1KB9 DDKDCKE---------EENKSSSEGGDAGNDTRNTTSDLQKTSEGT
. : :.:.. :: :.. . .. . .. :.:
CCDS44 KEAATLEVERPLPMEVEKNSTPSEPGSGRGPPQEEEEEEDEEEEATKEDAEAPGIRDHES
180 190 200 210 220 230
CCDS44 L
>>CCDS34347.1 HDGFL1 gene_id:154150|Hs108|chr6 (251 aa)
initn: 483 init1: 301 opt: 449 Z-score: 382.6 bits: 78.0 E(32554): 5.4e-15
Smith-Waterman score: 449; 41.2% identity (66.0% similar) in 194 aa overlap (9-195:9-192)
10 20 30 40 50 60
pF1KB9 MARPRPREYKAGDLVFAKMKGYPHWPARIDELPEGAVKPPANKYPIFFFGTHETAFLGPK
::.:::::::.::: ::::::... ..: :.: .:::::::::::.::
CCDS34 MSAYGMPMYKSGDLVFAKLKGYAHWPARIEHM----TQP--NRYQVFFFGTHETAFLSPK
10 20 30 40 50
70 80 90 100 110
pF1KB9 DLFPYKEYKDKFGKSNKRKGFNEGLWEIENNPGVKFTGYQAIQQQSSS-----ETEG-EG
:::::: :.:::: :::.::. ::::::::: :. . ....:. : :. ::
CCDS34 RLFPYKECKEKFGKPNKRRGFSAGLWEIENNPTVQASDCPLASEKGSGDGPWPEPEAAEG
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB9 GNTADASSEEEGDRV-EEDGKGKRKNEKAGSKRKKSYTSKKSSKQSRKSPGDEDDKDCKE
. . . ::.. . : ..::. ::. . . . :. ... :... :
CCDS34 DEDKPTHAGGGGDELGKPDDDKPTEEEKGPLKRSAGDPPEDAPKRPKEAAPDQEE----E
120 130 140 150 160 170
180 190 200
pF1KB9 EENKSSSEGGDAGNDTRNTTSDLQKTSEGT
: . ..:. :. . :. :
CCDS34 AEAERAAEAERAAAAAAATAVDEESPFLVAVENGSAPSEPGLVCEPPQPEEEELREEEVA
180 190 200 210 220 230
203 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 13:17:13 2016 done: Sat Nov 5 13:17:14 2016
Total Scan time: 1.460 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]