FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE0454, 294 aa
1>>>pF1KE0454 294 - 294 aa - 294 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.9253+/-0.000863; mu= 11.9189+/- 0.052
mean_var=67.2663+/-13.426, 0's: 0 Z-trim(106.3): 17 B-trim: 0 in 0/50
Lambda= 0.156378
statistics sampled from 8920 (8931) to 8920 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.653), E-opt: 0.2 (0.274), width: 16
Scan time: 2.350
The best scores are: opt bits E(32554)
CCDS53830.1 GPN3 gene_id:51184|Hs108|chr12 ( 294) 1985 456.6 9.7e-129
CCDS9147.1 GPN3 gene_id:51184|Hs108|chr12 ( 284) 1809 416.9 8.4e-117
CCDS53831.1 GPN3 gene_id:51184|Hs108|chr12 ( 323) 1805 416.0 1.8e-116
CCDS289.1 GPN2 gene_id:54707|Hs108|chr1 ( 310) 657 157.0 1.6e-38
CCDS1760.2 GPN1 gene_id:11321|Hs108|chr2 ( 388) 288 73.8 2.2e-13
CCDS46248.1 GPN1 gene_id:11321|Hs108|chr2 ( 362) 277 71.3 1.2e-12
>>CCDS53830.1 GPN3 gene_id:51184|Hs108|chr12 (294 aa)
initn: 1985 init1: 1985 opt: 1985 Z-score: 2424.7 bits: 456.6 E(32554): 9.7e-129
Smith-Waterman score: 1985; 100.0% identity (100.0% similar) in 294 aa overlap (1-294:1-294)
10 20 30 40 50 60
pF1KE0 MPRYAQLVMGPAGSGKVRICGDKERKSTYCATMVQHCEALNRSVQVVNLDPAAEHFNYSV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 MPRYAQLVMGPAGSGKVRICGDKERKSTYCATMVQHCEALNRSVQVVNLDPAAEHFNYSV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 MADIRELIEVDDVMEDDSLRFGPNGGLVFCMEYFANNFDWLENCLGHVEDDYILFDCPGQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 MADIRELIEVDDVMEDDSLRFGPNGGLVFCMEYFANNFDWLENCLGHVEDDYILFDCPGQ
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 IELYTHLPVMKQLVQQLEQWEFRVCGVFLVDSQFMVESFKFISGILAALSAMISLEIPQV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 IELYTHLPVMKQLVQQLEQWEFRVCGVFLVDSQFMVESFKFISGILAALSAMISLEIPQV
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE0 NIMTKMDLLSKKAKKEIEKFLDPDMYSLLEDSTSDLRSKKFKKLTKAICGLIDDYSMVRF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 NIMTKMDLLSKKAKKEIEKFLDPDMYSLLEDSTSDLRSKKFKKLTKAICGLIDDYSMVRF
190 200 210 220 230 240
250 260 270 280 290
pF1KE0 LPYDQSDEESMNIVLQHIDFAIQYGEDLEFKEPKEREDESSSMFDEYFQECQDE
::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 LPYDQSDEESMNIVLQHIDFAIQYGEDLEFKEPKEREDESSSMFDEYFQECQDE
250 260 270 280 290
>>CCDS9147.1 GPN3 gene_id:51184|Hs108|chr12 (284 aa)
initn: 1809 init1: 1809 opt: 1809 Z-score: 2210.4 bits: 416.9 E(32554): 8.4e-117
Smith-Waterman score: 1884; 96.6% identity (96.6% similar) in 294 aa overlap (1-294:1-284)
10 20 30 40 50 60
pF1KE0 MPRYAQLVMGPAGSGKVRICGDKERKSTYCATMVQHCEALNRSVQVVNLDPAAEHFNYSV
:::::::::::::::: ::::::::::::::::::::::::::::::::::
CCDS91 MPRYAQLVMGPAGSGK----------STYCATMVQHCEALNRSVQVVNLDPAAEHFNYSV
10 20 30 40 50
70 80 90 100 110 120
pF1KE0 MADIRELIEVDDVMEDDSLRFGPNGGLVFCMEYFANNFDWLENCLGHVEDDYILFDCPGQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS91 MADIRELIEVDDVMEDDSLRFGPNGGLVFCMEYFANNFDWLENCLGHVEDDYILFDCPGQ
60 70 80 90 100 110
130 140 150 160 170 180
pF1KE0 IELYTHLPVMKQLVQQLEQWEFRVCGVFLVDSQFMVESFKFISGILAALSAMISLEIPQV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS91 IELYTHLPVMKQLVQQLEQWEFRVCGVFLVDSQFMVESFKFISGILAALSAMISLEIPQV
120 130 140 150 160 170
190 200 210 220 230 240
pF1KE0 NIMTKMDLLSKKAKKEIEKFLDPDMYSLLEDSTSDLRSKKFKKLTKAICGLIDDYSMVRF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS91 NIMTKMDLLSKKAKKEIEKFLDPDMYSLLEDSTSDLRSKKFKKLTKAICGLIDDYSMVRF
180 190 200 210 220 230
250 260 270 280 290
pF1KE0 LPYDQSDEESMNIVLQHIDFAIQYGEDLEFKEPKEREDESSSMFDEYFQECQDE
::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS91 LPYDQSDEESMNIVLQHIDFAIQYGEDLEFKEPKEREDESSSMFDEYFQECQDE
240 250 260 270 280
>>CCDS53831.1 GPN3 gene_id:51184|Hs108|chr12 (323 aa)
initn: 1805 init1: 1805 opt: 1805 Z-score: 2204.6 bits: 416.0 E(32554): 1.8e-116
Smith-Waterman score: 1805; 99.6% identity (100.0% similar) in 269 aa overlap (26-294:55-323)
10 20 30 40 50
pF1KE0 MPRYAQLVMGPAGSGKVRICGDKERKSTYCATMVQHCEALNRSVQVVNLDPAAEH
.:::::::::::::::::::::::::::::
CCDS53 SPYRSNVCTQTTDRSTWRKDAELYLLSVITQSTYCATMVQHCEALNRSVQVVNLDPAAEH
30 40 50 60 70 80
60 70 80 90 100 110
pF1KE0 FNYSVMADIRELIEVDDVMEDDSLRFGPNGGLVFCMEYFANNFDWLENCLGHVEDDYILF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 FNYSVMADIRELIEVDDVMEDDSLRFGPNGGLVFCMEYFANNFDWLENCLGHVEDDYILF
90 100 110 120 130 140
120 130 140 150 160 170
pF1KE0 DCPGQIELYTHLPVMKQLVQQLEQWEFRVCGVFLVDSQFMVESFKFISGILAALSAMISL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 DCPGQIELYTHLPVMKQLVQQLEQWEFRVCGVFLVDSQFMVESFKFISGILAALSAMISL
150 160 170 180 190 200
180 190 200 210 220 230
pF1KE0 EIPQVNIMTKMDLLSKKAKKEIEKFLDPDMYSLLEDSTSDLRSKKFKKLTKAICGLIDDY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 EIPQVNIMTKMDLLSKKAKKEIEKFLDPDMYSLLEDSTSDLRSKKFKKLTKAICGLIDDY
210 220 230 240 250 260
240 250 260 270 280 290
pF1KE0 SMVRFLPYDQSDEESMNIVLQHIDFAIQYGEDLEFKEPKEREDESSSMFDEYFQECQDE
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 SMVRFLPYDQSDEESMNIVLQHIDFAIQYGEDLEFKEPKEREDESSSMFDEYFQECQDE
270 280 290 300 310 320
>>CCDS289.1 GPN2 gene_id:54707|Hs108|chr1 (310 aa)
initn: 684 init1: 399 opt: 657 Z-score: 805.2 bits: 157.0 E(32554): 1.6e-38
Smith-Waterman score: 681; 40.5% identity (70.5% similar) in 264 aa overlap (4-264:10-261)
10 20 30 40 50
pF1KE0 MPRYAQLVMGPAGSGKVRICGDKERKSTYCATMVQHCEALNRSVQVVNLDPAAE
..: :.:: :::: .::: : . .::.: : ::::::: :
CCDS28 MAGAAPTTAFGQAVIGPPGSGK----------TTYCLGMSEFLRALGRRVAVVNLDPANE
10 20 30 40 50
60 70 80 90 100 110
pF1KE0 HFNYSVMADIRELIEVDDVMEDDSLRFGPNGGLVFCMEYFANNFDWLENCLGHVEDDYIL
. : .:. ::. . ::: :.::.::::::..::::. :.:::. : .. :.:
CCDS28 GLPYECAVDVGELVGLGDVM--DALRLGPNGGLLYCMEYLEANLDWLRAKLDPLRGHYFL
60 70 80 90 100
120 130 140 150 160 170
pF1KE0 FDCPGQIELYTHLPVMKQLVQQLEQWEFRVCGVFLVDSQFMVESFKFISGILAALSAMIS
::::::.:: :: ..... .:. ::..:. .: ::::.. .. :::: . ..:..:.
CCDS28 FDCPGQVELCTHHGALRSIFSQMAQWDLRLTAVHLVDSHYCTDPAKFISVLCTSLATMLH
110 120 130 140 150 160
180 190 200 210 220 230
pF1KE0 LEIPQVNIMTKMDLLSKKAKK--EIEKFLDP-DMYSLLEDSTSDLRSKKFKKLTKAICGL
.:.:..:...::::. . .: ... . . :. ::. .:: .....:.. . :
CCDS28 VELPHINLLSKMDLIEHYGKLAFNLDYYTEVLDLSYLLDHLASDPFFRHYRQLNEKLVQL
170 180 190 200 210 220
240 250 260 270 280 290
pF1KE0 IDDYSMVRFLPYDQSDEESMNIVLQHIDFAIQYGEDLEFKEPKEREDESSSMFDEYFQEC
:.:::.: :.: . .:.::.. ::: .: : :
CCDS28 IEDYSLVSFIPLNIQDKESIQRVLQAVDKANGYCFRAQEQRSLEAMMSAAMGADFHFSST
230 240 250 260 270 280
pF1KE0 QDE
CCDS28 LGIQEKYLAPSNQSVEQEAMQL
290 300 310
>>CCDS1760.2 GPN1 gene_id:11321|Hs108|chr2 (388 aa)
initn: 183 init1: 85 opt: 288 Z-score: 353.6 bits: 73.8 E(32554): 2.2e-13
Smith-Waterman score: 304; 24.8% identity (58.1% similar) in 298 aa overlap (2-294:30-307)
10 20 30
pF1KE0 MPRYA--QLVMGPAGSGKVRICGDKERKSTYC
::. ::.: ::::: .:.
CCDS17 MRCLYGRVGGARRKMAASAAAAELQASGGPRHPVCLLVLGMAGSGK----------TTFV
10 20 30 40 50
40 50 60 70 80 90
pF1KE0 ATMVQHCEALNRSVQVVNLDPAAEHFNYSVMADIRELIEVDDVMEDDSLRFGPNGGLVFC
.. : .: . :.:::::... . . :::. .. .::.. .: :::::.:
CCDS17 QRLTGHLHAQGTPPYVINLDPAVHEVPFPANIDIRDTVKYKEVMKQYGL--GPNGGIVTS
60 70 80 90 100
100 110 120 130 140
pF1KE0 MEYFANNFDWLENCLGHVED--DYILFDCPGQIELYTHLPVMKQLVQQLEQWEFRVCGVF
.. ::. :: . . . .... :.:.: :::::..: ... : . : . ..
CCDS17 LNLFATRFDQVMKFIEKAQNMSKYVLIDTPGQIEVFTWSASGTIITEALAS-SFPTVVIY
110 120 130 140 150 160
150 160 170 180 190 200
pF1KE0 LVDSQFMVESFKFISGILAALSAMISLEIPQVNIMTKMDLLSKKAKKEIEKFLDPDMYSL
..:.. .. :.:..: : : . . ..: . .:.: :..... .: . : .
CCDS17 VMDTSRSTNPVTFMSNMLYACSILYKTKLPFIVVMNKTDIIDHSFA--VEWMQD---FEA
170 180 190 200 210 220
210 220 230 240 250 260
pF1KE0 LEDSTSDLRSKKFKKLTKAICGLIDD-YSMVRFLPYDQSDEESMNIVLQHIDFAIQYGED
..:. .. .. ..::... ..:. :: .: . . ... .. .. : . :
CCDS17 FQDALNQ-ETTYVSNLTRSMSLVLDEFYSSLRVVGVSAVLGTGLDELFVQVTSAAEEYER
230 240 250 260 270 280
270 280 290
pF1KE0 LEFKEPKEREDESSSMFDEYFQECQDE
:.. :: .: . . :. : :
CCDS17 -EYRPEYERLKKSLANAESQQQREQLERLRKDMGSVALDAGTAKDSLSPVLHPSDLILTR
290 300 310 320 330 340
>>CCDS46248.1 GPN1 gene_id:11321|Hs108|chr2 (362 aa)
initn: 147 init1: 85 opt: 277 Z-score: 340.7 bits: 71.3 E(32554): 1.2e-12
Smith-Waterman score: 277; 23.5% identity (58.8% similar) in 272 aa overlap (26-294:20-281)
10 20 30 40 50 60
pF1KE0 MPRYAQLVMGPAGSGKVRICGDKERKSTYCATMVQHCEALNRSVQVVNLDPAAEHFNYSV
: . .. : .: . :.:::::... . .
CCDS46 MTGHTRSSLPRCTGIVLLIKLRFSERLTGHLHAQGTPPYVINLDPAVHEVPFPA
10 20 30 40 50
70 80 90 100 110
pF1KE0 MADIRELIEVDDVMEDDSLRFGPNGGLVFCMEYFANNFDWLENCLGHVED--DYILFDCP
:::. .. .::.. .: :::::.: .. ::. :: . . . .... :.:.: :
CCDS46 NIDIRDTVKYKEVMKQYGL--GPNGGIVTSLNLFATRFDQVMKFIEKAQNMSKYVLIDTP
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE0 GQIELYTHLPVMKQLVQQLEQWEFRVCGVFLVDSQFMVESFKFISGILAALSAMISLEIP
::::..: ... : . : . ....:.. .. :.:..: : : . . ..:
CCDS46 GQIEVFTWSASGTIITEALAS-SFPTVVIYVMDTSRSTNPVTFMSNMLYACSILYKTKLP
120 130 140 150 160 170
180 190 200 210 220 230
pF1KE0 QVNIMTKMDLLSKKAKKEIEKFLDPDMYSLLEDSTSDLRSKKFKKLTKAICGLIDD-YSM
. .:.: :..... .: . : . ..:. .. .. ..::... ..:. ::
CCDS46 FIVVMNKTDIIDHSFA--VEWMQD---FEAFQDALNQ-ETTYVSNLTRSMSLVLDEFYSS
180 190 200 210 220
240 250 260 270 280 290
pF1KE0 VRFLPYDQSDEESMNIVLQHIDFAIQYGEDLEFKEPKEREDESSSMFDEYFQECQDE
.: . . ... .. .. : . : :.. :: .: . . :. : :
CCDS46 LRVVGVSAVLGTGLDELFVQVTSAAEEYER-EYRPEYERLKKSLANAESQQQREQLERLR
230 240 250 260 270 280
CCDS46 KDMGSVALDAGTAKDSLSPVLHPSDLILTRGTLDEEDEEADSDTDDIDHRVTEESHEEPA
290 300 310 320 330 340
294 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 07:44:28 2016 done: Thu Nov 3 07:44:29 2016
Total Scan time: 2.350 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]