FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1360, 239 aa
1>>>pF1KE1360 239 - 239 aa - 239 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.9073+/-0.000688; mu= 12.0810+/- 0.042
mean_var=75.0317+/-14.708, 0's: 0 Z-trim(110.1): 9 B-trim: 13 in 1/50
Lambda= 0.148065
statistics sampled from 11373 (11381) to 11373 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.719), E-opt: 0.2 (0.35), width: 16
Scan time: 1.920
The best scores are: opt bits E(32554)
CCDS9614.1 PSME2 gene_id:5721|Hs108|chr14 ( 239) 1550 339.8 9.8e-94
CCDS9612.1 PSME1 gene_id:5720|Hs108|chr14 ( 249) 559 128.1 5.4e-30
CCDS61415.1 PSME1 gene_id:5720|Hs108|chr14 ( 233) 486 112.5 2.5e-25
CCDS41930.1 PSME1 gene_id:5720|Hs108|chr14 ( 250) 481 111.4 5.6e-25
CCDS45689.1 PSME3 gene_id:10197|Hs108|chr17 ( 254) 424 99.2 2.6e-21
CCDS59290.1 PSME3 gene_id:10197|Hs108|chr17 ( 265) 424 99.2 2.7e-21
CCDS82133.1 PSME3 gene_id:10197|Hs108|chr17 ( 193) 397 93.4 1.1e-19
CCDS11442.1 PSME3 gene_id:10197|Hs108|chr17 ( 267) 310 74.9 5.9e-14
>>CCDS9614.1 PSME2 gene_id:5721|Hs108|chr14 (239 aa)
initn: 1550 init1: 1550 opt: 1550 Z-score: 1796.3 bits: 339.8 E(32554): 9.8e-94
Smith-Waterman score: 1550; 99.6% identity (99.6% similar) in 239 aa overlap (1-239:1-239)
10 20 30 40 50 60
pF1KE1 MAKPCGVRLSGEARKQVEVFRQNLFQEAEEFLYRFLPQKIIYLNQLLQEDSLNVADLTSL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS96 MAKPCGVRLSGEARKQVEVFRQNLFQEAEEFLYRFLPQKIIYLNQLLQEDSLNVADLTSL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 RAPLDIPIPDPPPKDDEMETDKQEKKEVPKCGFLPGNEKVLSLLALVKPEVWTLKEKCIL
:::::::::::::::::::::::::::: :::::::::::::::::::::::::::::::
CCDS96 RAPLDIPIPDPPPKDDEMETDKQEKKEVHKCGFLPGNEKVLSLLALVKPEVWTLKEKCIL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 VITWIQHLIPKIEDGNDFGVAIQEKVLERVNAVKTKVEAFQTTISKYFSERGDAVAKASK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS96 VITWIQHLIPKIEDGNDFGVAIQEKVLERVNAVKTKVEAFQTTISKYFSERGDAVAKASK
130 140 150 160 170 180
190 200 210 220 230
pF1KE1 ETHVMDYRALVHERDEAAYGELRAMVLDLRAFYAELYHIISSNLEKIVNPKGEEKPSMY
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS96 ETHVMDYRALVHERDEAAYGELRAMVLDLRAFYAELYHIISSNLEKIVNPKGEEKPSMY
190 200 210 220 230
>>CCDS9612.1 PSME1 gene_id:5720|Hs108|chr14 (249 aa)
initn: 763 init1: 536 opt: 559 Z-score: 652.0 bits: 128.1 E(32554): 5.4e-30
Smith-Waterman score: 746; 48.4% identity (74.0% similar) in 246 aa overlap (7-239:4-249)
10 20 30 40 50 60
pF1KE1 MAKPCGVRLSGEARKQVEVFRQNLFQEAEEFLYRFLPQKIIYLNQLLQEDSLNVADLTSL
.:.. ::. .:.:::..: ..:..: ..:.:: :. .:.: .:: :.:..:
CCDS96 MAMLRVQPEAQAKVDVFREDLCTKTENLLGSYFPKKISELDAFLKEPALNEANLSNL
10 20 30 40 50
70 80 90 100
pF1KE1 RAPLDIPIPDPPPKDDEMETDKQEKKEV-------------PKCGFLPGNEKVLSLLALV
.::::::.::: . .. : ::..:: : :: . :::.. :: .
CCDS96 KAPLDIPVPDPVKEKEKEERKKQQEKEDKDEKKKGEDEDKGPPCGPVNCNEKIVVLLQRL
60 70 80 90 100 110
110 120 130 140 150 160
pF1KE1 KPEVWTLKEKCILVITWIQHLIPKIEDGNDFGVAIQEKVLERVNAVKTKVEAFQTTISKY
:::. . :. :: ::.: ::.:::::.::::.::::.: .....::.:.:.: ::::
CCDS96 KPEIKDVIEQLNLVTTWLQLQIPRIEDGNNFGVAVQEKVFELMTSLHTKLEGFHTQISKY
120 130 140 150 160 170
170 180 190 200 210 220
pF1KE1 FSERGDAVAKASKETHVMDYRALVHERDEAAYGELRAMVLDLRAFYAELYHIISSNLEKI
::::::::.::.:. :: ::: :::: ::: : ..: ::...: :: :: :: .:.::.
CCDS96 FSERGDAVTKAAKQPHVGDYRQLVHELDEAEYRDIRLMVMEIRNAYAVLYDIILKNFEKL
180 190 200 210 220 230
230
pF1KE1 VNPKGEEKPSMY
.:.:: : .:
CCDS96 KKPRGETKGMIY
240
>>CCDS61415.1 PSME1 gene_id:5720|Hs108|chr14 (233 aa)
initn: 685 init1: 458 opt: 486 Z-score: 568.1 bits: 112.5 E(32554): 2.5e-25
Smith-Waterman score: 673; 47.4% identity (73.9% similar) in 230 aa overlap (7-222:4-233)
10 20 30 40 50 60
pF1KE1 MAKPCGVRLSGEARKQVEVFRQNLFQEAEEFLYRFLPQKIIYLNQLLQEDSLNVADLTSL
.:.. ::. .:.:::..: ..:..: ..:.:: :. .:.: .:: :.:..:
CCDS61 MAMLRVQPEAQAKVDVFREDLCTKTENLLGSYFPKKISELDAFLKEPALNEANLSNL
10 20 30 40 50
70 80 90 100
pF1KE1 RAPLDIPIPDPPPKDDEMETDKQEKKEV-------------PKCGFLPGNEKVLSLLALV
.::::::.::: . .. : ::..:: : :: . :::.. :: .
CCDS61 KAPLDIPVPDPVKEKEKEERKKQQEKEDKDEKKKGEDEDKGPPCGPVNCNEKIVVLLQRL
60 70 80 90 100 110
110 120 130 140 150 160
pF1KE1 KPEVWTLKEKCILVITWIQHLIPKIEDGNDFGVAIQEKVLERVNAVKTKVEAFQTTISKY
:::. . :. :: ::.: ::.:::::.::::.::::.: .....::.:.:.: ::::
CCDS61 KPEIKDVIEQLNLVTTWLQLQIPRIEDGNNFGVAVQEKVFELMTSLHTKLEGFHTQISKY
120 130 140 150 160 170
170 180 190 200 210 220
pF1KE1 FSERGDAVAKASKETHVMDYRALVHERDEAAYGELRAMVLDLR-AFYAELYHIISSNLEK
::::::::.::.:. :: ::: :::: ::: : ..: ::...: :. .: .. ::
CCDS61 FSERGDAVTKAAKQPHVGDYRQLVHELDEAEYRDIRLMVMEIRNAYVRRLCYMTSS
180 190 200 210 220 230
230
pF1KE1 IVNPKGEEKPSMY
>>CCDS41930.1 PSME1 gene_id:5720|Hs108|chr14 (250 aa)
initn: 685 init1: 458 opt: 481 Z-score: 561.9 bits: 111.4 E(32554): 5.6e-25
Smith-Waterman score: 668; 48.2% identity (74.1% similar) in 220 aa overlap (7-213:4-223)
10 20 30 40 50 60
pF1KE1 MAKPCGVRLSGEARKQVEVFRQNLFQEAEEFLYRFLPQKIIYLNQLLQEDSLNVADLTSL
.:.. ::. .:.:::..: ..:..: ..:.:: :. .:.: .:: :.:..:
CCDS41 MAMLRVQPEAQAKVDVFREDLCTKTENLLGSYFPKKISELDAFLKEPALNEANLSNL
10 20 30 40 50
70 80 90 100
pF1KE1 RAPLDIPIPDPPPKDDEMETDKQEKKEV-------------PKCGFLPGNEKVLSLLALV
.::::::.::: . .. : ::..:: : :: . :::.. :: .
CCDS41 KAPLDIPVPDPVKEKEKEERKKQQEKEDKDEKKKGEDEDKGPPCGPVNCNEKIVVLLQRL
60 70 80 90 100 110
110 120 130 140 150 160
pF1KE1 KPEVWTLKEKCILVITWIQHLIPKIEDGNDFGVAIQEKVLERVNAVKTKVEAFQTTISKY
:::. . :. :: ::.: ::.:::::.::::.::::.: .....::.:.:.: ::::
CCDS41 KPEIKDVIEQLNLVTTWLQLQIPRIEDGNNFGVAVQEKVFELMTSLHTKLEGFHTQISKY
120 130 140 150 160 170
170 180 190 200 210 220
pF1KE1 FSERGDAVAKASKETHVMDYRALVHERDEAAYGELRAMVLDLRAFYAELYHIISSNLEKI
::::::::.::.:. :: ::: :::: ::: : ..: ::...: :
CCDS41 FSERGDAVTKAAKQPHVGDYRQLVHELDEAEYRDIRLMVMEIRNAYVRRQGQGRGGQRQL
180 190 200 210 220 230
230
pF1KE1 VNPKGEEKPSMY
CCDS41 SQATHSLTLQARG
240 250
>>CCDS45689.1 PSME3 gene_id:10197|Hs108|chr17 (254 aa)
initn: 580 init1: 397 opt: 424 Z-score: 496.0 bits: 99.2 E(32554): 2.6e-21
Smith-Waterman score: 535; 33.6% identity (66.8% similar) in 250 aa overlap (7-239:5-254)
10 20 30 40 50 60
pF1KE1 MAKPCGVRLSGEARKQVEVFRQNLFQEAEEFLYRFLPQKIIYLNQLLQEDSLNVADLTSL
.... :.. .:. ::. . .:::... :.:.:.. :...:.: ::. :::..
CCDS45 MASLLKVDQEVKLKVDSFRERITSEAEDLVANFFPKKLLELDSFLKEPILNIHDLTQI
10 20 30 40 50
70 80 90 100
pF1KE1 RAPLDIPIPDP---PPKDDEMETDKQEKKEVPKC--------------GFLPGNEKVLSL
.. ...:.::: . : .. .:... .: :.: .:......
CCDS45 HSDMNLPVPDPILLTNSHDGLDGPTYKKRRLDECEEAFQGTKVFVMPNGMLKSNQQLVDI
60 70 80 90 100 110
110 120 130 140 150 160
pF1KE1 LALVKPEVWTLKEKCILVITWIQHLIPKIEDGNDFGVAIQEKVLERVNAVKTKVEAFQTT
. ::::. : ::: : :.: :::.:::::.:::.:::... .. .:.... ..
CCDS45 IEKVKPEIRLLIEKCNTVKMWVQLLIPRIEDGNNFGVSIQEETVAELRTVESEAASYLDQ
120 130 140 150 160 170
170 180 190 200 210 220
pF1KE1 ISKYFSERGDAVAKASKETHVMDYRALVHERDEAAYGELRAMVLDLRAFYAELYHIISSN
::.:. :. :.: .: :: ::: : : :: : :: .. .:: :. :. .: .:
CCDS45 ISRYYITRAKLVSKIAKYPHVEDYRRTVTEIDEKEYISLRLIISELRNQYVTLHDMILKN
180 190 200 210 220 230
230
pF1KE1 LEKIVNPKGEEKPSMY
.::: :.. . ..:
CCDS45 IEKIKRPRSSNAETLY
240 250
>>CCDS59290.1 PSME3 gene_id:10197|Hs108|chr17 (265 aa)
initn: 571 init1: 397 opt: 424 Z-score: 495.7 bits: 99.2 E(32554): 2.7e-21
Smith-Waterman score: 526; 34.4% identity (66.4% similar) in 241 aa overlap (16-239:25-265)
10 20 30 40 50
pF1KE1 MAKPCGVRLSGEARKQVEVFRQNLFQEAEEFLYRFLPQKIIYLNQLLQEDS
.:. ::. . .:::... :.:.:.. :...:.:
CCDS59 MEKWILKKIKYLQSGGLSASYYSYKVDSFRERITSEAEDLVANFFPKKLLELDSFLKEPI
10 20 30 40 50 60
60 70 80 90
pF1KE1 LNVADLTSLRAPLDIPIPDP---PPKDDEMETDKQEKKEVPKC--------------GFL
::. :::.... ...:.::: . : .. .:... .: :.:
CCDS59 LNIHDLTQIHSDMNLPVPDPILLTNSHDGLDGPTYKKRRLDECEEAFQGTKVFVMPNGML
70 80 90 100 110 120
100 110 120 130 140 150
pF1KE1 PGNEKVLSLLALVKPEVWTLKEKCILVITWIQHLIPKIEDGNDFGVAIQEKVLERVNAVK
.:....... ::::. : ::: : :.: :::.:::::.:::.:::... .. .:.
CCDS59 KSNQQLVDIIEKVKPEIRLLIEKCNTVKMWVQLLIPRIEDGNNFGVSIQEETVAELRTVE
130 140 150 160 170 180
160 170 180 190 200 210
pF1KE1 TKVEAFQTTISKYFSERGDAVAKASKETHVMDYRALVHERDEAAYGELRAMVLDLRAFYA
... .. ::.:. :. :.: .: :: ::: : : :: : :: .. .:: :.
CCDS59 SEAASYLDQISRYYITRAKLVSKIAKYPHVEDYRRTVTEIDEKEYISLRLIISELRNQYV
190 200 210 220 230 240
220 230
pF1KE1 ELYHIISSNLEKIVNPKGEEKPSMY
:. .: .:.::: :.. . ..:
CCDS59 TLHDMILKNIEKIKRPRSSNAETLY
250 260
>>CCDS82133.1 PSME3 gene_id:10197|Hs108|chr17 (193 aa)
initn: 450 init1: 397 opt: 397 Z-score: 466.7 bits: 93.4 E(32554): 1.1e-19
Smith-Waterman score: 405; 34.2% identity (63.2% similar) in 193 aa overlap (64-239:1-193)
40 50 60 70 80 90
pF1KE1 RFLPQKIIYLNQLLQEDSLNVADLTSLRAPLDIPIPDP---PPKDDEMETDKQEKKEVPK
...:.::: . : .. .:... .
CCDS82 MNLPVPDPILLTNSHDGLDGPTYKKRRLDE
10 20 30
100 110 120 130
pF1KE1 C--------------GFLPGNEKVLSLLALVKPEVWTLKEKCILVITWIQHLIPKIEDGN
: :.: .:....... ::::. : ::: : :.: :::.:::::
CCDS82 CEEAFQGTKVFVMPNGMLKSNQQLVDIIEKVKPEIRLLIEKCNTVKMWVQLLIPRIEDGN
40 50 60 70 80 90
140 150 160 170 180 190
pF1KE1 DFGVAIQEKVLERVNAVKTKVEAFQTTISKYFSERGDAVAKASKETHVMDYRALVHERDE
.:::.:::... .. .:.... .. ::.:. :. :.: .: :: ::: : : ::
CCDS82 NFGVSIQEETVAELRTVESEAASYLDQISRYYITRAKLVSKIAKYPHVEDYRRTVTEIDE
100 110 120 130 140 150
200 210 220 230
pF1KE1 AAYGELRAMVLDLRAFYAELYHIISSNLEKIVNPKGEEKPSMY
: :: .. .:: :. :. .: .:.::: :.. . ..:
CCDS82 KEYISLRLIISELRNQYVTLHDMILKNIEKIKRPRSSNAETLY
160 170 180 190
>>CCDS11442.1 PSME3 gene_id:10197|Hs108|chr17 (267 aa)
initn: 469 init1: 286 opt: 310 Z-score: 364.0 bits: 74.9 E(32554): 5.9e-14
Smith-Waterman score: 503; 31.9% identity (63.9% similar) in 263 aa overlap (7-239:5-267)
10 20 30 40 50 60
pF1KE1 MAKPCGVRLSGEARKQVEVFRQNLFQEAEEFLYRFLPQKIIYLNQLLQEDSLNVADLTSL
.... :.. .:. ::. . .:::... :.:.:.. :...:.: ::. :::..
CCDS11 MASLLKVDQEVKLKVDSFRERITSEAEDLVANFFPKKLLELDSFLKEPILNIHDLTQI
10 20 30 40 50
70 80 90 100
pF1KE1 RAPLDIPIPDP---PPKDDEMETDKQEKKEVPKC--------------GFLPGNEKVLSL
.. ...:.::: . : .. .:... .: :.: .:......
CCDS11 HSDMNLPVPDPILLTNSHDGLDGPTYKKRRLDECEEAFQGTKVFVMPNGMLKSNQQLVDI
60 70 80 90 100 110
110 120 130 140 150
pF1KE1 LALVKPEVWTLKEKC-------------ILVITWIQHLIPKIEDGNDFGVAIQEKVLERV
. ::::. : ::: . : :.: :::.:::::.:::.:::... ..
CCDS11 IEKVKPEIRLLIEKCNTPSGKGPHICFDLQVKMWVQLLIPRIEDGNNFGVSIQEETVAEL
120 130 140 150 160 170
160 170 180 190 200 210
pF1KE1 NAVKTKVEAFQTTISKYFSERGDAVAKASKETHVMDYRALVHERDEAAYGELRAMVLDLR
.:.... .. ::.:. :. :.: .: :: ::: : : :: : :: .. .::
CCDS11 RTVESEAASYLDQISRYYITRAKLVSKIAKYPHVEDYRRTVTEIDEKEYISLRLIISELR
180 190 200 210 220 230
220 230
pF1KE1 AFYAELYHIISSNLEKIVNPKGEEKPSMY
:. :. .: .:.::: :.. . ..:
CCDS11 NQYVTLHDMILKNIEKIKRPRSSNAETLY
240 250 260
239 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Mon Nov 7 02:28:23 2016 done: Mon Nov 7 02:28:23 2016
Total Scan time: 1.920 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]