FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE0193, 359 aa
1>>>pF1KE0193 359 - 359 aa - 359 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.2169+/-0.000689; mu= 16.5413+/- 0.041
mean_var=59.5728+/-12.012, 0's: 0 Z-trim(109.0): 11 B-trim: 113 in 1/47
Lambda= 0.166169
statistics sampled from 10613 (10620) to 10613 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.707), E-opt: 0.2 (0.326), width: 16
Scan time: 2.840
The best scores are: opt bits E(32554)
CCDS46365.1 LMAN2L gene_id:81562|Hs108|chr2 ( 359) 2468 599.8 1.2e-171
CCDS2023.1 LMAN2L gene_id:81562|Hs108|chr2 ( 348) 1206 297.2 1.4e-80
CCDS4417.1 LMAN2 gene_id:10960|Hs108|chr5 ( 356) 706 177.3 1.7e-44
CCDS11974.1 LMAN1 gene_id:3998|Hs108|chr18 ( 510) 319 84.6 2e-16
CCDS10270.1 LMAN1L gene_id:79748|Hs108|chr15 ( 526) 286 76.7 4.9e-14
>>CCDS46365.1 LMAN2L gene_id:81562|Hs108|chr2 (359 aa)
initn: 2468 init1: 2468 opt: 2468 Z-score: 3195.1 bits: 599.8 E(32554): 1.2e-171
Smith-Waterman score: 2468; 100.0% identity (100.0% similar) in 359 aa overlap (1-359:1-359)
10 20 30 40 50 60
pF1KE0 MAATLGPLGSWQQWRRCLSARDGSRMLLLLLLLGSGQGPQQVGAGQTFEYLKREHSLSKP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 MAATLGPLGSWQQWRRCLSARDGSRMLLLLLLLGSGQGPQQVGAGQTFEYLKREHSLSKP
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 YQGVGTGSSSLWNLMGNAMVMTQYIRLTPDMQSKQGALWNRVPCFLRDWELQVHFKIHGQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 YQGVGTGSSSLWNLMGNAMVMTQYIRLTPDMQSKQGALWNRVPCFLRDWELQVHFKIHGQ
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 GKKNLHGDGLAIWYTKDRMQPGPVFGNMDKFVGLGVFVDTYPNEEKQQEAQKRRYSPGVQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 GKKNLHGDGLAIWYTKDRMQPGPVFGNMDKFVGLGVFVDTYPNEEKQQEAQKRRYSPGVQ
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE0 RVFPYISAMVNNGSLSYDHERDGRPTELGGCTAIVRNLHYDTFLVIRYVKRHLTIMMDID
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 RVFPYISAMVNNGSLSYDHERDGRPTELGGCTAIVRNLHYDTFLVIRYVKRHLTIMMDID
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE0 GKHEWRDCIEVPGVRLPRGYYFGTSSITGDLSDNHDVISLKLFELTVERTPEEEKLHRDV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 GKHEWRDCIEVPGVRLPRGYYFGTSSITGDLSDNHDVISLKLFELTVERTPEEEKLHRDV
250 260 270 280 290 300
310 320 330 340 350
pF1KE0 FLPSVDNMKLPEMTAPLPPLSGLALFLIVFFSLVFSVFAIVIGIILYNKWQEQSRKRFY
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS46 FLPSVDNMKLPEMTAPLPPLSGLALFLIVFFSLVFSVFAIVIGIILYNKWQEQSRKRFY
310 320 330 340 350
>>CCDS2023.1 LMAN2L gene_id:81562|Hs108|chr2 (348 aa)
initn: 1210 init1: 1206 opt: 1206 Z-score: 1560.3 bits: 297.2 E(32554): 1.4e-80
Smith-Waterman score: 2361; 96.9% identity (96.9% similar) in 359 aa overlap (1-359:1-348)
10 20 30 40 50 60
pF1KE0 MAATLGPLGSWQQWRRCLSARDGSRMLLLLLLLGSGQGPQQVGAGQTFEYLKREHSLSKP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS20 MAATLGPLGSWQQWRRCLSARDGSRMLLLLLLLGSGQGPQQVGAGQTFEYLKREHSLSKP
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 YQGVGTGSSSLWNLMGNAMVMTQYIRLTPDMQSKQGALWNRVPCFLRDWELQVHFKIHGQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS20 YQGVGTGSSSLWNLMGNAMVMTQYIRLTPDMQSKQGALWNRVPCFLRDWELQVHFKIHGQ
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 GKKNLHGDGLAIWYTKDRMQPGPVFGNMDKFVGLGVFVDTYPNEEKQQEAQKRRYSPGVQ
:::::::::::::::::::::::::::::::::::::::::::::::::
CCDS20 GKKNLHGDGLAIWYTKDRMQPGPVFGNMDKFVGLGVFVDTYPNEEKQQE-----------
130 140 150 160
190 200 210 220 230 240
pF1KE0 RVFPYISAMVNNGSLSYDHERDGRPTELGGCTAIVRNLHYDTFLVIRYVKRHLTIMMDID
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS20 RVFPYISAMVNNGSLSYDHERDGRPTELGGCTAIVRNLHYDTFLVIRYVKRHLTIMMDID
170 180 190 200 210 220
250 260 270 280 290 300
pF1KE0 GKHEWRDCIEVPGVRLPRGYYFGTSSITGDLSDNHDVISLKLFELTVERTPEEEKLHRDV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS20 GKHEWRDCIEVPGVRLPRGYYFGTSSITGDLSDNHDVISLKLFELTVERTPEEEKLHRDV
230 240 250 260 270 280
310 320 330 340 350
pF1KE0 FLPSVDNMKLPEMTAPLPPLSGLALFLIVFFSLVFSVFAIVIGIILYNKWQEQSRKRFY
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS20 FLPSVDNMKLPEMTAPLPPLSGLALFLIVFFSLVFSVFAIVIGIILYNKWQEQSRKRFY
290 300 310 320 330 340
>>CCDS4417.1 LMAN2 gene_id:10960|Hs108|chr5 (356 aa)
initn: 1293 init1: 617 opt: 706 Z-score: 912.3 bits: 177.3 E(32554): 1.7e-44
Smith-Waterman score: 1280; 52.4% identity (75.4% similar) in 374 aa overlap (1-359:1-356)
10 20 30 40 50
pF1KE0 MAATLGPLGSWQQWRRCLSARDG--------SRMLLLLLLLGSGQGPQQVGAGQTFEYLK
::: : . : ::::. : : . :.::::::: . .. :.. :.::
CCDS44 MAAE-GWIWRWGWGRRCLG-RPGLLGPGPGPTTPLFLLLLLGSVTA--DITDGNS-EHLK
10 20 30 40 50
60 70 80 90 100 110
pF1KE0 REHSLSKPYQGVGTGSSSLWNLMGNAMVMTQYIRLTPDMQSKQGALWNRVPCFLRDWELQ
::::: :::::::..: ::...:..:. .::.::::: .::.:..::. ::::.:::..
CCDS44 REHSLIKPYQGVGSSSMPLWDFQGSTMLTSQYVRLTPDERSKEGSIWNHQPCFLKDWEMH
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE0 VHFKIHGQGKKNLHGDGLAIWYTKDRMQPGPVFGNMDKFVGLGVFVDTYPNEEKQQEAQK
::::.:: :::::::::.:.:::.::. ::::::. :.: ::..:.:::::.:
CCDS44 VHFKVHGTGKKNLHGDGIALWYTRDRLVPGPVFGSKDNFHGLAIFLDTYPNDET------
120 130 140 150 160
180 190 200 210 220 230
pF1KE0 RRYSPGVQRVFPYISAMVNNGSLSYDHERDGRPTELGGCTAIVRNLHYDTFLVIRYVKRH
..:::::::.::::::::::: .::: :::.:::: :: .::::..:: . .
CCDS44 ------TERVFPYISVMVNNGSLSYDHSKDGRWTELAGCTADFRNRDHDTFLAVRYSRGR
170 180 190 200 210 220
240 250 260 270 280 290
pF1KE0 LTIMMDIDGKHEWRDCIEVPGVRLPRGYYFGTSSITGDLSDNHDVISLKLFELTVERTPE
::.: :.. :.::..::.. ::::: :::::.:. :::::::::.::.:::.: ::.::.
CCDS44 LTVMTDLEDKNEWKNCIDITGVRLPTGYYFGASAGTGDLSDNHDIISMKLFQLMVEHTPD
230 240 250 260 270 280
300 310 320 330 340
pF1KE0 EEKLHRDVFLPSVDNMKLPEMTAPLP-------PLSGLALFLIVFFSLVFSVFAIVIGII
::.. . :::. .: :. .. : ::.: .::... .:. : :.: .
CCDS44 EESIDWTKIEPSVNFLKSPKDNVDDPTGNFRSGPLTGWRVFLLLLCALLGIVVCAVVGAV
290 300 310 320 330 340
350
pF1KE0 LYNKWQEQSRKRFY
...: ::.. ::::
CCDS44 VFQKRQERN-KRFY
350
>>CCDS11974.1 LMAN1 gene_id:3998|Hs108|chr18 (510 aa)
initn: 363 init1: 157 opt: 319 Z-score: 408.5 bits: 84.6 E(32554): 2e-16
Smith-Waterman score: 506; 32.3% identity (60.3% similar) in 282 aa overlap (15-286:6-268)
10 20 30 40 50
pF1KE0 MAATLGPLGSWQQWRRCLSARDGSRMLLLLLLLGS---GQG----PQQVGAGQTFEYLKR
.: : :: . ::: :: :.: : . . :::
CCDS11 MAGSRQRGLRARVRPLFCALLLSLGRFVRGDGVGGDPAVALPHRRFEY---
10 20 30 40
60 70 80 90 100 110
pF1KE0 EHSLSKPYQGVGTGSSSLWNLMGNAMVMTQYIRLTPDMQSKQGALWNRVPCFLRDWELQV
..:.. :. . :. .: :::. .. ::..:...:..:..:... ...::..:
CCDS11 KYSFKGPHLVQSDGTVPFWAHAGNAIPSSDQIRVAPSLKSQRGSVWTKTKAAFENWEVEV
50 60 70 80 90 100
120 130 140 150 160 170
pF1KE0 HFKIHGQGKKNLHGDGLAIWYTKDRMQPGPVFGNMDKFVGLGVFVDTYPNEEKQQEAQKR
:.. :.:. . .:::::::.... :::::. : . :.:.: :.. :. :...
CCDS11 TFRVTGRGR--IGADGLAIWYAENQGLEGPVFGSADLWNGVGIFFDSFDNDGKKNN----
110 120 130 140 150 160
180 190 200 210 220 230
pF1KE0 RYSPGVQRVFPYISAMVNNGSLSYDHERDGRPTELGGCTAIVRNLHYDTFLVIRYVKRHL
: : . :::.. :::. :: :..: :: : . : : . :
CCDS11 ----------PAIVIIGNNGQIHYDHQNDGASQALASCQRDFRNKPYPVRAKITYYQNTL
170 180 190 200 210
240 250 260 270 280 290
pF1KE0 TIMMD---IDGKHEWRDCIEVPGVRLPRGYYFGTSSITGDLSDNHDVISLKLFELTVERT
:.:.. :.... : .: .. .: .:: :. :: :.:.:::.:. :.::
CCDS11 TVMINNGFTPDKNDYEFCAKVENMIIPAQGHFGISAATGGLADDHDVLSFLTFQLTEPGK
220 230 240 250 260 270
300 310 320 330 340 350
pF1KE0 PEEEKLHRDVFLPSVDNMKLPEMTAPLPPLSGLALFLIVFFSLVFSVFAIVIGIILYNKW
CCDS11 EPPTPDKEISEKEKEKYQEEFEHFQQELDKKKEEFQKGHPDLQGQPAEEIFESVGDRELR
280 290 300 310 320 330
>>CCDS10270.1 LMAN1L gene_id:79748|Hs108|chr15 (526 aa)
initn: 254 init1: 130 opt: 286 Z-score: 365.5 bits: 76.7 E(32554): 4.9e-14
Smith-Waterman score: 409; 28.7% identity (58.5% similar) in 272 aa overlap (23-292:8-258)
10 20 30 40 50 60
pF1KE0 MAATLGPLGSWQQWRRCLSARDGSRMLLLLLLLGSGQGPQQVGAGQTFEYLKREHSLSKP
: . :::::: . . ::: . :.. :
CCDS10 MPAVSGPGPLFCLLLLLLDPHSPETGCPPLRRFEY---KLSFKGP
10 20 30 40
70 80 90 100 110 120
pF1KE0 YQGVGTGSSSLWNLMGNAMVMTQYIRLTPDMQSKQGALWNRVPCFLRDWELQVHFKIHGQ
.. .. .:. :.:.. . .::::.:....::.:.:. . ::..:.... :
CCDS10 RLALPGAGIPFWSHHGDAILGLEEVRLTPSMRNRSGAVWSRASVPFSAWEVEVQMRVTGL
50 60 70 80 90 100
130 140 150 160 170 180
pF1KE0 GKKNLHGDGLAIWYTKDRMQPGPVFGNMDKFVGLGVFVDTYPNEEKQQEAQKRRYSPGVQ
:... ..:.:.:::. : . : :.:.. .. :.:.: :. : :. :.
CCDS10 GRRG--AQGMAVWYTRGRGHVGSVLGGLASWDGIGIFFDS-PAEDTQDS-----------
110 120 130 140
190 200 210 220 230
pF1KE0 RVFPYISAMVNNGSLSYDHERDGRPTELGGCTAIVRNLHYDTFLVIRYVKRHLTIMMD--
: : .....: . .. :: ::.: :: . : : ..: . ..
CCDS10 ---PAIRVLASDGHIPSEQPGDGASQGLGSCHWDFRNRPHPFRARITYWGQRLRMSLNSG
150 160 170 180 190 200
240 250 260 270 280 290
pF1KE0 IDGKHEWRDCIEVPGVRLPRGYYFGTSSITGDLSDNHDVISLKLFELTVERTPEEEKLHR
. . . :..: . : : .::.:. :: :.:.:::.:. : :. : .::
CCDS10 LTPSDPGEFCVDVGPLLLVPGGFFGVSAATGTLADDHDVLSFLTFSLS-EPSPEVPPQPF
210 220 230 240 250 260
300 310 320 330 340 350
pF1KE0 DVFLPSVDNMKLPEMTAPLPPLSGLALFLIVFFSLVFSVFAIVIGIILYNKWQEQSRKRF
CCDS10 LEMQQLRLARQLEGLWARLGLGTREDVTPKSDSEAQGEGERLFDLEETLGRHRRILQALR
270 280 290 300 310 320
359 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 21:44:18 2016 done: Thu Nov 3 21:44:18 2016
Total Scan time: 2.840 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]