FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB4992, 281 aa
1>>>pF1KB4992 281 - 281 aa - 281 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.9590+/-0.000689; mu= 13.4749+/- 0.042
mean_var=87.0540+/-17.082, 0's: 0 Z-trim(111.9): 7 B-trim: 0 in 0/51
Lambda= 0.137461
statistics sampled from 12720 (12727) to 12720 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.755), E-opt: 0.2 (0.391), width: 16
Scan time: 2.210
The best scores are: opt bits E(32554)
CCDS1731.1 MAPRE3 gene_id:22924|Hs108|chr2 ( 281) 1909 387.7 5e-108
CCDS45851.1 MAPRE2 gene_id:10982|Hs108|chr18 ( 284) 1142 235.6 3.1e-62
CCDS45850.1 MAPRE2 gene_id:10982|Hs108|chr18 ( 315) 1142 235.6 3.4e-62
CCDS11910.1 MAPRE2 gene_id:10982|Hs108|chr18 ( 327) 1142 235.6 3.5e-62
CCDS58619.1 MAPRE2 gene_id:10982|Hs108|chr18 ( 274) 932 193.9 1e-49
CCDS13208.1 MAPRE1 gene_id:22919|Hs108|chr20 ( 268) 885 184.6 6.5e-47
>>CCDS1731.1 MAPRE3 gene_id:22924|Hs108|chr2 (281 aa)
initn: 1909 init1: 1909 opt: 1909 Z-score: 2052.9 bits: 387.7 E(32554): 5e-108
Smith-Waterman score: 1909; 100.0% identity (100.0% similar) in 281 aa overlap (1-281:1-281)
10 20 30 40 50 60
pF1KB4 MAVNVYSTSVTSENLSRHDMLAWVNDSLHLNYTKIEQLCSGAAYCQFMDMLFPGCVHLRK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS17 MAVNVYSTSVTSENLSRHDMLAWVNDSLHLNYTKIEQLCSGAAYCQFMDMLFPGCVHLRK
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB4 VKFQAKLEHEYIHNFKVLQAAFKKMGVDKIIPVEKLVKGKFQDNFEFIQWFKKFFDANYD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS17 VKFQAKLEHEYIHNFKVLQAAFKKMGVDKIIPVEKLVKGKFQDNFEFIQWFKKFFDANYD
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB4 GKDYNPLLARQGQDVAPPPNPGDQIFNKSKKLIGTAVPQRTSPTGPKNMQTSGRLSNVAP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS17 GKDYNPLLARQGQDVAPPPNPGDQIFNKSKKLIGTAVPQRTSPTGPKNMQTSGRLSNVAP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB4 PCILRKNPPSARNGGHETDAQILELNQQLVDLKLTVDGLEKERDFYFSKLRDIELICQEH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS17 PCILRKNPPSARNGGHETDAQILELNQQLVDLKLTVDGLEKERDFYFSKLRDIELICQEH
190 200 210 220 230 240
250 260 270 280
pF1KB4 ESENSPVISGIIGILYATEEGFAPPEDDEIEEHQQEDQDEY
:::::::::::::::::::::::::::::::::::::::::
CCDS17 ESENSPVISGIIGILYATEEGFAPPEDDEIEEHQQEDQDEY
250 260 270 280
>>CCDS45851.1 MAPRE2 gene_id:10982|Hs108|chr18 (284 aa)
initn: 1197 init1: 880 opt: 1142 Z-score: 1230.8 bits: 235.6 E(32554): 3.1e-62
Smith-Waterman score: 1142; 59.2% identity (83.0% similar) in 289 aa overlap (1-281:1-284)
10 20 30 40 50 60
pF1KB4 MAVNVYSTSVTSENLSRHDMLAWVNDSLHLNYTKIEQLCSGAAYCQFMDMLFPGCVHLRK
:::::::::.:.:..::::..::::: . :::::.::::::::::::::::::::. :.:
CCDS45 MAVNVYSTSITQETMSRHDIIAWVNDIVSLNYTKVEQLCSGAAYCQFMDMLFPGCISLKK
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB4 VKFQAKLEHEYIHNFKVLQAAFKKMGVDKIIPVEKLVKGKFQDNFEFIQWFKKFFDANYD
::::::::::::::::.:::.::.:.:::.:::::::::.::::..::::::::.:::::
CCDS45 VKFQAKLEHEYIHNFKLLQASFKRMNVDKVIPVEKLVKGRFQDNLDFIQWFKKFYDANYD
70 80 90 100 110 120
130 140 150 160 170
pF1KB4 GKDYNPLLARQGQDVAPPPNPGDQIFNKSKKLIGTAVPQ----RTSPTGPKNMQTSGRLS
::.:.:. ::::::. :::.::.:::: :: . : ..::.. : .: .: :
CCDS45 GKEYDPVEARQGQDAIPPPDPGEQIFNLPKKSHHANSPTAGAAKSSPAA-KPGSTPSRPS
130 140 150 160 170
180 190 200 210 220 230
pF1KB4 NVAPPCILRKNPPSARNGGHETDAQILELNQQLVDLKLTVDGLEKERDFYFSKLRDIELI
.. .. :: .. .. ..:...::.:. .:::...:.::::::::.:::.:::.
CCDS45 SAKRA----SSSGSASKSDKDLETQVIQLNEQVHSLKLALEGVEKERDFYFGKLREIELL
180 190 200 210 220 230
240 250 260 270 280
pF1KB4 CQEHESENSPVISGIIGILYATEEGFAPPEDDEIEE--HQQE--DQDEY
:::: .::. ... .. ::::.:: . :. : :: :.:. .:.::
CCDS45 CQEHGQENDDLVQRLMDILYASEEHEGHTEEPEAEEQAHEQQPPQQEEY
240 250 260 270 280
>>CCDS45850.1 MAPRE2 gene_id:10982|Hs108|chr18 (315 aa)
initn: 1197 init1: 880 opt: 1142 Z-score: 1230.1 bits: 235.6 E(32554): 3.4e-62
Smith-Waterman score: 1142; 59.2% identity (83.0% similar) in 289 aa overlap (1-281:32-315)
10 20 30
pF1KB4 MAVNVYSTSVTSENLSRHDMLAWVNDSLHL
:::::::::.:.:..::::..::::: . :
CCDS45 KQNRDQKCPVSQRNSSFQQPGRKPGCSSWGMAVNVYSTSITQETMSRHDIIAWVNDIVSL
10 20 30 40 50 60
40 50 60 70 80 90
pF1KB4 NYTKIEQLCSGAAYCQFMDMLFPGCVHLRKVKFQAKLEHEYIHNFKVLQAAFKKMGVDKI
::::.::::::::::::::::::::. :.:::::::::::::::::.:::.::.:.:::.
CCDS45 NYTKVEQLCSGAAYCQFMDMLFPGCISLKKVKFQAKLEHEYIHNFKLLQASFKRMNVDKV
70 80 90 100 110 120
100 110 120 130 140 150
pF1KB4 IPVEKLVKGKFQDNFEFIQWFKKFFDANYDGKDYNPLLARQGQDVAPPPNPGDQIFNKSK
:::::::::.::::..::::::::.:::::::.:.:. ::::::. :::.::.:::: :
CCDS45 IPVEKLVKGRFQDNLDFIQWFKKFYDANYDGKEYDPVEARQGQDAIPPPDPGEQIFNLPK
130 140 150 160 170 180
160 170 180 190 200
pF1KB4 KLIGTAVPQ----RTSPTGPKNMQTSGRLSNVAPPCILRKNPPSARNGGHETDAQILELN
: . : ..::.. : .: .: :.. .. :: .. .. ..:...::
CCDS45 KSHHANSPTAGAAKSSPAA-KPGSTPSRPSSAKRA----SSSGSASKSDKDLETQVIQLN
190 200 210 220 230
210 220 230 240 250 260
pF1KB4 QQLVDLKLTVDGLEKERDFYFSKLRDIELICQEHESENSPVISGIIGILYATEEGFAPPE
.:. .:::...:.::::::::.:::.:::.:::: .::. ... .. ::::.:: . :
CCDS45 EQVHSLKLALEGVEKERDFYFGKLREIELLCQEHGQENDDLVQRLMDILYASEEHEGHTE
240 250 260 270 280 290
270 280
pF1KB4 DDEIEE--HQQE--DQDEY
. : :: :.:. .:.::
CCDS45 EPEAEEQAHEQQPPQQEEY
300 310
>>CCDS11910.1 MAPRE2 gene_id:10982|Hs108|chr18 (327 aa)
initn: 1197 init1: 880 opt: 1142 Z-score: 1229.9 bits: 235.6 E(32554): 3.5e-62
Smith-Waterman score: 1142; 59.2% identity (83.0% similar) in 289 aa overlap (1-281:44-327)
10 20 30
pF1KB4 MAVNVYSTSVTSENLSRHDMLAWVNDSLHL
:::::::::.:.:..::::..::::: . :
CCDS11 NNNDIIQDNNGTIIPFRKHTVRGERSYSWGMAVNVYSTSITQETMSRHDIIAWVNDIVSL
20 30 40 50 60 70
40 50 60 70 80 90
pF1KB4 NYTKIEQLCSGAAYCQFMDMLFPGCVHLRKVKFQAKLEHEYIHNFKVLQAAFKKMGVDKI
::::.::::::::::::::::::::. :.:::::::::::::::::.:::.::.:.:::.
CCDS11 NYTKVEQLCSGAAYCQFMDMLFPGCISLKKVKFQAKLEHEYIHNFKLLQASFKRMNVDKV
80 90 100 110 120 130
100 110 120 130 140 150
pF1KB4 IPVEKLVKGKFQDNFEFIQWFKKFFDANYDGKDYNPLLARQGQDVAPPPNPGDQIFNKSK
:::::::::.::::..::::::::.:::::::.:.:. ::::::. :::.::.:::: :
CCDS11 IPVEKLVKGRFQDNLDFIQWFKKFYDANYDGKEYDPVEARQGQDAIPPPDPGEQIFNLPK
140 150 160 170 180 190
160 170 180 190 200
pF1KB4 KLIGTAVPQ----RTSPTGPKNMQTSGRLSNVAPPCILRKNPPSARNGGHETDAQILELN
: . : ..::.. : .: .: :.. .. :: .. .. ..:...::
CCDS11 KSHHANSPTAGAAKSSPAA-KPGSTPSRPSSAKRA----SSSGSASKSDKDLETQVIQLN
200 210 220 230 240
210 220 230 240 250 260
pF1KB4 QQLVDLKLTVDGLEKERDFYFSKLRDIELICQEHESENSPVISGIIGILYATEEGFAPPE
.:. .:::...:.::::::::.:::.:::.:::: .::. ... .. ::::.:: . :
CCDS11 EQVHSLKLALEGVEKERDFYFGKLREIELLCQEHGQENDDLVQRLMDILYASEEHEGHTE
250 260 270 280 290 300
270 280
pF1KB4 DDEIEE--HQQE--DQDEY
. : :: :.:. .:.::
CCDS11 EPEAEEQAHEQQPPQQEEY
310 320
>>CCDS58619.1 MAPRE2 gene_id:10982|Hs108|chr18 (274 aa)
initn: 987 init1: 670 opt: 932 Z-score: 1005.9 bits: 193.9 E(32554): 1e-49
Smith-Waterman score: 932; 56.4% identity (81.2% similar) in 250 aa overlap (40-281:30-274)
10 20 30 40 50 60
pF1KB4 VTSENLSRHDMLAWVNDSLHLNYTKIEQLCSGAAYCQFMDMLFPGCVHLRKVKFQAKLEH
.:::::::::::::::. :.::::::::::
CCDS58 MARTTTTSSRIITGPSFLSGSTQCAGSVPTGAAYCQFMDMLFPGCISLKKVKFQAKLEH
10 20 30 40 50
70 80 90 100 110 120
pF1KB4 EYIHNFKVLQAAFKKMGVDKIIPVEKLVKGKFQDNFEFIQWFKKFFDANYDGKDYNPLLA
:::::::.:::.::.:.:::.:::::::::.::::..::::::::.:::::::.:.:. :
CCDS58 EYIHNFKLLQASFKRMNVDKVIPVEKLVKGRFQDNLDFIQWFKKFYDANYDGKEYDPVEA
60 70 80 90 100 110
130 140 150 160 170 180
pF1KB4 RQGQDVAPPPNPGDQIFNKSKKLIGTAVPQ----RTSPTGPKNMQTSGRLSNVAPPCILR
:::::. :::.::.:::: :: . : ..::.. : .: .: :..
CCDS58 RQGQDAIPPPDPGEQIFNLPKKSHHANSPTAGAAKSSPAA-KPGSTPSRPSSAKRA----
120 130 140 150 160 170
190 200 210 220 230 240
pF1KB4 KNPPSARNGGHETDAQILELNQQLVDLKLTVDGLEKERDFYFSKLRDIELICQEHESENS
.. :: .. .. ..:...::.:. .:::...:.::::::::.:::.:::.:::: .::.
CCDS58 SSSGSASKSDKDLETQVIQLNEQVHSLKLALEGVEKERDFYFGKLREIELLCQEHGQEND
180 190 200 210 220 230
250 260 270 280
pF1KB4 PVISGIIGILYATEEGFAPPEDDEIEE--HQQE--DQDEY
... .. ::::.:: . :. : :: :.:. .:.::
CCDS58 DLVQRLMDILYASEEHEGHTEEPEAEEQAHEQQPPQQEEY
240 250 260 270
>>CCDS13208.1 MAPRE1 gene_id:22919|Hs108|chr20 (268 aa)
initn: 1157 init1: 838 opt: 885 Z-score: 955.7 bits: 184.6 E(32554): 6.5e-47
Smith-Waterman score: 1159; 65.4% identity (81.6% similar) in 283 aa overlap (1-281:1-268)
10 20 30 40 50 60
pF1KB4 MAVNVYSTSVTSENLSRHDMLAWVNDSLHLNYTKIEQLCSGAAYCQFMDMLFPGCVHLRK
::::::::::::.::::::::::.:.::.:: :::::::::::::::::::::: . :.:
CCDS13 MAVNVYSTSVTSDNLSRHDMLAWINESLQLNLTKIEQLCSGAAYCQFMDMLFPGSIALKK
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB4 VKFQAKLEHEYIHNFKVLQAAFKKMGVDKIIPVEKLVKGKFQDNFEFIQWFKKFFDANYD
::::::::::::.:::.:::.::.:::::::::.:::::::::::::.::::::::::::
CCDS13 VKFQAKLEHEYIQNFKILQAGFKRMGVDKIIPVDKLVKGKFQDNFEFVQWFKKFFDANYD
70 80 90 100 110 120
130 140 150 160 170
pF1KB4 GKDYNPLLARQGQDVAPPPNPGDQIFNKSKKLI--GTAVPQRTSPTGPKNMQTSGRLSNV
::::.:. :::::..: :. .:: :: . ..:.::: : . : .. .
CCDS13 GKDYDPVAARQGQETAVAPSLVAPALNKPKKPLTSSSAAPQR-----PISTQRTAAAPK-
130 140 150 160 170
180 190 200 210 220 230
pF1KB4 APPCILRKNPPSARNGGHETDAQILELNQQLVDLKLTVDGLEKERDFYFSKLRDIELICQ
: : ..:::: .. :: : . :: ::. :::::. :::::::::.:::.::::::
CCDS13 AGPGVVRKNP-GVGNG----DDEAAELMQQVNVLKLTVEDLEKERDFYFGKLRNIELICQ
180 190 200 210 220
240 250 260 270 280
pF1KB4 EHESENSPVISGIIGILYATEEGFAPPEDDEIEEHQQEDQDEY
:.:.::.::.. :. :::::.:::. :. : ::.:.::
CCDS13 ENEGENDPVLQRIVDILYATDEGFVIPD----EGGPQEEQEEY
230 240 250 260
281 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 06:12:32 2016 done: Sat Nov 5 06:12:32 2016
Total Scan time: 2.210 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]