FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB5840, 482 aa
1>>>pF1KB5840 482 - 482 aa - 482 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.9573+/-0.00088; mu= 12.4880+/- 0.053
mean_var=149.7292+/-31.208, 0's: 0 Z-trim(111.9): 22 B-trim: 801 in 1/51
Lambda= 0.104814
statistics sampled from 12756 (12778) to 12756 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.735), E-opt: 0.2 (0.393), width: 16
Scan time: 3.520
The best scores are: opt bits E(32554)
CCDS5860.1 MKRN1 gene_id:23608|Hs108|chr7 ( 482) 3331 515.3 5.7e-146
CCDS47725.1 MKRN1 gene_id:23608|Hs108|chr7 ( 329) 2255 352.5 4.1e-97
CCDS10013.1 MKRN3 gene_id:7681|Hs108|chr15 ( 507) 1440 229.4 7e-60
CCDS63545.1 MKRN2 gene_id:23609|Hs108|chr3 ( 373) 1038 168.5 1.1e-41
CCDS33702.1 MKRN2 gene_id:23609|Hs108|chr3 ( 416) 1038 168.5 1.2e-41
>>CCDS5860.1 MKRN1 gene_id:23608|Hs108|chr7 (482 aa)
initn: 3331 init1: 3331 opt: 3331 Z-score: 2734.2 bits: 515.3 E(32554): 5.7e-146
Smith-Waterman score: 3331; 99.8% identity (100.0% similar) in 482 aa overlap (1-482:1-482)
10 20 30 40 50 60
pF1KB5 MAEAATPGTTATTSGAGAAAATAAAASPTPIPTVTAPSLGAGGGGGGSDGSGGGWTKQVT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 MAEAATPGTTATTSGAGAAAATAAAASPTPIPTVTAPSLGAGGGGGGSDGSGGGWTKQVT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB5 CRYFMHGVCKEGDNCRYSHDLSDSPYSVVCKYFQRGYCIYGDRCRYEHSKPLKQEEATAT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 CRYFMHGVCKEGDNCRYSHDLSDSPYSVVCKYFQRGYCIYGDRCRYEHSKPLKQEEATAT
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB5 ELTTKSSLAASSSLSSIVGPLVEMNTGEAESRNSNFATVGAGSEDWVNAIEFVPGQPYCG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 ELTTKSSLAASSSLSSIVGPLVEMNTGEAESRNSNFATVGAGSEDWVNAIEFVPGQPYCG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB5 RTAPSCTEAPLQGSVTKEESEKEQTAVETKKQLCPYAAVGECRYGENCVYLHGDSCDMCG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 RTAPSCTEAPLQGSVTKEESEKEQTAVETKKQLCPYAAVGECRYGENCVYLHGDSCDMCG
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB5 LQLLHPMDAAQRSQHIKSCIEAHEKDMELSFAVQRSKDMVCGICMEVVYEKANPSERRFG
::.:::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 LQVLHPMDAAQRSQHIKSCIEAHEKDMELSFAVQRSKDMVCGICMEVVYEKANPSERRFG
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB5 ILSNCNHTYCLKCIRKWRSAKQFESKIIKSCPECRITSNFVIPSEYWVEEKEEKQKLILK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 ILSNCNHTYCLKCIRKWRSAKQFESKIIKSCPECRITSNFVIPSEYWVEEKEEKQKLILK
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB5 YKEAMSNKACRYFDEGRGSCPFGGNCFYKHAYPDGRREEPQRQKVGTSSRYRAQRRNHFW
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 YKEAMSNKACRYFDEGRGSCPFGGNCFYKHAYPDGRREEPQRQKVGTSSRYRAQRRNHFW
370 380 390 400 410 420
430 440 450 460 470 480
pF1KB5 ELIEERENSNPFDNDEEEVVTFELGEMLLMLLAAGGDDELTDSEDEWDLFHDELEDFYDL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 ELIEERENSNPFDNDEEEVVTFELGEMLLMLLAAGGDDELTDSEDEWDLFHDELEDFYDL
430 440 450 460 470 480
pF1KB5 DL
::
CCDS58 DL
>>CCDS47725.1 MKRN1 gene_id:23608|Hs108|chr7 (329 aa)
initn: 2255 init1: 2255 opt: 2255 Z-score: 1857.0 bits: 352.5 E(32554): 4.1e-97
Smith-Waterman score: 2255; 99.7% identity (100.0% similar) in 329 aa overlap (1-329:1-329)
10 20 30 40 50 60
pF1KB5 MAEAATPGTTATTSGAGAAAATAAAASPTPIPTVTAPSLGAGGGGGGSDGSGGGWTKQVT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 MAEAATPGTTATTSGAGAAAATAAAASPTPIPTVTAPSLGAGGGGGGSDGSGGGWTKQVT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB5 CRYFMHGVCKEGDNCRYSHDLSDSPYSVVCKYFQRGYCIYGDRCRYEHSKPLKQEEATAT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 CRYFMHGVCKEGDNCRYSHDLSDSPYSVVCKYFQRGYCIYGDRCRYEHSKPLKQEEATAT
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB5 ELTTKSSLAASSSLSSIVGPLVEMNTGEAESRNSNFATVGAGSEDWVNAIEFVPGQPYCG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 ELTTKSSLAASSSLSSIVGPLVEMNTGEAESRNSNFATVGAGSEDWVNAIEFVPGQPYCG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB5 RTAPSCTEAPLQGSVTKEESEKEQTAVETKKQLCPYAAVGECRYGENCVYLHGDSCDMCG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 RTAPSCTEAPLQGSVTKEESEKEQTAVETKKQLCPYAAVGECRYGENCVYLHGDSCDMCG
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB5 LQLLHPMDAAQRSQHIKSCIEAHEKDMELSFAVQRSKDMVCGICMEVVYEKANPSERRFG
::.:::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 LQVLHPMDAAQRSQHIKSCIEAHEKDMELSFAVQRSKDMVCGICMEVVYEKANPSERRFG
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB5 ILSNCNHTYCLKCIRKWRSAKQFESKIIKSCPECRITSNFVIPSEYWVEEKEEKQKLILK
:::::::::::::::::::::::::::::
CCDS47 ILSNCNHTYCLKCIRKWRSAKQFESKIIK
310 320
>>CCDS10013.1 MKRN3 gene_id:7681|Hs108|chr15 (507 aa)
initn: 1419 init1: 1125 opt: 1440 Z-score: 1188.5 bits: 229.4 E(32554): 7e-60
Smith-Waterman score: 1618; 51.5% identity (70.9% similar) in 505 aa overlap (2-482:20-507)
10 20 30
pF1KB5 MAEAATPGTTATT------SGAGAAAATA---AAASPTPIPT
:::: :... :: .:: .: :: . .:.:.
CCDS10 MEEPAAPSEAHEAAGAQAGAEAAREGVSGPDLPVCEPSGESAAPDSALPHAARGWAPFPV
10 20 30 40 50 60
40 50 60 70 80
pF1KB5 VTAPS-LGAGG------GGGGS------DGSGGGWTKQVTCRYFMHGVCKEGDNCRYSHD
. .:. : :: .:::. . :.: ::::. :::..:: ::::.:::::::
CCDS10 APVPAHLRRGGLRPAPASGGGAWPSPLPSRSSGIWTKQIICRYYIHGQCKEGENCRYSHD
70 80 90 100 110 120
90 100 110 120 130
pF1KB5 LSDSPYSVVCKYFQRGYCIYGDRCRYEHSKPLKQEEATATELTTKSSLAASSSLS-SIVG
:: ... : : : .: :: : : :.:::: ..:
CCDS10 LSGRKMATEGGVSPPGASAGGGPSTAAHIEPPTQEVAEAPP--------AASSLSLPVIG
130 140 150 160 170
140 150 160 170 180 190
pF1KB5 PLVEMNTGEAESRNSNF-ATVGAGSEDWVNAIEFVPGQPYCGRTAPSCTEAPLQGSVTKE
.: . ::: :.. :. ::: :.:..:::::::::: :: . : :::::.:
CCDS10 SAAERGFFEAERDNADRGAAGGAGVESWADAIEFVPGQPYRGRWVASAPEAPLQSS----
180 190 200 210 220
200 210 220 230 240 250
pF1KB5 ESEKEQTAVETKKQLCPYAAVGECRYGENCVYLHGDSCDMCGLQLLHPMDAAQRSQHIKS
:.:..: :: . ..: ::. : : ::.:.::::: ::::::: ::::::::: .:...
CCDS10 ETERKQMAVGSGLRFCYYASRGVCFRGESCMYLHGDICDMCGLQTLHPMDAAQREEHMRA
230 240 250 260 270 280
260 270 280 290 300 310
pF1KB5 CIEAHEKDMELSFAVQRSKDMVCGICMEVVYEKANPSERRFGILSNCNHTYCLKCIRKWR
:::::::::::::::::. : :::::::::::::::..:::::::::::..:..:::.::
CCDS10 CIEAHEKDMELSFAVQRGMDKVCGICMEVVYEKANPNDRRFGILSNCNHSFCIRCIRRWR
290 300 310 320 330 340
320 330 340 350 360 370
pF1KB5 SAKQFESKIIKSCPECRITSNFVIPSEYWVEEKEEKQKLILKYKEAMSNKACRYFDEGRG
::.:::..:.::::.::.::..:::::.::::.::::::: .::::::::::::: ::::
CCDS10 SARQFENRIVKSCPQCRVTSELVIPSEFWVEEEEEKQKLIQQYKEAMSNKACRYFAEGRG
350 360 370 380 390 400
380 390 400 410 420 430
pF1KB5 SCPFGGNCFYKHAYPDGRREEPQRQKVGTSSRYRAQRRNHFWELIEERENSNPFDNDEEE
.:::: .::::: ::.: .:: :. : : : . : .. :.. . . ..:
CCDS10 NCPFGDTCFYKHEYPEGWGDEPPGPGGGSFSAYWHQ----LVEPVRMGEGNMLYKSIKKE
410 420 430 440 450 460
440 450 460 470 480
pF1KB5 VVTFELGEMLLMLLAAGGDDELTDSEDEWDLFHDELEDFYDLDL
.:...:. .:. . . ::: :::.:::.: :::....: :
CCDS10 LVVLRLASLLFKRFLSL-RDELPFSEDQWDLLHYELEEYFNLIL
470 480 490 500
>>CCDS63545.1 MKRN2 gene_id:23609|Hs108|chr3 (373 aa)
initn: 1016 init1: 975 opt: 1038 Z-score: 861.7 bits: 168.5 E(32554): 1.1e-41
Smith-Waterman score: 1060; 44.2% identity (71.1% similar) in 360 aa overlap (104-451:8-365)
80 90 100 110 120 130
pF1KB5 NCRYSHDLSDSPYSVVCKYFQRGYCIYGDRCRYEHSKPLKQEE-ATATELTTKSSLAASS
:::.:..: :..: . : : :
CCDS63 MSTKQITCRYDHTRPSAAAGGAVGTMAHSVPSPAFHS
10 20 30
140 150 160 170 180
pF1KB5 SL--SSIVGPLVEMNTGEAESRNSNFATVG----AGSEDWVNAIEFVPGQPYCGRTAPSC
: ... .:. :. : .:.. .. .: . . .: . :. ::
CCDS63 PHPPSEVTASIVKTNSHEPGKREKRTLVLRDRNLSGMAERKTQPSMVSNPGSCSDPQPSP
40 50 60 70 80 90
190 200 210 220 230 240
pF1KB5 TEAP---LQGSVTKEESEKEQTAVETKKQLCPYAAVGECRYGENCVYLHGDSCDMCGLQL
: :.. . .. . ... ...:::::::.::::.:. ::::::. :..: ::.
CCDS63 EMKPHSYLDAIRSGLDDVEASSSYSNEQQLCPYAAAGECRFGDACVYLHGEVCEICRLQV
100 110 120 130 140 150
250 260 270 280 290 300
pF1KB5 LHPMDAAQRSQHIKSCIEAHEKDMELSFAVQRSKDMVCGICMEVVYEKANPSERRFGILS
:::.: ::. : : :. . :..:: .:: : :.: ::.:::::. :::. :::::::::
CCDS63 LHPFDPEQRKAHEKICMLTFEHEMEKAFAFQASQDKVCSICMEVILEKASASERRFGILS
160 170 180 190 200 210
310 320 330 340 350 360
pF1KB5 NCNHTYCLKCIRKWRSAKQFESKIIKSCPECRITSNFVIPSEYWVEEKEEKQKLILKYKE
::::::::.:::.:: :::::. :::::::::. :.::::: ::::....:..:: .:.
CCDS63 NCNHTYCLSCIRQWRCAKQFENPIIKSCPECRVISEFVIPSVYWVEDQNKKNELIEAFKQ
220 230 240 250 260 270
370 380 390 400 410 420
pF1KB5 AMSNKACRYFDEGRGSCPFGGNCFYKHAYPDGRREEPQRQKVGTSSRYRAQRRN--HFWE
.:..:::.::..:.:.::::..:.:.::::::: ::.. . ::. .. : ..:.
CCDS63 GMGKKACKYFEQGKGTCPFGSKCLYRHAYPDGRLAEPEKPRKQLSSQGTVRFFNSVRLWD
280 290 300 310 320 330
430 440 450 460 470 480
pF1KB5 LIEERENSNPFDNDEEEVVTFELGEMLLMLLAAGGDDELTDSEDEWDLFHDELEDFYDLD
.::.::. . .: :.: :::.... :
CCDS63 FIENRESRHVPNN--EDVDMTELGDLFMHLSGVESSEP
340 350 360 370
>>CCDS33702.1 MKRN2 gene_id:23609|Hs108|chr3 (416 aa)
initn: 1285 init1: 975 opt: 1038 Z-score: 861.1 bits: 168.5 E(32554): 1.2e-41
Smith-Waterman score: 1329; 46.8% identity (73.3% similar) in 408 aa overlap (56-451:3-408)
30 40 50 60 70 80
pF1KB5 ASPTPIPTVTAPSLGAGGGGGGSDGSGGGWTKQVTCRYFMHGVCKEGDNCRYSHDLSDSP
:::.::::::::::.::..: .::::..:
CCDS33 MSTKQITCRYFMHGVCREGSQCLFSHDLANSK
10 20 30
90 100 110 120 130 140
pF1KB5 YSVVCKYFQRGYCIYGDRCRYEHSKPLKQEE-ATATELTTKSSLAASSSL--SSIVGPLV
:..:::.:.::: :: ::::.:..: :..: . : : : : ... .:
CCDS33 PSTICKYYQKGYCAYGTRCRYDHTRPSAAAGGAVGTMAHSVPSPAFHSPHPPSEVTASIV
40 50 60 70 80 90
150 160 170 180 190
pF1KB5 EMNTGEAESRNSNFATVG----AGSEDWVNAIEFVPGQPYCGRTAPSCTEAP---LQGSV
. :. : .:.. .. .: . . .: . :. :: : :..
CCDS33 KTNSHEPGKREKRTLVLRDRNLSGMAERKTQPSMVSNPGSCSDPQPSPEMKPHSYLDAIR
100 110 120 130 140 150
200 210 220 230 240 250
pF1KB5 TKEESEKEQTAVETKKQLCPYAAVGECRYGENCVYLHGDSCDMCGLQLLHPMDAAQRSQH
. .. . ... ...:::::::.::::.:. ::::::. :..: ::.:::.: ::. :
CCDS33 SGLDDVEASSSYSNEQQLCPYAAAGECRFGDACVYLHGEVCEICRLQVLHPFDPEQRKAH
160 170 180 190 200 210
260 270 280 290 300 310
pF1KB5 IKSCIEAHEKDMELSFAVQRSKDMVCGICMEVVYEKANPSERRFGILSNCNHTYCLKCIR
: :. . :..:: .:: : :.: ::.:::::. :::. :::::::::::::::::.:::
CCDS33 EKICMLTFEHEMEKAFAFQASQDKVCSICMEVILEKASASERRFGILSNCNHTYCLSCIR
220 230 240 250 260 270
320 330 340 350 360 370
pF1KB5 KWRSAKQFESKIIKSCPECRITSNFVIPSEYWVEEKEEKQKLILKYKEAMSNKACRYFDE
.:: :::::. :::::::::. :.::::: ::::....:..:: .:..:..:::.::..
CCDS33 QWRCAKQFENPIIKSCPECRVISEFVIPSVYWVEDQNKKNELIEAFKQGMGKKACKYFEQ
280 290 300 310 320 330
380 390 400 410 420 430
pF1KB5 GRGSCPFGGNCFYKHAYPDGRREEPQRQKVGTSSRYRAQRRN--HFWELIEERENSNPFD
:.:.::::..:.:.::::::: ::.. . ::. .. : ..:..::.::. . .
CCDS33 GKGTCPFGSKCLYRHAYPDGRLAEPEKPRKQLSSQGTVRFFNSVRLWDFIENRESRHVPN
340 350 360 370 380 390
440 450 460 470 480
pF1KB5 NDEEEVVTFELGEMLLMLLAAGGDDELTDSEDEWDLFHDELEDFYDLDL
: :.: :::.... :
CCDS33 N--EDVDMTELGDLFMHLSGVESSEP
400 410
482 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 15:08:19 2016 done: Sat Nov 5 15:08:20 2016
Total Scan time: 3.520 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]