FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB7587, 224 aa
1>>>pF1KB7587 224 - 224 aa - 224 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.7024+/-0.000263; mu= 14.6219+/- 0.017
mean_var=104.0911+/-21.787, 0's: 0 Z-trim(122.7): 50 B-trim: 2111 in 1/54
Lambda= 0.125709
statistics sampled from 41280 (41342) to 41280 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.816), E-opt: 0.2 (0.485), width: 16
Scan time: 5.700
The best scores are: opt bits E(85289)
NP_002470 (OMIM: 159980) myogenin [Homo sapiens] ( 224) 1536 287.8 9.8e-78
NP_002460 (OMIM: 159991,614408) myogenic factor 6 ( 242) 527 104.9 1.3e-22
NP_005584 (OMIM: 159990) myogenic factor 5 [Homo s ( 255) 471 94.7 1.5e-19
NP_002469 (OMIM: 159970) myoblast determination pr ( 320) 422 85.9 8.3e-17
NP_835455 (OMIM: 607194,609069,615935) pancreas tr ( 328) 167 39.7 0.0071
>>NP_002470 (OMIM: 159980) myogenin [Homo sapiens] (224 aa)
initn: 1536 init1: 1536 opt: 1536 Z-score: 1516.6 bits: 287.8 E(85289): 9.8e-78
Smith-Waterman score: 1536; 100.0% identity (100.0% similar) in 224 aa overlap (1-224:1-224)
10 20 30 40 50 60
pF1KB7 MELYETSPYFYQEPRFYDGENYLPVHLQGFEPPGYERTELTLSPEAPGPLEDKGLGTPEH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 MELYETSPYFYQEPRFYDGENYLPVHLQGFEPPGYERTELTLSPEAPGPLEDKGLGTPEH
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 CPGQCLPWACKVCKRKSVSVDRRRAATLREKRRLKKVNEAFEALKRSTLLNPNQRLPKVE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 CPGQCLPWACKVCKRKSVSVDRRRAATLREKRRLKKVNEAFEALKRSTLLNPNQRLPKVE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 ILRSAIQYIERLQALLSSLNQEERDLRYRGGGGPQPGVPSECSSHSASCSPEWGSALEFS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 ILRSAIQYIERLQALLSSLNQEERDLRYRGGGGPQPGVPSECSSHSASCSPEWGSALEFS
130 140 150 160 170 180
190 200 210 220
pF1KB7 ANPGDHLLTADPTDAHNLHSLTSIVDSITVEDVSVAFPDETMPN
::::::::::::::::::::::::::::::::::::::::::::
NP_002 ANPGDHLLTADPTDAHNLHSLTSIVDSITVEDVSVAFPDETMPN
190 200 210 220
>>NP_002460 (OMIM: 159991,614408) myogenic factor 6 [Hom (242 aa)
initn: 542 init1: 463 opt: 527 Z-score: 527.2 bits: 104.9 E(85289): 1.3e-22
Smith-Waterman score: 570; 45.6% identity (64.0% similar) in 250 aa overlap (1-222:3-240)
10 20 30 40
pF1KB7 MELYETSPYFYQEPRFYDGENYLPVHLQGFE----PPGYERTELTLSP----------
:.:.::. ::. . :::: : :: .: : : .. ::::
NP_002 MMMDLFETGSYFF----YLDGEN---VTLQPLEVAEGSPLYPGSDGTLSPCQDQMPPEAG
10 20 30 40 50
50 60 70 80 90 100
pF1KB7 -EAPGP---LEDKGLGTPEHCPGQCLPWACKVCKRKSVSVDRRRAATLREKRRLKKVNEA
.. : : :: : ::::::: ::::.:::::. .:::.::::::.:::::.:::
NP_002 SDSSGEEHVLAPPGL-QPPHCPGQCLIWACKTCKRKSAPTDRRKAATLRERRRLKKINEA
60 70 80 90 100 110
110 120 130 140 150 160
pF1KB7 FEALKRSTLLNPNQRLPKVEILRSAIQYIERLQALLSSLNQEERDLRYRGGGGPQPGVPS
:::::: :. ::::::::::::::::.:::::: :: :.:.:. . : : :.
NP_002 FEALKRRTVANPNQRLPKVEILRSAISYIERLQDLLHRLDQQEKMQEL--GVDPFSYRPK
120 130 140 150 160 170
170 180 190 200 210
pF1KB7 ECSSHSA----SCSPEWGSA------LEFSANPGDHLLTADPTDAHNLHSLTSIVDSITV
. . ..: .:: .: :. : ..:. : . : . . .:. :.::::::.
NP_002 QENLEGADFLRTCSSQWPSVSDHSRGLVITAKEGG--ASIDSSASSSLRCLSSIVDSISS
180 190 200 210 220
220
pF1KB7 EDVSVAFPDETMPN
:. .. .:..
NP_002 EERKLPCVEEVVEK
230 240
>>NP_005584 (OMIM: 159990) myogenic factor 5 [Homo sapie (255 aa)
initn: 461 init1: 393 opt: 471 Z-score: 472.0 bits: 94.7 E(85289): 1.5e-19
Smith-Waterman score: 471; 42.9% identity (64.8% similar) in 219 aa overlap (4-209:9-215)
10 20 30 40 50
pF1KB7 MELYETSPYFYQ-----EPRFYDGENYLPVHLQGFEPPGYERTELTLSPEAPGPL
. : :::. :. :....: .. .: : ...:: :
NP_005 MDVMDGCQFSPSEYFYDGSCIPSPEGEFGDEFVP-RVAAF---GAHKAELQ------GSD
10 20 30 40 50
60 70 80 90 100
pF1KB7 EDKGLGTP--EHCPGQCLPWACKVCKRKSVSVDRRRAATLREKRRLKKVNEAFEALKRST
::. . .: .: :.:: ::::.:::::...:::.:::.::.:::::::.:::.::: :
NP_005 EDEHVRAPTGHHQAGHCLMWACKACKRKSTTMDRRKAATMRERRRLKKVNQAFETLKRCT
60 70 80 90 100 110
110 120 130 140 150 160
pF1KB7 LLNPNQRLPKVEILRSAIQYIERLQALLSSLNQEERDLRYRGGGGPQPGVP-SECSSHSA
:::::::::::::.::.::: :: :: .: : : . .: : :.::.
NP_005 TTNPNQRLPKVEILRNAIRYIESLQELLR--EQVENYYSLPGQSCSEPTSPTSNCSDGMP
120 130 140 150 160
170 180 190 200 210 220
pF1KB7 SC-SPEWG---SALEFSANPG-DHLLTADPTDAHNLHSLTSIVDSITVEDVSVAFPDETM
: :: :. :... : ... ..: .. .: :..::: ::
NP_005 ECNSPVWSRKSSTFDSIYCPDVSNVYATDKNSLSSLDCLSNIVDRITSSEQPGLPLQDLA
170 180 190 200 210 220
pF1KB7 PN
NP_005 SLSPVASTDSQPATPGASSSRLIYHVL
230 240 250
>>NP_002469 (OMIM: 159970) myoblast determination protei (320 aa)
initn: 444 init1: 382 opt: 422 Z-score: 422.7 bits: 85.9 E(85289): 8.3e-17
Smith-Waterman score: 434; 41.7% identity (64.3% similar) in 199 aa overlap (4-183:23-217)
10 20 30
pF1KB7 MELYETSPYFYQEP-------RFYDGENYLPVHLQGF-EPP
. :. ::..: ::.. . .:. .. .:
NP_002 MELLSPPLRDVDLTAPDGSLCSFATTDDFYDDPCFDSPDLRFFEDLDPRLMHVGALLKPE
10 20 30 40 50 60
40 50 60 70 80 90
pF1KB7 GYERTELTLSPEAPGPLEDKGLGTP--EHCPGQCLPWACKVCKRKSVSVDRRRAATLREK
. . .. : ::: ::. . .: .: :.:: ::::.::::....:::.:::.::.
NP_002 EHSHFPAAVHP-APGAREDEHVRAPSGHHQAGRCLLWACKACKRKTTNADRRKAATMRER
70 80 90 100 110
100 110 120 130 140
pF1KB7 RRLKKVNEAFEALKRSTLLNPNQRLPKVEILRSAIQYIERLQALLSSLNQEERDLR---Y
:::.:::::::.::: : :::::::::::::.::.::: ::::: . . :
NP_002 RRLSKVNEAFETLKRCTSSNPNQRLPKVEILRNAIRYIEGLQALLRDQDAAPPGAAAAFY
120 130 140 150 160 170
150 160 170 180 190 200
pF1KB7 RGG------GGPQPGVPSECSSHSASCSPEWGSALEFSANPGDHLLTADPTDAHNLHSLT
: :: . . :. :: ..:: . ...:. :
NP_002 APGPLPPGRGGEHYSGDSDASSPRSNCSD---GMMDYSGPPSGARRRNCYEGAYYNEAPS
180 190 200 210 220 230
210 220
pF1KB7 SIVDSITVEDVSVAFPDETMPN
NP_002 EPRPGKSAAVSSLDCLSSIVERISTESPAAPALLLADVPSESPPRRQEAAAPSEGESSGD
240 250 260 270 280 290
>>NP_835455 (OMIM: 607194,609069,615935) pancreas transc (328 aa)
initn: 145 init1: 95 opt: 167 Z-score: 172.6 bits: 39.7 E(85289): 0.0071
Smith-Waterman score: 167; 44.9% identity (66.7% similar) in 78 aa overlap (83-159:165-236)
60 70 80 90 100 110
pF1KB7 KGLGTPEHCPGQCLPWACKVCKRKSVSVDRRRAATLREKRRLKKVNEAFEALKRSTLLNP
:.::..::.::....:.:::.:. :
NP_835 GARLRGLSGAAAAAARRRRRVRSEAELQQLRQAANVRERRRMQSINDAFEGLRSHIPTLP
140 150 160 170 180 190
120 130 140 150 160 170
pF1KB7 -NQRLPKVEILRSAIQYIERLQALLSSLNQEERDLRYRGGGGPQPGVPSECSSHSASCSP
..:: ::. :: :: ::. .:: : : :: ::::. : :
NP_835 YEKRLSKVDTLRLAIGYIN----FLSELVQA--DLPLRGGGAGGCGGPGGGGRLGGDSPG
200 210 220 230 240
180 190 200 210 220
pF1KB7 EWGSALEFSANPGDHLLTADPTDAHNLHSLTSIVDSITVEDVSVAFPDETMPN
NP_835 SQAQKVIICHRGTRSPSPSDPDYGLPPLAGHSLSWTDEKQLKEQNIIRTAKVWTPEDPRK
250 260 270 280 290 300
224 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 08:52:22 2016 done: Fri Nov 4 08:52:23 2016
Total Scan time: 5.700 Total Display time: -0.020
Function used was FASTA [36.3.4 Apr, 2011]