FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB7530, 255 aa
1>>>pF1KB7530 255 - 255 aa - 255 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.9483+/-0.000712; mu= 13.0081+/- 0.043
mean_var=93.2452+/-18.572, 0's: 0 Z-trim(112.2): 68 B-trim: 11 in 1/50
Lambda= 0.132819
statistics sampled from 12900 (12969) to 12900 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.767), E-opt: 0.2 (0.398), width: 16
Scan time: 2.540
The best scores are: opt bits E(32554)
CCDS9020.1 MYF5 gene_id:4617|Hs108|chr12 ( 255) 1745 343.7 7.2e-95
CCDS7826.1 MYOD1 gene_id:4654|Hs108|chr11 ( 320) 564 117.5 1.1e-26
CCDS9019.1 MYF6 gene_id:4618|Hs108|chr12 ( 242) 499 104.9 5.2e-23
CCDS1433.1 MYOG gene_id:4656|Hs108|chr1 ( 224) 471 99.6 2e-21
>>CCDS9020.1 MYF5 gene_id:4617|Hs108|chr12 (255 aa)
initn: 1745 init1: 1745 opt: 1745 Z-score: 1816.7 bits: 343.7 E(32554): 7.2e-95
Smith-Waterman score: 1745; 100.0% identity (100.0% similar) in 255 aa overlap (1-255:1-255)
10 20 30 40 50 60
pF1KB7 MDVMDGCQFSPSEYFYDGSCIPSPEGEFGDEFVPRVAAFGAHKAELQGSDEDEHVRAPTG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS90 MDVMDGCQFSPSEYFYDGSCIPSPEGEFGDEFVPRVAAFGAHKAELQGSDEDEHVRAPTG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 HHQAGHCLMWACKACKRKSTTMDRRKAATMRERRRLKKVNQAFETLKRCTTTNPNQRLPK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS90 HHQAGHCLMWACKACKRKSTTMDRRKAATMRERRRLKKVNQAFETLKRCTTTNPNQRLPK
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 VEILRNAIRYIESLQELLREQVENYYSLPGQSCSEPTSPTSNCSDGMPECNSPVWSRKSS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS90 VEILRNAIRYIESLQELLREQVENYYSLPGQSCSEPTSPTSNCSDGMPECNSPVWSRKSS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 TFDSIYCPDVSNVYATDKNSLSSLDCLSNIVDRITSSEQPGLPLQDLASLSPVASTDSQP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS90 TFDSIYCPDVSNVYATDKNSLSSLDCLSNIVDRITSSEQPGLPLQDLASLSPVASTDSQP
190 200 210 220 230 240
250
pF1KB7 ATPGASSSRLIYHVL
:::::::::::::::
CCDS90 ATPGASSSRLIYHVL
250
>>CCDS7826.1 MYOD1 gene_id:4654|Hs108|chr11 (320 aa)
initn: 758 init1: 550 opt: 564 Z-score: 592.2 bits: 117.5 E(32554): 1.1e-26
Smith-Waterman score: 767; 48.7% identity (68.5% similar) in 279 aa overlap (5-247:17-294)
10 20 30 40
pF1KB7 MDVMDG--CQFSPSEYFYDGSCIPSPEGEFGDEFVPRVAAFGAH-KAE
:: :.:. .. ::: :. ::. .: ... ::. :: : :
CCDS78 MELLSPPLRDVDLTAPDGSLCSFATTDDFYDDPCFDSPDLRFFEDLDPRLMHVGALLKPE
10 20 30 40 50 60
50 60 70 80 90
pF1KB7 LQ-----------GSDEDEHVRAPTGHHQAGHCLMWACKACKRKSTTMDRRKAATMRERR
. :. ::::::::.::::::.::.:::::::::.:. ::::::::::::
CCDS78 EHSHFPAAVHPAPGAREDEHVRAPSGHHQAGRCLLWACKACKRKTTNADRRKAATMRERR
70 80 90 100 110 120
100 110 120 130 140
pF1KB7 RLKKVNQAFETLKRCTTTNPNQRLPKVEILRNAIRYIESLQELLREQ-------VENYYS
::.:::.:::::::::..::::::::::::::::::::.:: :::.: . .:.
CCDS78 RLSKVNEAFETLKRCTSSNPNQRLPKVEILRNAIRYIEGLQALLRDQDAAPPGAAAAFYA
130 140 150 160 170 180
150 160 170 180 190
pF1KB7 L----PGQSC------SEPTSPTSNCSDGMPECNSP-VWSRKSSTFDSIYCPDVSNVYAT
::.. :. .:: ::::::: . ..: .:. . ... : .. .
CCDS78 PGPLPPGRGGEHYSGDSDASSPRSNCSDGMMDYSGPPSGARRRNCYEGAYYNEAPSEPRP
190 200 210 220 230 240
200 210 220 230 240 250
pF1KB7 DKNS-LSSLDCLSNIVDRITSSEQPGLP---LQDLASLSPVASTDSQPATPGASSSRLIY
:.. .:::::::.::.:: :.:.:. : : :. : :: .. . : ::
CCDS78 GKSAAVSSLDCLSSIVERI-STESPAAPALLLADVPSESPPRRQEAAAPSEGESSGDPTQ
250 260 270 280 290
pF1KB7 HVL
CCDS78 SPDAAPQCPAGANPNPIYQVL
300 310 320
>>CCDS9019.1 MYF6 gene_id:4618|Hs108|chr12 (242 aa)
initn: 484 init1: 396 opt: 499 Z-score: 526.7 bits: 104.9 E(32554): 5.2e-23
Smith-Waterman score: 499; 50.5% identity (69.6% similar) in 184 aa overlap (48-223:53-234)
20 30 40 50 60 70
pF1KB7 GSCIPSPEGEFGDEFVPRVAAFGAHKAELQGSDE--DEHVRAPTG---HHQAGHCLMWAC
::: .::: :: : : :.::.:::
CCDS90 QPLEVAEGSPLYPGSDGTLSPCQDQMPPEAGSDSSGEEHVLAPPGLQPPHCPGQCLIWAC
30 40 50 60 70 80
80 90 100 110 120 130
pF1KB7 KACKRKSTTMDRRKAATMRERRRLKKVNQAFETLKRCTTTNPNQRLPKVEILRNAIRYIE
:.:::::. :::::::.::::::::.:.:::.::: :..:::::::::::::.:: :::
CCDS90 KTCKRKSAPTDRRKAATLRERRRLKKINEAFEALKRRTVANPNQRLPKVEILRSAISYIE
90 100 110 120 130 140
140 150 160 170 180
pF1KB7 SLQELLR--EQVENYYSLPGQSCS-EPTSPTSNCSDGMPECNSPVWSRKSSTFDSIYCPD
::.::. .: :.. : . : .: . . . .: . :.: : :. ..
CCDS90 RLQDLLHRLDQQEKMQELGVDPFSYRPKQENLEGADFLRTCSSQ-WPSVSDHSRGLVITA
150 160 170 180 190 200
190 200 210 220 230 240
pF1KB7 VSNVYATDKNSLSSLDCLSNIVDRITSSEQPGLPLQDLASLSPVASTDSQPATPGASSSR
. . :... ::: :::.::: : :::. ::
CCDS90 KEGGASIDSSASSSLRCLSSIVDSI-SSEERKLPCVEEVVEK
210 220 230 240
250
pF1KB7 LIYHVL
>>CCDS1433.1 MYOG gene_id:4656|Hs108|chr1 (224 aa)
initn: 461 init1: 393 opt: 471 Z-score: 498.1 bits: 99.6 E(32554): 2e-21
Smith-Waterman score: 471; 42.9% identity (64.8% similar) in 219 aa overlap (9-215:4-209)
10 20 30 40 50
pF1KB7 MDVMDGCQFSPSEYFYDGSCIPSPEGEFGDEFVP-RVAAF---GAHKAELQ------GSD
. : :::. :. :....: .. .: : ...:: :
CCDS14 MELYETSPYFYQ-----EPRFYDGENYLPVHLQGFEPPGYERTELTLSPEAPGPL
10 20 30 40 50
60 70 80 90 100 110
pF1KB7 EDEHVRAPTGHHQAGHCLMWACKACKRKSTTMDRRKAATMRERRRLKKVNQAFETLKRCT
::. . .: .: :.:: ::::.:::::...:::.:::.::.:::::::.:::.::: :
CCDS14 EDKGLGTP--EHCPGQCLPWACKVCKRKSVSVDRRRAATLREKRRLKKVNEAFEALKRST
60 70 80 90 100
120 130 140 150 160
pF1KB7 TTNPNQRLPKVEILRNAIRYIESLQELLR--EQVENYYSLPGQSCSEPTSPTSNCSDGMP
:::::::::::::.::.::: :: :: .: : : . .: : :.::.
CCDS14 LLNPNQRLPKVEILRSAIQYIERLQALLSSLNQEERDLRYRGGGGPQPGVP-SECSSHSA
110 120 130 140 150 160
170 180 190 200 210 220
pF1KB7 ECNSPVWSRKSSTFDSIYCPDVSNVYATDKNSLSSLDCLSNIVDRITSSEQPGLPLQDLA
: :: :. :... : ... ..: .. .: :..::: ::
CCDS14 SC-SPEWG---SALEFSANPG-DHLLTADPTDAHNLHSLTSIVDSITVEDVSVAFPDETM
170 180 190 200 210 220
230 240 250
pF1KB7 SLSPVASTDSQPATPGASSSRLIYHVL
CCDS14 PN
255 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 04:02:06 2016 done: Sat Nov 5 04:02:06 2016
Total Scan time: 2.540 Total Display time: -0.020
Function used was FASTA [36.3.4 Apr, 2011]