FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7587, 224 aa 1>>>pF1KB7587 224 - 224 aa - 224 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.8355+/-0.000608; mu= 13.8612+/- 0.037 mean_var=98.1400+/-20.336, 0's: 0 Z-trim(115.5): 39 B-trim: 703 in 1/51 Lambda= 0.129465 statistics sampled from 15972 (16012) to 15972 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.824), E-opt: 0.2 (0.492), width: 16 Scan time: 2.030 The best scores are: opt bits E(32554) CCDS1433.1 MYOG gene_id:4656|Hs108|chr1 ( 224) 1536 295.9 1.4e-80 CCDS9019.1 MYF6 gene_id:4618|Hs108|chr12 ( 242) 527 107.4 8e-24 CCDS9020.1 MYF5 gene_id:4617|Hs108|chr12 ( 255) 471 97.0 1.2e-20 CCDS7826.1 MYOD1 gene_id:4654|Hs108|chr11 ( 320) 422 87.9 7.9e-18 >>CCDS1433.1 MYOG gene_id:4656|Hs108|chr1 (224 aa) initn: 1536 init1: 1536 opt: 1536 Z-score: 1560.2 bits: 295.9 E(32554): 1.4e-80 Smith-Waterman score: 1536; 100.0% identity (100.0% similar) in 224 aa overlap (1-224:1-224) 10 20 30 40 50 60 pF1KB7 MELYETSPYFYQEPRFYDGENYLPVHLQGFEPPGYERTELTLSPEAPGPLEDKGLGTPEH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MELYETSPYFYQEPRFYDGENYLPVHLQGFEPPGYERTELTLSPEAPGPLEDKGLGTPEH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 CPGQCLPWACKVCKRKSVSVDRRRAATLREKRRLKKVNEAFEALKRSTLLNPNQRLPKVE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 CPGQCLPWACKVCKRKSVSVDRRRAATLREKRRLKKVNEAFEALKRSTLLNPNQRLPKVE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 ILRSAIQYIERLQALLSSLNQEERDLRYRGGGGPQPGVPSECSSHSASCSPEWGSALEFS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 ILRSAIQYIERLQALLSSLNQEERDLRYRGGGGPQPGVPSECSSHSASCSPEWGSALEFS 130 140 150 160 170 180 190 200 210 220 pF1KB7 ANPGDHLLTADPTDAHNLHSLTSIVDSITVEDVSVAFPDETMPN :::::::::::::::::::::::::::::::::::::::::::: CCDS14 ANPGDHLLTADPTDAHNLHSLTSIVDSITVEDVSVAFPDETMPN 190 200 210 220 >>CCDS9019.1 MYF6 gene_id:4618|Hs108|chr12 (242 aa) initn: 542 init1: 463 opt: 527 Z-score: 541.2 bits: 107.4 E(32554): 8e-24 Smith-Waterman score: 570; 45.6% identity (64.0% similar) in 250 aa overlap (1-222:3-240) 10 20 30 40 pF1KB7 MELYETSPYFYQEPRFYDGENYLPVHLQGFE----PPGYERTELTLSP---------- :.:.::. ::. . :::: : :: .: : : .. :::: CCDS90 MMMDLFETGSYFF----YLDGEN---VTLQPLEVAEGSPLYPGSDGTLSPCQDQMPPEAG 10 20 30 40 50 50 60 70 80 90 100 pF1KB7 -EAPGP---LEDKGLGTPEHCPGQCLPWACKVCKRKSVSVDRRRAATLREKRRLKKVNEA .. : : :: : ::::::: ::::.:::::. .:::.::::::.:::::.::: CCDS90 SDSSGEEHVLAPPGL-QPPHCPGQCLIWACKTCKRKSAPTDRRKAATLRERRRLKKINEA 60 70 80 90 100 110 110 120 130 140 150 160 pF1KB7 FEALKRSTLLNPNQRLPKVEILRSAIQYIERLQALLSSLNQEERDLRYRGGGGPQPGVPS :::::: :. ::::::::::::::::.:::::: :: :.:.:. . : : :. CCDS90 FEALKRRTVANPNQRLPKVEILRSAISYIERLQDLLHRLDQQEKMQEL--GVDPFSYRPK 120 130 140 150 160 170 170 180 190 200 210 pF1KB7 ECSSHSA----SCSPEWGSA------LEFSANPGDHLLTADPTDAHNLHSLTSIVDSITV . . ..: .:: .: :. : ..:. : . : . . .:. :.::::::. CCDS90 QENLEGADFLRTCSSQWPSVSDHSRGLVITAKEGG--ASIDSSASSSLRCLSSIVDSISS 180 190 200 210 220 220 pF1KB7 EDVSVAFPDETMPN :. .. .:.. CCDS90 EERKLPCVEEVVEK 230 240 >>CCDS9020.1 MYF5 gene_id:4617|Hs108|chr12 (255 aa) initn: 461 init1: 393 opt: 471 Z-score: 484.3 bits: 97.0 E(32554): 1.2e-20 Smith-Waterman score: 471; 42.9% identity (64.8% similar) in 219 aa overlap (4-209:9-215) 10 20 30 40 50 pF1KB7 MELYETSPYFYQ-----EPRFYDGENYLPVHLQGFEPPGYERTELTLSPEAPGPL . : :::. :. :....: .. .: : ...:: : CCDS90 MDVMDGCQFSPSEYFYDGSCIPSPEGEFGDEFVP-RVAAF---GAHKAELQ------GSD 10 20 30 40 50 60 70 80 90 100 pF1KB7 EDKGLGTP--EHCPGQCLPWACKVCKRKSVSVDRRRAATLREKRRLKKVNEAFEALKRST ::. . .: .: :.:: ::::.:::::...:::.:::.::.:::::::.:::.::: : CCDS90 EDEHVRAPTGHHQAGHCLMWACKACKRKSTTMDRRKAATMRERRRLKKVNQAFETLKRCT 60 70 80 90 100 110 110 120 130 140 150 160 pF1KB7 LLNPNQRLPKVEILRSAIQYIERLQALLSSLNQEERDLRYRGGGGPQPGVP-SECSSHSA :::::::::::::.::.::: :: :: .: : : . .: : :.::. CCDS90 TTNPNQRLPKVEILRNAIRYIESLQELLR--EQVENYYSLPGQSCSEPTSPTSNCSDGMP 120 130 140 150 160 170 180 190 200 210 220 pF1KB7 SC-SPEWG---SALEFSANPG-DHLLTADPTDAHNLHSLTSIVDSITVEDVSVAFPDETM : :: :. :... : ... ..: .. .: :..::: :: CCDS90 ECNSPVWSRKSSTFDSIYCPDVSNVYATDKNSLSSLDCLSNIVDRITSSEQPGLPLQDLA 170 180 190 200 210 220 pF1KB7 PN CCDS90 SLSPVASTDSQPATPGASSSRLIYHVL 230 240 250 >>CCDS7826.1 MYOD1 gene_id:4654|Hs108|chr11 (320 aa) initn: 444 init1: 382 opt: 422 Z-score: 433.5 bits: 87.9 E(32554): 7.9e-18 Smith-Waterman score: 434; 41.7% identity (64.3% similar) in 199 aa overlap (4-183:23-217) 10 20 30 pF1KB7 MELYETSPYFYQEP-------RFYDGENYLPVHLQGF-EPP . :. ::..: ::.. . .:. .. .: CCDS78 MELLSPPLRDVDLTAPDGSLCSFATTDDFYDDPCFDSPDLRFFEDLDPRLMHVGALLKPE 10 20 30 40 50 60 40 50 60 70 80 90 pF1KB7 GYERTELTLSPEAPGPLEDKGLGTP--EHCPGQCLPWACKVCKRKSVSVDRRRAATLREK . . .. : ::: ::. . .: .: :.:: ::::.::::....:::.:::.::. CCDS78 EHSHFPAAVHP-APGAREDEHVRAPSGHHQAGRCLLWACKACKRKTTNADRRKAATMRER 70 80 90 100 110 100 110 120 130 140 pF1KB7 RRLKKVNEAFEALKRSTLLNPNQRLPKVEILRSAIQYIERLQALLSSLNQEERDLR---Y :::.:::::::.::: : :::::::::::::.::.::: ::::: . . : CCDS78 RRLSKVNEAFETLKRCTSSNPNQRLPKVEILRNAIRYIEGLQALLRDQDAAPPGAAAAFY 120 130 140 150 160 170 150 160 170 180 190 200 pF1KB7 RGG------GGPQPGVPSECSSHSASCSPEWGSALEFSANPGDHLLTADPTDAHNLHSLT : :: . . :. :: ..:: . ...:. : CCDS78 APGPLPPGRGGEHYSGDSDASSPRSNCSD---GMMDYSGPPSGARRRNCYEGAYYNEAPS 180 190 200 210 220 230 210 220 pF1KB7 SIVDSITVEDVSVAFPDETMPN CCDS78 EPRPGKSAAVSSLDCLSSIVERISTESPAAPALLLADVPSESPPRRQEAAAPSEGESSGD 240 250 260 270 280 290 224 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 08:52:22 2016 done: Fri Nov 4 08:52:22 2016 Total Scan time: 2.030 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]