FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0937, 428 aa 1>>>pF1KE0937 428 - 428 aa - 428 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 11.4965+/-0.000905; mu= -9.3073+/- 0.054 mean_var=351.3008+/-71.899, 0's: 0 Z-trim(117.1): 81 B-trim: 181 in 1/52 Lambda= 0.068428 statistics sampled from 17723 (17805) to 17723 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.82), E-opt: 0.2 (0.547), width: 16 Scan time: 3.610 The best scores are: opt bits E(32554) CCDS14283.1 ELK1 gene_id:2002|Hs108|chrX ( 428) 2899 299.4 4.4e-81 CCDS1456.1 ELK4 gene_id:2005|Hs108|chr1 ( 431) 701 82.4 9.2e-16 CCDS59165.1 ELK1 gene_id:2002|Hs108|chrX ( 95) 608 72.8 1.6e-13 >>CCDS14283.1 ELK1 gene_id:2002|Hs108|chrX (428 aa) initn: 2899 init1: 2899 opt: 2899 Z-score: 1569.2 bits: 299.4 E(32554): 4.4e-81 Smith-Waterman score: 2899; 100.0% identity (100.0% similar) in 428 aa overlap (1-428:1-428) 10 20 30 40 50 60 pF1KE0 MDPSVTLWQFLLQLLREQGNGHIISWTSRDGGEFKLVDAEEVARLWGLRKNKTNMNYDKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MDPSVTLWQFLLQLLREQGNGHIISWTSRDGGEFKLVDAEEVARLWGLRKNKTNMNYDKL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 SRALRYYYDKNIIRKVSGQKFVYKFVSYPEVAGCSTEDCPPQPEVSVTSTMPNVAPAAIH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 SRALRYYYDKNIIRKVSGQKFVYKFVSYPEVAGCSTEDCPPQPEVSVTSTMPNVAPAAIH 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 AAPGDTVSGKPGTPKGAGMAGPGGLARSSRNEYMRSGLYSTFTIQSLQPQPPPHPRPAVV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 AAPGDTVSGKPGTPKGAGMAGPGGLARSSRNEYMRSGLYSTFTIQSLQPQPPPHPRPAVV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 LPSAAPAGAAAPPSGSRSTSPSPLEACLEAEEAGLPLQVILTPPEAPNLKSEELNVEPGL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 LPSAAPAGAAAPPSGSRSTSPSPLEACLEAEEAGLPLQVILTPPEAPNLKSEELNVEPGL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE0 GRALPPEVKVEGPKEELEVAGERGFVPETTKAEPEVPPQEGVPARLPAVVMDTAGQAGGH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 GRALPPEVKVEGPKEELEVAGERGFVPETTKAEPEVPPQEGVPARLPAVVMDTAGQAGGH 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE0 AASSPEISQPQKGRKPRDLELPLSPSLLGGPGPERTPGSGSGSGLQAPGPALTPSLLPTH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 AASSPEISQPQKGRKPRDLELPLSPSLLGGPGPERTPGSGSGSGLQAPGPALTPSLLPTH 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE0 TLTPVLLTPSSLPPSIHFWSTLSPIAPRSPAKLSFQFPSSGSAQVHIPSISVDGLSTPVV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 TLTPVLLTPSSLPPSIHFWSTLSPIAPRSPAKLSFQFPSSGSAQVHIPSISVDGLSTPVV 370 380 390 400 410 420 pF1KE0 LSPGPQKP :::::::: CCDS14 LSPGPQKP >>CCDS1456.1 ELK4 gene_id:2005|Hs108|chr1 (431 aa) initn: 725 init1: 338 opt: 701 Z-score: 396.5 bits: 82.4 E(32554): 9.2e-16 Smith-Waterman score: 786; 37.4% identity (57.7% similar) in 463 aa overlap (1-427:1-430) 10 20 30 40 50 60 pF1KE0 MDPSVTLWQFLLQLLREQGNGHIISWTSRDGGEFKLVDAEEVARLWGLRKNKTNMNYDKL :: ..::::::::::.. : :.: ::: :: .:::..:::::::::.:::: ::::::: CCDS14 MDSAITLWQFLLQLLQKPQNKHMICWTSNDG-QFKLLQAEEVARLWGIRKNKPNMNYDKL 10 20 30 40 50 70 80 90 100 110 120 pF1KE0 SRALRYYYDKNIIRKVSGQKFVYKFVSYPEVAGCSTEDCPPQPEVSVTSTMPNVAPAAIH :::::::: ::::.::.:::::::::::::. . :. . . .. . . CCDS14 SRALRYYYVKNIIKKVNGQKFVYKFVSYPEIL-----NMDPMTVGRIEGDCESLNFSEVS 60 70 80 90 100 110 130 140 150 160 pF1KE0 AAPGDTVSGKPGTPKGAGMAGPGGLARSSRNEYMRSGLYSTFTIQSL------------- .. :. .: : ::. ::::.:..:::::.::..:: CCDS14 SSSKDVENGGKDKPPQ-----PGA-KTSSRNDYIHSGLYSSFTLNSLNSSNVKLFKLIKT 120 130 140 150 160 170 180 190 200 210 pF1KE0 --------QPQPPPHPRPAVV----LPSAAPA--GAAAPPSGSRSTSPSPLEACLEAEEA . . : .: :.:. :: : .:: : . : ::: : ..: :. CCDS14 ENPAEKLAEKKSPQEPTPSVIKFVTTPSKKPPVEPVAATISIGPSISPSS-EETIQALET 170 180 190 200 210 220 220 230 240 250 260 270 pF1KE0 GLPLQVILTPPEAPNLKSEELNVEPGLGRALPPEVKVEGPKEELEVAGERGFVPETTKAE :. :. :.:.. . . : : .. : .: : : . .. CCDS14 -------LVSPKLPSLEAPTSASNVMTAFATTPPISSIPPLQE----PPRTPSPPLS-SH 230 240 250 260 270 280 290 300 310 320 pF1KE0 PEVPPQ-EGV---PARLPAVVMDTAGQAGGHAASSPEISQPQKGRKPRDLELPLSPSLLG :.. . ..: : .:: . . . . .... ....::. ::: .:.:. CCDS14 PDIDTDIDSVASQPMELPENLSLEPKDQDSVLLEKDKVNNSSRSKKPKGLEL--APTLVI 280 290 300 310 320 330 330 340 350 360 370 380 pF1KE0 GPGPERTPGSGSGSGLQAPGPALTPSLLPTHTLTPVLLTPSSLPPSIHFWSTLSPIAPRS . : : : : .:::... . ::..:::: : ::::::::::.:: : CCDS14 TSSDPSPLGILSPS---LPTASLTPAFF---SQTPIILTPSPLLSSIHFWSTLSPVAPLS 340 350 360 370 380 390 400 410 420 pF1KE0 PAKLS-----FQFPSSGSAQVHIPSISVDGLSTPVVLSPGPQKP ::.:. ::::: ... . ..:: ::: .:: :: CCDS14 PARLQGANTLFQFPSVLNSHGPFTLSGLDGPSTPGPFSPDLQKT 390 400 410 420 430 >>CCDS59165.1 ELK1 gene_id:2002|Hs108|chrX (95 aa) initn: 607 init1: 607 opt: 608 Z-score: 356.1 bits: 72.8 E(32554): 1.6e-13 Smith-Waterman score: 608; 95.8% identity (97.9% similar) in 95 aa overlap (1-95:1-94) 10 20 30 40 50 60 pF1KE0 MDPSVTLWQFLLQLLREQGNGHIISWTSRDGGEFKLVDAEEVARLWGLRKNKTNMNYDKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 MDPSVTLWQFLLQLLREQGNGHIISWTSRDGGEFKLVDAEEVARLWGLRKNKTNMNYDKL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 SRALRYYYDKNIIRKVSGQKFVYKFVSYPEVAGCSTEDCPPQPEVSVTSTMPNVAPAAIH :::::::::::::::::::::::::::::: . :. CCDS59 SRALRYYYDKNIIRKVSGQKFVYKFVSYPE-SHCAP 70 80 90 130 140 150 160 170 180 pF1KE0 AAPGDTVSGKPGTPKGAGMAGPGGLARSSRNEYMRSGLYSTFTIQSLQPQPPPHPRPAVV 428 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 04:36:22 2016 done: Sat Nov 5 04:36:22 2016 Total Scan time: 3.610 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]