FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9657, 202 aa 1>>>pF1KB9657 202 - 202 aa - 202 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.9260+/-0.000927; mu= 5.9165+/- 0.057 mean_var=315.4300+/-64.106, 0's: 0 Z-trim(116.5): 51 B-trim: 0 in 0/52 Lambda= 0.072214 statistics sampled from 17094 (17136) to 17094 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.822), E-opt: 0.2 (0.526), width: 16 Scan time: 1.770 The best scores are: opt bits E(32554) CCDS5367.1 TWIST1 gene_id:7291|Hs108|chr7 ( 202) 1356 153.3 9.1e-38 CCDS46558.1 TWIST2 gene_id:117581|Hs108|chr2 ( 160) 655 80.2 7.7e-16 >>CCDS5367.1 TWIST1 gene_id:7291|Hs108|chr7 (202 aa) initn: 1356 init1: 1356 opt: 1356 Z-score: 791.5 bits: 153.3 E(32554): 9.1e-38 Smith-Waterman score: 1356; 100.0% identity (100.0% similar) in 202 aa overlap (1-202:1-202) 10 20 30 40 50 60 pF1KB9 MMQDVSSSPVSPADDSLSNSEEEPDRQQPPSGKRGGRKRRSSRRSAGGGAGPGGAAGGGV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 MMQDVSSSPVSPADDSLSNSEEEPDRQQPPSGKRGGRKRRSSRRSAGGGAGPGGAAGGGV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 GGGDEPGSPAQGKRGKKSAGCGGGGGAGGGGGSSSGGGSPQSYEELQTQRVMANVRERQR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 GGGDEPGSPAQGKRGKKSAGCGGGGGAGGGGGSSSGGGSPQSYEELQTQRVMANVRERQR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 TQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDELDSKMASCSYVAH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 TQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDELDSKMASCSYVAH 130 140 150 160 170 180 190 200 pF1KB9 ERLSYAFSVWRMEGAWSMSASH :::::::::::::::::::::: CCDS53 ERLSYAFSVWRMEGAWSMSASH 190 200 >>CCDS46558.1 TWIST2 gene_id:117581|Hs108|chr2 (160 aa) initn: 744 init1: 640 opt: 655 Z-score: 397.8 bits: 80.2 E(32554): 7.7e-16 Smith-Waterman score: 750; 66.7% identity (75.6% similar) in 201 aa overlap (2-202:1-160) 10 20 30 40 50 60 pF1KB9 MMQDVSSSPVSPADDSLSNSEEEPDRQQPPSGKRGGRKRRSSRRSAGGGAGPGGAAGGGV :.. :::::::.: ::..:::: .:: : :: ::::: :..:. CCDS46 MEEGSSSPVSPVD-SLGTSEEELERQ-P---KRFGRKRRYSKKSS-------------- 10 20 30 40 70 80 90 100 110 120 pF1KB9 GGGDEPGSPAQGKRGKKSAGCGGGGGAGGGGGSSSGGGSPQSYEELQTQRVMANVRERQR : :::. :::::: :. : ::.::::.::..:::::::: CCDS46 ----EDGSPTPGKRGKK------------------GSPSAQSFEELQSQRILANVRERQR 50 60 70 130 140 150 160 170 180 pF1KB9 TQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDELDSKMASCSYVAH :::::::::::::::::::::::::::::::::::::::::::::::.:.::.::::::: CCDS46 TQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDNKMTSCSYVAH 80 90 100 110 120 130 190 200 pF1KB9 ERLSYAFSVWRMEGAWSMSASH :::::::::::::::::::::: CCDS46 ERLSYAFSVWRMEGAWSMSASH 140 150 160 202 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 14:19:56 2016 done: Sat Nov 5 14:19:56 2016 Total Scan time: 1.770 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]