FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8734, 551 aa 1>>>pF1KB8734 551 - 551 aa - 551 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 10.8125+/-0.000935; mu= -5.9677+/- 0.057 mean_var=267.6187+/-54.248, 0's: 0 Z-trim(114.4): 48 B-trim: 30 in 1/52 Lambda= 0.078400 statistics sampled from 14966 (15007) to 14966 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.758), E-opt: 0.2 (0.461), width: 16 Scan time: 4.050 The best scores are: opt bits E(32554) CCDS69579.1 MLLT3 gene_id:4300|Hs108|chr9 ( 565) 2580 304.9 1.6e-82 CCDS6494.1 MLLT3 gene_id:4300|Hs108|chr9 ( 568) 2580 304.9 1.6e-82 CCDS12160.1 MLLT1 gene_id:4298|Hs108|chr19 ( 559) 990 125.1 2.2e-28 >>CCDS69579.1 MLLT3 gene_id:4300|Hs108|chr9 (565 aa) initn: 2580 init1: 2580 opt: 2580 Z-score: 1594.9 bits: 304.9 E(32554): 1.6e-82 Smith-Waterman score: 3552; 97.0% identity (97.0% similar) in 564 aa overlap (5-551:2-565) 10 20 30 40 50 60 pF1KB8 MASSCAVQVKLELGHRAQVRKKPTVEGFTHDWMVFVRGPEHSNIQHFVEKVVFHLHESFP :::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 MCAVQVKLELGHRAQVRKKPTVEGFTHDWMVFVRGPEHSNIQHFVEKVVFHLHESFP 10 20 30 40 50 70 80 90 100 110 120 pF1KB8 RPKRVCKDPPYKVEESGYAGFILPIEVYFKNKEEPRKVRFDYDLFLHLEGHPPVNHLRCE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 RPKRVCKDPPYKVEESGYAGFILPIEVYFKNKEEPRKVRFDYDLFLHLEGHPPVNHLRCE 60 70 80 90 100 110 130 140 150 160 170 pF1KB8 KLTFNNPTEDFRRKLLKAGGDPNRSIHTSSSSSSSSSSSSSSSSSSSSSSSSS------- ::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 KLTFNNPTEDFRRKLLKAGGDPNRSIHTSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS 120 130 140 150 160 170 180 190 200 210 220 pF1KB8 ----------TSFSKPHKLMKEHKEKPSKDSREHKSAFKEPSRDHNKSSKESSKKPKENK :::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 SSSSSSSSSSTSFSKPHKLMKEHKEKPSKDSREHKSAFKEPSRDHNKSSKESSKKPKENK 180 190 200 210 220 230 230 240 250 260 270 280 pF1KB8 PLKEEKIVPKMAFKEPKPMSKEPKPDSNLLTITSGQDKKAPSKRPPISDSEELSAKKRKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 PLKEEKIVPKMAFKEPKPMSKEPKPDSNLLTITSGQDKKAPSKRPPISDSEELSAKKRKK 240 250 260 270 280 290 290 300 310 320 330 340 pF1KB8 SSSEALFKSFSSAPPLILTCSADKKQIKDKSHVKMGKVKIESETSEKKKSTLPPFDDIVD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 SSSEALFKSFSSAPPLILTCSADKKQIKDKSHVKMGKVKIESETSEKKKSTLPPFDDIVD 300 310 320 330 340 350 350 360 370 380 390 400 pF1KB8 PNDSDVEENISSKSDSEQPSPASSSSSSSSSFTPSQTRQQGPLRSIMKDLHSDDNEEESD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 PNDSDVEENISSKSDSEQPSPASSSSSSSSSFTPSQTRQQGPLRSIMKDLHSDDNEEESD 360 370 380 390 400 410 410 420 430 440 450 460 pF1KB8 EVEDNDNDSEMERPVNRGGSRSRRVSLSDGSDSESSSASSPLHHEPPPPLLKTNNNQILE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 EVEDNDNDSEMERPVNRGGSRSRRVSLSDGSDSESSSASSPLHHEPPPPLLKTNNNQILE 420 430 440 450 460 470 470 480 490 500 510 520 pF1KB8 VKSPIKQSKSDKQIKNGECDKAYLDELVELHRRLMTLRERHILQQIVNLIEETGHFHITN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 VKSPIKQSKSDKQIKNGECDKAYLDELVELHRRLMTLRERHILQQIVNLIEETGHFHITN 480 490 500 510 520 530 530 540 550 pF1KB8 TTFDFDLCSLDKTTVRKLQSYLETSGTS :::::::::::::::::::::::::::: CCDS69 TTFDFDLCSLDKTTVRKLQSYLETSGTS 540 550 560 >>CCDS6494.1 MLLT3 gene_id:4300|Hs108|chr9 (568 aa) initn: 2580 init1: 2580 opt: 2580 Z-score: 1594.9 bits: 304.9 E(32554): 1.6e-82 Smith-Waterman score: 3574; 97.0% identity (97.0% similar) in 568 aa overlap (1-551:1-568) 10 20 30 40 50 60 pF1KB8 MASSCAVQVKLELGHRAQVRKKPTVEGFTHDWMVFVRGPEHSNIQHFVEKVVFHLHESFP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 MASSCAVQVKLELGHRAQVRKKPTVEGFTHDWMVFVRGPEHSNIQHFVEKVVFHLHESFP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 RPKRVCKDPPYKVEESGYAGFILPIEVYFKNKEEPRKVRFDYDLFLHLEGHPPVNHLRCE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 RPKRVCKDPPYKVEESGYAGFILPIEVYFKNKEEPRKVRFDYDLFLHLEGHPPVNHLRCE 70 80 90 100 110 120 130 140 150 160 170 pF1KB8 KLTFNNPTEDFRRKLLKAGGDPNRSIHTSSSSSSSSSSSSSSSSSSSSSSSSS------- ::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 KLTFNNPTEDFRRKLLKAGGDPNRSIHTSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS 130 140 150 160 170 180 180 190 200 210 220 pF1KB8 ----------TSFSKPHKLMKEHKEKPSKDSREHKSAFKEPSRDHNKSSKESSKKPKENK :::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 SSSSSSSSSSTSFSKPHKLMKEHKEKPSKDSREHKSAFKEPSRDHNKSSKESSKKPKENK 190 200 210 220 230 240 230 240 250 260 270 280 pF1KB8 PLKEEKIVPKMAFKEPKPMSKEPKPDSNLLTITSGQDKKAPSKRPPISDSEELSAKKRKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 PLKEEKIVPKMAFKEPKPMSKEPKPDSNLLTITSGQDKKAPSKRPPISDSEELSAKKRKK 250 260 270 280 290 300 290 300 310 320 330 340 pF1KB8 SSSEALFKSFSSAPPLILTCSADKKQIKDKSHVKMGKVKIESETSEKKKSTLPPFDDIVD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 SSSEALFKSFSSAPPLILTCSADKKQIKDKSHVKMGKVKIESETSEKKKSTLPPFDDIVD 310 320 330 340 350 360 350 360 370 380 390 400 pF1KB8 PNDSDVEENISSKSDSEQPSPASSSSSSSSSFTPSQTRQQGPLRSIMKDLHSDDNEEESD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 PNDSDVEENISSKSDSEQPSPASSSSSSSSSFTPSQTRQQGPLRSIMKDLHSDDNEEESD 370 380 390 400 410 420 410 420 430 440 450 460 pF1KB8 EVEDNDNDSEMERPVNRGGSRSRRVSLSDGSDSESSSASSPLHHEPPPPLLKTNNNQILE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 EVEDNDNDSEMERPVNRGGSRSRRVSLSDGSDSESSSASSPLHHEPPPPLLKTNNNQILE 430 440 450 460 470 480 470 480 490 500 510 520 pF1KB8 VKSPIKQSKSDKQIKNGECDKAYLDELVELHRRLMTLRERHILQQIVNLIEETGHFHITN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 VKSPIKQSKSDKQIKNGECDKAYLDELVELHRRLMTLRERHILQQIVNLIEETGHFHITN 490 500 510 520 530 540 530 540 550 pF1KB8 TTFDFDLCSLDKTTVRKLQSYLETSGTS :::::::::::::::::::::::::::: CCDS64 TTFDFDLCSLDKTTVRKLQSYLETSGTS 550 560 >>CCDS12160.1 MLLT1 gene_id:4298|Hs108|chr19 (559 aa) initn: 1485 init1: 831 opt: 990 Z-score: 623.0 bits: 125.1 E(32554): 2.2e-28 Smith-Waterman score: 1792; 54.7% identity (74.0% similar) in 578 aa overlap (1-550:1-559) 10 20 30 40 50 60 pF1KB8 MASSCAVQVKLELGHRAQVRKKPTVEGFTHDWMVFVRGPEHSNIQHFVEKVVFHLHESFP : ..:.:::.::::::::.:::::.:::::::::::::::. .:::::::::: ::.::: CCDS12 MDNQCTVQVRLELGHRAQLRKKPTTEGFTHDWMVFVRGPEQCDIQHFVEKVVFWLHDSFP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 RPKRVCKDPPYKVEESGYAGFILPIEVYFKNKEEPRKVRFDYDLFLHLEGHPPVNHLRCE .:.::::.::::::::::::::.::::.:::::::::: : :::::.:::.::::::::: CCDS12 KPRRVCKEPPYKVEESGYAGFIMPIEVHFKNKEEPRKVCFTYDLFLNLEGNPPVNHLRCE 70 80 90 100 110 120 130 140 150 160 pF1KB8 KLTFNNPTEDFRRKLLKAGG--------------DPNRSI-HTSSSSSSSSSSSSSSSSS :::::::: .:: :::.::: .:. . : :. :. .... : . CCDS12 KLTFNNPTTEFRYKLLRAGGVMVMPEGADTVSRPSPDYPMLPTIPLSAFSDPKKTKPSHG 130 140 150 160 170 180 170 180 190 200 210 220 pF1KB8 SSSSSSSSTSFSKPHKLMKEHKEKPSKDSREHKSAFKEPSRDHNKSSKESSKKPKENKPL :..... :.. :::::. :::.:.: ::: : ::. :: :.. ::::..:.: :.. CCDS12 SKDANKESSKTSKPHKVTKEHRERPRKDS-ESKSSSKELEREQAKSSKDTSRKLGEGRLP 190 200 210 220 230 230 240 250 260 270 pF1KB8 KEEKIVP-KMAFKEPKPMSKEPKPDSNLLTITSGQDKKAP------SKRPPISDSEELSA :::: : : :::::: :: : .:. . .: : :::: .:: . :: CCDS12 KEEKAPPPKAAFKEPKMALKETKLEST--SPKGGPPPPPPPPPRASSKRPATADSPKPSA 240 250 260 270 280 290 280 290 300 310 320 330 pF1KB8 KKRKKSSSEALFKSFSSAPPLILTCS-ADKKQIKDKSHVKMGKVKIESETSEKKKSTLPP ::.:::::.. .. ...: . : .::: :::: .. ::: ::: : ::. CCDS12 KKQKKSSSKGSRSAPGTSPRTSSSSSFSDKKPAKDKSSTRGEKVKAESEPREAKKA---- 300 310 320 330 340 350 340 350 360 370 380 390 pF1KB8 FDDIVDPNDSDVEENISSKSDSEQPSPASSSSSSSSS----FTPSQTRQQGPLRSIMKDL .. ..:. :.. : ::.: : ::..:::::.:: : :::...::::::...:: CCDS12 ----LEVEESNSEDEASFKSESAQSSPSNSSSSSDSSSDSDFEPSQNHSQGPLRSMVEDL 360 370 380 390 400 400 410 420 430 440 450 pF1KB8 HSDDNEEESDEVEDNDNDSEMERPVNRGGSRSRRVSLSDGSDSESSSASSPLHHEPPPPL .: ::::: .:... : .: : :. :.:.:: :.:..:. :: .::::: CCDS12 QS----EESDE-DDSSSGEEAAGKTNPG--RDSRLSFSD-SESDNSADSSLPSREPPPPQ 410 420 430 440 450 460 460 470 480 490 500 510 pF1KB8 LKTN-NNQILEVKSPIKQSKSDKQIKNGECDKAYLDELVELHRRLMTLRERHILQQIVNL :... .:: . :: .: .:.: :::: :::::::::::.::::..::::::: CCDS12 KPPPPNSKVSGRRSPESCSKPEKILKKGTYDKAYTDELVELHRRLMALRERNVLQQIVNL 470 480 490 500 510 520 520 530 540 550 pF1KB8 IEETGHFHITNTTFDFDLCSLDKTTVRKLQSYLETSGTS :::::::..::::::::: :::.:::::::: ::. .: CCDS12 IEETGHFNVTNTTFDFDLFSLDETTVRKLQSCLEAVAT 530 540 550 551 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 15:04:12 2016 done: Fri Nov 4 15:04:12 2016 Total Scan time: 4.050 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]