FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB3015, 736 aa 1>>>pF1KB3015 736 - 736 aa - 736 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.9746+/-0.00102; mu= 13.4626+/- 0.062 mean_var=135.9511+/-27.424, 0's: 0 Z-trim(108.7): 151 B-trim: 0 in 0/51 Lambda= 0.109998 statistics sampled from 10252 (10408) to 10252 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.678), E-opt: 0.2 (0.32), width: 16 Scan time: 4.210 The best scores are: opt bits E(32554) CCDS13585.1 BACH1 gene_id:571|Hs108|chr21 ( 736) 4929 794.3 0 CCDS5026.1 BACH2 gene_id:60468|Hs108|chr6 ( 841) 612 109.2 2.6e-23 >>CCDS13585.1 BACH1 gene_id:571|Hs108|chr21 (736 aa) initn: 4929 init1: 4929 opt: 4929 Z-score: 4235.2 bits: 794.3 E(32554): 0 Smith-Waterman score: 4929; 100.0% identity (100.0% similar) in 736 aa overlap (1-736:1-736) 10 20 30 40 50 60 pF1KB3 MSLSENSVFAYESSVHSTNVLLSLNDQRKKDVLCDVTIFVEGQRFRAHRSVLAACSSYFH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MSLSENSVFAYESSVHSTNVLLSLNDQRKKDVLCDVTIFVEGQRFRAHRSVLAACSSYFH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB3 SRIVGQADGELNITLPEEVTVKGFEPLIQFAYTAKLILSKENVDEVCKCVEFLSVHNIEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 SRIVGQADGELNITLPEEVTVKGFEPLIQFAYTAKLILSKENVDEVCKCVEFLSVHNIEE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB3 SCFQFLKFKFLDSTADQQECPRKKCFSSHCQKTDLKLSLLDQRDLETDEVEEFLENKNVQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 SCFQFLKFKFLDSTADQQECPRKKCFSSHCQKTDLKLSLLDQRDLETDEVEEFLENKNVQ 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB3 TPQCKLRRYQGNAKASPPLQDSASQTYESMCLEKDAALALPSLCPKYRKFQKAFGTDRVR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 TPQCKLRRYQGNAKASPPLQDSASQTYESMCLEKDAALALPSLCPKYRKFQKAFGTDRVR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB3 TGESSVKDIHASVQPNERSENECLGGVPECRDLQVMLKCDESKLAMEPEETKKDPASQCP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 TGESSVKDIHASVQPNERSENECLGGVPECRDLQVMLKCDESKLAMEPEETKKDPASQCP 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB3 TEKSEVTPFPHNSSIDPHGLYSLSLLHTYDQYGDLNFAGMQNTTVLTEKPLSGTDVQEKT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 TEKSEVTPFPHNSSIDPHGLYSLSLLHTYDQYGDLNFAGMQNTTVLTEKPLSGTDVQEKT 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB3 FGESQDLPLKSDLGTREDSSVASSDRSSVEREVAEHLAKGFWSDICSTDTPCQMQLSPAV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 FGESQDLPLKSDLGTREDSSVASSDRSSVEREVAEHLAKGFWSDICSTDTPCQMQLSPAV 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB3 AKDGSEQISQKRSECPWLGIRISESPEPGQRTFTTLSSVNCPFISTLSTEGCSSNLEIGN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 AKDGSEQISQKRSECPWLGIRISESPEPGQRTFTTLSSVNCPFISTLSTEGCSSNLEIGN 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB3 DDYVSEPQQEPCPYACVISLGDDSETDTEGDSESCSAREQECEVKLPFNAQRIISLSRND :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 DDYVSEPQQEPCPYACVISLGDDSETDTEGDSESCSAREQECEVKLPFNAQRIISLSRND 490 500 510 520 530 540 550 560 570 580 590 600 pF1KB3 FQSLLKMHKLTPEQLDCIHDIRRRSKNRIAAQRCRKRKLDCIQNLESEIEKLQSEKESLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 FQSLLKMHKLTPEQLDCIHDIRRRSKNRIAAQRCRKRKLDCIQNLESEIEKLQSEKESLL 550 560 570 580 590 600 610 620 630 640 650 660 pF1KB3 KERDHILSTLGETKQNLTGLCQKVCKEAALSQEQIQILAKYSAADCPLSFLISEKDKSTP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 KERDHILSTLGETKQNLTGLCQKVCKEAALSQEQIQILAKYSAADCPLSFLISEKDKSTP 610 620 630 640 650 660 670 680 690 700 710 720 pF1KB3 DGELALPSIFSLSDRPPAVLPPCARGNSEPGYARGQESQQMSTATSEQAGPAEQCRQSGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 DGELALPSIFSLSDRPPAVLPPCARGNSEPGYARGQESQQMSTATSEQAGPAEQCRQSGG 670 680 690 700 710 720 730 pF1KB3 ISDFCQQMTDKCTTDE :::::::::::::::: CCDS13 ISDFCQQMTDKCTTDE 730 >>CCDS5026.1 BACH2 gene_id:60468|Hs108|chr6 (841 aa) initn: 1234 init1: 565 opt: 612 Z-score: 531.9 bits: 109.2 E(32554): 2.6e-23 Smith-Waterman score: 1169; 35.2% identity (58.5% similar) in 764 aa overlap (1-662:1-750) 10 20 30 40 50 pF1KB3 MSLSE---NSVFAYESSVHSTNVLLSLNDQRKKDVLCDVTIFVEGQRFRAHRSVLAACSS ::..: . ...:::.:: ::.::.::::::::.:::::..:: ..:::::.:::::: CCDS50 MSVDEKPDSPMYVYESTVHCTNILLGLNDQRKKDILCDVTLIVERKEFRAHRAVLAACSE 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB3 YFHSRIVGQADGELNITLPEEVTVKGFEPLIQFAYTAKLILSKENVDEVCKCVEFLSVHN :: . .:::. ..: ..::::::..:: ::.::::::::.::.::. :: .:.::: .:: CCDS50 YFWQALVGQTKNDLVVSLPEEVTARGFGPLLQFAYTAKLLLSRENIREVIRCAEFLRMHN 70 80 90 100 110 120 120 130 140 150 160 170 pF1KB3 IEESCFQFLKFKFLDSTADQQECPRKKCFSSHCQKT--DLKLSLLDQRDLETDEVEEFLE .:.:::.::. ..:.: : :: .. ::. : . : ...: : . .. : CCDS50 LEDSCFSFLQTQLLNSEDGLFVC-RK---DAACQRPHEDCENSAGEEEDEEEETMDS--E 130 140 150 160 170 180 190 200 210 220 pF1KB3 NKNVQTPQCKL--RRYQGNAKASP---------PLQDSASQTYESMCLEKDAALALPSLC . .. :. .. . . .: : : : : ..: :: :::: : CCDS50 TAKMACPRDQMLPEPISFEAAAIPVAEKEEALLPEPDVPTDTKESS--EKDALTQYP--- 180 190 200 210 220 230 240 250 260 pF1KB3 PKYRKFQKA---------------FGTD-RVRTGESSVKDIHASVQ-----PNERSENE- .:.:.: : :.. : .. .:.: : : :.:..:.: CCDS50 -RYKKYQLACTKNVYNASSHSTSGFASTFREDNSSNSLKPGLARGQIKSEPPSEENEEES 230 240 250 260 270 280 270 280 290 300 310 pF1KB3 ---CLGG-VPECRDLQVMLKCDESKLAMEPEETKKDPASQCPTEKSEVTP------FPHN ::.: :. .: .. :... . : : :. .: ..: : . CCDS50 ITLCLSGDEPDAKDRAGDVEMDRKQPSPAPTPTAPAGAACLERSRSVASPSCLRSLFSIT 290 300 310 320 330 340 320 330 340 pF1KB3 SSIDPHGLYSLSLLH-------TYDQ---YGDLN-----FAGMQNTTVLTEK-------- .:.. :: : : : .:. :::. :.: . . .: CCDS50 KSVELSGLPSTSQQHFARSPACPFDKGITQGDLKTDYTPFTGNYGQPHVGQKEVSNFTMG 350 360 370 380 390 400 350 360 370 380 390 400 pF1KB3 -PLSGTDVQE--KTFGE--SQDLPLKSDLGTREDSSVAS-SDRSSVEREVAEHLAKGFWS :: : .. : :: ... ..:. . ..:: : : ::......: . ::.: CCDS50 SPLRGPGLEALCKQEGELDRRSVIFSSSACDQVSTSVHSYSGVSSLDKDLSEPVPKGLWV 410 420 430 440 450 460 410 420 430 440 450 460 pF1KB3 DICSTDTPCQMQLSPAVAKDGSEQISQKRSECPWLGIRIS-ESP--EPGQRTFTTLSSVN .. : .. : . . :: . :.. .:: : :: .. :: . CCDS50 GAGQSLPSSQAYSHGGLMADHLPGRMRPNTSCP-VPIKVCPRSPPLETRTRTSSSCSSYS 470 480 490 500 510 520 470 480 490 pF1KB3 ---------------CPFISTLSTEGC----SSNLE---IGNDDYVSEPQQEPCPYACVI : : :. ..: . . : .:. : . : : . CCDS50 YAEDGSGGSPCSLPLCEFSSSPCSQGARFLATEHQEPGLMGDGMYNQVRPQIKCEQSYGT 530 540 550 560 570 580 500 510 520 530 540 550 pF1KB3 SLGDDSETDTEGDSESCSAREQECEVKLPFNAQRIISLSRNDFQSLLKMHKLTPEQLDCI . .:.: . .:.::::: .... :::::: ...: .: ::::: ..:::::: :::. : CCDS50 NSSDESGSFSEADSESCPVQDRGQEVKLPFPVDQITDLPRNDFQMMIKMHKLTSEQLEFI 590 600 610 620 630 640 560 570 580 590 600 610 pF1KB3 HDIRRRSKNRIAAQRCRKRKLDCIQNLESEIEKLQSEKESLLKERDHILSTLGETKQNLT ::.::::::::::::::::::::::::: ::.:: :::.::.::... . .:: .:.. CCDS50 HDVRRRSKNRIAAQRCRKRKLDCIQNLECEIRKLVCEKEKLLSERNQLKACMGELLDNFS 650 660 670 680 690 700 620 630 640 650 660 670 pF1KB3 GLCQKVCKEAALSQEQIQILAKYSAADCPLSFLISEKDKSTPDGELALPSIFSLSDRPPA : :.::.. : :::: : .: . :... . . . .: : CCDS50 CLSQEVCRDIQ-SPEQIQALHRYCPVLRPMDLPTASSINPAPLGAEQNIAASQCAVGENV 710 720 730 740 750 760 680 690 700 710 720 730 pF1KB3 VLPPCARGNSEPGYARGQESQQMSTATSEQAGPAEQCRQSGGISDFCQQMTDKCTTDE CCDS50 PCCLEPGAAPPGPPWAPSNTSENCTSGRRLEGTDPGTFSERGPPLEPRSQTVTVDFCQEM 770 780 790 800 810 820 736 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 20:30:30 2016 done: Thu Nov 3 20:30:31 2016 Total Scan time: 4.210 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]