FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7939, 342 aa 1>>>pF1KB7939 342 - 342 aa - 342 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.1433+/-0.000854; mu= 8.1549+/- 0.050 mean_var=160.5269+/-37.316, 0's: 0 Z-trim(112.0): 27 B-trim: 722 in 1/50 Lambda= 0.101228 statistics sampled from 12786 (12810) to 12786 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.752), E-opt: 0.2 (0.394), width: 16 Scan time: 2.740 The best scores are: opt bits E(32554) CCDS2855.1 PHF7 gene_id:51533|Hs108|chr3 ( 342) 2502 377.1 1.1e-104 CCDS2854.1 PHF7 gene_id:51533|Hs108|chr3 ( 381) 1649 252.6 3.9e-67 CCDS9638.1 G2E3 gene_id:55632|Hs108|chr14 ( 706) 659 108.3 2e-23 CCDS76669.1 G2E3 gene_id:55632|Hs108|chr14 ( 660) 563 94.2 3.2e-19 >>CCDS2855.1 PHF7 gene_id:51533|Hs108|chr3 (342 aa) initn: 2502 init1: 2502 opt: 2502 Z-score: 1992.7 bits: 377.1 E(32554): 1.1e-104 Smith-Waterman score: 2502; 100.0% identity (100.0% similar) in 342 aa overlap (1-342:1-342) 10 20 30 40 50 60 pF1KB7 MKTVKEKKECQRLRKSAKTRRVTQRKPSSGPVCWLCLREPGDPEKLGEFLQKDNISVHYF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS28 MKTVKEKKECQRLRKSAKTRRVTQRKPSSGPVCWLCLREPGDPEKLGEFLQKDNISVHYF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 CLILSSKLPQRGQSNRGFHGFLPEDIKKEAARASRKICFVCKKKGAAINCQKDQCLRNFH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS28 CLILSSKLPQRGQSNRGFHGFLPEDIKKEAARASRKICFVCKKKGAAINCQKDQCLRNFH 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 LPCGQERGCLSQFFGEYKSFCDKHRPTQNIQHGHVGEESCILCCEDLSQQSVENIQSPCC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS28 LPCGQERGCLSQFFGEYKSFCDKHRPTQNIQHGHVGEESCILCCEDLSQQSVENIQSPCC 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 SQAIYHRKCIQKYAHTSAKHFFKCPQCNNRKEFPQEMLRMGIHIPDRRWCLILCATCGSH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS28 SQAIYHRKCIQKYAHTSAKHFFKCPQCNNRKEFPQEMLRMGIHIPDRRWCLILCATCGSH 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 GTHRDCSSLRSNSKKWECEECSPAAATDYIPENSGDIPCCSSTFHPEEHFCRDNTLEENP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS28 GTHRDCSSLRSNSKKWECEECSPAAATDYIPENSGDIPCCSSTFHPEEHFCRDNTLEENP 250 260 270 280 290 300 310 320 330 340 pF1KB7 GLSWTDWPEPSLLEKPESSRGRRSYSWRSKGVRITNSCKKSK :::::::::::::::::::::::::::::::::::::::::: CCDS28 GLSWTDWPEPSLLEKPESSRGRRSYSWRSKGVRITNSCKKSK 310 320 330 340 >>CCDS2854.1 PHF7 gene_id:51533|Hs108|chr3 (381 aa) initn: 2488 init1: 1646 opt: 1649 Z-score: 1318.8 bits: 252.6 E(32554): 3.9e-67 Smith-Waterman score: 2366; 89.6% identity (89.6% similar) in 374 aa overlap (1-335:1-374) 10 20 30 40 50 60 pF1KB7 MKTVKEKKECQRLRKSAKTRRVTQRKPSSGPVCWLCLREPGDPEKLGEFLQKDNISVHYF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS28 MKTVKEKKECQRLRKSAKTRRVTQRKPSSGPVCWLCLREPGDPEKLGEFLQKDNISVHYF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 CLILSSKLPQRGQSNRGFHGFLPEDIKKEAARASRKICFVCKKKGAAINCQKDQCLRNFH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS28 CLILSSKLPQRGQSNRGFHGFLPEDIKKEAARASRKICFVCKKKGAAINCQKDQCLRNFH 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 LPCGQERGCLSQFFGEYKSFCDKHRPTQNIQHGHVGEESCILCCEDLSQQSVENIQSPCC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS28 LPCGQERGCLSQFFGEYKSFCDKHRPTQNIQHGHVGEESCILCCEDLSQQSVENIQSPCC 130 140 150 160 170 180 190 200 210 220 pF1KB7 SQAIYHRKCIQKYAHTSAKHFFKCPQCNNRKEFPQEMLRMGIHIPDR------------- ::::::::::::::::::::::::::::::::::::::::::::::: CCDS28 SQAIYHRKCIQKYAHTSAKHFFKCPQCNNRKEFPQEMLRMGIHIPDRDAAWELEPGAFSD 190 200 210 220 230 240 230 240 250 260 pF1KB7 --------------------------RWCLILCATCGSHGTHRDCSSLRSNSKKWECEEC :::::::::::::::::::::::::::::::::: CCDS28 LYQRYQHCDAPICLYEQGRDSFEDEGRWCLILCATCGSHGTHRDCSSLRSNSKKWECEEC 250 260 270 280 290 300 270 280 290 300 310 320 pF1KB7 SPAAATDYIPENSGDIPCCSSTFHPEEHFCRDNTLEENPGLSWTDWPEPSLLEKPESSRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS28 SPAAATDYIPENSGDIPCCSSTFHPEEHFCRDNTLEENPGLSWTDWPEPSLLEKPESSRG 310 320 330 340 350 360 330 340 pF1KB7 RRSYSWRSKGVRITNSCKKSK :::::::::::::: CCDS28 RRSYSWRSKGVRITNSCKKSK 370 380 >>CCDS9638.1 G2E3 gene_id:55632|Hs108|chr14 (706 aa) initn: 580 init1: 382 opt: 659 Z-score: 534.0 bits: 108.3 E(32554): 2e-23 Smith-Waterman score: 716; 36.8% identity (55.5% similar) in 353 aa overlap (22-326:1-347) 10 20 30 40 50 pF1KB7 MKTVKEKKECQRLRKSAKTRRVTQRKP--SSGPVCWLCLREPGDPEKLGEFLQKD--NIS ... :: :.. .: .: .. :.: :: :. :.. CCDS96 MNESKPGDSQNLACVFCRKHDDCPNKYGEKKTKEKWNLT 10 20 30 60 70 80 90 100 110 pF1KB7 VHYFCLILSSKLPQRGQSNRGFHGFLPEDIKKEAARASRKICFVCKKKGAAINCQKDQCL :::.::..:: . :::. ..: .::: :::.::. :::. : ::::.::.:.: .: CCDS96 VHYYCLLMSSGIWQRGKEEEGVYGFLIEDIRKEVNRASKLKCCVCKKNGASIGCVAPRCK 40 50 60 70 80 90 120 130 140 150 160 170 pF1KB7 RNFHLPCGQERGCLSQFFGEYKSFCDKHRPTQNIQHGHVGEE-SCILCCEDLSQQSVENI :..:.::: .: :. :: :.. ::: :::.: : .. : : .: : . :: CCDS96 RSYHFPCGLQRECIFQFTGNFASFCWDHRPVQIITSNNYRESLPCTICLEFIEPIPSYNI 100 110 120 130 140 150 180 190 200 210 220 pF1KB7 -QSPCCSQAIYHRKCIQKYAHTSAKHFFKCPQCNNRKEFPQEMLRMGIHIP--------- .::::..: .:: :.: : ... ::.: ::: : .:::::::::: CCDS96 LRSPCCKNAWFHRDCLQVQAINAGVFFFRCTICNNSDIFQKEMLRMGIHIPEKDASWELE 160 170 180 190 200 210 230 240 250 pF1KB7 ------------------------------DRRWCLILCATCGSHGTHRDCSSLRSNSKK : .: . : ::: ::: :::::: .. CCDS96 ENAYQELLQHYERCDVRRCRCKEGRDYNAPDSKWEIKRCQCCGSSGTHLACSSLRSWEQN 220 230 240 250 260 270 260 270 280 290 300 310 pF1KB7 WECEECSPAAATDYIPENSGDIPCCSSTFHPEEHFC--RDNTLEEN-PGLSWTDWPEPSL ::: :: : :::.. .. :. . : :::. : : . : CCDS96 WECLECRG------IIYNSGEFQKAKKHVLPNSNNVGITDCLLEESSPKLPRQSPGSQSK 280 290 300 310 320 330 320 330 340 pF1KB7 LEKPESSRGRRSYSWRSKGVRITNSCKKSK ..:. ::. : CCDS96 DLLRQGSKFRRNVSTLLIELGFQIKKKTKRLYINKANIWNSALDAFRNRNFNPSYAIEVA 340 350 360 370 380 390 >>CCDS76669.1 G2E3 gene_id:55632|Hs108|chr14 (660 aa) initn: 530 init1: 332 opt: 563 Z-score: 458.6 bits: 94.2 E(32554): 3.2e-19 Smith-Waterman score: 620; 37.1% identity (53.7% similar) in 307 aa overlap (64-326:1-301) 40 50 60 70 80 90 pF1KB7 WLCLREPGDPEKLGEFLQKDNISVHYFCLILSSKLPQRGQSNRGFHGFLPEDIKKEAARA .:: . :::. ..: .::: :::.::. :: CCDS76 MSSGIWQRGKEEEGVYGFLIEDIRKEVNRA 10 20 30 100 110 120 130 140 150 pF1KB7 SRKICFVCKKKGAAINCQKDQCLRNFHLPCGQERGCLSQFFGEYKSFCDKHRPTQNIQHG :. : ::::.::.:.: .: :..:.::: .: :. :: :.. ::: :::.: : . CCDS76 SKLKCCVCKKNGASIGCVAPRCKRSYHFPCGLQRECIFQFTGNFASFCWDHRPVQIITSN 40 50 60 70 80 90 160 170 180 190 200 210 pF1KB7 HVGEE-SCILCCEDLSQQSVENI-QSPCCSQAIYHRKCIQKYAHTSAKHFFKCPQCNNRK . : : .: : . :: .::::..: .:: :.: : ... ::.: ::: CCDS76 NYRESLPCTICLEFIEPIPSYNILRSPCCKNAWFHRDCLQVQAINAGVFFFRCTICNNSD 100 110 120 130 140 150 220 230 pF1KB7 EFPQEMLRMGIHIP---------------------------------------DRRWCLI : .:::::::::: : .: . CCDS76 IFQKEMLRMGIHIPEKDASWELEENAYQELLQHYERCDVRRCRCKEGRDYNAPDSKWEIK 160 170 180 190 200 210 240 250 260 270 280 290 pF1KB7 LCATCGSHGTHRDCSSLRSNSKKWECEECSPAAATDYIPENSGDIPCCSSTFHPEEHFC- : ::: ::: :::::: ..::: :: : :::.. .. :. . CCDS76 RCQCCGSSGTHLACSSLRSWEQNWECLECRG------IIYNSGEFQKAKKHVLPNSNNVG 220 230 240 250 260 300 310 320 330 340 pF1KB7 -RDNTLEEN-PGLSWTDWPEPSLLEKPESSRGRRSYSWRSKGVRITNSCKKSK : :::. : : . : ..:. ::. : CCDS76 ITDCLLEESSPKLPRQSPGSQSKDLLRQGSKFRRNVSTLLIELGFQIKKKTKRLYINKAN 270 280 290 300 310 320 CCDS76 IWNSALDAFRNRNFNPSYAIEVAYVIENDNFGSEHPGSKQEFLSLLMQHLENSSLFEGSL 330 340 350 360 370 380 342 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 14:41:18 2016 done: Sat Nov 5 14:41:19 2016 Total Scan time: 2.740 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]