FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KA1333, 706 aa 1>>>pF1KA1333 706 - 706 aa - 706 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.2566+/-0.000947; mu= 15.5276+/- 0.057 mean_var=89.1096+/-18.388, 0's: 0 Z-trim(106.5): 39 B-trim: 0 in 0/51 Lambda= 0.135866 statistics sampled from 8993 (9024) to 8993 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.644), E-opt: 0.2 (0.277), width: 16 Scan time: 3.110 The best scores are: opt bits E(32554) CCDS9638.1 G2E3 gene_id:55632|Hs108|chr14 ( 706) 4894 969.8 0 CCDS76669.1 G2E3 gene_id:55632|Hs108|chr14 ( 660) 4551 902.6 0 CCDS2854.1 PHF7 gene_id:51533|Hs108|chr3 ( 381) 920 190.7 3.4e-48 CCDS2855.1 PHF7 gene_id:51533|Hs108|chr3 ( 342) 659 139.5 7.8e-33 CCDS14639.1 PHF6 gene_id:84295|Hs108|chrX ( 365) 295 68.2 2.5e-11 CCDS14640.1 PHF6 gene_id:84295|Hs108|chrX ( 312) 290 67.2 4.3e-11 >>CCDS9638.1 G2E3 gene_id:55632|Hs108|chr14 (706 aa) initn: 4894 init1: 4894 opt: 4894 Z-score: 5184.5 bits: 969.8 E(32554): 0 Smith-Waterman score: 4894; 100.0% identity (100.0% similar) in 706 aa overlap (1-706:1-706) 10 20 30 40 50 60 pF1KA1 MNESKPGDSQNLACVFCRKHDDCPNKYGEKKTKEKWNLTVHYYCLLMSSGIWQRGKEEEG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 MNESKPGDSQNLACVFCRKHDDCPNKYGEKKTKEKWNLTVHYYCLLMSSGIWQRGKEEEG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KA1 VYGFLIEDIRKEVNRASKLKCCVCKKNGASIGCVAPRCKRSYHFPCGLQRECIFQFTGNF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 VYGFLIEDIRKEVNRASKLKCCVCKKNGASIGCVAPRCKRSYHFPCGLQRECIFQFTGNF 70 80 90 100 110 120 130 140 150 160 170 180 pF1KA1 ASFCWDHRPVQIITSNNYRESLPCTICLEFIEPIPSYNILRSPCCKNAWFHRDCLQVQAI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 ASFCWDHRPVQIITSNNYRESLPCTICLEFIEPIPSYNILRSPCCKNAWFHRDCLQVQAI 130 140 150 160 170 180 190 200 210 220 230 240 pF1KA1 NAGVFFFRCTICNNSDIFQKEMLRMGIHIPEKDASWELEENAYQELLQHYERCDVRRCRC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 NAGVFFFRCTICNNSDIFQKEMLRMGIHIPEKDASWELEENAYQELLQHYERCDVRRCRC 190 200 210 220 230 240 250 260 270 280 290 300 pF1KA1 KEGRDYNAPDSKWEIKRCQCCGSSGTHLACSSLRSWEQNWECLECRGIIYNSGEFQKAKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 KEGRDYNAPDSKWEIKRCQCCGSSGTHLACSSLRSWEQNWECLECRGIIYNSGEFQKAKK 250 260 270 280 290 300 310 320 330 340 350 360 pF1KA1 HVLPNSNNVGITDCLLEESSPKLPRQSPGSQSKDLLRQGSKFRRNVSTLLIELGFQIKKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 HVLPNSNNVGITDCLLEESSPKLPRQSPGSQSKDLLRQGSKFRRNVSTLLIELGFQIKKK 310 320 330 340 350 360 370 380 390 400 410 420 pF1KA1 TKRLYINKANIWNSALDAFRNRNFNPSYAIEVAYVIENDNFGSEHPGSKQEFLSLLMQHL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 TKRLYINKANIWNSALDAFRNRNFNPSYAIEVAYVIENDNFGSEHPGSKQEFLSLLMQHL 370 380 390 400 410 420 430 440 450 460 470 480 pF1KA1 ENSSLFEGSLSKNLSLNSQALKENLYYEAGKMLAISLVHGGPSPGFFSKTLFNCLVYGPE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 ENSSLFEGSLSKNLSLNSQALKENLYYEAGKMLAISLVHGGPSPGFFSKTLFNCLVYGPE 430 440 450 460 470 480 490 500 510 520 530 540 pF1KA1 NTQPILDDVSDFDVAQIIIRINTATTVADLKSIINECYNYLELIGCLRLITTLSDKYMLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 NTQPILDDVSDFDVAQIIIRINTATTVADLKSIINECYNYLELIGCLRLITTLSDKYMLV 490 500 510 520 530 540 550 560 570 580 590 600 pF1KA1 KDILGYHVIQRVHTPFESFKQGLKTLGVLEKIQAYPEAFCSILCHKPESLSAKILSELFT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 KDILGYHVIQRVHTPFESFKQGLKTLGVLEKIQAYPEAFCSILCHKPESLSAKILSELFT 550 560 570 580 590 600 610 620 630 640 650 660 pF1KA1 VHTLPDVKALGFWNSYLQAVEDGKSTTTMEDILIFATGCSSIPPAGFKPTPSIECLHVDF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS96 VHTLPDVKALGFWNSYLQAVEDGKSTTTMEDILIFATGCSSIPPAGFKPTPSIECLHVDF 610 620 630 640 650 660 670 680 690 700 pF1KA1 PVGNKCNNCLAIPITNTYKEFQENMDFTIRNTLRLEKEESSHYIGH :::::::::::::::::::::::::::::::::::::::::::::: CCDS96 PVGNKCNNCLAIPITNTYKEFQENMDFTIRNTLRLEKEESSHYIGH 670 680 690 700 >>CCDS76669.1 G2E3 gene_id:55632|Hs108|chr14 (660 aa) initn: 4551 init1: 4551 opt: 4551 Z-score: 4821.6 bits: 902.6 E(32554): 0 Smith-Waterman score: 4551; 100.0% identity (100.0% similar) in 660 aa overlap (47-706:1-660) 20 30 40 50 60 70 pF1KA1 CRKHDDCPNKYGEKKTKEKWNLTVHYYCLLMSSGIWQRGKEEEGVYGFLIEDIRKEVNRA :::::::::::::::::::::::::::::: CCDS76 MSSGIWQRGKEEEGVYGFLIEDIRKEVNRA 10 20 30 80 90 100 110 120 130 pF1KA1 SKLKCCVCKKNGASIGCVAPRCKRSYHFPCGLQRECIFQFTGNFASFCWDHRPVQIITSN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 SKLKCCVCKKNGASIGCVAPRCKRSYHFPCGLQRECIFQFTGNFASFCWDHRPVQIITSN 40 50 60 70 80 90 140 150 160 170 180 190 pF1KA1 NYRESLPCTICLEFIEPIPSYNILRSPCCKNAWFHRDCLQVQAINAGVFFFRCTICNNSD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 NYRESLPCTICLEFIEPIPSYNILRSPCCKNAWFHRDCLQVQAINAGVFFFRCTICNNSD 100 110 120 130 140 150 200 210 220 230 240 250 pF1KA1 IFQKEMLRMGIHIPEKDASWELEENAYQELLQHYERCDVRRCRCKEGRDYNAPDSKWEIK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 IFQKEMLRMGIHIPEKDASWELEENAYQELLQHYERCDVRRCRCKEGRDYNAPDSKWEIK 160 170 180 190 200 210 260 270 280 290 300 310 pF1KA1 RCQCCGSSGTHLACSSLRSWEQNWECLECRGIIYNSGEFQKAKKHVLPNSNNVGITDCLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 RCQCCGSSGTHLACSSLRSWEQNWECLECRGIIYNSGEFQKAKKHVLPNSNNVGITDCLL 220 230 240 250 260 270 320 330 340 350 360 370 pF1KA1 EESSPKLPRQSPGSQSKDLLRQGSKFRRNVSTLLIELGFQIKKKTKRLYINKANIWNSAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 EESSPKLPRQSPGSQSKDLLRQGSKFRRNVSTLLIELGFQIKKKTKRLYINKANIWNSAL 280 290 300 310 320 330 380 390 400 410 420 430 pF1KA1 DAFRNRNFNPSYAIEVAYVIENDNFGSEHPGSKQEFLSLLMQHLENSSLFEGSLSKNLSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 DAFRNRNFNPSYAIEVAYVIENDNFGSEHPGSKQEFLSLLMQHLENSSLFEGSLSKNLSL 340 350 360 370 380 390 440 450 460 470 480 490 pF1KA1 NSQALKENLYYEAGKMLAISLVHGGPSPGFFSKTLFNCLVYGPENTQPILDDVSDFDVAQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 NSQALKENLYYEAGKMLAISLVHGGPSPGFFSKTLFNCLVYGPENTQPILDDVSDFDVAQ 400 410 420 430 440 450 500 510 520 530 540 550 pF1KA1 IIIRINTATTVADLKSIINECYNYLELIGCLRLITTLSDKYMLVKDILGYHVIQRVHTPF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 IIIRINTATTVADLKSIINECYNYLELIGCLRLITTLSDKYMLVKDILGYHVIQRVHTPF 460 470 480 490 500 510 560 570 580 590 600 610 pF1KA1 ESFKQGLKTLGVLEKIQAYPEAFCSILCHKPESLSAKILSELFTVHTLPDVKALGFWNSY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 ESFKQGLKTLGVLEKIQAYPEAFCSILCHKPESLSAKILSELFTVHTLPDVKALGFWNSY 520 530 540 550 560 570 620 630 640 650 660 670 pF1KA1 LQAVEDGKSTTTMEDILIFATGCSSIPPAGFKPTPSIECLHVDFPVGNKCNNCLAIPITN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 LQAVEDGKSTTTMEDILIFATGCSSIPPAGFKPTPSIECLHVDFPVGNKCNNCLAIPITN 580 590 600 610 620 630 680 690 700 pF1KA1 TYKEFQENMDFTIRNTLRLEKEESSHYIGH :::::::::::::::::::::::::::::: CCDS76 TYKEFQENMDFTIRNTLRLEKEESSHYIGH 640 650 660 >>CCDS2854.1 PHF7 gene_id:51533|Hs108|chr3 (381 aa) initn: 854 init1: 486 opt: 920 Z-score: 978.8 bits: 190.7 E(32554): 3.4e-48 Smith-Waterman score: 920; 41.1% identity (64.0% similar) in 353 aa overlap (1-347:22-365) 10 20 30 pF1KA1 MNESKPGDSQNLACVFCRKHDDCPNKYGEKKTKEKWNLT ... :: :.. .: .: .. :.: :: :. :.. CCDS28 MKTVKEKKECQRLRKSAKTRRVTQRKP--SSGPVCWLCLREPGDPEKLGEFLQKD--NIS 10 20 30 40 50 40 50 60 70 80 90 pF1KA1 VHYYCLLMSSGIWQRGKEEEGVYGFLIEDIRKEVNRASKLKCCVCKKNGASIGCVAPRCK :::.::..:: . :::. ..: .::: :::.::. :::. : ::::.::.:.: .: CCDS28 VHYFCLILSSKLPQRGQSNRGFHGFLPEDIKKEAARASRKICFVCKKKGAAINCQKDQCL 60 70 80 90 100 110 100 110 120 130 140 150 pF1KA1 RSYHFPCGLQRECIFQFTGNFASFCWDHRPVQIITSNNYRESLPCTICLEFIEPIPSYNI :..:.::: .: :. :: :.. ::: :::.: : .. : : .: : . :: CCDS28 RNFHLPCGQERGCLSQFFGEYKSFCDKHRPTQNIQHGHVGEE-SCILCCEDLSQQSVENI 120 130 140 150 160 170 160 170 180 190 200 210 pF1KA1 LRSPCCKNAWFHRDCLQVQAINAGVFFFRCTICNNSDIFQKEMLRMGIHIPEKDASWELE .::::..: .:: :.: : ... ::.: ::: : .::::::::::..::.:::: CCDS28 -QSPCCSQAIYHRKCIQKYAHTSAKHFFKCPQCNNRKEFPQEMLRMGIHIPDRDAAWELE 180 190 200 210 220 230 220 230 240 250 260 270 pF1KA1 ENAYQELLQHYERCDVRRCRCKEGRDYNAPDSKWEIKRCQCCGSSGTHLACSSLRSWEQN .:...: :.:..::. : ..::: ...: . : ::: ::: :::::: .. CCDS28 PGAFSDLYQRYQHCDAPICLYEQGRDSFEDEGRWCLILCATCGSHGTHRDCSSLRSNSKK 240 250 260 270 280 290 280 290 300 310 320 330 pF1KA1 WECLECRG------IIYNSGEFQKAKKHVLPNSNNVGITDCLLEESSPKLPRQSPGSQSK ::: :: : :::.. .. :. . : :::. : : . : CCDS28 WECEECSPAAATDYIPENSGDIPCCSSTFHPEEHFC--RDNTLEEN-PGLSWTDWPEPSL 300 310 320 330 340 350 340 350 360 370 380 390 pF1KA1 DLLRQGSKFRRNVSTLLIELGFQIKKKTKRLYINKANIWNSALDAFRNRNFNPSYAIEVA ..:. ::. : CCDS28 LEKPESSRGRRSYSWRSKGVRITNSCKKSK 360 370 380 >>CCDS2855.1 PHF7 gene_id:51533|Hs108|chr3 (342 aa) initn: 580 init1: 382 opt: 659 Z-score: 703.0 bits: 139.5 E(32554): 7.8e-33 Smith-Waterman score: 716; 36.8% identity (55.5% similar) in 353 aa overlap (1-347:22-326) 10 20 30 pF1KA1 MNESKPGDSQNLACVFCRKHDDCPNKYGEKKTKEKWNLT ... :: :.. .: .: .. :.: :: :. :.. CCDS28 MKTVKEKKECQRLRKSAKTRRVTQRKP--SSGPVCWLCLREPGDPEKLGEFLQKD--NIS 10 20 30 40 50 40 50 60 70 80 90 pF1KA1 VHYYCLLMSSGIWQRGKEEEGVYGFLIEDIRKEVNRASKLKCCVCKKNGASIGCVAPRCK :::.::..:: . :::. ..: .::: :::.::. :::. : ::::.::.:.: .: CCDS28 VHYFCLILSSKLPQRGQSNRGFHGFLPEDIKKEAARASRKICFVCKKKGAAINCQKDQCL 60 70 80 90 100 110 100 110 120 130 140 150 pF1KA1 RSYHFPCGLQRECIFQFTGNFASFCWDHRPVQIITSNNYRESLPCTICLEFIEPIPSYNI :..:.::: .: :. :: :.. ::: :::.: : .. : : .: : . :: CCDS28 RNFHLPCGQERGCLSQFFGEYKSFCDKHRPTQNIQHGHVGEE-SCILCCEDLSQQSVENI 120 130 140 150 160 170 160 170 180 190 200 210 pF1KA1 LRSPCCKNAWFHRDCLQVQAINAGVFFFRCTICNNSDIFQKEMLRMGIHIPEKDASWELE .::::..: .:: :.: : ... ::.: ::: : .:::::::::: CCDS28 -QSPCCSQAIYHRKCIQKYAHTSAKHFFKCPQCNNRKEFPQEMLRMGIHIP--------- 180 190 200 210 220 220 230 240 250 260 270 pF1KA1 ENAYQELLQHYERCDVRRCRCKEGRDYNAPDSKWEIKRCQCCGSSGTHLACSSLRSWEQN : .: . : ::: ::: :::::: .. CCDS28 ------------------------------DRRWCLILCATCGSHGTHRDCSSLRSNSKK 230 240 250 280 290 300 310 320 330 pF1KA1 WECLECRG------IIYNSGEFQKAKKHVLPNSNNVGITDCLLEESSPKLPRQSPGSQSK ::: :: : :::.. .. :. . : :::. : : . : CCDS28 WECEECSPAAATDYIPENSGDIPCCSSTFHPEEHFC--RDNTLEEN-PGLSWTDWPEPSL 260 270 280 290 300 310 340 350 360 370 380 390 pF1KA1 DLLRQGSKFRRNVSTLLIELGFQIKKKTKRLYINKANIWNSALDAFRNRNFNPSYAIEVA ..:. ::. : CCDS28 LEKPESSRGRRSYSWRSKGVRITNSCKKSK 320 330 340 >>CCDS14639.1 PHF6 gene_id:84295|Hs108|chrX (365 aa) initn: 265 init1: 237 opt: 295 Z-score: 317.0 bits: 68.2 E(32554): 2.5e-11 Smith-Waterman score: 295; 31.4% identity (65.4% similar) in 156 aa overlap (3-153:6-153) 10 20 30 40 50 pF1KA1 MNESKPGDSQNLACVFCRKHDDCPNKYGEKKTKEKWNLTVHYYCLLMSSGIWQRGKE :.: : ... : ::... : .. :. .:. ....:. :.:.::.. . .. CCDS14 MSSSVEQKKGPTRQRKCGFCKSNRD--KECGQLLISENQKVAAHHKCMLFSSALVSSHSD 10 20 30 40 50 60 70 80 90 100 110 pF1KA1 EEGVYGFLIEDIRKEVNRASKLKCCVCKKNGASIGCVAPRCKRSYHFPCGLQ-----REC .:.. :: :::..::..:..:: : .:. ::.::: . :.:.::. :.:. :: CCDS14 NESLGGFSIEDVQKEIKRGTKLMCSLCHCPGATIGCDVKTCHRTYHYHCALHDKAQIREK 60 70 80 90 100 110 120 130 140 150 160 170 pF1KA1 IFQFTGNFASFCWDHRPVQIITSNNYRESLPCTICLEFIEPIPSYNILRSPCCKNAWFHR : : . .: :. :..: . .: .. . .:: CCDS14 PSQ--GIYMVYCRKHKK----TAHNSEADLEESFNEHELEPSSPKSKKKSRKGRPRKTNF 120 130 140 150 160 170 180 190 200 210 220 230 pF1KA1 DCLQVQAINAGVFFFRCTICNNSDIFQKEMLRMGIHIPEKDASWELEENAYQELLQHYER CCDS14 KGLSEDTRSTSSHGTDEMESSSYRDRSPHRSSPSDTRPKCGFCHVGEEENEARGKLHIFN 180 190 200 210 220 230 >>CCDS14640.1 PHF6 gene_id:84295|Hs108|chrX (312 aa) initn: 299 init1: 237 opt: 290 Z-score: 312.7 bits: 67.2 E(32554): 4.3e-11 Smith-Waterman score: 290; 33.6% identity (67.9% similar) in 131 aa overlap (3-128:6-132) 10 20 30 40 50 pF1KA1 MNESKPGDSQNLACVFCRKHDDCPNKYGEKKTKEKWNLTVHYYCLLMSSGIWQRGKE :.: : ... : ::... : .. :. .:. ....:. :.:.::.. . .. CCDS14 MSSSVEQKKGPTRQRKCGFCKSNRD--KECGQLLISENQKVAAHHKCMLFSSALVSSHSD 10 20 30 40 50 60 70 80 90 100 110 pF1KA1 EEGVYGFLIEDIRKEVNRASKLKCCVCKKNGASIGCVAPRCKRSYHFPCGLQ-----REC .:.. :: :::..::..:..:: : .:. ::.::: . :.:.::. :.:. :: CCDS14 NESLGGFSIEDVQKEIKRGTKLMCSLCHCPGATIGCDVKTCHRTYHYHCALHDKAQIREK 60 70 80 90 100 110 120 130 140 150 160 170 pF1KA1 IFQFTGNFASFCWDHRPVQIITSNNYRESLPCTICLEFIEPIPSYNILRSPCCKNAWFHR : : . .: :. CCDS14 PSQ--GIYMVYCRKHKKTAHNSEAADLEESFNEHELEPSSPKSKKKSRKGRPRKTNFKGL 120 130 140 150 160 170 706 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Wed Nov 2 21:00:15 2016 done: Wed Nov 2 21:00:16 2016 Total Scan time: 3.110 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]