FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7253, 419 aa 1>>>pF1KB7253 419 - 419 aa - 419 aa Library: human.CCDS.faa 18921897 residues in 33420 sequences Statistics: Expectation_n fit: rho(ln(x))= 10.0243+/-0.000908; mu= -3.4570+/- 0.055 mean_var=259.9831+/-52.560, 0's: 0 Z-trim(115.1): 19 B-trim: 169 in 2/51 Lambda= 0.079543 statistics sampled from 15895 (15910) to 15895 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.787), E-opt: 0.2 (0.476), width: 16 Scan time: 1.560 The best scores are: opt bits E(33420) CCDS10155.1 PYGO1 gene_id:26108|Hs109|chr15 ( 419) 2981 354.9 8.8e-98 CCDS81885.1 PYGO1 gene_id:26108|Hs109|chr15 ( 419) 2883 343.6 2.1e-94 CCDS1075.1 PYGO2 gene_id:90780|Hs109|chr1 ( 406) 556 76.6 5.1e-14 >>CCDS10155.1 PYGO1 gene_id:26108|Hs109|chr15 (419 aa) initn: 2981 init1: 2981 opt: 2981 Z-score: 1869.2 bits: 354.9 E(33420): 8.8e-98 Smith-Waterman score: 2981; 100.0% identity (100.0% similar) in 419 aa overlap (1-419:1-419) 10 20 30 40 50 60 pF1KB7 MPAENSPAPAYKVSSHGGDSGLDGLGGPGVQLGSPDKKKRKANTQGPSFPPLSEYAPPPN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 MPAENSPAPAYKVSSHGGDSGLDGLGGPGVQLGSPDKKKRKANTQGPSFPPLSEYAPPPN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 PNSDHLVAANPFDDNYNTISYKPLPSSNPYLGPGYPGFGGYSTFRMPPHVPPRMSSPYCG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 PNSDHLVAANPFDDNYNTISYKPLPSSNPYLGPGYPGFGGYSTFRMPPHVPPRMSSPYCG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 PYSLRNQPHPFPQNPLGMGFNRPHAFNFGPHDNSSFGNPSYNNALSQNVNMPNQHFRQNP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 PYSLRNQPHPFPQNPLGMGFNRPHAFNFGPHDNSSFGNPSYNNALSQNVNMPNQHFRQNP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 AENFSQIPPQNASQVSNPDLASNFVPGNNSNFTSPLESNHSFIPPPNTFGQAKAPPPKQD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 AENFSQIPPQNASQVSNPDLASNFVPGNNSNFTSPLESNHSFIPPPNTFGQAKAPPPKQD 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 FTQGATKNTNQNSSAHPPHLNMDDTVNQSNIELKNVNRNNAVNQENSRSSSTEATNNNPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 FTQGATKNTNQNSSAHPPHLNMDDTVNQSNIELKNVNRNNAVNQENSRSSSTEATNNNPA 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB7 NGTQNKPRQPRGAADACTTEKSNKSSLHPNRHGHSSSDPVYPCGICTNEVNDDQDAILCE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 NGTQNKPRQPRGAADACTTEKSNKSSLHPNRHGHSSSDPVYPCGICTNEVNDDQDAILCE 310 320 330 340 350 360 370 380 390 400 410 pF1KB7 ASCQKWFHRICTGMTETAYGLLTAEASAVWGCDTCMADKDVQLMRTRETFGPSAVGSDA ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS10 ASCQKWFHRICTGMTETAYGLLTAEASAVWGCDTCMADKDVQLMRTRETFGPSAVGSDA 370 380 390 400 410 >>CCDS81885.1 PYGO1 gene_id:26108|Hs109|chr15 (419 aa) initn: 2883 init1: 2883 opt: 2883 Z-score: 1808.4 bits: 343.6 E(33420): 2.1e-94 Smith-Waterman score: 2883; 97.1% identity (97.6% similar) in 419 aa overlap (1-419:1-419) 10 20 30 40 50 60 pF1KB7 MPAENSPAPAYKVSSHGGDSGLDGLGGPGVQLGSPDKKKRKANTQGPSFPPLSEYAPPPN : ::. : .:::::::::::::::::::::::::::::::::::::::::::: CCDS81 MSAEQEKDPISLKRVRGGDSGLDGLGGPGVQLGSPDKKKRKANTQGPSFPPLSEYAPPPN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 PNSDHLVAANPFDDNYNTISYKPLPSSNPYLGPGYPGFGGYSTFRMPPHVPPRMSSPYCG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 PNSDHLVAANPFDDNYNTISYKPLPSSNPYLGPGYPGFGGYSTFRMPPHVPPRMSSPYCG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 PYSLRNQPHPFPQNPLGMGFNRPHAFNFGPHDNSSFGNPSYNNALSQNVNMPNQHFRQNP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 PYSLRNQPHPFPQNPLGMGFNRPHAFNFGPHDNSSFGNPSYNNALSQNVNMPNQHFRQNP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 AENFSQIPPQNASQVSNPDLASNFVPGNNSNFTSPLESNHSFIPPPNTFGQAKAPPPKQD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 AENFSQIPPQNASQVSNPDLASNFVPGNNSNFTSPLESNHSFIPPPNTFGQAKAPPPKQD 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 FTQGATKNTNQNSSAHPPHLNMDDTVNQSNIELKNVNRNNAVNQENSRSSSTEATNNNPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 FTQGATKNTNQNSSAHPPHLNMDDTVNQSNIELKNVNRNNAVNQENSRSSSTEATNNNPA 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB7 NGTQNKPRQPRGAADACTTEKSNKSSLHPNRHGHSSSDPVYPCGICTNEVNDDQDAILCE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 NGTQNKPRQPRGAADACTTEKSNKSSLHPNRHGHSSSDPVYPCGICTNEVNDDQDAILCE 310 320 330 340 350 360 370 380 390 400 410 pF1KB7 ASCQKWFHRICTGMTETAYGLLTAEASAVWGCDTCMADKDVQLMRTRETFGPSAVGSDA ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 ASCQKWFHRICTGMTETAYGLLTAEASAVWGCDTCMADKDVQLMRTRETFGPSAVGSDA 370 380 390 400 410 >>CCDS1075.1 PYGO2 gene_id:90780|Hs109|chr1 (406 aa) initn: 634 init1: 407 opt: 556 Z-score: 365.4 bits: 76.6 E(33420): 5.1e-14 Smith-Waterman score: 874; 37.6% identity (59.2% similar) in 431 aa overlap (3-418:2-405) 10 20 30 40 50 pF1KB7 MPAENSPAPAYKVSSHGGD-------SGLDGLGGPGVQLGSPDKKKRKANTQGPSFPPLS : ..: : :. . :: : : :.:. ::.::.::.:::::.. :. CCDS10 MAASAPPPPDKLEGGGGPAPPPAPPSTGRKQGKAGLQMKSPEKKRRKSNTQGPAYSHLT 10 20 30 40 50 60 70 80 90 100 110 pF1KB7 EYAPPPNPNSDHLVAANPFDDNYNTISYKPLPSSNPYLGPGYPGFGGYSTFR-MPPHVPP :.::::.: :::::.:::.:... . : .. :.:: : :::. . : .::: CCDS10 EFAPPPTPMVDHLVASNPFEDDFG--APKVGVAAPPFLGSPVP-FGGFRVQGGMAGQVPP 60 70 80 90 100 110 120 130 140 150 160 170 pF1KB7 RMSSPYCG-PYSLRNQPHPFPQNPLGMGFNRP-HAFNFGPHDNSSFGNPSYNNALSQNVN .:. : : :: :: ::: ::.: .:: : .. .. : : .: . .:. :.:: . CCDS10 GYSTGGGGGPQPLRRQPPPFPPNPMGPAFNMPPQGPGYPPPGNMNFPSQPFNQPLGQNFS 120 130 140 150 160 170 180 190 200 210 220 230 pF1KB7 MPNQHFRQNPAENFSQIPPQNASQVSNPDLASNFVPGNNSNFTSPLESNHSFIPPPNTFG :. .. .:. .:. :. . ...: : .: .. : : :: CCDS10 PPSGQMMPGPVGGFG---PMISPTMGQPPRAE----------LGPPSLSQRFAQPGAPFG 180 190 200 210 220 240 250 260 270 280 pF1KB7 QAKAPPPKQDFTQGATKNTNQNSSAHP-PHLNMDDTVNQSNIELKNVNRNNAVNQENSRS : : : :: . :.: : : .. .... . : ..: :: . CCDS10 ----PSPLQRPGQG-LPSLPPNTSPFPGPDPGFPGPGGEDGGKPLNPPASTAFPQEPHSG 230 240 250 260 270 290 300 310 320 330 340 pF1KB7 SSTEATNNNPANGTQNKPRQPRGAADACTTEKSNKSSLHPNRHGHSSSDP----VYPCGI : . :.:.: . :. . :. :: . .:.. : :. .: ::::: CCDS10 SPAAAVNGNQPSFPPNSSGRGGGTPDANSLAPPGKAG------GGSGPQPPPGLVYPCGA 280 290 300 310 320 330 350 360 370 380 390 400 pF1KB7 CTNEVNDDQDAILCEASCQKWFHRICTGMTETAYGLLTAEASAVWGCDTCMADKDVQLMR : .::::::::::::::::::::: ::::::.::::::.::::::.:: :. :..: . CCDS10 CRSEVNDDQDAILCEASCQKWFHRECTGMTESAYGLLTTEASAVWACDLCLKTKEIQSVY 340 350 360 370 380 390 410 pF1KB7 TRETFGPSAVGSDA :: .: ....: CCDS10 IREGMGQLVAANDG 400 419 residues in 1 query sequences 18921897 residues in 33420 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Oct 24 21:57:08 2019 done: Thu Oct 24 21:57:08 2019 Total Scan time: 1.560 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]