FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB7253, 419 aa
1>>>pF1KB7253 419 - 419 aa - 419 aa
Library: human.CCDS.faa
18921897 residues in 33420 sequences
Statistics: Expectation_n fit: rho(ln(x))= 10.0243+/-0.000908; mu= -3.4570+/- 0.055
mean_var=259.9831+/-52.560, 0's: 0 Z-trim(115.1): 19 B-trim: 169 in 2/51
Lambda= 0.079543
statistics sampled from 15895 (15910) to 15895 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.787), E-opt: 0.2 (0.476), width: 16
Scan time: 1.560
The best scores are: opt bits E(33420)
CCDS10155.1 PYGO1 gene_id:26108|Hs109|chr15 ( 419) 2981 354.9 8.8e-98
CCDS81885.1 PYGO1 gene_id:26108|Hs109|chr15 ( 419) 2883 343.6 2.1e-94
CCDS1075.1 PYGO2 gene_id:90780|Hs109|chr1 ( 406) 556 76.6 5.1e-14
>>CCDS10155.1 PYGO1 gene_id:26108|Hs109|chr15 (419 aa)
initn: 2981 init1: 2981 opt: 2981 Z-score: 1869.2 bits: 354.9 E(33420): 8.8e-98
Smith-Waterman score: 2981; 100.0% identity (100.0% similar) in 419 aa overlap (1-419:1-419)
10 20 30 40 50 60
pF1KB7 MPAENSPAPAYKVSSHGGDSGLDGLGGPGVQLGSPDKKKRKANTQGPSFPPLSEYAPPPN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 MPAENSPAPAYKVSSHGGDSGLDGLGGPGVQLGSPDKKKRKANTQGPSFPPLSEYAPPPN
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 PNSDHLVAANPFDDNYNTISYKPLPSSNPYLGPGYPGFGGYSTFRMPPHVPPRMSSPYCG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 PNSDHLVAANPFDDNYNTISYKPLPSSNPYLGPGYPGFGGYSTFRMPPHVPPRMSSPYCG
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 PYSLRNQPHPFPQNPLGMGFNRPHAFNFGPHDNSSFGNPSYNNALSQNVNMPNQHFRQNP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 PYSLRNQPHPFPQNPLGMGFNRPHAFNFGPHDNSSFGNPSYNNALSQNVNMPNQHFRQNP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 AENFSQIPPQNASQVSNPDLASNFVPGNNSNFTSPLESNHSFIPPPNTFGQAKAPPPKQD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 AENFSQIPPQNASQVSNPDLASNFVPGNNSNFTSPLESNHSFIPPPNTFGQAKAPPPKQD
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB7 FTQGATKNTNQNSSAHPPHLNMDDTVNQSNIELKNVNRNNAVNQENSRSSSTEATNNNPA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 FTQGATKNTNQNSSAHPPHLNMDDTVNQSNIELKNVNRNNAVNQENSRSSSTEATNNNPA
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB7 NGTQNKPRQPRGAADACTTEKSNKSSLHPNRHGHSSSDPVYPCGICTNEVNDDQDAILCE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 NGTQNKPRQPRGAADACTTEKSNKSSLHPNRHGHSSSDPVYPCGICTNEVNDDQDAILCE
310 320 330 340 350 360
370 380 390 400 410
pF1KB7 ASCQKWFHRICTGMTETAYGLLTAEASAVWGCDTCMADKDVQLMRTRETFGPSAVGSDA
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 ASCQKWFHRICTGMTETAYGLLTAEASAVWGCDTCMADKDVQLMRTRETFGPSAVGSDA
370 380 390 400 410
>>CCDS81885.1 PYGO1 gene_id:26108|Hs109|chr15 (419 aa)
initn: 2883 init1: 2883 opt: 2883 Z-score: 1808.4 bits: 343.6 E(33420): 2.1e-94
Smith-Waterman score: 2883; 97.1% identity (97.6% similar) in 419 aa overlap (1-419:1-419)
10 20 30 40 50 60
pF1KB7 MPAENSPAPAYKVSSHGGDSGLDGLGGPGVQLGSPDKKKRKANTQGPSFPPLSEYAPPPN
: ::. : .::::::::::::::::::::::::::::::::::::::::::::
CCDS81 MSAEQEKDPISLKRVRGGDSGLDGLGGPGVQLGSPDKKKRKANTQGPSFPPLSEYAPPPN
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 PNSDHLVAANPFDDNYNTISYKPLPSSNPYLGPGYPGFGGYSTFRMPPHVPPRMSSPYCG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 PNSDHLVAANPFDDNYNTISYKPLPSSNPYLGPGYPGFGGYSTFRMPPHVPPRMSSPYCG
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 PYSLRNQPHPFPQNPLGMGFNRPHAFNFGPHDNSSFGNPSYNNALSQNVNMPNQHFRQNP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 PYSLRNQPHPFPQNPLGMGFNRPHAFNFGPHDNSSFGNPSYNNALSQNVNMPNQHFRQNP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 AENFSQIPPQNASQVSNPDLASNFVPGNNSNFTSPLESNHSFIPPPNTFGQAKAPPPKQD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 AENFSQIPPQNASQVSNPDLASNFVPGNNSNFTSPLESNHSFIPPPNTFGQAKAPPPKQD
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB7 FTQGATKNTNQNSSAHPPHLNMDDTVNQSNIELKNVNRNNAVNQENSRSSSTEATNNNPA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 FTQGATKNTNQNSSAHPPHLNMDDTVNQSNIELKNVNRNNAVNQENSRSSSTEATNNNPA
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB7 NGTQNKPRQPRGAADACTTEKSNKSSLHPNRHGHSSSDPVYPCGICTNEVNDDQDAILCE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 NGTQNKPRQPRGAADACTTEKSNKSSLHPNRHGHSSSDPVYPCGICTNEVNDDQDAILCE
310 320 330 340 350 360
370 380 390 400 410
pF1KB7 ASCQKWFHRICTGMTETAYGLLTAEASAVWGCDTCMADKDVQLMRTRETFGPSAVGSDA
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS81 ASCQKWFHRICTGMTETAYGLLTAEASAVWGCDTCMADKDVQLMRTRETFGPSAVGSDA
370 380 390 400 410
>>CCDS1075.1 PYGO2 gene_id:90780|Hs109|chr1 (406 aa)
initn: 634 init1: 407 opt: 556 Z-score: 365.4 bits: 76.6 E(33420): 5.1e-14
Smith-Waterman score: 874; 37.6% identity (59.2% similar) in 431 aa overlap (3-418:2-405)
10 20 30 40 50
pF1KB7 MPAENSPAPAYKVSSHGGD-------SGLDGLGGPGVQLGSPDKKKRKANTQGPSFPPLS
: ..: : :. . :: : : :.:. ::.::.::.:::::.. :.
CCDS10 MAASAPPPPDKLEGGGGPAPPPAPPSTGRKQGKAGLQMKSPEKKRRKSNTQGPAYSHLT
10 20 30 40 50
60 70 80 90 100 110
pF1KB7 EYAPPPNPNSDHLVAANPFDDNYNTISYKPLPSSNPYLGPGYPGFGGYSTFR-MPPHVPP
:.::::.: :::::.:::.:... . : .. :.:: : :::. . : .:::
CCDS10 EFAPPPTPMVDHLVASNPFEDDFG--APKVGVAAPPFLGSPVP-FGGFRVQGGMAGQVPP
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB7 RMSSPYCG-PYSLRNQPHPFPQNPLGMGFNRP-HAFNFGPHDNSSFGNPSYNNALSQNVN
.:. : : :: :: ::: ::.: .:: : .. .. : : .: . .:. :.:: .
CCDS10 GYSTGGGGGPQPLRRQPPPFPPNPMGPAFNMPPQGPGYPPPGNMNFPSQPFNQPLGQNFS
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB7 MPNQHFRQNPAENFSQIPPQNASQVSNPDLASNFVPGNNSNFTSPLESNHSFIPPPNTFG
:. .. .:. .:. :. . ...: : .: .. : : ::
CCDS10 PPSGQMMPGPVGGFG---PMISPTMGQPPRAE----------LGPPSLSQRFAQPGAPFG
180 190 200 210 220
240 250 260 270 280
pF1KB7 QAKAPPPKQDFTQGATKNTNQNSSAHP-PHLNMDDTVNQSNIELKNVNRNNAVNQENSRS
: : : :: . :.: : : .. .... . : ..: :: .
CCDS10 ----PSPLQRPGQG-LPSLPPNTSPFPGPDPGFPGPGGEDGGKPLNPPASTAFPQEPHSG
230 240 250 260 270
290 300 310 320 330 340
pF1KB7 SSTEATNNNPANGTQNKPRQPRGAADACTTEKSNKSSLHPNRHGHSSSDP----VYPCGI
: . :.:.: . :. . :. :: . .:.. : :. .: :::::
CCDS10 SPAAAVNGNQPSFPPNSSGRGGGTPDANSLAPPGKAG------GGSGPQPPPGLVYPCGA
280 290 300 310 320 330
350 360 370 380 390 400
pF1KB7 CTNEVNDDQDAILCEASCQKWFHRICTGMTETAYGLLTAEASAVWGCDTCMADKDVQLMR
: .::::::::::::::::::::: ::::::.::::::.::::::.:: :. :..: .
CCDS10 CRSEVNDDQDAILCEASCQKWFHRECTGMTESAYGLLTTEASAVWACDLCLKTKEIQSVY
340 350 360 370 380 390
410
pF1KB7 TRETFGPSAVGSDA
:: .: ....:
CCDS10 IREGMGQLVAANDG
400
419 residues in 1 query sequences
18921897 residues in 33420 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Oct 24 21:57:08 2019 done: Thu Oct 24 21:57:08 2019
Total Scan time: 1.560 Total Display time: 0.030
Function used was FASTA [36.3.4 Apr, 2011]