FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE0677, 263 aa
1>>>pF1KE0677 263 - 263 aa - 263 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.8597+/-0.000666; mu= 10.7452+/- 0.041
mean_var=160.3211+/-33.081, 0's: 0 Z-trim(116.4): 178 B-trim: 40 in 1/51
Lambda= 0.101293
statistics sampled from 16835 (17038) to 16835 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.827), E-opt: 0.2 (0.523), width: 16
Scan time: 2.570
The best scores are: opt bits E(32554)
CCDS44388.2 DRGX gene_id:644168|Hs108|chr10 ( 263) 1786 271.7 3.7e-73
CCDS31468.1 ALX4 gene_id:60529|Hs108|chr11 ( 411) 422 72.6 5.1e-13
CCDS14215.1 ARX gene_id:170302|Hs108|chrX ( 562) 403 69.9 4.3e-12
>>CCDS44388.2 DRGX gene_id:644168|Hs108|chr10 (263 aa)
initn: 1786 init1: 1786 opt: 1786 Z-score: 1426.9 bits: 271.7 E(32554): 3.7e-73
Smith-Waterman score: 1786; 100.0% identity (100.0% similar) in 263 aa overlap (1-263:1-263)
10 20 30 40 50 60
pF1KE0 MFYFHCPPQLEGTATFGNHSSGDFDDGFLRRKQRRNRTTFTLQQLEALEAVFAQTHYPDV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 MFYFHCPPQLEGTATFGNHSSGDFDDGFLRRKQRRNRTTFTLQQLEALEAVFAQTHYPDV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 FTREELAMKINLTEARVQVWFQNRRAKWRKTERGASDQEPGAKEPMAEVTPPPVRNINSP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 FTREELAMKINLTEARVQVWFQNRRAKWRKTERGASDQEPGAKEPMAEVTPPPVRNINSP
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 PPGDQARSKKEALEAQQSLGRTVGPAGPFFPSCLPGTLLNTATYAQALSHVASLKGGPLC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 PPGDQARSKKEALEAQQSLGRTVGPAGPFFPSCLPGTLLNTATYAQALSHVASLKGGPLC
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE0 SCCVPDPMGLSFLPTYGCQSNRTASVATLRMKAREHSEAVLQSANLLPSTSSSPGPVAKP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 SCCVPDPMGLSFLPTYGCQSNRTASVATLRMKAREHSEAVLQSANLLPSTSSSPGPVAKP
190 200 210 220 230 240
250 260
pF1KE0 APPDGSQEKTSPTKEQSEAEKSV
:::::::::::::::::::::::
CCDS44 APPDGSQEKTSPTKEQSEAEKSV
250 260
>>CCDS31468.1 ALX4 gene_id:60529|Hs108|chr11 (411 aa)
initn: 409 init1: 330 opt: 422 Z-score: 347.2 bits: 72.6 E(32554): 5.1e-13
Smith-Waterman score: 422; 42.2% identity (62.3% similar) in 199 aa overlap (32-220:213-407)
10 20 30 40 50 60
pF1KE0 FYFHCPPQLEGTATFGNHSSGDFDDGFLRRKQRRNRTTFTLQQLEALEAVFAQTHYPDVF
:.:::::::: ::: :: :: .::::::.
CCDS31 KEAGVKGPQDRASSDLPSPLEKADSESNKGKKRRNRTTFTSYQLEELEKVFQKTHYPDVY
190 200 210 220 230 240
70 80 90 100 110 120
pF1KE0 TREELAMKINLTEARVQVWFQNRRAKWRKTERGASDQEPGAKEPMAEVTPPPVRNINSPP
.::.:::. .::::::::::::::::::: :: .. :. .. : : .: :
CCDS31 AREQLAMRTDLTEARVQVWFQNRRAKWRKRERFGQMQQVRTHFSTAYELPLLTRAENYAQ
250 260 270 280 290 300
130 140 150 160 170
pF1KE0 PGDQARSKKEALEAQQSLGRTVGPAGPFFPSCL------PGTLLNTAT----YAQALSHV
. . ... : . . : : : :.:. ::. ...: . : :::
CCDS31 IQNPSWLGNNG--AASPVPACVVPCDPV-PACMSPHAHPPGSGASSVTDFLSVSGAGSHV
310 320 330 340 350
180 190 200 210 220 230
pF1KE0 ASLKGGPLCSCCVPDPMGLSFLPTYGCQSNRTASVATLRMKAREHSEAVLQSANLLPSTS
.. . : : . .: ::. : . .:.:.:.:::::.::: :.
CCDS31 GQTHMGSLFGAASLSP-GLNGYELNGEPDRKTSSIAALRMKAKEHSAAISWAT
360 370 380 390 400 410
240 250 260
pF1KE0 SSPGPVAKPAPPDGSQEKTSPTKEQSEAEKSV
>>CCDS14215.1 ARX gene_id:170302|Hs108|chrX (562 aa)
initn: 420 init1: 348 opt: 403 Z-score: 330.5 bits: 69.9 E(32554): 4.3e-12
Smith-Waterman score: 456; 40.9% identity (61.1% similar) in 247 aa overlap (20-232:314-557)
10 20 30 40
pF1KE0 MFYFHCPPQLEGTATFGNHSSG-DFDDGFLRRKQRRNRTTFTLQQLEAL
:.: : ..:.:.::::: ::::: ::: :
CCDS14 TEGGELSPKEELLLHPEDAEGKDGEDSVCLSAGSDSEEGLLKRKQRRYRTTFTSYQLEEL
290 300 310 320 330 340
50 60 70 80 90 100
pF1KE0 EAVFAQTHYPDVFTREELAMKINLTEARVQVWFQNRRAKWRKTER-GASDQEPGAKEP--
: .: .::::::::::::::...::::::::::::::::::: :. ::. . :: :
CCDS14 ERAFQKTHYPDVFTREELAMRLDLTEARVQVWFQNRRAKWRKREKAGAQTHPPGLPFPGP
350 360 370 380 390 400
110 120 130 140 150 160
pF1KE0 MAEVTP-PPVRNINSPPPGDQARSKKEALEAQQSLGRTVGPAGPFFP--SCLP--GTLLN
.. . : : . . :: : .. : : . . .. :. : : . :: :. :.
CCDS14 LSATHPLSPYLDASPFPPHHPALDS--AWTAAAAAAAAAFPSLPPPPGSASLPPSGAPLG
410 420 430 440 450 460
170 180 190
pF1KE0 TATY--AQALSHVASLKGG---------PLCSCCV-------PDPM-------GLSFLPT
.:. : .. : : .. . :: : . : : : :.
CCDS14 LSTFLGAAVFRHPAFISPAFGRLFSTMAPLTSASTAAALLRQPTPAVEGAVASGALADPA
470 480 490 500 510 520
200 210 220 230 240 250
pF1KE0 YGCQSNRTASVATLRMKAREHSEAVLQSANLLPSTSSSPGPVAKPAPPDGSQEKTSPTKE
. . :..:.:.::.::.::. : : . :.::.::.
CCDS14 TAAADRRASSIAALRLKAKEHA-AQLTQLNILPGTSTGKEVC
530 540 550 560
260
pF1KE0 QSEAEKSV
263 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 04:16:59 2016 done: Sat Nov 5 04:16:59 2016
Total Scan time: 2.570 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]