FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0677, 263 aa 1>>>pF1KE0677 263 - 263 aa - 263 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.8597+/-0.000666; mu= 10.7452+/- 0.041 mean_var=160.3211+/-33.081, 0's: 0 Z-trim(116.4): 178 B-trim: 40 in 1/51 Lambda= 0.101293 statistics sampled from 16835 (17038) to 16835 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.827), E-opt: 0.2 (0.523), width: 16 Scan time: 2.570 The best scores are: opt bits E(32554) CCDS44388.2 DRGX gene_id:644168|Hs108|chr10 ( 263) 1786 271.7 3.7e-73 CCDS31468.1 ALX4 gene_id:60529|Hs108|chr11 ( 411) 422 72.6 5.1e-13 CCDS14215.1 ARX gene_id:170302|Hs108|chrX ( 562) 403 69.9 4.3e-12 >>CCDS44388.2 DRGX gene_id:644168|Hs108|chr10 (263 aa) initn: 1786 init1: 1786 opt: 1786 Z-score: 1426.9 bits: 271.7 E(32554): 3.7e-73 Smith-Waterman score: 1786; 100.0% identity (100.0% similar) in 263 aa overlap (1-263:1-263) 10 20 30 40 50 60 pF1KE0 MFYFHCPPQLEGTATFGNHSSGDFDDGFLRRKQRRNRTTFTLQQLEALEAVFAQTHYPDV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 MFYFHCPPQLEGTATFGNHSSGDFDDGFLRRKQRRNRTTFTLQQLEALEAVFAQTHYPDV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 FTREELAMKINLTEARVQVWFQNRRAKWRKTERGASDQEPGAKEPMAEVTPPPVRNINSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 FTREELAMKINLTEARVQVWFQNRRAKWRKTERGASDQEPGAKEPMAEVTPPPVRNINSP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 PPGDQARSKKEALEAQQSLGRTVGPAGPFFPSCLPGTLLNTATYAQALSHVASLKGGPLC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 PPGDQARSKKEALEAQQSLGRTVGPAGPFFPSCLPGTLLNTATYAQALSHVASLKGGPLC 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 SCCVPDPMGLSFLPTYGCQSNRTASVATLRMKAREHSEAVLQSANLLPSTSSSPGPVAKP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 SCCVPDPMGLSFLPTYGCQSNRTASVATLRMKAREHSEAVLQSANLLPSTSSSPGPVAKP 190 200 210 220 230 240 250 260 pF1KE0 APPDGSQEKTSPTKEQSEAEKSV ::::::::::::::::::::::: CCDS44 APPDGSQEKTSPTKEQSEAEKSV 250 260 >>CCDS31468.1 ALX4 gene_id:60529|Hs108|chr11 (411 aa) initn: 409 init1: 330 opt: 422 Z-score: 347.2 bits: 72.6 E(32554): 5.1e-13 Smith-Waterman score: 422; 42.2% identity (62.3% similar) in 199 aa overlap (32-220:213-407) 10 20 30 40 50 60 pF1KE0 FYFHCPPQLEGTATFGNHSSGDFDDGFLRRKQRRNRTTFTLQQLEALEAVFAQTHYPDVF :.:::::::: ::: :: :: .::::::. CCDS31 KEAGVKGPQDRASSDLPSPLEKADSESNKGKKRRNRTTFTSYQLEELEKVFQKTHYPDVY 190 200 210 220 230 240 70 80 90 100 110 120 pF1KE0 TREELAMKINLTEARVQVWFQNRRAKWRKTERGASDQEPGAKEPMAEVTPPPVRNINSPP .::.:::. .::::::::::::::::::: :: .. :. .. : : .: : CCDS31 AREQLAMRTDLTEARVQVWFQNRRAKWRKRERFGQMQQVRTHFSTAYELPLLTRAENYAQ 250 260 270 280 290 300 130 140 150 160 170 pF1KE0 PGDQARSKKEALEAQQSLGRTVGPAGPFFPSCL------PGTLLNTAT----YAQALSHV . . ... : . . : : : :.:. ::. ...: . : ::: CCDS31 IQNPSWLGNNG--AASPVPACVVPCDPV-PACMSPHAHPPGSGASSVTDFLSVSGAGSHV 310 320 330 340 350 180 190 200 210 220 230 pF1KE0 ASLKGGPLCSCCVPDPMGLSFLPTYGCQSNRTASVATLRMKAREHSEAVLQSANLLPSTS .. . : : . .: ::. : . .:.:.:.:::::.::: :. CCDS31 GQTHMGSLFGAASLSP-GLNGYELNGEPDRKTSSIAALRMKAKEHSAAISWAT 360 370 380 390 400 410 240 250 260 pF1KE0 SSPGPVAKPAPPDGSQEKTSPTKEQSEAEKSV >>CCDS14215.1 ARX gene_id:170302|Hs108|chrX (562 aa) initn: 420 init1: 348 opt: 403 Z-score: 330.5 bits: 69.9 E(32554): 4.3e-12 Smith-Waterman score: 456; 40.9% identity (61.1% similar) in 247 aa overlap (20-232:314-557) 10 20 30 40 pF1KE0 MFYFHCPPQLEGTATFGNHSSG-DFDDGFLRRKQRRNRTTFTLQQLEAL :.: : ..:.:.::::: ::::: ::: : CCDS14 TEGGELSPKEELLLHPEDAEGKDGEDSVCLSAGSDSEEGLLKRKQRRYRTTFTSYQLEEL 290 300 310 320 330 340 50 60 70 80 90 100 pF1KE0 EAVFAQTHYPDVFTREELAMKINLTEARVQVWFQNRRAKWRKTER-GASDQEPGAKEP-- : .: .::::::::::::::...::::::::::::::::::: :. ::. . :: : CCDS14 ERAFQKTHYPDVFTREELAMRLDLTEARVQVWFQNRRAKWRKREKAGAQTHPPGLPFPGP 350 360 370 380 390 400 110 120 130 140 150 160 pF1KE0 MAEVTP-PPVRNINSPPPGDQARSKKEALEAQQSLGRTVGPAGPFFP--SCLP--GTLLN .. . : : . . :: : .. : : . . .. :. : : . :: :. :. CCDS14 LSATHPLSPYLDASPFPPHHPALDS--AWTAAAAAAAAAFPSLPPPPGSASLPPSGAPLG 410 420 430 440 450 460 170 180 190 pF1KE0 TATY--AQALSHVASLKGG---------PLCSCCV-------PDPM-------GLSFLPT .:. : .. : : .. . :: : . : : : :. CCDS14 LSTFLGAAVFRHPAFISPAFGRLFSTMAPLTSASTAAALLRQPTPAVEGAVASGALADPA 470 480 490 500 510 520 200 210 220 230 240 250 pF1KE0 YGCQSNRTASVATLRMKAREHSEAVLQSANLLPSTSSSPGPVAKPAPPDGSQEKTSPTKE . . :..:.:.::.::.::. : : . :.::.::. CCDS14 TAAADRRASSIAALRLKAKEHA-AQLTQLNILPGTSTGKEVC 530 540 550 560 260 pF1KE0 QSEAEKSV 263 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 04:16:59 2016 done: Sat Nov 5 04:16:59 2016 Total Scan time: 2.570 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]