FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8945, 304 aa 1>>>pF1KB8945 304 - 304 aa - 304 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.3569+/-0.000822; mu= 9.5948+/- 0.050 mean_var=231.8799+/-48.456, 0's: 0 Z-trim(116.0): 157 B-trim: 113 in 1/52 Lambda= 0.084225 statistics sampled from 16366 (16542) to 16366 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.804), E-opt: 0.2 (0.508), width: 16 Scan time: 2.860 The best scores are: opt bits E(32554) CCDS3494.1 GSX2 gene_id:170825|Hs108|chr4 ( 304) 2106 267.9 6.7e-72 CCDS9326.1 GSX1 gene_id:219409|Hs108|chr13 ( 264) 516 74.7 8.8e-14 >>CCDS3494.1 GSX2 gene_id:170825|Hs108|chr4 (304 aa) initn: 2106 init1: 2106 opt: 2106 Z-score: 1404.4 bits: 267.9 E(32554): 6.7e-72 Smith-Waterman score: 2106; 99.7% identity (100.0% similar) in 304 aa overlap (1-304:1-304) 10 20 30 40 50 60 pF1KB8 MSRSFYVDSLIIKDTSRPAPSLPEPHPGPDFFIPLGMPPPLVMSVSGPGCPSRKSGAFCV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 MSRSFYVDSLIIKDTSRPAPSLPEPHPGPDFFIPLGMPPPLVMSVSGPGCPSRKSGAFCV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 CPLCVTSHLHSSRGSVGAGSGGAGAGVTGAGGSGVAGAAGALPLLKSQFSSAPGDAQFCP ::::::::::::::::::::::::::::::::::::::::::::::.::::::::::::: CCDS34 CPLCVTSHLHSSRGSVGAGSGGAGAGVTGAGGSGVAGAAGALPLLKGQFSSAPGDAQFCP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 RVNHAHHHHHPPQHHHHHHQPQQPGSAAAAAAAAAAAAAAAALGHPQHHAPVCTATTYNV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 RVNHAHHHHHPPQHHHHHHQPQQPGSAAAAAAAAAAAAAAAALGHPQHHAPVCTATTYNV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 ADPRRFHCLTMGGSDASQVPNGKRMRTAFTSTQLLELEREFSSNMYLSRLRRIEIATYLN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 ADPRRFHCLTMGGSDASQVPNGKRMRTAFTSTQLLELEREFSSNMYLSRLRRIEIATYLN 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 LSEKQVKIWFQNRRVKHKKEGKGTQRNSHAGCKCVGSQVHYARSEDEDSLSPASANDDKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 LSEKQVKIWFQNRRVKHKKEGKGTQRNSHAGCKCVGSQVHYARSEDEDSLSPASANDDKE 250 260 270 280 290 300 pF1KB8 ISPL :::: CCDS34 ISPL >>CCDS9326.1 GSX1 gene_id:219409|Hs108|chr13 (264 aa) initn: 649 init1: 415 opt: 516 Z-score: 360.9 bits: 74.7 E(32554): 8.8e-14 Smith-Waterman score: 725; 45.0% identity (64.8% similar) in 318 aa overlap (1-302:1-261) 10 20 30 40 50 pF1KB8 MSRSFYVDSLIIKDTSRPAPSLPEPHPGPDFFIPLGMPPPLVMSVSGPG-CPSRKSGAFC : ::: ::::....... :: : : : : ..::: .. .:: : .::.: .: CCDS93 MPRSFLVDSLVLREAGEKKA--PEGSPPPLF--PYAVPPPHALHGLSPGACHARKAGLLC 10 20 30 40 50 60 70 80 90 100 110 pF1KB8 VCPLCVT-SHLHSSRGSVGAGSGGAGAGVTGAGGSGVAGAAGALPLLKSQFSSAPGDAQF ::::::: :.::. : ::::::..: : .:. CCDS93 VCPLCVTASQLHGPPGP------------------------PALPLLKASFP--PFGSQY 60 70 80 90 120 130 140 150 160 170 pF1KB8 CPRVNHAHHHHHPPQHHHHHHQPQQPGSAAAAAAAAAAAAAAAALGHPQHHAPVCTATTY : : : .. :. .:: .: . ::::::::: . :.: CCDS93 C----------HAPLGRQ--HSAVSPG----VAHGPAAAAAAAALYQ----------TSY 100 110 120 180 190 200 210 220 230 pF1KB8 NVADPRRFHCLTMGGSDASQVPNGKRMRTAFTSTQLLELEREFSSNMYLSRLRRIEIATY . :::.:::... .: ..:.:..:::::::::::::::::::.:::::::::::::::: CCDS93 PLPDPRQFHCISVDSS-SNQLPSSKRMRTAFTSTQLLELEREFASNMYLSRLRRIEIATY 130 140 150 160 170 180 240 250 260 270 280 pF1KB8 LNLSEKQVKIWFQNRRVKHKKEGKGTQR------------NSHAGCKCVG-SQVHYARSE :::::::::::::::::::::::::... .. ::::.. :... .... CCDS93 LNLSEKQVKIWFQNRRVKHKKEGKGSNHRGGGGGGAGGGGSAPQGCKCASLSSAKCSEDD 190 200 210 220 230 240 290 300 pF1KB8 DEDSLSPASAN-DDKEISPL :: .::.:.. ::.... CCDS93 DELPMSPSSSGKDDRDLTVTP 250 260 304 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 16:36:52 2016 done: Fri Nov 4 16:36:52 2016 Total Scan time: 2.860 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]