FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9695, 264 aa 1>>>pF1KB9695 264 - 264 aa - 264 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.0890+/-0.000699; mu= 10.7060+/- 0.043 mean_var=200.7837+/-40.965, 0's: 0 Z-trim(117.4): 163 B-trim: 0 in 0/56 Lambda= 0.090513 statistics sampled from 18013 (18188) to 18013 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.844), E-opt: 0.2 (0.559), width: 16 Scan time: 2.260 The best scores are: opt bits E(32554) CCDS9326.1 GSX1 gene_id:219409|Hs108|chr13 ( 264) 1834 250.7 7.8e-67 CCDS3494.1 GSX2 gene_id:170825|Hs108|chr4 ( 304) 516 78.7 5.5e-15 >>CCDS9326.1 GSX1 gene_id:219409|Hs108|chr13 (264 aa) initn: 1834 init1: 1834 opt: 1834 Z-score: 1313.4 bits: 250.7 E(32554): 7.8e-67 Smith-Waterman score: 1834; 100.0% identity (100.0% similar) in 264 aa overlap (1-264:1-264) 10 20 30 40 50 60 pF1KB9 MPRSFLVDSLVLREAGEKKAPEGSPPPLFPYAVPPPHALHGLSPGACHARKAGLLCVCPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS93 MPRSFLVDSLVLREAGEKKAPEGSPPPLFPYAVPPPHALHGLSPGACHARKAGLLCVCPL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 CVTASQLHGPPGPPALPLLKASFPPFGSQYCHAPLGRQHSAVSPGVAHGPAAAAAAAALY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS93 CVTASQLHGPPGPPALPLLKASFPPFGSQYCHAPLGRQHSAVSPGVAHGPAAAAAAAALY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 QTSYPLPDPRQFHCISVDSSSNQLPSSKRMRTAFTSTQLLELEREFASNMYLSRLRRIEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS93 QTSYPLPDPRQFHCISVDSSSNQLPSSKRMRTAFTSTQLLELEREFASNMYLSRLRRIEI 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 ATYLNLSEKQVKIWFQNRRVKHKKEGKGSNHRGGGGGGAGGGGSAPQGCKCASLSSAKCS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS93 ATYLNLSEKQVKIWFQNRRVKHKKEGKGSNHRGGGGGGAGGGGSAPQGCKCASLSSAKCS 190 200 210 220 230 240 250 260 pF1KB9 EDDDELPMSPSSSGKDDRDLTVTP :::::::::::::::::::::::: CCDS93 EDDDELPMSPSSSGKDDRDLTVTP 250 260 >>CCDS3494.1 GSX2 gene_id:170825|Hs108|chr4 (304 aa) initn: 701 init1: 415 opt: 516 Z-score: 382.5 bits: 78.7 E(32554): 5.5e-15 Smith-Waterman score: 675; 44.6% identity (64.3% similar) in 294 aa overlap (21-261:25-302) 10 20 30 40 50 pF1KB9 MPRSFLVDSLVLREAGEKKAPEGSPPPLFPYAVPPPHALHGLSPGACHARKAGLLC :. .: ..: ..::: .. .:: : .::.: .: CCDS34 MSRSFYVDSLIIKDTSRPAPSLPEPHPGPDFFIPLGMPPPLVMSVSGPG-CPSRKSGAFC 10 20 30 40 50 60 70 80 90 pF1KB9 VCPLCVTASQLHGPPGP------------------------PALPLLKASFP--PFGSQY ::::::: :.::. : ::::::..: : .:. CCDS34 VCPLCVT-SHLHSSRGSVGAGSGGAGAGVTGAGGSGVAGAAGALPLLKGQFSSAPGDAQF 60 70 80 90 100 110 100 110 120 pF1KB9 C----------HAPLGRQH--SAVSPGVAHGPAAAAAAAALYQ--------------TSY : : : ..: . .:: : . :::::::: :.: CCDS34 CPRVNHAHHHHHPPQHHHHHHQPQQPGSAAAAAAAAAAAAAAAALGHPQHHAPVCTATTY 120 130 140 150 160 170 130 140 150 160 170 180 pF1KB9 PLPDPRQFHCISVDSS-SNQLPSSKRMRTAFTSTQLLELEREFASNMYLSRLRRIEIATY . :::.:::... .: ..:.:..:::::::::::::::::::.:::::::::::::::: CCDS34 NVADPRRFHCLTMGGSDASQVPNGKRMRTAFTSTQLLELEREFSSNMYLSRLRRIEIATY 180 190 200 210 220 230 190 200 210 220 230 240 pF1KB9 LNLSEKQVKIWFQNRRVKHKKEGKGSNHRGGGGGGAGGGGSAPQGCKCASLSSAKCSEDD :::::::::::::::::::::::::... .. ::::.. :... .... CCDS34 LNLSEKQVKIWFQNRRVKHKKEGKGTQR------------NSHAGCKCVG-SQVHYARSE 240 250 260 270 280 250 260 pF1KB9 DELPMSPSSSGKDDRDLTVTP :: .::.:.. ::.... CCDS34 DEDSLSPASAN-DDKEISPL 290 300 264 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 18:23:15 2016 done: Fri Nov 4 18:23:16 2016 Total Scan time: 2.260 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]