FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9833, 345 aa 1>>>pF1KB9833 345 - 345 aa - 345 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 10.3705+/-0.000901; mu= -5.1335+/- 0.055 mean_var=268.9228+/-53.870, 0's: 0 Z-trim(115.4): 16 B-trim: 0 in 0/54 Lambda= 0.078210 statistics sampled from 15997 (16008) to 15997 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.787), E-opt: 0.2 (0.492), width: 16 Scan time: 2.470 The best scores are: opt bits E(32554) CCDS11576.1 TOB1 gene_id:10140|Hs108|chr17 ( 345) 2334 275.8 3.6e-74 CCDS14015.1 TOB2 gene_id:10766|Hs108|chr22 ( 344) 1159 143.2 3e-34 >>CCDS11576.1 TOB1 gene_id:10140|Hs108|chr17 (345 aa) initn: 2334 init1: 2334 opt: 2334 Z-score: 1445.0 bits: 275.8 E(32554): 3.6e-74 Smith-Waterman score: 2334; 100.0% identity (100.0% similar) in 345 aa overlap (1-345:1-345) 10 20 30 40 50 60 pF1KB9 MQLEIQVALNFIISYLYNKLPRRRVNIFGEELERLLKKKYEGHWYPEKPYKGSGFRCIHI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MQLEIQVALNFIISYLYNKLPRRRVNIFGEELERLLKKKYEGHWYPEKPYKGSGFRCIHI 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 GEKVDPVIEQASKESGLDIDDVRGNLPQDLSVWIDPFEVSYQIGEKGPVKVLYVDDNNEN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 GEKVDPVIEQASKESGLDIDDVRGNLPQDLSVWIDPFEVSYQIGEKGPVKVLYVDDNNEN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 GCELDKEIKNSFNPEAQVFMPISDPASSVSSSPSPPFGHSAAVSPTFMPRSTQPLTFTTA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 GCELDKEIKNSFNPEAQVFMPISDPASSVSSSPSPPFGHSAAVSPTFMPRSTQPLTFTTA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 TFAATKFGSTKMKNSGRSNKVARTSPINLGLNVNDLLKQKAISSSMHSLYGLGLGSQQQP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 TFAATKFGSTKMKNSGRSNKVARTSPINLGLNVNDLLKQKAISSSMHSLYGLGLGSQQQP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB9 QQQQQPAQPPPPPPPPQQQQQQKTSALSPNAKEFIFPNMQGQGSSTNGMFPGDSPLNLSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 QQQQQPAQPPPPPPPPQQQQQQKTSALSPNAKEFIFPNMQGQGSSTNGMFPGDSPLNLSP 250 260 270 280 290 300 310 320 330 340 pF1KB9 LQYSNAFDVFAAYGGLNEKSFVDGLNFSLNNMQYSNQQFQPVMAN ::::::::::::::::::::::::::::::::::::::::::::: CCDS11 LQYSNAFDVFAAYGGLNEKSFVDGLNFSLNNMQYSNQQFQPVMAN 310 320 330 340 >>CCDS14015.1 TOB2 gene_id:10766|Hs108|chr22 (344 aa) initn: 1084 init1: 680 opt: 1159 Z-score: 728.5 bits: 143.2 E(32554): 3e-34 Smith-Waterman score: 1197; 55.7% identity (73.5% similar) in 377 aa overlap (1-345:1-344) 10 20 30 40 50 60 pF1KB9 MQLEIQVALNFIISYLYNKLPRRRVNIFGEELERLLKKKYEGHWYPEKPYKGSGFRCIHI :::::.::::::::::::::::::...:::::::::::::::::::::: :::::::.:: CCDS14 MQLEIKVALNFIISYLYNKLPRRRADLFGEELERLLKKKYEGHWYPEKPLKGSGFRCVHI 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 GEKVDPVIEQASKESGLDIDDVRGNLPQDLSVWIDPFEVSYQIGEKGPVKVLYVDDNNEN :: ::::.: :.:.::: ..:::.:.:..:::::::::::::::::: :::::.::.. CCDS14 GEMVDPVVELAAKRSGLAVEDVRANVPEELSVWIDPFEVSYQIGEKGAVKVLYLDDSE-- 70 80 90 100 110 130 140 150 160 170 pF1KB9 GC---ELDKEIKNSFNPEAQVFMPISDPASSVSSSPSPPFGHSAAVSPTFMPRSTQPLTF :: :::::::.::::.::::.::.. ::.:.:::: ::.: ::::.:::.::.:: CCDS14 GCGAPELDKEIKSSFNPDAQVFVPIGSQDSSLSNSPSPSFGQSP--SPTFIPRSAQPITF 120 130 140 150 160 170 180 190 200 210 220 pF1KB9 TTATFAATKFGSTKMKNSGRSNK---VARTS---------PINLGLNVNDLLKQKAISSS :::.::::::::::::..: . . :: .. : .:.:::.:..: : CCDS14 TTASFAATKFGSTKMKKGGGAASGGGVASSGAGGQQPPQQPRMARSPTNSLLKHKSLSLS 180 190 200 210 220 230 230 240 250 260 270 pF1KB9 MHSLYGLGLGSQQQPQQQQQPAQPPPPPPPPQQQQQQKTSALSPNAKEFIF-----PNM- :::: . . : ::.: ::::::::.. :.. CCDS14 MHSLNFITAN------------------PAPQSQ-------LSPNAKEFVYNGGGSPSLF 240 250 260 270 280 290 300 310 320 pF1KB9 ----QGQGSSTNGMFPGDSPLNLSPLQYSNAFDVFAAYGG------LNEKSFVDGLNFSL .::::.: : : :.. . . :..::. ..:: :.. ::.::...: CCDS14 FDAADGQGSGTPGPFGGSGAGTCN----SSSFDMAQVFGGGANSLFLEKTPFVEGLSYNL 280 290 300 310 320 330 340 pF1KB9 NNMQYSNQQFQPV-MAN :.::: .:::::: .:: CCDS14 NTMQYPSQQFQPVVLAN 330 340 345 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 19:34:24 2016 done: Fri Nov 4 19:34:24 2016 Total Scan time: 2.470 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]