FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9475, 343 aa 1>>>pF1KB9475 343 - 343 aa - 343 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.0811+/-0.000783; mu= 13.3852+/- 0.047 mean_var=92.3811+/-18.367, 0's: 0 Z-trim(109.7): 8 B-trim: 7 in 1/52 Lambda= 0.133439 statistics sampled from 11094 (11096) to 11094 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.718), E-opt: 0.2 (0.341), width: 16 Scan time: 2.940 The best scores are: opt bits E(32554) CCDS83050.1 UIMC1 gene_id:51720|Hs108|chr5 ( 553) 2037 402.1 5.6e-112 CCDS4408.1 UIMC1 gene_id:51720|Hs108|chr5 ( 719) 2032 401.2 1.3e-111 >>CCDS83050.1 UIMC1 gene_id:51720|Hs108|chr5 (553 aa) initn: 2039 init1: 1224 opt: 2037 Z-score: 2123.8 bits: 402.1 E(32554): 5.6e-112 Smith-Waterman score: 2037; 92.2% identity (95.1% similar) in 344 aa overlap (6-343:211-553) 10 20 30 pF1KB9 MLPLPDLDLWPLDRLPSPIKRKPQTLGSLKSSQGI .... .:: . .. : :. .::::: CCDS83 EPWDHTEKTEEEPVSGSSGSWDQSSQPVFENVNVKSFDRCTGHSAEHTQC-GKPQSSQGI 190 200 210 220 230 40 50 60 70 80 90 pF1KB9 VEETSEEGNSVPASQSVAALTSKRSLVLMPESSAEEITVCPETQLSSSETFDLEREVSPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 VEETSEEGNSVPASQSVAALTSKRSLVLMPESSAEEITVCPETQLSSSETFDLEREVSPG 240 250 260 270 280 290 100 110 120 130 140 150 pF1KB9 SRDILDGVRIIMADKEVGNKEDAEKEVAISTFSSSNQVSCPLCDQCFPPTKIERHAMYCN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 SRDILDGVRIIMADKEVGNKEDAEKEVAISTFSSSNQVSCPLCDQCFPPTKIERHAMYCN 300 310 320 330 340 350 160 170 180 190 200 210 pF1KB9 GLMEEDTVLTRRQKEAKTKSDSGTAAQTSLDIDKNEKCYLCKSLVPFREYQCHVDSCLQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 GLMEEDTVLTRRQKEAKTKSDSGTAAQTSLDIDKNEKCYLCKSLVPFREYQCHVDSCLQL 360 370 380 390 400 410 220 230 240 250 260 pF1KB9 AKA------EGSGRACSTVEGKWQQRLKNPKEKGHSEGRLLSFLEQSEHKTSDADIKSSE ::: ::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 AKADQGDGPEGSGRACSTVEGKWQQRLKNPKEKGHSEGRLLSFLEQSEHKTSDADIKSSE 420 430 440 450 460 470 270 280 290 300 310 320 pF1KB9 TGAFRVPSPGMEEAGCSREMQSSFTRRDLNESPVKSFVSISEATDCLVDFKKQVTVQPGS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS83 TGAFRVPSPGMEEAGCSREMQSSFTRRDLNESPVKSFVSISEATDCLVDFKKQVTVQPGS 480 490 500 510 520 530 330 340 pF1KB9 RTRTKAGRGRRRKF :::::::::::::: CCDS83 RTRTKAGRGRRRKF 540 550 >>CCDS4408.1 UIMC1 gene_id:51720|Hs108|chr5 (719 aa) initn: 2039 init1: 1224 opt: 2032 Z-score: 2117.0 bits: 401.2 E(32554): 1.3e-111 Smith-Waterman score: 2032; 95.2% identity (97.0% similar) in 331 aa overlap (19-343:390-719) 10 20 30 40 pF1KB9 MLPLPDLDLWPLDRLPSPIKRKPQTLGSLKSSQGIVEETSEEGNSVPA ....: : . .:::::::::::::::::: CCDS44 ERQESRASDWHSKTKDFQESSIKSLKEKLLLEEEPTTSHG-QSSQGIVEETSEEGNSVPA 360 370 380 390 400 410 50 60 70 80 90 100 pF1KB9 SQSVAALTSKRSLVLMPESSAEEITVCPETQLSSSETFDLEREVSPGSRDILDGVRIIMA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 SQSVAALTSKRSLVLMPESSAEEITVCPETQLSSSETFDLEREVSPGSRDILDGVRIIMA 420 430 440 450 460 470 110 120 130 140 150 160 pF1KB9 DKEVGNKEDAEKEVAISTFSSSNQVSCPLCDQCFPPTKIERHAMYCNGLMEEDTVLTRRQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 DKEVGNKEDAEKEVAISTFSSSNQVSCPLCDQCFPPTKIERHAMYCNGLMEEDTVLTRRQ 480 490 500 510 520 530 170 180 190 200 210 220 pF1KB9 KEAKTKSDSGTAAQTSLDIDKNEKCYLCKSLVPFREYQCHVDSCLQLAKA------EGSG :::::::::::::::::::::::::::::::::::::::::::::::::: :::: CCDS44 KEAKTKSDSGTAAQTSLDIDKNEKCYLCKSLVPFREYQCHVDSCLQLAKADQGDGPEGSG 540 550 560 570 580 590 230 240 250 260 270 280 pF1KB9 RACSTVEGKWQQRLKNPKEKGHSEGRLLSFLEQSEHKTSDADIKSSETGAFRVPSPGMEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 RACSTVEGKWQQRLKNPKEKGHSEGRLLSFLEQSEHKTSDADIKSSETGAFRVPSPGMEE 600 610 620 630 640 650 290 300 310 320 330 340 pF1KB9 AGCSREMQSSFTRRDLNESPVKSFVSISEATDCLVDFKKQVTVQPGSRTRTKAGRGRRRK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 AGCSREMQSSFTRRDLNESPVKSFVSISEATDCLVDFKKQVTVQPGSRTRTKAGRGRRRK 660 670 680 690 700 710 pF1KB9 F : CCDS44 F 343 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 23:44:23 2016 done: Thu Nov 3 23:44:24 2016 Total Scan time: 2.940 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]