FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB6448, 349 aa 1>>>pF1KB6448 349 - 349 aa - 349 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6039+/-0.00082; mu= 15.0369+/- 0.049 mean_var=68.6856+/-13.858, 0's: 0 Z-trim(107.5): 23 B-trim: 0 in 0/51 Lambda= 0.154754 statistics sampled from 9590 (9607) to 9590 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.678), E-opt: 0.2 (0.295), width: 16 Scan time: 2.420 The best scores are: opt bits E(32554) CCDS7498.1 HIF1AN gene_id:55662|Hs108|chr10 ( 349) 2427 550.7 6.8e-157 CCDS3017.1 HSPBAP1 gene_id:79663|Hs108|chr3 ( 488) 325 81.4 1.7e-15 CCDS10627.1 KDM8 gene_id:79831|Hs108|chr16 ( 416) 302 76.3 5.2e-14 CCDS45448.1 KDM8 gene_id:79831|Hs108|chr16 ( 454) 302 76.3 5.6e-14 >>CCDS7498.1 HIF1AN gene_id:55662|Hs108|chr10 (349 aa) initn: 2427 init1: 2427 opt: 2427 Z-score: 2930.2 bits: 550.7 E(32554): 6.8e-157 Smith-Waterman score: 2427; 100.0% identity (100.0% similar) in 349 aa overlap (1-349:1-349) 10 20 30 40 50 60 pF1KB6 MAATAAEAVASGSGEPREEAGALGPAWDESQLRSYSFPTRPIPRLSQSDPRAEELIENEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 MAATAAEAVASGSGEPREEAGALGPAWDESQLRSYSFPTRPIPRLSQSDPRAEELIENEE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 PVVLTDTNLVYPALKWDLEYLQENIGNGDFSVYSASTHKFLYYDEKKMANFQNFKPRSNR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 PVVLTDTNLVYPALKWDLEYLQENIGNGDFSVYSASTHKFLYYDEKKMANFQNFKPRSNR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB6 EEMKFHEFVEKLQDIQQRGGEERLYLQQTLNDTVGRKIVMDFLGFNWNWINKQQGKRGWG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 EEMKFHEFVEKLQDIQQRGGEERLYLQQTLNDTVGRKIVMDFLGFNWNWINKQQGKRGWG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB6 QLTSNLLLIGMEGNVTPAHYDEQQNFFAQIKGYKRCILFPPDQFECLYPYPVHHPCDRQS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 QLTSNLLLIGMEGNVTPAHYDEQQNFFAQIKGYKRCILFPPDQFECLYPYPVHHPCDRQS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB6 QVDFDNPDYERFPNFQNVVGYETVVGPGDVLYIPMYWWHHIESLLNGGITITVNFWYKGA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 QVDFDNPDYERFPNFQNVVGYETVVGPGDVLYIPMYWWHHIESLLNGGITITVNFWYKGA 250 260 270 280 290 300 310 320 330 340 pF1KB6 PTPKRIEYPLKAHQKVAIMRNIEKMLGEALGNPQEVGPLLNTMIKGRYN ::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 PTPKRIEYPLKAHQKVAIMRNIEKMLGEALGNPQEVGPLLNTMIKGRYN 310 320 330 340 >>CCDS3017.1 HSPBAP1 gene_id:79663|Hs108|chr3 (488 aa) initn: 282 init1: 160 opt: 325 Z-score: 391.7 bits: 81.4 E(32554): 1.7e-15 Smith-Waterman score: 329; 27.0% identity (55.3% similar) in 300 aa overlap (51-341:32-309) 30 40 50 60 70 pF1KB6 GALGPAWDESQLRSYSFPTRPIPRLSQSDPRAEELIEN-EEPVVLTDTNLVYPALKWDLE .:.:.: . ..:... . . .:: .:. . CCDS30 AAGSEATTPVIVAAGAGGEEGEHVKPFKPEKAKEIIMSLQQPAIFCNMVFDWPARHWNAK 10 20 30 40 50 60 80 90 100 110 120 130 pF1KB6 YLQENIGNGDFSVYSASTHKFLYYDEKKMANFQNFKPRSNREEMKFHEFVEKLQDIQQRG ::.. : .. .: . :.:.. .:. : : ..::. : .. . CCDS30 YLSQ--------VLHGKQIRF-RMGMKSMSTVPQFETTCNYVEATLEEFLTWNCDQSSIS 70 80 90 100 110 140 150 160 170 180 190 pF1KB6 GEERLYLQQTLNDTVGRKIVMDF------LGFNWNWINKQQGKRGWGQLTSNLLLIGMEG : : : .. . . : ... : . .: . :. :: .. : :: : CCDS30 GPFRDYDHSKFWAYADYKYFVSLFEDKTDLFQDVKWSDFGFPGRN-GQEST--LWIGSLG 120 130 140 150 160 200 210 220 230 240 250 pF1KB6 NVTPAHYDEQQ-NFFAQIKGYKRCILFPPDQFECLYPYPV-HHPCDRQSQVDFDNPDYER :: : : :. :..: :: ::::.. ::: . .. . :... ::: .: CCDS30 AHTPCHLDSYGCNLVFQVQGRKRWHLFPPEDTPFLYPTRIPYEESSVFSKINVVNPDLKR 170 180 190 200 210 220 260 270 280 290 300 310 pF1KB6 FPNFQNVVGYETVVGPGDVLYIPMYWWHHIESLLNGGITITVNFWYKGAPTPKRIEYPLK ::.:... . ....::.::..: .:::..::. .:...: : . .: CCDS30 FPQFRKAQRHAVTLSPGQVLFVPRHWWHYVESI--DPVTVSINSWIE-------LEEDHL 230 240 250 260 270 280 320 330 340 pF1KB6 AHQKVAIMRNIEKMLGEALGNPQEVGPLLNTMIKGRYN :. . :: : . : : :::.. :: CCDS30 ARVEEAITRMLVCALKTAE-NPQNTRAWLNPTEVEETSHAVNCCYLNAAVSAFFDRCRTS 290 300 310 320 330 >>CCDS10627.1 KDM8 gene_id:79831|Hs108|chr16 (416 aa) initn: 333 init1: 165 opt: 302 Z-score: 365.0 bits: 76.3 E(32554): 5.2e-14 Smith-Waterman score: 335; 28.2% identity (56.1% similar) in 262 aa overlap (40-297:182-415) 10 20 30 40 50 60 pF1KB6 ASGSGEPREEAGALGPAWDESQLRSYSFPTRPIPRLSQSDPRA--EELIENEEPVVLTDT . .::: . . . :... .::.: . CCDS10 KRPARGSLPEQPCTKKARADHGLIPDVKLEKTVPRLHRPSLQHFREQFLVPGRPVILKGV 160 170 180 190 200 210 70 80 90 100 110 120 pF1KB6 NLVYPAL-KWDLEYLQENIGNGDFSVYSASTHKFLYYDEKKMANFQNFKPRSNREEMKFH .: . ::.:::.:: : : .: : ::. .. : . CCDS10 ADHWPCMQKWSLEYIQEIAGCRTVPVEVGSR----YTDEEW-----------SQTLMTVN 220 230 240 250 130 140 150 160 170 180 pF1KB6 EFVEKLQDIQQRGGEERLYL-QQTLNDTVGRKIVMDFLGFNWNWINKQQGKRGWGQLTSN ::. : . : . :: :. : : . .. .:. .. .. . . ..: : CCDS10 EFISKYIVNEPR---DVGYLAQHQLFDQIP-ELKQDISIPDYCSLGDGEEE----EITIN 260 270 280 290 300 190 200 210 220 230 240 pF1KB6 LLLIGMEGNVTPAHYDEQQNFFAQIKGYKRCILFPPDQFECLYPYPVHHPCDRQSQVDFD . : .:...: : : ::::..:. : : :. :.. :::. .: :::: . CCDS10 AWF-GPQGTISPLHQDPQQNFLVQVMGRKYIRLYSPQESGALYPHDTHL-LHNTSQVDVE 310 320 330 340 350 360 250 260 270 280 290 300 pF1KB6 NPDYERFPNFQNVVGYETVVGPGDVLYIPMYWWHHIESLLNGGITITVNFWYKGAPTPKR ::: :.::.: .. ...::..:.::. .::....: ....:.::. CCDS10 NPDLEKFPKFAKAPFLSCILSPGEILFIPVKYWHYVRAL---DLSFSVSFWWS 370 380 390 400 410 310 320 330 340 pF1KB6 IEYPLKAHQKVAIMRNIEKMLGEALGNPQEVGPLLNTMIKGRYN >>CCDS45448.1 KDM8 gene_id:79831|Hs108|chr16 (454 aa) initn: 333 init1: 165 opt: 302 Z-score: 364.4 bits: 76.3 E(32554): 5.6e-14 Smith-Waterman score: 335; 28.2% identity (56.1% similar) in 262 aa overlap (40-297:220-453) 10 20 30 40 50 60 pF1KB6 ASGSGEPREEAGALGPAWDESQLRSYSFPTRPIPRLSQSDPRA--EELIENEEPVVLTDT . .::: . . . :... .::.: . CCDS45 KRPARGSLPEQPCTKKARADHGLIPDVKLEKTVPRLHRPSLQHFREQFLVPGRPVILKGV 190 200 210 220 230 240 70 80 90 100 110 120 pF1KB6 NLVYPAL-KWDLEYLQENIGNGDFSVYSASTHKFLYYDEKKMANFQNFKPRSNREEMKFH .: . ::.:::.:: : : .: : ::. .. : . CCDS45 ADHWPCMQKWSLEYIQEIAGCRTVPVEVGSR----YTDEEW-----------SQTLMTVN 250 260 270 280 290 130 140 150 160 170 180 pF1KB6 EFVEKLQDIQQRGGEERLYL-QQTLNDTVGRKIVMDFLGFNWNWINKQQGKRGWGQLTSN ::. : . : . :: :. : : . .. .:. .. .. . . ..: : CCDS45 EFISKYIVNEPR---DVGYLAQHQLFDQIP-ELKQDISIPDYCSLGDGEEE----EITIN 300 310 320 330 340 190 200 210 220 230 240 pF1KB6 LLLIGMEGNVTPAHYDEQQNFFAQIKGYKRCILFPPDQFECLYPYPVHHPCDRQSQVDFD . : .:...: : : ::::..:. : : :. :.. :::. .: :::: . CCDS45 AWF-GPQGTISPLHQDPQQNFLVQVMGRKYIRLYSPQESGALYPHDTHL-LHNTSQVDVE 350 360 370 380 390 400 250 260 270 280 290 300 pF1KB6 NPDYERFPNFQNVVGYETVVGPGDVLYIPMYWWHHIESLLNGGITITVNFWYKGAPTPKR ::: :.::.: .. ...::..:.::. .::....: ....:.::. CCDS45 NPDLEKFPKFAKAPFLSCILSPGEILFIPVKYWHYVRAL---DLSFSVSFWWS 410 420 430 440 450 310 320 330 340 pF1KB6 IEYPLKAHQKVAIMRNIEKMLGEALGNPQEVGPLLNTMIKGRYN 349 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 17:32:57 2016 done: Fri Nov 4 17:32:57 2016 Total Scan time: 2.420 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]