FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8980, 376 aa 1>>>pF1KB8980 376 - 376 aa - 376 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.9068+/-0.000802; mu= 1.9726+/- 0.049 mean_var=202.6803+/-41.877, 0's: 0 Z-trim(115.4): 150 B-trim: 871 in 2/51 Lambda= 0.090088 statistics sampled from 15790 (15956) to 15790 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.799), E-opt: 0.2 (0.49), width: 16 Scan time: 3.000 The best scores are: opt bits E(32554) CCDS5403.1 HOXA2 gene_id:3199|Hs108|chr7 ( 376) 2513 338.5 6e-93 CCDS11527.1 HOXB2 gene_id:3212|Hs108|chr17 ( 356) 866 124.4 1.6e-28 >>CCDS5403.1 HOXA2 gene_id:3199|Hs108|chr7 (376 aa) initn: 2513 init1: 2513 opt: 2513 Z-score: 1782.2 bits: 338.5 E(32554): 6e-93 Smith-Waterman score: 2513; 100.0% identity (100.0% similar) in 376 aa overlap (1-376:1-376) 10 20 30 40 50 60 pF1KB8 MNYEFEREIGFINSQPSLAECLTSFPPVADTFQSSSIKTSTLSHSTLIPPPFEQTIPSLN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MNYEFEREIGFINSQPSLAECLTSFPPVADTFQSSSIKTSTLSHSTLIPPPFEQTIPSLN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 PGSHPRHGAGGRPKPSPAGSRGSPVPAGALQPPEYPWMKEKKAAKKTALLPAAAAAATAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 PGSHPRHGAGGRPKPSPAGSRGSPVPAGALQPPEYPWMKEKKAAKKTALLPAAAAAATAA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 ATGPACLSHKESLEIADGSGGGSRRLRTAYTNTQLLELEKEFHFNKYLCRPRRVEIAALL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 ATGPACLSHKESLEIADGSGGGSRRLRTAYTNTQLLELEKEFHFNKYLCRPRRVEIAALL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 DLTERQVKVWFQNRRMKHKRQTQCKENQNSEGKCKSLEDSEKVEEDEEEKTLFEQALSVS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 DLTERQVKVWFQNRRMKHKRQTQCKENQNSEGKCKSLEDSEKVEEDEEEKTLFEQALSVS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 GALLEREGYTFQQNALSQQQAPNGHNGDSQSFPVSPLTSNEKNLKHFQHQSPTVPNCLST :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 GALLEREGYTFQQNALSQQQAPNGHNGDSQSFPVSPLTSNEKNLKHFQHQSPTVPNCLST 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB8 MGQNCGAGLNNDSPEALEVPSLQDFSVFSTDSCLQLSDAVSPSLPGSLDSPVDISADSLD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MGQNCGAGLNNDSPEALEVPSLQDFSVFSTDSCLQLSDAVSPSLPGSLDSPVDISADSLD 310 320 330 340 350 360 370 pF1KB8 FFTDTLTTIDLQHLNY :::::::::::::::: CCDS54 FFTDTLTTIDLQHLNY 370 >>CCDS11527.1 HOXB2 gene_id:3212|Hs108|chr17 (356 aa) initn: 921 init1: 716 opt: 866 Z-score: 625.7 bits: 124.4 E(32554): 1.6e-28 Smith-Waterman score: 1037; 50.3% identity (67.5% similar) in 378 aa overlap (1-372:1-354) 10 20 30 40 50 60 pF1KB8 MNYEFEREIGFINSQPSLAECLTSFPPVADTFQSSSIKTSTLSHSTLIPPPFEQTIPSLN ::.::::::::::::::::::::::: : .:::.:::: ::: :::::::.:::. CCDS11 MNFEFEREIGFINSQPSLAECLTSFPAVLETFQTSSIKESTLIPP---PPPFEQTFPSLQ 10 20 30 40 50 70 80 90 100 110 pF1KB8 PGSHP--RHGAGGRPKPSPAGSRGSPVPAGALQP-PEYPWMKEKKAAKKTALLPAAAAAA ::. : . : . .:: : : : : ::.:::::::.::: . .. . : CCDS11 PGASTLQRPRSQKRAEDGPALPPPPPPPLPAAPPAPEFPWMKEKKSAKKPSQSATSPSPA 60 70 80 90 100 110 120 130 140 150 160 170 pF1KB8 TAAATGPACLSHKESLEIADGSGGGSRRLRTAYTNTQLLELEKEFHFNKYLCRPRRVEIA ..:. . . : ..: . ...:::.:::::::::::::::::::::::::::::::::: CCDS11 ASAVPASGVGSPADGLGLPEAGGGGARRLRTAYTNTQLLELEKEFHFNKYLCRPRRVEIA 120 130 140 150 160 170 180 190 200 210 220 230 pF1KB8 ALLDLTERQVKVWFQNRRMKHKRQTQCKENQNSEGKCK-SLEDSEKVEEDEEEKTLFEQA :::::::::::::::::::::::::: .: ..: : .::: . . :: : CCDS11 ALLDLTERQVKVWFQNRRMKHKRQTQHREPPDGEPACPGALED---ICDPAEEP-----A 180 190 200 210 220 240 250 260 270 280 290 pF1KB8 LSVSGALLEREGYTFQQNALSQQQAPNGHNGDSQSFPVSPLTSNEKNLKHFQHQSPTVPN : .: : .. . . .:.. ..: . . : ... . . :. CCDS11 ASPGGPSASRAAW--EACCHPPEVVPGALSADPRPLAV-----------RLEGAGASSPG 230 240 250 260 270 300 310 320 330 340 350 pF1KB8 C-LSTMGQNCGAGLNNDSPEA-LEVPSLQDFSVFSTDSCLQLSDAVSPSLPGSLDSPVDI : : : . : .: . . : : :.. :..::::::: ..:::: ::::::: . CCDS11 CALRGAGGLEPGPLPEDVFSGRQDSPFLPDLNFFAADSCLQLSGGLSPSLQGSLDSPVPF 280 290 300 310 320 330 360 370 pF1KB8 SADSLDFFTDTLTTIDLQHLNY : . :::::.:: .:::: CCDS11 SEEELDFFTSTLCAIDLQFP 340 350 376 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 16:49:30 2016 done: Fri Nov 4 16:49:30 2016 Total Scan time: 3.000 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]