FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8934, 284 aa 1>>>pF1KB8934 284 - 284 aa - 284 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.4940+/-0.000823; mu= 8.3448+/- 0.050 mean_var=225.5429+/-47.007, 0's: 0 Z-trim(115.0): 114 B-trim: 810 in 1/52 Lambda= 0.085400 statistics sampled from 15409 (15533) to 15409 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.802), E-opt: 0.2 (0.477), width: 16 Scan time: 2.760 The best scores are: opt bits E(32554) CCDS11536.1 HOXB13 gene_id:10481|Hs108|chr17 ( 284) 1992 257.4 8.7e-69 CCDS8865.1 HOXC13 gene_id:3229|Hs108|chr12 ( 330) 848 116.5 2.6e-26 CCDS5412.1 HOXA13 gene_id:3209|Hs108|chr7 ( 388) 714 100.1 2.7e-21 CCDS2264.2 HOXD13 gene_id:3239|Hs108|chr2 ( 343) 669 94.5 1.1e-19 >>CCDS11536.1 HOXB13 gene_id:10481|Hs108|chr17 (284 aa) initn: 1992 init1: 1992 opt: 1992 Z-score: 1348.4 bits: 257.4 E(32554): 8.7e-69 Smith-Waterman score: 1992; 100.0% identity (100.0% similar) in 284 aa overlap (1-284:1-284) 10 20 30 40 50 60 pF1KB8 MEPGNYATLDGAKDIEGLLGAGGGRNLVAHSPLTSHPAAPTLMPAVNYAPLDLPGSAEPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MEPGNYATLDGAKDIEGLLGAGGGRNLVAHSPLTSHPAAPTLMPAVNYAPLDLPGSAEPP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 KQCHPCPGVPQGTSPAPVPYGYFGGGYYSCRVSRSSLKPCAQAATLAAYPAETPTAGEEY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 KQCHPCPGVPQGTSPAPVPYGYFGGGYYSCRVSRSSLKPCAQAATLAAYPAETPTAGEEY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 PSRPTEFAFYPGYPGTYQPMASYLDVSVVQTLGAPGEPRHDSLLPVDSYQSWALAGGWNS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 PSRPTEFAFYPGYPGTYQPMASYLDVSVVQTLGAPGEPRHDSLLPVDSYQSWALAGGWNS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 QMCCQGEQNPPGPFWKAAFADSSGQHPPDACAFRRGRKKRIPYSKGQLRELEREYAANKF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 QMCCQGEQNPPGPFWKAAFADSSGQHPPDACAFRRGRKKRIPYSKGQLRELEREYAANKF 190 200 210 220 230 240 250 260 270 280 pF1KB8 ITKDKRRKISAATSLSERQITIWFQNRRVKEKKVLAKVKNSATP :::::::::::::::::::::::::::::::::::::::::::: CCDS11 ITKDKRRKISAATSLSERQITIWFQNRRVKEKKVLAKVKNSATP 250 260 270 280 >>CCDS8865.1 HOXC13 gene_id:3229|Hs108|chr12 (330 aa) initn: 787 init1: 351 opt: 848 Z-score: 585.9 bits: 116.5 E(32554): 2.6e-26 Smith-Waterman score: 848; 48.2% identity (72.0% similar) in 282 aa overlap (3-279:51-323) 10 20 30 pF1KB8 MEPGNYATLDGAKDIEGLLGAGGGRNLVAHSP ::. ..:: . . :. :.:. : : CCDS88 DSAAESGIGGGGGGGGGGTGGAGGGCSGASPGKAPSMDG---LGSSCPASHCRDLLPH-P 30 40 50 60 70 40 50 60 70 80 90 pF1KB8 LTSHPAAPTLMPAVNYAPLDLPGSAEPPKQCHPCPGVPQGTSPAPVPYGY-FGGGYYSCR . ..: :: : . . :.: . : .:: : : .: .: : . ::: :::.::.:: CCDS88 VLGRPPAPLGAPQ-GAVYTDIP-APEAARQCAP-PPAPPTSSSATLGYGYPFGGSYYGCR 80 90 100 110 120 130 100 110 120 130 140 pF1KB8 VSRS---SLKPCAQAATLAAYPAETPT-AGEEYPSRPTEFAFYPGYPGTYQPMASYLDVS .:.. . :::: :: . . :.. :: ::::::.. ..:: : .::::: CCDS88 LSHNVNLQQKPCAYHPG-DKYPEPSGALPGDDLSSRAKEFAFYPSFASSYQAMPGYLDVS 140 150 160 170 180 190 150 160 170 180 190 200 pF1KB8 VVQTLGAPGEPRHDSLLPVDSYQSWALAGGWNSQMCCQGEQNPPGPFWKAAFADSSGQHP :: ... :::::.:.::..:: :::..::.::. :. ::. . .::. : : .: CCDS88 VVPGISGHPEPRHDALIPVEGYQHWALSNGWDSQVYCSKEQSQSAHLWKSPFPDVVPLQP 200 210 220 230 240 250 210 220 230 240 250 260 pF1KB8 PDACAFRRGRKKRIPYSKGQLRELEREYAANKFITKDKRRKISAATSLSERQITIWFQNR .. ..:::::::.::.: ::.:::.::::.:::::.:::.:::.:.:::::.::::::: CCDS88 -EVSSYRRGRKKRVPYTKVQLKELEKEYAASKFITKEKRRRISATTNLSERQVTIWFQNR 260 270 280 290 300 310 270 280 pF1KB8 RVKEKKVLAKVKNSATP :::::::..: : CCDS88 RVKEKKVVSKSKAPHLHST 320 330 >>CCDS5412.1 HOXA13 gene_id:3209|Hs108|chr7 (388 aa) initn: 868 init1: 457 opt: 714 Z-score: 495.9 bits: 100.1 E(32554): 2.7e-21 Smith-Waterman score: 871; 57.3% identity (75.2% similar) in 246 aa overlap (54-282:144-388) 30 40 50 60 70 80 pF1KB8 GRNLVAHSPLTSHPAAPTLMPAVNYAPLDLPGSAEPPKQCHPCPGVPQGTS-PAPVPYGY :..:: ::: :: .. :..: :: .:::: CCDS54 PSAAAAAAAAAAAAAAAAAASSSGGPGPAGPAGAEAAKQCSPCSAAAQSSSGPAALPYGY 120 130 140 150 160 170 90 100 110 120 130 pF1KB8 FGGGYYSC-RVSR--SSLKPCAQAATLAAYPAETP----TAG---EEYPSRPTEFAFY-P ::.::: : :.. ...: ::: :. :: : . ::: ::. :: ::::: CCDS54 FGSGYYPCARMGPHPNAIKSCAQPASAAAAAAFADKYMDTAGPAAEEFSSRAKEFAFYHQ 180 190 200 210 220 230 140 150 160 170 180 pF1KB8 GYP-GTY---QPMASYLDVSVVQTLGAPGEPRHDSL-LPVDSYQSWALAGGWNSQMCCQG :: : : ::: .:::. :: ::.::: ::. : ::..::: ::: .:::.:: : CCDS54 GYAAGPYHHHQPMPGYLDMPVVPGLGGPGESRHEPLGLPMESYQPWALPNGWNGQMYCPK 240 250 260 270 280 290 190 200 210 220 230 240 pF1KB8 EQNPPGPFWKAAFADSSGQHPPDACAFRRGRKKRIPYSKGQLRELEREYAANKFITKDKR :: : .::... : . :: :: ..:::::::.::.: ::.:::::::.::::::::: CCDS54 EQAQPPHLWKSTLPDVVS-HPSDASSYRRGRKKRVPYTKVQLKELEREYATNKFITKDKR 300 310 320 330 340 350 250 260 270 280 pF1KB8 RKISAATSLSERQITIWFQNRRVKEKKVLAKVKNSATP :.:::.:.:::::.::::::::::::::. :.:... CCDS54 RRISATTNLSERQVTIWFQNRRVKEKKVINKLKTTS 360 370 380 >>CCDS2264.2 HOXD13 gene_id:3239|Hs108|chr2 (343 aa) initn: 740 init1: 568 opt: 669 Z-score: 466.5 bits: 94.5 E(32554): 1.1e-19 Smith-Waterman score: 749; 47.8% identity (74.1% similar) in 247 aa overlap (57-283:101-343) 30 40 50 60 70 80 pF1KB8 LVAHSPLTSHPAAPTLMPAVNYAPLDLPGSAEPPKQCH-PCPGVPQGTSP-AP-VPYGY- : : :.: : :.. .. : :: . ::: CCDS22 ASGFAYPGTSERTGSSSSSSSSAVVAARPEAPPAKECPAPTPAAAAAAPPSAPALGYGYH 80 90 100 110 120 130 90 100 110 120 pF1KB8 FGGGYYSCRVS------RSSLKPCAQAATLAAYPAE----------TPTAGEEYPSRPTE ::.::::::.: ...:: .: .:...:.: . . ..: :.: : CCDS22 FGNGYYSCRMSHGVGLQQNALKSSPHA-SLGGFPVEKYMDVSGLASSSVPANEVPARAKE 140 150 160 170 180 130 140 150 160 170 180 pF1KB8 FAFYPGYPGTYQPMASYLDVSVVQTLGAPGEPRHDSLLPVDSYQSWALAGGWNSQMCCQG .:: :: . :: . .:.: .:.:.:. :::::.. . ...::::.::.:::::. : CCDS22 VSFYQGYTSPYQHVPGYID--MVSTFGS-GEPRHEAYISMEGYQSWTLANGWNSQVYCTK 190 200 210 220 230 240 190 200 210 220 230 240 pF1KB8 EQNPPGPFWKAAFADSSGQHPPDACAFRRGRKKRIPYSKGQLRELEREYAANKFITKDKR .: . :::..: . . . :: :..:::::::.::.: ::.::: ::: ::::.:::: CCDS22 DQPQGSHFWKSSFPGDVALNQPDMCVYRRGRKKRVPYTKLQLKELENEYAINKFINKDKR 250 260 270 280 290 300 250 260 270 280 pF1KB8 RKISAATSLSERQITIWFQNRRVKEKKVLAKVKNSATP :.:::::.:::::.::::::::::.::...:.:.... CCDS22 RRISAATNLSERQVTIWFQNRRVKDKKIVSKLKDTVS 310 320 330 340 284 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 16:34:32 2016 done: Fri Nov 4 16:34:33 2016 Total Scan time: 2.760 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]