FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8923, 270 aa 1>>>pF1KB8923 270 - 270 aa - 270 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.9583+/-0.000855; mu= 1.3517+/- 0.052 mean_var=248.8582+/-51.302, 0's: 0 Z-trim(115.7): 137 B-trim: 0 in 0/52 Lambda= 0.081301 statistics sampled from 16170 (16319) to 16170 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.799), E-opt: 0.2 (0.501), width: 16 Scan time: 2.850 The best scores are: opt bits E(32554) CCDS5406.1 HOXA5 gene_id:3202|Hs108|chr7 ( 270) 1854 229.4 2.1e-60 CCDS11530.1 HOXB5 gene_id:3215|Hs108|chr17 ( 269) 1066 137.0 1.4e-32 CCDS8873.1 HOXC4 gene_id:3221|Hs108|chr12 ( 264) 529 74.0 1.2e-13 CCDS11529.1 HOXB4 gene_id:3214|Hs108|chr17 ( 251) 527 73.7 1.4e-13 CCDS8872.1 HOXC5 gene_id:3222|Hs108|chr12 ( 222) 524 73.3 1.6e-13 CCDS5405.1 HOXA4 gene_id:3201|Hs108|chr7 ( 320) 519 72.9 3.2e-13 CCDS2269.1 HOXD4 gene_id:3233|Hs108|chr2 ( 255) 515 72.3 3.8e-13 CCDS11531.1 HOXB6 gene_id:3216|Hs108|chr17 ( 224) 480 68.2 5.9e-12 CCDS8871.1 HOXC6 gene_id:3223|Hs108|chr12 ( 235) 467 66.7 1.8e-11 CCDS5407.1 HOXA6 gene_id:3203|Hs108|chr7 ( 233) 462 66.1 2.6e-11 CCDS41792.1 HOXC6 gene_id:3223|Hs108|chr12 ( 153) 458 65.4 2.7e-11 >>CCDS5406.1 HOXA5 gene_id:3202|Hs108|chr7 (270 aa) initn: 1854 init1: 1854 opt: 1854 Z-score: 1198.0 bits: 229.4 E(32554): 2.1e-60 Smith-Waterman score: 1854; 100.0% identity (100.0% similar) in 270 aa overlap (1-270:1-270) 10 20 30 40 50 60 pF1KB8 MSSYFVNSFCGRYPNGPDYQLHNYGDHSSVSEQFRDSASMHSGRYGYGYNGMDLSVGRSG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MSSYFVNSFCGRYPNGPDYQLHNYGDHSSVSEQFRDSASMHSGRYGYGYNGMDLSVGRSG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 SGHFGSGERARSYAASASAAPAEPRYSQPATSTHSPQPDPLPCSAVAPSPGSDSHHGGKN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 SGHFGSGERARSYAASASAAPAEPRYSQPATSTHSPQPDPLPCSAVAPSPGSDSHHGGKN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 SLSNSSGASADAGSTHISSREGVGTASGAEEDAPASSEQASAQSEPSPAPPAQPQIYPWM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 SLSNSSGASADAGSTHISSREGVGTASGAEEDAPASSEQASAQSEPSPAPPAQPQIYPWM 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 RKLHISHDNIGGPEGKRARTAYTRYQTLELEKEFHFNRYLTRRRRIEIAHALCLSERQIK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 RKLHISHDNIGGPEGKRARTAYTRYQTLELEKEFHFNRYLTRRRRIEIAHALCLSERQIK 190 200 210 220 230 240 250 260 270 pF1KB8 IWFQNRRMKWKKDNKLKSMSMAAAGGAFRP :::::::::::::::::::::::::::::: CCDS54 IWFQNRRMKWKKDNKLKSMSMAAAGGAFRP 250 260 270 >>CCDS11530.1 HOXB5 gene_id:3215|Hs108|chr17 (269 aa) initn: 951 init1: 560 opt: 1066 Z-score: 698.5 bits: 137.0 E(32554): 1.4e-32 Smith-Waterman score: 1066; 62.3% identity (78.3% similar) in 281 aa overlap (1-270:1-269) 10 20 30 40 50 60 pF1KB8 MSSYFVNSFCGRYPNGPDYQLHNYGDHSSVSEQFRDSASMHSGRYGYGYNGMDLSVGRSG ::::::::: ::::::::::: :::. ::.: ..:: :.::.: :::.::::::::.::. CCDS11 MSSYFVNSFSGRYPNGPDYQLLNYGSGSSLSGSYRDPAAMHTGSYGYNYNGMDLSVNRSS 10 20 30 40 50 60 70 80 90 100 110 pF1KB8 --SGHFGS-GERARSYAASASAAPAEPRYSQPATSTHSPQPDPLPCSAVAPSPGSDSHHG :.:::. :: .:.. : :. :::. : :.: .:. :::. ..::: : CCDS11 ASSSHFGAVGESSRAFPAPAQ----EPRFRQAASSCSLSSPESLPCT------NGDSH-G 70 80 90 100 120 130 140 150 160 170 pF1KB8 GKNSLSNSSGASADAGSTHISSREGVGTASGAEEDAPA---SSEQASAQSEP----SPAP .: : :. : ...:.:. .. ..::. :.: . : : :: :: . :: CCDS11 AKPSASSPSDQATSASSSANFTEIDEASASSEPEEAASQLSSPSLARAQPEPMATSTAAP 110 120 130 140 150 160 180 190 200 210 220 pF1KB8 PAQ-PQIYPWMRKLHISHDNIGGPEGKRARTAYTRYQTLELEKEFHFNRYLTRRRRIEIA .: :::.::::::::::: . ::.::::::::::::::::::::::::::::::::::: CCDS11 EGQTPQIFPWMRKLHISHD-MTGPDGKRARTAYTRYQTLELEKEFHFNRYLTRRRRIEIA 170 180 190 200 210 220 230 240 250 260 270 pF1KB8 HALCLSERQIKIWFQNRRMKWKKDNKLKSMSMAAAGGAFRP :::::::::::::::::::::::::::::::.:.::.::.: CCDS11 HALCLSERQIKIWFQNRRMKWKKDNKLKSMSLATAGSAFQP 230 240 250 260 >>CCDS8873.1 HOXC4 gene_id:3221|Hs108|chr12 (264 aa) initn: 586 init1: 382 opt: 529 Z-score: 358.2 bits: 74.0 E(32554): 1.2e-13 Smith-Waterman score: 529; 49.2% identity (64.9% similar) in 185 aa overlap (96-267:54-231) 70 80 90 100 110 pF1KB8 SGERARSYAASASAAPAEPRYSQPATSTHSPQPDPLP--------CSAVAPSPGSDSHHG : : : : :... .::.. :: CCDS88 SQNSYIPEHSPEYYGRTRESGFQHHHQELYPPPPPRPSYPERQYSCTSLQ-GPGNSRGHG 30 40 50 60 70 80 120 130 140 150 160 170 pF1KB8 GKNSLSNSSGASADAGSTHISSREGVGTASGAEEDAPASSEQASAQSEPSPAPPAQPQIY ..: : . .. ::.. :: . : : ..:: : :: .: CCDS88 -----PAQAGHHHPEKSQSLCEPAPLSGASASPSPAPPACSQP-APDHPSSAASKQPIVY 90 100 110 120 130 180 190 200 210 220 230 pF1KB8 PWMRKLHIS--HDNIGGPEGKRARTAYTRYQTLELEKEFHFNRYLTRRRRIEIAHALCLS :::.:.:.: . : .: : ::.:::::: :.::::::::.::::::::::::::.:::: CCDS88 PWMKKIHVSTVNPNYNGGEPKRSRTAYTRQQVLELEKEFHYNRYLTRRRRIEIAHSLCLS 140 150 160 170 180 190 240 250 260 270 pF1KB8 ERQIKIWFQNRRMKWKKDNKL---KSMSMAAAGGAFRP ::::::::::::::::::..: : : ::.: CCDS88 ERQIKIWFQNRRMKWKKDHRLPNTKVRSAPPAGAAPSTLSAATPGTSEDHSQSATPPEQQ 200 210 220 230 240 250 CCDS88 RAEDITRL 260 >>CCDS11529.1 HOXB4 gene_id:3214|Hs108|chr17 (251 aa) initn: 514 init1: 396 opt: 527 Z-score: 357.2 bits: 73.7 E(32554): 1.4e-13 Smith-Waterman score: 536; 44.3% identity (61.0% similar) in 228 aa overlap (49-267:23-234) 20 30 40 50 60 70 pF1KB8 YQLHNYGDHSSVSEQFRDSASMHSGRYGYGYNGMDLSVGRSGSGHFGSGERARS-YAASA :. : . . :....:.: .: . : CCDS11 MAMSSFLINSNYVDPKFPPCEEYSQSDYLPSDHSPGYYAGGQRRESSFQPEA 10 20 30 40 50 80 90 100 110 120 130 pF1KB8 S----AAPAEPRYSQPATSTHSPQPDPLPCSAVAPSPGSDSHHGGKNSLSNSSGASADAG . :: . ::. : .: : : : : :: :: . : :: CCDS11 GFGRRAACTVQRYA--ACRDPGPPPPPPPPPPPPPPPG----------LSPRAPAPPPAG 60 70 80 90 100 140 150 160 170 180 pF1KB8 STHISSREGVGTASGAEEDAPASSEQASAQSEPSPAPPA--QPQIYPWMRKLHIS--HDN . : : ..: :. .:::. : .: .::::::.:.: . : CCDS11 ALLPEP----GQRCEAVSSSPPPPPCAQNPLHPSPSHSACKEPVVYPWMRKVHVSTVNPN 110 120 130 140 150 190 200 210 220 230 240 pF1KB8 IGGPEGKRARTAYTRYQTLELEKEFHFNRYLTRRRRIEIAHALCLSERQIKIWFQNRRMK .: : ::.:::::: :.::::::::.:::::::::.::::::::::::::::::::::: CCDS11 YAGGEPKRSRTAYTRQQVLELEKEFHYNRYLTRRRRVEIAHALCLSERQIKIWFQNRRMK 160 170 180 190 200 210 250 260 270 pF1KB8 WKKDNKLKSMSMAAAGGAFRP ::::.:: . .. ..:.: CCDS11 WKKDHKLPNTKIRSGGAAGSAGGPPGRPNGGPRAL 220 230 240 250 >>CCDS8872.1 HOXC5 gene_id:3222|Hs108|chr12 (222 aa) initn: 523 init1: 410 opt: 524 Z-score: 356.0 bits: 73.3 E(32554): 1.6e-13 Smith-Waterman score: 625; 45.9% identity (66.4% similar) in 259 aa overlap (1-258:1-218) 10 20 30 40 50 60 pF1KB8 MSSYFVNSFCGRYPNGPDYQLHNYGDHSSVSEQFRDSASMHSGRYGYGYNGMDLSVGRSG :::: .::: . :: : :.... :...:.:: ....:: :: :.:::. CCDS88 MSSYVANSFYKQSPNIPAYNMQTCGNYGSASE-------VQASRYCYG--GLDLSITFPP 10 20 30 40 50 70 80 90 100 110 pF1KB8 SGHFGSGERARSYAASASAAPAEPRYSQPATSTHSPQPDPLPCSAVAP-SPGSDSHHGGK . .: .. ..::. : : .: : :. :.: : .:: .:: :..... CCDS88 PAPSNS-LHGVDMAANPRAHPDRPACSAAAAPGHAPGRD-----EAAPLNPGMYSQKAAR 60 70 80 90 100 120 130 140 150 160 170 pF1KB8 NSLSNSSGASADAGSTHISSREGVGTASGAEEDAPASSEQASAQSEPSPAPPAQPQIYPW .: : . .:: .. :.. : .. :.: :::: ::::: CCDS88 PAL------------------EERAKSSGEIKEEQAQTGQPAGLSQP-PAPP---QIYPW 110 120 130 140 180 190 200 210 220 230 pF1KB8 MRKLHISHDNIGGPEGKRARTAYTRYQTLELEKEFHFNRYLTRRRRIEIAHALCLSERQI : :::.::.. .:::.::.::::::::::::::::::::::::::::. :::.:::: CCDS88 MTKLHMSHET----DGKRSRTSYTRYQTLELEKEFHFNRYLTRRRRIEIANNLCLNERQI 150 160 170 180 190 240 250 260 270 pF1KB8 KIWFQNRRMKWKKDNKLKSMSMAAAGGAFRP ::::::::::::::.:.:: CCDS88 KIWFQNRRMKWKKDSKMKSKEAL 200 210 220 >>CCDS5405.1 HOXA4 gene_id:3201|Hs108|chr7 (320 aa) initn: 491 init1: 393 opt: 519 Z-score: 350.8 bits: 72.9 E(32554): 3.2e-13 Smith-Waterman score: 519; 46.3% identity (66.2% similar) in 201 aa overlap (73-267:90-287) 50 60 70 80 90 100 pF1KB8 GRYGYGYNGMDLSVGRSGSGHFGSGERARSYAASASAAPAEPR-YSQPATSTHSPQPDPL : : ..: : : : :. . :::. CCDS54 LPHAGGGREPTASYYAPRTAREPAYPAAALYPAHGAADTAYPYGYRGGASPGRPPQPEQP 60 70 80 90 100 110 110 120 130 140 150 pF1KB8 PCSAVAPSPGSDSHHGGKNSLSN--SSGASADAGSTHISSREGV-GTASGAEEDAPASSE : .: .:. : . : . .: . : :. . . .. :. .:. .::: CCDS54 PAQAKGPAHGLHASHVLQPQLPPPLQPRAVPPAAPRRCEAAPATPGVPAGG--SAPACPL 120 130 140 150 160 170 160 170 180 190 200 210 pF1KB8 QASAQSEPSPAPPAQPQIYPWMRKLHISHDN--IGGPEGKRARTAYTRYQTLELEKEFHF . .: : .: .::::.:.:.: : .: : ::.:::::: :.::::::::: CCDS54 LLADKS-PLGLKGKEPVVYPWMKKIHVSAVNPSYNGGEPKRSRTAYTRQQVLELEKEFHF 180 190 200 210 220 230 220 230 240 250 260 270 pF1KB8 NRYLTRRRRIEIAHALCLSERQIKIWFQNRRMKWKKDNKLKSMSMAAAGGAFRP ::::::::::::::.:::::::.::::::::::::::.:: . .: ....: CCDS54 NRYLTRRRRIEIAHTLCLSERQVKIWFQNRRMKWKKDHKLPNTKMRSSNSASASAGPPGK 240 250 260 270 280 290 CCDS54 AQTQSPHLHPHPHPSTSTPVPSSI 300 310 320 >>CCDS2269.1 HOXD4 gene_id:3233|Hs108|chr2 (255 aa) initn: 442 init1: 414 opt: 515 Z-score: 349.5 bits: 72.3 E(32554): 3.8e-13 Smith-Waterman score: 515; 46.6% identity (65.7% similar) in 204 aa overlap (62-256:27-215) 40 50 60 70 80 90 pF1KB8 EQFRDSASMHSGRYGYGYNGMDLSVGRSGSGHFGSGERARSYAASASAAPAEPRYSQPAT :..: . : :...:..: .: :. CCDS22 MVMSSYMVNSKYVDPKFPPCEEYLQGGYLGE-QGADYYGGGAQGADFQP----PGL 10 20 30 40 50 100 110 120 130 140 pF1KB8 STHSPQPD--PLPCSAVAPSPGSDSHHGGKNSLSNSSGASADAGSTHISSREGVGTASGA :.:: : .. .:.::: :... .. :. : .. : : CCDS22 ---YPRPDFGEQPFGGSGPGPGSALPARGHGQEPGGPGG-------HYAAPGEPCPAPPA 60 70 80 90 100 150 160 170 180 190 200 pF1KB8 EEDAPASSEQASAQSEPSPAPPA----QPQI-YPWMRKLHIS--HDNIGGPEGKRARTAY :: . .: .::.:. : . :: . ::::.:.:.. . : : : ::.:::: CCDS22 PPPAPLPGARAYSQSDPKQPPSGTALKQPAVVYPWMKKVHVNSVNPNYTGGEPKRSRTAY 110 120 130 140 150 160 210 220 230 240 250 260 pF1KB8 TRYQTLELEKEFHFNRYLTRRRRIEIAHALCLSERQIKIWFQNRRMKWKKDNKLKSMSMA :: :.:::::::::::::::::::::::.::::::::::::::::::::::.:: CCDS22 TRQQVLELEKEFHFNRYLTRRRRIEIAHTLCLSERQIKIWFQNRRMKWKKDHKLPNTKGR 170 180 190 200 210 220 270 pF1KB8 AAGGAFRP CCDS22 SSSSSSSSSCSSSVAPSQHLQPMAKDHHTDLTTL 230 240 250 >>CCDS11531.1 HOXB6 gene_id:3216|Hs108|chr17 (224 aa) initn: 528 init1: 448 opt: 480 Z-score: 328.1 bits: 68.2 E(32554): 5.9e-12 Smith-Waterman score: 482; 46.0% identity (67.4% similar) in 187 aa overlap (95-264:30-215) 70 80 90 100 110 120 pF1KB8 GSGERARSYAASASAAPAEPRYSQPATSTHSPQPDPL---PCSAVAPSPGSDSHHGGKNS : ::: : . .:.::.:. . .. CCDS11 MSSYFVNSTFPVTLASGQESFLGQLPLYSSGYADPLRHYP-APYGPGPGQDKGFATSSY 10 20 30 40 50 130 140 150 160 170 pF1KB8 LSNSSG-----ASADAGSTHISSRE--GVGTASGAEEDAPASSE--QASAQSEPSPAPPA ..: : : : . :: .. . :::.:. : : ... .. : . CCDS11 YPPAGGGYGRAAPCDYGPAPAFYREKESACALSGADEQPPFHPEPRKSDCAQDKSVFGET 60 70 80 90 100 110 180 190 200 210 220 pF1KB8 QPQ-----IYPWMRKLHISHDNIGGPEGKRARTAYTRYQTLELEKEFHFNRYLTRRRRIE . : .::::.... ... :: :.:.: .::::::::::::::.::::::::::: CCDS11 EEQKCSTPVYPWMQRMNSCNSSSFGPSGRRGRQTYTRYQTLELEKEFHYNRYLTRRRRIE 120 130 140 150 160 170 230 240 250 260 270 pF1KB8 IAHALCLSERQIKIWFQNRRMKWKKDNKLKSMSMAAAGGAFRP :::::::.:::::::::::::::::..:: : :. .: CCDS11 IAHALCLTERQIKIWFQNRRMKWKKESKLLSASQLSAEEEEEKQAE 180 190 200 210 220 >>CCDS8871.1 HOXC6 gene_id:3223|Hs108|chr12 (235 aa) initn: 455 init1: 430 opt: 467 Z-score: 319.6 bits: 66.7 E(32554): 1.8e-11 Smith-Waterman score: 486; 47.4% identity (68.4% similar) in 196 aa overlap (75-267:38-213) 50 60 70 80 90 100 pF1KB8 YGYGYNGMDLSVGRSGSGHFGSGERARSYAASASAAPAEPR-YSQPATSTHSPQPDPLPC .. .:: :. : :: : .::: . . CCDS88 PSLSCHLAGGQDVLPNVALNSTAYDPVRHFSTYGAAVAQNRIYSTP---FYSPQENVVFS 10 20 30 40 50 60 110 120 130 140 150 160 pF1KB8 SAVAPSP-GSDSHHGGKNSLSNSSGASADAGSTHISSREGVGTASGAEEDAPASSEQASA :. .: ::.: . :. ::: . : .: .. .: :::: . CCDS88 SSRGPYDYGSNSFYQEKDMLSNCRQNTL-----------GHNTQTSIAQDF--SSEQ--G 70 80 90 100 170 180 190 200 210 220 pF1KB8 QSEPSPAPPAQPQIYPWMRKLHISHDNIG-GPEGKRARTAYTRYQTLELEKEFHFNRYLT .. :. :. ::::::.... ::...: : . .:.: :.:::::::::::::::::: CCDS88 RTAPQDQK-ASIQIYPWMQRMN-SHSGVGYGADRRRGRQIYSRYQTLELEKEFHFNRYLT 110 120 130 140 150 160 230 240 250 260 270 pF1KB8 RRRRIEIAHALCLSERQIKIWFQNRRMKWKKDNKLKSMSMAAAGGAFRP ::::::::.::::.:::::::::::::::::...: : ...::: CCDS88 RRRRIEIANALCLTERQIKIWFQNRRMKWKKESNLTSTLSGGGGGATADSLGGKEEKREE 170 180 190 200 210 220 CCDS88 TEEEKQKE 230 >>CCDS5407.1 HOXA6 gene_id:3203|Hs108|chr7 (233 aa) initn: 492 init1: 444 opt: 462 Z-score: 316.4 bits: 66.1 E(32554): 2.6e-11 Smith-Waterman score: 462; 45.3% identity (66.1% similar) in 192 aa overlap (71-256:36-216) 50 60 70 80 90 pF1KB8 HSGRYGYGYNGMDLSVGRSGSGHFGSGERARSYAAS--ASAAPAEPRYSQPATSTHSPQP : . :: ::. : . :..: .: . CCDS54 VNPTFPGSLPSGQDSFLGQLPLYQAGYDALRPFPASYGASSLP-DKTYTSPCFYQQSNSV 10 20 30 40 50 60 100 110 120 130 140 150 pF1KB8 DPLPCSAVAPSPGSDSHHGGKNSLSNSSGASADAGSTHISSREGVGTA---SGAEEDAP- : :. .. :.. .. :. :::: .:: ....: : : .. : CCDS54 --LACNRASYEYGASCFYSDKDL----SGASP-SGS---GKQRGPGDYLHFSPEQQYKPD 70 80 90 100 110 160 170 180 190 200 210 pF1KB8 ASSEQASAQSEPSPAPPAQPQIYPWMRKLHISHDNIGGPEGKRARTAYTRYQTLELEKEF .:: :..: . . .::::.... . : .:.:.: .::::::::::::: CCDS54 SSSGQGKALHDEGADRKYTSPVYPWMQRMNSCAGAVYGSHGRRGRQTYTRYQTLELEKEF 120 130 140 150 160 170 220 230 240 250 260 270 pF1KB8 HFNRYLTRRRRIEIAHALCLSERQIKIWFQNRRMKWKKDNKLKSMSMAAAGGAFRP :::::::::::::::.::::.:::::::::::::::::.::: CCDS54 HFNRYLTRRRRIEIANALCLTERQIKIWFQNRRMKWKKENKLINSTQPSGEDSEAKAGE 180 190 200 210 220 230 270 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 20:02:31 2016 done: Sat Nov 5 20:02:32 2016 Total Scan time: 2.850 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]