FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9701, 320 aa 1>>>pF1KB9701 320 - 320 aa - 320 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 10.7634+/-0.00104; mu= -3.8367+/- 0.063 mean_var=492.2584+/-101.129, 0's: 0 Z-trim(118.4): 105 B-trim: 186 in 1/51 Lambda= 0.057807 statistics sampled from 19244 (19353) to 19244 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.846), E-opt: 0.2 (0.594), width: 16 Scan time: 3.300 The best scores are: opt bits E(32554) CCDS5405.1 HOXA4 gene_id:3201|Hs108|chr7 ( 320) 2264 202.2 4.6e-52 CCDS11529.1 HOXB4 gene_id:3214|Hs108|chr17 ( 251) 756 76.3 2.8e-14 CCDS2269.1 HOXD4 gene_id:3233|Hs108|chr2 ( 255) 747 75.5 4.8e-14 CCDS8873.1 HOXC4 gene_id:3221|Hs108|chr12 ( 264) 705 72.1 5.6e-13 >>CCDS5405.1 HOXA4 gene_id:3201|Hs108|chr7 (320 aa) initn: 2264 init1: 2264 opt: 2264 Z-score: 1048.2 bits: 202.2 E(32554): 4.6e-52 Smith-Waterman score: 2264; 99.4% identity (99.4% similar) in 320 aa overlap (1-320:1-320) 10 20 30 40 50 60 pF1KB9 MTMSSFLINSNYIEPKFPPFEEYAQHSGSGGADGGPGGGPGYQQPPAPPTQHLPLQQPQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MTMSSFLINSNYIEPKFPPFEEYAQHSGSGGADGGPGGGPGYQQPPAPPTQHLPLQQPQL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 PHAGGGREPPASYYAPRTAREPAYPAAALYPAHGAADTAYPYGYRGGASPGRPPQPEQPP ::::::::: :::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 PHAGGGREPTASYYAPRTAREPAYPAAALYPAHGAADTAYPYGYRGGASPGRPPQPEQPP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 AQAKGPAHGLHASHVLQPQPPPPLQPRAVPPAAPRRCEAAPATPGVPAGGSAPACPLLLA ::::::::::::::::::: :::::::::::::::::::::::::::::::::::::::: CCDS54 AQAKGPAHGLHASHVLQPQLPPPLQPRAVPPAAPRRCEAAPATPGVPAGGSAPACPLLLA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 DKSPLGLKGKEPVVYPWMKKIHVSAVNPSYNGGEPKRSRTAYTRQQVLELEKEFHFNRYL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 DKSPLGLKGKEPVVYPWMKKIHVSAVNPSYNGGEPKRSRTAYTRQQVLELEKEFHFNRYL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB9 TRRRRIEIAHTLCLSERQVKIWFQNRRMKWKKDHKLPNTKMRSSNSASASAGPPGKAQTQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 TRRRRIEIAHTLCLSERQVKIWFQNRRMKWKKDHKLPNTKMRSSNSASASAGPPGKAQTQ 250 260 270 280 290 300 310 320 pF1KB9 SPHLHPHPHPSTSTPVPSSI :::::::::::::::::::: CCDS54 SPHLHPHPHPSTSTPVPSSI 310 320 >>CCDS11529.1 HOXB4 gene_id:3214|Hs108|chr17 (251 aa) initn: 904 init1: 691 opt: 756 Z-score: 369.7 bits: 76.3 E(32554): 2.8e-14 Smith-Waterman score: 868; 49.7% identity (63.9% similar) in 296 aa overlap (1-296:1-243) 10 20 30 40 50 60 pF1KB9 MTMSSFLINSNYIEPKFPPFEEYAQHSGSGGADGGPGGGPGYQQPPAPPTQHLPLQQPQL :.::::::::::..::::: :::.: : .: .:: : :. .. .: CCDS11 MAMSSFLINSNYVDPKFPPCEEYSQ-SDYLPSDHSPGYYAGGQR------RESSFQ---- 10 20 30 40 70 80 90 100 110 120 pF1KB9 PHAGGGREPPASYYAPRTAREPAYPAAALYPAHGAADTAYPYGYRGGASPGRPPQPEQPP :.:: ::. . . :.:. : : :: : :: CCDS11 PEAGFGRRAACTVQRYAACRDPGPP------------------------PPPPPPPPPPP 50 60 70 80 130 140 150 160 170 180 pF1KB9 AQAKGPAHGLHASHVLQPQPPPPLQPRAVPPAAPRRCEAAPATPGVPAGGSAPACPLLLA . .: : ::: :. : .::::. ..: : .. : : CCDS11 PPGLSPR---------APAPPPA---GALLPEPGQRCEAVSSSPPPPPCAQNPLHP---- 90 100 110 120 190 200 210 220 230 240 pF1KB9 DKSPLGLKGKEPVVYPWMKKIHVSAVNPSYNGGEPKRSRTAYTRQQVLELEKEFHFNRYL :: :::::::::.:.:::.:::.: ::::::::::::::::::::::::.:::: CCDS11 --SPSHSACKEPVVYPWMRKVHVSTVNPNYAGGEPKRSRTAYTRQQVLELEKEFHYNRYL 130 140 150 160 170 180 250 260 270 280 290 300 pF1KB9 TRRRRIEIAHTLCLSERQVKIWFQNRRMKWKKDHKLPNTKMRSSNSASASAGPPGKAQTQ :::::.::::.:::::::.:::::::::::::::::::::.::...:....::::. CCDS11 TRRRRVEIAHALCLSERQIKIWFQNRRMKWKKDHKLPNTKIRSGGAAGSAGGPPGRPNGG 190 200 210 220 230 240 310 320 pF1KB9 SPHLHPHPHPSTSTPVPSSI CCDS11 PRAL 250 >>CCDS2269.1 HOXD4 gene_id:3233|Hs108|chr2 (255 aa) initn: 815 init1: 622 opt: 747 Z-score: 365.6 bits: 75.5 E(32554): 4.8e-14 Smith-Waterman score: 803; 49.4% identity (61.7% similar) in 308 aa overlap (1-306:1-243) 10 20 30 40 50 60 pF1KB9 MTMSSFLINSNYIEPKFPPFEEYAQHSGSGGADGGPGGGPGYQQPPAPPTQHLPLQQPQL :.:::...::.:..::::: ::: : .: : .:. : : : CCDS22 MVMSSYMVNSKYVDPKFPPCEEYLQ-GGYLGEQGADYYGGGAQ----------------- 10 20 30 40 70 80 90 100 110 120 pF1KB9 PHAGGGREPPASYYAPRTAREPAYPAAALYPAHGAADTAYPYGYRGGASPGRPPQPEQPP :. .::. : :: : : :.: :..:: : : CCDS22 ---GADFQPPGLY--PR-------------PDFGE----QPFG---GSGPG--PGSALP- 50 60 70 130 140 150 160 170 pF1KB9 AQAKGPAHGLHASHVLQPQPPPPLQPRAVPPAAPRRCEAAPATPGVPA-GGSAPACPLLL :...: : ..: : : : : .:: :: ::. : . : : : CCDS22 ARGHGQEPGGPGGHYAAPGEPCPAPP--APPPAP--------LPGARAYSQSDPKQP--- 80 90 100 110 120 180 190 200 210 220 230 pF1KB9 ADKSPLGLKGKEP-VVYPWMKKIHVSAVNPSYNGGEPKRSRTAYTRQQVLELEKEFHFNR : : :.: ::::::::.::..:::.:.::::::::::::::::::::::::::: CCDS22 ----PSGTALKQPAVVYPWMKKVHVNSVNPNYTGGEPKRSRTAYTRQQVLELEKEFHFNR 130 140 150 160 170 240 250 260 270 280 290 pF1KB9 YLTRRRRIEIAHTLCLSERQVKIWFQNRRMKWKKDHKLPNTKMRSSNSASASAGPPGKAQ ::::::::::::::::::::.::::::::::::::::::::: :::.:.:.:. . : CCDS22 YLTRRRRIEIAHTLCLSERQIKIWFQNRRMKWKKDHKLPNTKGRSSSSSSSSSCSSSVAP 180 190 200 210 220 230 300 310 320 pF1KB9 TQSPHLHPHPHPSTSTPVPSSI .: ::.: CCDS22 SQ--HLQPMAKDHHTDLTTL 240 250 >>CCDS8873.1 HOXC4 gene_id:3221|Hs108|chr12 (264 aa) initn: 853 init1: 659 opt: 705 Z-score: 346.5 bits: 72.1 E(32554): 5.6e-13 Smith-Waterman score: 758; 46.0% identity (59.8% similar) in 311 aa overlap (1-306:1-252) 10 20 30 40 50 60 pF1KB9 MTMSSFLINSNYIEPKFPPFEEYAQHSGSGGADGGPGGGPGYQQPPAPPTQHLPLQQPQL : :::.:..::::.::::: :::.:.: : : . .. . CCDS88 MIMSSYLMDSNYIDPKFPPCEEYSQNS---------------YIPEHSPEYYGRTRESGF 10 20 30 40 70 80 90 100 110 120 pF1KB9 PHAGGGREPPASYYAPRTAREPAYPAAALYPAHGAADTAYPYGYRGGASPGRPPQPEQPP : :: :: :.:: .. .. : . :: . : CCDS88 QHHHQELYPPPP---PR----PSYPERQ----YSCTSLQGPGNSRG-----------HGP 50 60 70 80 130 140 150 160 170 180 pF1KB9 AQAKGPAHGLHASHVLQPQPPPPLQPRAVPPAAPRRCEAAPATPGVPAGGSAPACPLLLA ::: : : ... . .: : .: :.:. :: ::: CCDS88 AQA-GHHHPEKSQSLCEPAP----------------LSGASASPS-PA---PPACSQPAP 90 100 110 120 190 200 210 220 230 240 pF1KB9 DKSPLGLKGKEPVVYPWMKKIHVSAVNPSYNGGEPKRSRTAYTRQQVLELEKEFHFNRYL :. : . .:.:.:::::::::::.:::.::::::::::::::::::::::::::.:::: CCDS88 DH-PSSAASKQPIVYPWMKKIHVSTVNPNYNGGEPKRSRTAYTRQQVLELEKEFHYNRYL 130 140 150 160 170 180 250 260 270 280 290 pF1KB9 TRRRRIEIAHTLCLSERQVKIWFQNRRMKWKKDHKLPNTKMRSSNSASA-----SAGPPG ::::::::::.:::::::.:::::::::::::::.:::::.::. :.: ::. :: CCDS88 TRRRRIEIAHSLCLSERQIKIWFQNRRMKWKKDHRLPNTKVRSAPPAGAAPSTLSAATPG 190 200 210 220 230 240 300 310 320 pF1KB9 KAQTQSPHLHPHPHPSTSTPVPSSI .. .: : CCDS88 TSEDHSQSATPPEQQRAEDITRL 250 260 320 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 23:25:43 2016 done: Fri Nov 4 23:25:43 2016 Total Scan time: 3.300 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]