FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8449, 135 aa 1>>>pF1KB8449 135 - 135 aa - 135 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.7550+/-0.000309; mu= 13.4735+/- 0.019 mean_var=51.1673+/-10.470, 0's: 0 Z-trim(115.5): 37 B-trim: 1076 in 1/52 Lambda= 0.179299 statistics sampled from 25914 (25951) to 25914 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.705), E-opt: 0.2 (0.304), width: 16 Scan time: 4.560 The best scores are: opt bits E(85289) NP_002296 (OMIM: 150570) galectin-1 [Homo sapiens] ( 135) 935 249.0 1.7e-66 NP_006489 (OMIM: 150571,608446) galectin-2 [Homo s ( 132) 391 108.3 3.7e-24 NP_002298 (OMIM: 600615) galectin-7 [Homo sapiens] ( 136) 244 70.3 1.1e-12 NP_001035972 (OMIM: 617139) galectin-7 [Homo sapie ( 136) 244 70.3 1.1e-12 NP_002297 (OMIM: 153619) galectin-3 isoform 1 [Hom ( 250) 236 68.4 7.6e-12 XP_011535061 (OMIM: 153619) PREDICTED: galectin-3 ( 264) 236 68.4 7.9e-12 XP_011523098 (OMIM: 601879) PREDICTED: galectin-9 ( 221) 220 64.2 1.2e-10 XP_006721958 (OMIM: 601879) PREDICTED: galectin-9 ( 234) 220 64.2 1.3e-10 NP_001317092 (OMIM: 601879) galectin-9 isoform 3 [ ( 246) 220 64.2 1.3e-10 XP_006721956 (OMIM: 601879) PREDICTED: galectin-9 ( 278) 220 64.2 1.5e-10 XP_011525275 (OMIM: 602518) PREDICTED: galectin-4 ( 310) 220 64.3 1.6e-10 XP_006721955 (OMIM: 601879) PREDICTED: galectin-9 ( 311) 220 64.3 1.6e-10 NP_002299 (OMIM: 601879) galectin-9 isoform short ( 323) 220 64.3 1.7e-10 NP_006140 (OMIM: 602518) galectin-4 [Homo sapiens] ( 323) 220 64.3 1.7e-10 XP_016880112 (OMIM: 601879) PREDICTED: galectin-9 ( 343) 220 64.3 1.7e-10 NP_033665 (OMIM: 601879) galectin-9 isoform long [ ( 355) 220 64.3 1.8e-10 NP_963837 (OMIM: 606099) galectin-8 isoform b [Hom ( 317) 205 60.4 2.4e-09 NP_963838 (OMIM: 606099) galectin-8 isoform b [Hom ( 317) 205 60.4 2.4e-09 NP_006490 (OMIM: 606099) galectin-8 isoform a [Hom ( 359) 205 60.4 2.7e-09 NP_963839 (OMIM: 606099) galectin-8 isoform a [Hom ( 359) 205 60.4 2.7e-09 XP_016856763 (OMIM: 606099) PREDICTED: galectin-8 ( 287) 187 55.7 5.6e-08 XP_011542490 (OMIM: 606099) PREDICTED: galectin-8 ( 329) 187 55.7 6.3e-08 XP_016856764 (OMIM: 606099) PREDICTED: galectin-8 ( 226) 180 53.9 1.6e-07 XP_011525276 (OMIM: 602518) PREDICTED: galectin-4 ( 192) 178 53.3 2e-07 NP_001136008 (OMIM: 606096) galectin-12 isoform 3 ( 327) 172 51.9 9.2e-07 NP_149092 (OMIM: 606096) galectin-12 isoform 2 [Ho ( 336) 172 51.9 9.4e-07 XP_016873947 (OMIM: 606096) PREDICTED: galectin-12 ( 252) 166 50.3 2.2e-06 NP_001136007 (OMIM: 606096) galectin-12 isoform 1 ( 337) 166 50.3 2.8e-06 NP_001136010 (OMIM: 606096) galectin-12 isoform 5 ( 266) 162 49.2 4.6e-06 NP_001136009 (OMIM: 606096) galectin-12 isoform 4 ( 275) 162 49.2 4.8e-06 >>NP_002296 (OMIM: 150570) galectin-1 [Homo sapiens] (135 aa) initn: 935 init1: 935 opt: 935 Z-score: 1315.0 bits: 249.0 E(85289): 1.7e-66 Smith-Waterman score: 935; 100.0% identity (100.0% similar) in 135 aa overlap (1-135:1-135) 10 20 30 40 50 60 pF1KB8 MACGLVASNLNLKPGECLRVRGEVAPDAKSFVLNLGKDSNNLCLHFNPRFNAHGDANTIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 MACGLVASNLNLKPGECLRVRGEVAPDAKSFVLNLGKDSNNLCLHFNPRFNAHGDANTIV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 CNSKDGGAWGTEQREAVFPFQPGSVAEVCITFDQANLTVKLPDGYEFKFPNRLNLEAINY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 CNSKDGGAWGTEQREAVFPFQPGSVAEVCITFDQANLTVKLPDGYEFKFPNRLNLEAINY 70 80 90 100 110 120 130 pF1KB8 MAADGDFKIKCVAFD ::::::::::::::: NP_002 MAADGDFKIKCVAFD 130 >>NP_006489 (OMIM: 150571,608446) galectin-2 [Homo sapie (132 aa) initn: 236 init1: 236 opt: 391 Z-score: 554.6 bits: 108.3 E(85289): 3.7e-24 Smith-Waterman score: 391; 44.2% identity (73.6% similar) in 129 aa overlap (1-129:1-125) 10 20 30 40 50 60 pF1KB8 MACGLVASNLNLKPGECLRVRGEVAPDAKSFVLNLGKDSNNLCLHFNPRFNAHGDANTIV :. : ..:...::: :.. : .: . .::.:::. ...: :::::::. .::: NP_006 MTGELEVKNMDMKPGSTLKITGSIADGTDGFVINLGQGTDKLNLHFNPRFSE----STIV 10 20 30 40 50 70 80 90 100 110 120 pF1KB8 CNSKDGGAWGTEQREAVFPFQPGSVAEVCITFDQANLTVKLPDGYEFKFPNRLNLEAINY ::: ::. :: :::: . :.::: .. .::.. .. ::::::.:. :::::. ..: NP_006 CNSLDGSNWGQEQREDHLCFSPGSEVKFTVTFESDKFKVKLPDGHELTFPNRLGHSHLSY 60 70 80 90 100 110 130 pF1KB8 MAADGDFKIKCVAFD ... : :.. NP_006 LSVRGGFNMSSFKLKE 120 130 >>NP_002298 (OMIM: 600615) galectin-7 [Homo sapiens] (136 aa) initn: 261 init1: 153 opt: 244 Z-score: 348.9 bits: 70.3 E(85289): 1.1e-12 Smith-Waterman score: 244; 35.5% identity (62.1% similar) in 124 aa overlap (12-132:14-133) 10 20 30 40 50 pF1KB8 MACGLVASNLNLKPGECLRVRGEVAPDAKSFVLNL--GKDS-NNLCLHFNPRFNAHGD ..:: ::.:: : :.:. : .:: :... .. ::::::. : NP_002 MSNVPHKSSLPEGIRPGTVLRIRGLVPPNASRFHVNLLCGEEQGSDAALHFNPRL----D 10 20 30 40 50 60 70 80 90 100 110 pF1KB8 ANTIVCNSKDGGAWGTEQREAVFPFQPGSVAEVCITFDQANLTVKLPDGYEFKFPNRLNL .. .: :::. :.:: :.: ::: :. :: : .. .. . . :. .: .:: : NP_002 TSEVVFNSKEQGSWGREERGPGVPFQRGQPFEVLIIASDDGFKAVVGDAQYHHFRHRLPL 60 70 80 90 100 110 120 130 pF1KB8 EAINYMAADGDFKIKCVAFD . . . :: .. : NP_002 ARVRLVEVGGDVQLDSVRIF 120 130 >>NP_001035972 (OMIM: 617139) galectin-7 [Homo sapiens] (136 aa) initn: 261 init1: 153 opt: 244 Z-score: 348.9 bits: 70.3 E(85289): 1.1e-12 Smith-Waterman score: 244; 35.5% identity (62.1% similar) in 124 aa overlap (12-132:14-133) 10 20 30 40 50 pF1KB8 MACGLVASNLNLKPGECLRVRGEVAPDAKSFVLNL--GKDS-NNLCLHFNPRFNAHGD ..:: ::.:: : :.:. : .:: :... .. ::::::. : NP_001 MSNVPHKSSLPEGIRPGTVLRIRGLVPPNASRFHVNLLCGEEQGSDAALHFNPRL----D 10 20 30 40 50 60 70 80 90 100 110 pF1KB8 ANTIVCNSKDGGAWGTEQREAVFPFQPGSVAEVCITFDQANLTVKLPDGYEFKFPNRLNL .. .: :::. :.:: :.: ::: :. :: : .. .. . . :. .: .:: : NP_001 TSEVVFNSKEQGSWGREERGPGVPFQRGQPFEVLIIASDDGFKAVVGDAQYHHFRHRLPL 60 70 80 90 100 110 120 130 pF1KB8 EAINYMAADGDFKIKCVAFD . . . :: .. : NP_001 ARVRLVEVGGDVQLDSVRIF 120 130 >>NP_002297 (OMIM: 153619) galectin-3 isoform 1 [Homo sa (250 aa) initn: 165 init1: 112 opt: 236 Z-score: 333.7 bits: 68.4 E(85289): 7.6e-12 Smith-Waterman score: 236; 29.6% identity (65.9% similar) in 135 aa overlap (5-134:115-247) 10 20 30 pF1KB8 MACGLVASNLNLKPGECLR----VRGEVAPDAKS .: :: : : : . : : :.:. NP_002 GPGAYPSSGQPSATGAYPATGPYGAPAGPLIVPYNLPLPGGVVPRMLITILGTVKPNANR 90 100 110 120 130 140 40 50 60 70 80 90 pF1KB8 FVLNLGKDSNNLCLHFNPRFNAHGDANTIVCNSKDGGAWGTEQREAVFPFQPGSVAEVCI ..:.. . .:.. .::::::: ... .::::.: . :: :.:..::::. :. .. . NP_002 IALDFQR-GNDVAFHFNPRFN-ENNRRVIVCNTKLDNNWGREERQSVFPFESGKPFKIQV 150 160 170 180 190 200 100 110 120 130 pF1KB8 TFDQANLTVKLPDGYEFKFPNRLN-LEAINYMAADGDFKIKCVAFD . .. : . :.. ... .:.. :. :. .. .::. . ... NP_002 LVEPDHFKVAVNDAHLLQYNHRVKKLNEISKLGISGDIDLTSASYTMI 210 220 230 240 250 >>XP_011535061 (OMIM: 153619) PREDICTED: galectin-3 isof (264 aa) initn: 165 init1: 112 opt: 236 Z-score: 333.3 bits: 68.4 E(85289): 7.9e-12 Smith-Waterman score: 236; 29.6% identity (65.9% similar) in 135 aa overlap (5-134:129-261) 10 20 30 pF1KB8 MACGLVASNLNLKPGECLR----VRGEVAPDAKS .: :: : : : . : : :.:. XP_011 GPGAYPSSGQPSATGAYPATGPYGAPAGPLIVPYNLPLPGGVVPRMLITILGTVKPNANR 100 110 120 130 140 150 40 50 60 70 80 90 pF1KB8 FVLNLGKDSNNLCLHFNPRFNAHGDANTIVCNSKDGGAWGTEQREAVFPFQPGSVAEVCI ..:.. . .:.. .::::::: ... .::::.: . :: :.:..::::. :. .. . XP_011 IALDFQR-GNDVAFHFNPRFN-ENNRRVIVCNTKLDNNWGREERQSVFPFESGKPFKIQV 160 170 180 190 200 210 100 110 120 130 pF1KB8 TFDQANLTVKLPDGYEFKFPNRLN-LEAINYMAADGDFKIKCVAFD . .. : . :.. ... .:.. :. :. .. .::. . ... XP_011 LVEPDHFKVAVNDAHLLQYNHRVKKLNEISKLGISGDIDLTSASYTMI 220 230 240 250 260 >>XP_011523098 (OMIM: 601879) PREDICTED: galectin-9 isof (221 aa) initn: 145 init1: 105 opt: 220 Z-score: 312.1 bits: 64.2 E(85289): 1.2e-10 Smith-Waterman score: 220; 27.6% identity (70.1% similar) in 127 aa overlap (12-134:25-147) 10 20 30 40 pF1KB8 MACGLVASNLNLKPGECLRVRGEVAPDAKS-FVLNL--GKDSNNLCL :. : . : : : .. . :..:. : ..:.. . XP_011 MAFSGSQAPYLSPAVPFSGTIQGGLQDGLQITVNGTVLSSSGTRFAVNFQTGFSGNDIAF 10 20 30 40 50 60 50 60 70 80 90 100 pF1KB8 HFNPRFNAHGDANTIVCNSKDGGAWGTEQREAVFPFQPGSVAEVCITFDQANLTVKLPDG ::::::. :.. .:::....:.:: :.:.. .::: : ..:. ..... : . .: XP_011 HFNPRFE---DGGYVVCNTRQNGSWGPEERKTHMPFQKGMPFDLCFLVQSSDFKV-MVNG 70 80 90 100 110 110 120 130 pF1KB8 YEF-KFPNRLNLEAINYMAADGDFKIKCVAFD : .. .:. .. .. ....:. ... ..: XP_011 ILFVQYFHRVPFHRVDTISVNGSVQLSYISFQNPRTVPVQPAFSTVPFSQPVCFPPRPRG 120 130 140 150 160 170 XP_011 RRQKARTCLPPEATDVVQASGIPHHDDRGPVPHPLHPTTATVTFL 180 190 200 210 220 >>XP_006721958 (OMIM: 601879) PREDICTED: galectin-9 isof (234 aa) initn: 145 init1: 105 opt: 220 Z-score: 311.7 bits: 64.2 E(85289): 1.3e-10 Smith-Waterman score: 220; 27.6% identity (70.1% similar) in 127 aa overlap (12-134:25-147) 10 20 30 40 pF1KB8 MACGLVASNLNLKPGECLRVRGEVAPDAKS-FVLNL--GKDSNNLCL :. : . : : : .. . :..:. : ..:.. . XP_006 MAFSGSQAPYLSPAVPFSGTIQGGLQDGLQITVNGTVLSSSGTRFAVNFQTGFSGNDIAF 10 20 30 40 50 60 50 60 70 80 90 100 pF1KB8 HFNPRFNAHGDANTIVCNSKDGGAWGTEQREAVFPFQPGSVAEVCITFDQANLTVKLPDG ::::::. :.. .:::....:.:: :.:.. .::: : ..:. ..... : . .: XP_006 HFNPRFE---DGGYVVCNTRQNGSWGPEERKTHMPFQKGMPFDLCFLVQSSDFKV-MVNG 70 80 90 100 110 110 120 130 pF1KB8 YEF-KFPNRLNLEAINYMAADGDFKIKCVAFD : .. .:. .. .. ....:. ... ..: XP_006 ILFVQYFHRVPFHRVDTISVNGSVQLSYISFQTQTVIHTVQSAPGQMFSTPAIPPMMYPH 120 130 140 150 160 170 XP_006 PAYPMPFITTILGGLYPSKSILLSGTVLPSAQRCGSCVKLTASRWPWMVSTCLNTTIA 180 190 200 210 220 230 >>NP_001317092 (OMIM: 601879) galectin-9 isoform 3 [Homo (246 aa) initn: 159 init1: 105 opt: 220 Z-score: 311.4 bits: 64.2 E(85289): 1.3e-10 Smith-Waterman score: 220; 27.6% identity (70.1% similar) in 127 aa overlap (12-134:25-147) 10 20 30 40 pF1KB8 MACGLVASNLNLKPGECLRVRGEVAPDAKS-FVLNL--GKDSNNLCL :. : . : : : .. . :..:. : ..:.. . NP_001 MAFSGSQAPYLSPAVPFSGTIQGGLQDGLQITVNGTVLSSSGTRFAVNFQTGFSGNDIAF 10 20 30 40 50 60 50 60 70 80 90 100 pF1KB8 HFNPRFNAHGDANTIVCNSKDGGAWGTEQREAVFPFQPGSVAEVCITFDQANLTVKLPDG ::::::. :.. .:::....:.:: :.:.. .::: : ..:. ..... : . .: NP_001 HFNPRFE---DGGYVVCNTRQNGSWGPEERKTHMPFQKGMPFDLCFLVQSSDFKV-MVNG 70 80 90 100 110 110 120 130 pF1KB8 YEF-KFPNRLNLEAINYMAADGDFKIKCVAFD : .. .:. .. .. ....:. ... ..: NP_001 ILFVQYFHRVPFHRVDTISVNGSVQLSYISFQPPGVWPANPAPITQTVIHTVQSAPGQMF 120 130 140 150 160 170 NP_001 STPAIPPMMYPHPAYPMPFITTILGGLYPSKSILLSGTVLPSAQRCGSCVKLTASRWPWM 180 190 200 210 220 230 >>XP_006721956 (OMIM: 601879) PREDICTED: galectin-9 isof (278 aa) initn: 145 init1: 105 opt: 220 Z-score: 310.6 bits: 64.2 E(85289): 1.5e-10 Smith-Waterman score: 220; 27.6% identity (70.1% similar) in 127 aa overlap (12-134:25-147) 10 20 30 40 pF1KB8 MACGLVASNLNLKPGECLRVRGEVAPDAKS-FVLNL--GKDSNNLCL :. : . : : : .. . :..:. : ..:.. . XP_006 MAFSGSQAPYLSPAVPFSGTIQGGLQDGLQITVNGTVLSSSGTRFAVNFQTGFSGNDIAF 10 20 30 40 50 60 50 60 70 80 90 100 pF1KB8 HFNPRFNAHGDANTIVCNSKDGGAWGTEQREAVFPFQPGSVAEVCITFDQANLTVKLPDG ::::::. :.. .:::....:.:: :.:.. .::: : ..:. ..... : . .: XP_006 HFNPRFE---DGGYVVCNTRQNGSWGPEERKTHMPFQKGMPFDLCFLVQSSDFKV-MVNG 70 80 90 100 110 110 120 130 pF1KB8 YEF-KFPNRLNLEAINYMAADGDFKIKCVAFD : .. .:. .. .. ....:. ... ..: XP_006 ILFVQYFHRVPFHRVDTISVNGSVQLSYISFQNPRTVPVQPAFSTVPFSQPVCFPPRPRG 120 130 140 150 160 170 XP_006 RRQKPPGVWPANPAPITQTVIHTVQSAPGQMFSTPAIPPMMYPHPAYPMPFITTILGGLY 180 190 200 210 220 230 135 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 12:59:57 2016 done: Fri Nov 4 12:59:57 2016 Total Scan time: 4.560 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]