FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9714, 382 aa
1>>>pF1KB9714 382 - 382 aa - 382 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.9322+/-0.000424; mu= 17.7423+/- 0.027
mean_var=237.0520+/-49.332, 0's: 0 Z-trim(119.1): 398 B-trim: 0 in 0/54
Lambda= 0.083301
statistics sampled from 32117 (32650) to 32117 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.738), E-opt: 0.2 (0.383), width: 16
Scan time: 9.310
The best scores are: opt bits E(85289)
NP_001167540 (OMIM: 600298) LIM homeobox transcrip ( 382) 2619 327.7 2.7e-89
NP_796372 (OMIM: 600298) LIM homeobox transcriptio ( 382) 2619 327.7 2.7e-89
XP_011507840 (OMIM: 600298) PREDICTED: LIM homeobo ( 302) 2007 254.0 3.3e-67
XP_011507842 (OMIM: 600298) PREDICTED: LIM homeobo ( 279) 1858 236.1 7.9e-62
NP_001167618 (OMIM: 161200,602575) LIM homeobox tr ( 402) 1795 228.8 1.8e-59
NP_002307 (OMIM: 161200,602575) LIM homeobox trans ( 395) 1722 220.0 7.8e-57
NP_001167617 (OMIM: 161200,602575) LIM homeobox tr ( 406) 1587 203.8 6e-52
NP_665804 (OMIM: 609481) insulin gene enhancer pro ( 359) 612 86.5 1.1e-16
XP_016877994 (OMIM: 609481) PREDICTED: insulin gen ( 534) 612 86.8 1.3e-16
NP_002193 (OMIM: 600366) insulin gene enhancer pro ( 349) 592 84.1 5.6e-16
NP_835258 (OMIM: 221750,600577) LIM/homeobox prote ( 397) 485 71.3 4.4e-12
XP_005263467 (OMIM: 221750,600577) PREDICTED: LIM/ ( 386) 483 71.1 5.1e-12
XP_016870657 (OMIM: 221750,600577) PREDICTED: LIM/ ( 373) 482 70.9 5.5e-12
NP_055379 (OMIM: 221750,600577) LIM/homeobox prote ( 402) 482 71.0 5.7e-12
NP_071758 (OMIM: 605992) LIM/homeobox protein Lhx5 ( 402) 469 69.4 1.7e-11
NP_203129 (OMIM: 262700,602146) LIM/homeobox prote ( 390) 466 69.0 2.1e-11
NP_005559 (OMIM: 601999) LIM/homeobox protein Lhx1 ( 406) 456 67.8 5e-11
XP_005245407 (OMIM: 606066) PREDICTED: LIM/homeobo ( 336) 405 61.6 3.2e-09
NP_001243043 (OMIM: 604425) LIM/homeobox protein L ( 346) 402 61.2 4.1e-09
NP_001001933 (OMIM: 604425) LIM/homeobox protein L ( 356) 402 61.3 4.2e-09
XP_016856805 (OMIM: 604425) PREDICTED: LIM/homeobo ( 363) 402 61.3 4.2e-09
XP_016856806 (OMIM: 604425) PREDICTED: LIM/homeobo ( 363) 402 61.3 4.2e-09
XP_006717386 (OMIM: 603759) PREDICTED: LIM/homeobo ( 314) 398 60.7 5.5e-09
XP_011541682 (OMIM: 600366) PREDICTED: insulin gen ( 285) 396 60.4 6.2e-09
XP_005251973 (OMIM: 608215) PREDICTED: LIM/homeobo ( 230) 392 59.7 7.8e-09
NP_001229263 (OMIM: 608215) LIM/homeobox protein L ( 363) 392 60.1 9.7e-09
NP_001229262 (OMIM: 608215) LIM/homeobox protein L ( 366) 392 60.1 9.8e-09
NP_954629 (OMIM: 608215) LIM/homeobox protein Lhx6 ( 377) 392 60.1 9.9e-09
XP_011516824 (OMIM: 608215) PREDICTED: LIM/homeobo ( 378) 392 60.1 9.9e-09
NP_055183 (OMIM: 608215) LIM/homeobox protein Lhx6 ( 392) 392 60.1 1e-08
XP_011516823 (OMIM: 608215) PREDICTED: LIM/homeobo ( 407) 392 60.2 1e-08
NP_001001395 (OMIM: 180386) LIM domain only protei ( 145) 378 57.7 2e-08
XP_011519065 (OMIM: 180386) PREDICTED: LIM domain ( 145) 378 57.7 2e-08
XP_006719173 (OMIM: 180386) PREDICTED: LIM domain ( 145) 378 57.7 2e-08
XP_011519064 (OMIM: 180386) PREDICTED: LIM domain ( 145) 378 57.7 2e-08
XP_006719174 (OMIM: 180386) PREDICTED: LIM domain ( 145) 378 57.7 2e-08
NP_001230538 (OMIM: 180386) LIM domain only protei ( 145) 378 57.7 2e-08
NP_061110 (OMIM: 180386) LIM domain only protein 3 ( 145) 378 57.7 2e-08
NP_001230539 (OMIM: 180386) LIM domain only protei ( 145) 378 57.7 2e-08
NP_001230540 (OMIM: 180386) LIM domain only protei ( 156) 378 57.8 2.1e-08
NP_001230541 (OMIM: 180386) LIM domain only protei ( 163) 378 57.8 2.1e-08
XP_011518400 (OMIM: 186921) PREDICTED: rhombotin-1 ( 145) 370 56.8 3.9e-08
XP_011518401 (OMIM: 186921) PREDICTED: rhombotin-1 ( 145) 370 56.8 3.9e-08
NP_001257357 (OMIM: 186921) rhombotin-1 isoform b ( 155) 370 56.8 4e-08
NP_002306 (OMIM: 186921) rhombotin-1 isoform a [Ho ( 156) 370 56.8 4e-08
XP_006718291 (OMIM: 186921) PREDICTED: rhombotin-1 ( 193) 370 57.0 4.5e-08
XP_005271348 (OMIM: 603129) PREDICTED: LIM domain ( 165) 363 56.0 7.4e-08
NP_006760 (OMIM: 603129) LIM domain transcription ( 165) 363 56.0 7.4e-08
XP_011508407 (OMIM: 262700,602146) PREDICTED: LIM/ ( 329) 319 51.2 4.1e-06
XP_011508408 (OMIM: 262700,602146) PREDICTED: LIM/ ( 329) 319 51.2 4.1e-06
>>NP_001167540 (OMIM: 600298) LIM homeobox transcription (382 aa)
initn: 2619 init1: 2619 opt: 2619 Z-score: 1724.1 bits: 327.7 E(85289): 2.7e-89
Smith-Waterman score: 2619; 100.0% identity (100.0% similar) in 382 aa overlap (1-382:1-382)
10 20 30 40 50 60
pF1KB9 MLDGLKMEENFQSAIDTSASFSSLLGRAVSPKSVCEGCQRVILDRFLLRLNDSFWHEQCV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MLDGLKMEENFQSAIDTSASFSSLLGRAVSPKSVCEGCQRVILDRFLLRLNDSFWHEQCV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 QCASCKEPLETTCFYRDKKLYCKYDYEKLFAVKCGGCFEAIAPNEFVMRAQKSVYHLSCF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 QCASCKEPLETTCFYRDKKLYCKYDYEKLFAVKCGGCFEAIAPNEFVMRAQKSVYHLSCF
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 CCCVCERQLQKGDEFVLKEGQLLCKGDYEKERELLSLVSPAASDSGKSDDEESLCKSAHG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 CCCVCERQLQKGDEFVLKEGQLLCKGDYEKERELLSLVSPAASDSGKSDDEESLCKSAHG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB9 AGKGTAEEGKDHKRPKRPRTILTTQQRRAFKASFEVSSKPCRKVRETLAAETGLSVRVVQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 AGKGTAEEGKDHKRPKRPRTILTTQQRRAFKASFEVSSKPCRKVRETLAAETGLSVRVVQ
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB9 VWFQNQRAKMKKLARRQQQQQQDQQNTQRLSSAQTNGGGSAGMEGIMNPYTALPTPQQLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 VWFQNQRAKMKKLARRQQQQQQDQQNTQRLSSAQTNGGGSAGMEGIMNPYTALPTPQQLL
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB9 AIEQSVYSSDPFRQGLTPPQMPGDHMHPYGAEPLFHDLDSDDTSLSNLGDCFLATSEAGP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 AIEQSVYSSDPFRQGLTPPQMPGDHMHPYGAEPLFHDLDSDDTSLSNLGDCFLATSEAGP
310 320 330 340 350 360
370 380
pF1KB9 LQSRVGNPIDHLYSMQNSYFTS
::::::::::::::::::::::
NP_001 LQSRVGNPIDHLYSMQNSYFTS
370 380
>>NP_796372 (OMIM: 600298) LIM homeobox transcription fa (382 aa)
initn: 2619 init1: 2619 opt: 2619 Z-score: 1724.1 bits: 327.7 E(85289): 2.7e-89
Smith-Waterman score: 2619; 100.0% identity (100.0% similar) in 382 aa overlap (1-382:1-382)
10 20 30 40 50 60
pF1KB9 MLDGLKMEENFQSAIDTSASFSSLLGRAVSPKSVCEGCQRVILDRFLLRLNDSFWHEQCV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_796 MLDGLKMEENFQSAIDTSASFSSLLGRAVSPKSVCEGCQRVILDRFLLRLNDSFWHEQCV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 QCASCKEPLETTCFYRDKKLYCKYDYEKLFAVKCGGCFEAIAPNEFVMRAQKSVYHLSCF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_796 QCASCKEPLETTCFYRDKKLYCKYDYEKLFAVKCGGCFEAIAPNEFVMRAQKSVYHLSCF
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 CCCVCERQLQKGDEFVLKEGQLLCKGDYEKERELLSLVSPAASDSGKSDDEESLCKSAHG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_796 CCCVCERQLQKGDEFVLKEGQLLCKGDYEKERELLSLVSPAASDSGKSDDEESLCKSAHG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB9 AGKGTAEEGKDHKRPKRPRTILTTQQRRAFKASFEVSSKPCRKVRETLAAETGLSVRVVQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_796 AGKGTAEEGKDHKRPKRPRTILTTQQRRAFKASFEVSSKPCRKVRETLAAETGLSVRVVQ
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB9 VWFQNQRAKMKKLARRQQQQQQDQQNTQRLSSAQTNGGGSAGMEGIMNPYTALPTPQQLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_796 VWFQNQRAKMKKLARRQQQQQQDQQNTQRLSSAQTNGGGSAGMEGIMNPYTALPTPQQLL
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB9 AIEQSVYSSDPFRQGLTPPQMPGDHMHPYGAEPLFHDLDSDDTSLSNLGDCFLATSEAGP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_796 AIEQSVYSSDPFRQGLTPPQMPGDHMHPYGAEPLFHDLDSDDTSLSNLGDCFLATSEAGP
310 320 330 340 350 360
370 380
pF1KB9 LQSRVGNPIDHLYSMQNSYFTS
::::::::::::::::::::::
NP_796 LQSRVGNPIDHLYSMQNSYFTS
370 380
>>XP_011507840 (OMIM: 600298) PREDICTED: LIM homeobox tr (302 aa)
initn: 2007 init1: 2007 opt: 2007 Z-score: 1327.5 bits: 254.0 E(85289): 3.3e-67
Smith-Waterman score: 2007; 99.7% identity (100.0% similar) in 295 aa overlap (88-382:8-302)
60 70 80 90 100 110
pF1KB9 QCVQCASCKEPLETTCFYRDKKLYCKYDYEKLFAVKCGGCFEAIAPNEFVMRAQKSVYHL
.:::::::::::::::::::::::::::::
XP_011 MMFVLSNRLFAVKCGGCFEAIAPNEFVMRAQKSVYHL
10 20 30
120 130 140 150 160 170
pF1KB9 SCFCCCVCERQLQKGDEFVLKEGQLLCKGDYEKERELLSLVSPAASDSGKSDDEESLCKS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 SCFCCCVCERQLQKGDEFVLKEGQLLCKGDYEKERELLSLVSPAASDSGKSDDEESLCKS
40 50 60 70 80 90
180 190 200 210 220 230
pF1KB9 AHGAGKGTAEEGKDHKRPKRPRTILTTQQRRAFKASFEVSSKPCRKVRETLAAETGLSVR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 AHGAGKGTAEEGKDHKRPKRPRTILTTQQRRAFKASFEVSSKPCRKVRETLAAETGLSVR
100 110 120 130 140 150
240 250 260 270 280 290
pF1KB9 VVQVWFQNQRAKMKKLARRQQQQQQDQQNTQRLSSAQTNGGGSAGMEGIMNPYTALPTPQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 VVQVWFQNQRAKMKKLARRQQQQQQDQQNTQRLSSAQTNGGGSAGMEGIMNPYTALPTPQ
160 170 180 190 200 210
300 310 320 330 340 350
pF1KB9 QLLAIEQSVYSSDPFRQGLTPPQMPGDHMHPYGAEPLFHDLDSDDTSLSNLGDCFLATSE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 QLLAIEQSVYSSDPFRQGLTPPQMPGDHMHPYGAEPLFHDLDSDDTSLSNLGDCFLATSE
220 230 240 250 260 270
360 370 380
pF1KB9 AGPLQSRVGNPIDHLYSMQNSYFTS
:::::::::::::::::::::::::
XP_011 AGPLQSRVGNPIDHLYSMQNSYFTS
280 290 300
>>XP_011507842 (OMIM: 600298) PREDICTED: LIM homeobox tr (279 aa)
initn: 1858 init1: 1858 opt: 1858 Z-score: 1231.0 bits: 236.1 E(85289): 7.9e-62
Smith-Waterman score: 1858; 100.0% identity (100.0% similar) in 272 aa overlap (1-272:1-272)
10 20 30 40 50 60
pF1KB9 MLDGLKMEENFQSAIDTSASFSSLLGRAVSPKSVCEGCQRVILDRFLLRLNDSFWHEQCV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 MLDGLKMEENFQSAIDTSASFSSLLGRAVSPKSVCEGCQRVILDRFLLRLNDSFWHEQCV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 QCASCKEPLETTCFYRDKKLYCKYDYEKLFAVKCGGCFEAIAPNEFVMRAQKSVYHLSCF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 QCASCKEPLETTCFYRDKKLYCKYDYEKLFAVKCGGCFEAIAPNEFVMRAQKSVYHLSCF
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 CCCVCERQLQKGDEFVLKEGQLLCKGDYEKERELLSLVSPAASDSGKSDDEESLCKSAHG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 CCCVCERQLQKGDEFVLKEGQLLCKGDYEKERELLSLVSPAASDSGKSDDEESLCKSAHG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB9 AGKGTAEEGKDHKRPKRPRTILTTQQRRAFKASFEVSSKPCRKVRETLAAETGLSVRVVQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 AGKGTAEEGKDHKRPKRPRTILTTQQRRAFKASFEVSSKPCRKVRETLAAETGLSVRVVQ
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB9 VWFQNQRAKMKKLARRQQQQQQDQQNTQRLSSAQTNGGGSAGMEGIMNPYTALPTPQQLL
::::::::::::::::::::::::::::::::
XP_011 VWFQNQRAKMKKLARRQQQQQQDQQNTQRLSSDWCSHTC
250 260 270
310 320 330 340 350 360
pF1KB9 AIEQSVYSSDPFRQGLTPPQMPGDHMHPYGAEPLFHDLDSDDTSLSNLGDCFLATSEAGP
>>NP_001167618 (OMIM: 161200,602575) LIM homeobox transc (402 aa)
initn: 1835 init1: 1038 opt: 1795 Z-score: 1188.7 bits: 228.8 E(85289): 1.8e-59
Smith-Waterman score: 1795; 68.7% identity (85.5% similar) in 387 aa overlap (1-382:24-402)
10 20 30
pF1KB9 MLDGLKMEENFQSAIDTSASFSSLLGRAVSPKSVCEG
::::.:::: .. :... ::: .::::
NP_001 MDIATGPESLERCFPRGQTDCAKMLDGIKMEE--HALRPGPATLGVLLGSDCPHPAVCEG
10 20 30 40 50
40 50 60 70 80 90
pF1KB9 CQRVILDRFLLRLNDSFWHEQCVQCASCKEPLETTCFYRDKKLYCKYDYEKLFAVKCGGC
::: : ::::.:.:.: :::.:.:::.:.. : :.:..::.::::: ::..:::.::.::
NP_001 CQRPISDRFLMRVNESSWHEECLQCAACQQALTTSCYFRDRKLYCKQDYQQLFAAKCSGC
60 70 80 90 100 110
100 110 120 130 140 150
pF1KB9 FEAIAPNEFVMRAQKSVYHLSCFCCCVCERQLQKGDEFVLKEGQLLCKGDYEKERELLSL
.: :::.:::::: . ::::.:::::::::::.:::::::::::::::::::::..:::
NP_001 MEKIAPTEFVMRALECVYHLGCFCCCVCERQLRKGDEFVLKEGQLLCKGDYEKEKDLLSS
120 130 140 150 160 170
160 170 180 190 200 210
pF1KB9 VSPAASDSGKSDDEESLCKSAHGAG---KGTAEEGKDHKRPKRPRTILTTQQRRAFKASF
::: ::: ::.::.. : :.: : ::....::: .:::::::::::::::::::::
NP_001 VSPDESDSVKSEDEDGDMKPAKGQGSQSKGSGDDGKDPRRPKRPRTILTTQQRRAFKASF
180 190 200 210 220 230
220 230 240 250 260 270
pF1KB9 EVSSKPCRKVRETLAAETGLSVRVVQVWFQNQRAKMKKLARRQQQQQQDQQNTQRLSSAQ
::::::::::::::::::::::::::::::::::::::::::.::::. :::.:::..
NP_001 EVSSKPCRKVRETLAAETGLSVRVVQVWFQNQRAKMKKLARRHQQQQE-QQNSQRLGQEV
240 250 260 270 280 290
280 290 300 310 320 330
pF1KB9 TNGGGSAGMEGIMNPYTALPTPQQ-LLAIEQSVY-SSDPFRQGLTPPQMPGDHMHPYGAE
:. :::.: :: : ::: ..:.::: : :::::.:::::::::::::.::: .
NP_001 L----SSRMEGMMASYTPLAPPQQQIVAMEQSPYGSSDPFQQGLTPPQMPGDHMNPYGND
300 310 320 330 340 350
340 350 360 370 380
pF1KB9 PLFHDLDSDDTSLSNLGDCFLATSEAGPLQSRVGNPIDHLYSMQNSYFTS
.:::.::: :::..:.::::..:..: ::.:::::::.:::::.:::.:
NP_001 SIFHDIDSD-TSLTSLSDCFLGSSDVGSLQARVGNPIDRLYSMQSSYFAS
360 370 380 390 400
>>NP_002307 (OMIM: 161200,602575) LIM homeobox transcrip (395 aa)
initn: 1748 init1: 935 opt: 1722 Z-score: 1141.4 bits: 220.0 E(85289): 7.8e-57
Smith-Waterman score: 1722; 67.2% identity (84.0% similar) in 387 aa overlap (1-382:24-395)
10 20 30
pF1KB9 MLDGLKMEENFQSAIDTSASFSSLLGRAVSPKSVCEG
::::.:::: .. :... ::: .::::
NP_002 MDIATGPESLERCFPRGQTDCAKMLDGIKMEE--HALRPGPATLGVLLGSDCPHPAVCEG
10 20 30 40 50
40 50 60 70 80 90
pF1KB9 CQRVILDRFLLRLNDSFWHEQCVQCASCKEPLETTCFYRDKKLYCKYDYEKLFAVKCGGC
::: : ::::.:.:.: :::.:.:::.:.. : :.:..::.::::: ::..:::.::.::
NP_002 CQRPISDRFLMRVNESSWHEECLQCAACQQALTTSCYFRDRKLYCKQDYQQLFAAKCSGC
60 70 80 90 100 110
100 110 120 130 140 150
pF1KB9 FEAIAPNEFVMRAQKSVYHLSCFCCCVCERQLQKGDEFVLKEGQLLCKGDYEKERELLSL
.: :::.:::::: . ::::.:::::::::::.:::::::::::::::::::::..:::
NP_002 MEKIAPTEFVMRALECVYHLGCFCCCVCERQLRKGDEFVLKEGQLLCKGDYEKEKDLLSS
120 130 140 150 160 170
160 170 180 190 200 210
pF1KB9 VSPAASDSGKSDDEESLCKSAHGAG---KGTAEEGKDHKRPKRPRTILTTQQRRAFKASF
::: ::: ::.::.. : :.: : ::....::: .:::::::::::::::::::::
NP_002 VSPDESDSVKSEDEDGDMKPAKGQGSQSKGSGDDGKDPRRPKRPRTILTTQQRRAFKASF
180 190 200 210 220 230
220 230 240 250 260 270
pF1KB9 EVSSKPCRKVRETLAAETGLSVRVVQVWFQNQRAKMKKLARRQQQQQQDQQNTQRLSSAQ
::::::::::::::::::::::::::::::::::::::::::.::::. :::.:::..
NP_002 EVSSKPCRKVRETLAAETGLSVRVVQVWFQNQRAKMKKLARRHQQQQE-QQNSQRLGQEV
240 250 260 270 280 290
280 290 300 310 320 330
pF1KB9 TNGGGSAGMEGIMNPYTALPTPQQ-LLAIEQSVY-SSDPFRQGLTPPQMPGDHMHPYGAE
:. :::.: :: : ::: ..:.::: : :::::.::::::::::. .
NP_002 L----SSRMEGMMASYTPLAPPQQQIVAMEQSPYGSSDPFQQGLTPPQMPGN-------D
300 310 320 330 340
340 350 360 370 380
pF1KB9 PLFHDLDSDDTSLSNLGDCFLATSEAGPLQSRVGNPIDHLYSMQNSYFTS
.:::.::: :::..:.::::..:..: ::.:::::::.:::::.:::.:
NP_002 SIFHDIDSD-TSLTSLSDCFLGSSDVGSLQARVGNPIDRLYSMQSSYFAS
350 360 370 380 390
>>NP_001167617 (OMIM: 161200,602575) LIM homeobox transc (406 aa)
initn: 1766 init1: 857 opt: 1587 Z-score: 1053.6 bits: 203.8 E(85289): 6e-52
Smith-Waterman score: 1729; 66.2% identity (83.5% similar) in 394 aa overlap (1-382:24-406)
10 20 30
pF1KB9 MLDGLKMEENFQSAIDTSASFSSLLGRAVSPKSVCEG
::::.:::: .. :... ::: .::::
NP_001 MDIATGPESLERCFPRGQTDCAKMLDGIKMEE--HALRPGPATLGVLLGSDCPHPAVCEG
10 20 30 40 50
40 50 60 70 80 90
pF1KB9 CQRVILDRFLLRLNDSFWHEQCVQCASCKEPLETTCFYRDKKLYCKYDYEKLFAVKCGGC
::: : ::::.:.:.: :::.:.:::.:.. : :.:..::.::::: ::..:::.::.::
NP_001 CQRPISDRFLMRVNESSWHEECLQCAACQQALTTSCYFRDRKLYCKQDYQQLFAAKCSGC
60 70 80 90 100 110
100 110 120 130 140 150
pF1KB9 FEAIAPNEFVMRAQKSVYHLSCFCCCVCERQLQKGDEFVLKEGQLLCKGDYEKERELLSL
.: :::.:::::: . ::::.:::::::::::.:::::::::::::::::::::..:::
NP_001 MEKIAPTEFVMRALECVYHLGCFCCCVCERQLRKGDEFVLKEGQLLCKGDYEKEKDLLSS
120 130 140 150 160 170
160 170 180 190 200 210
pF1KB9 VSPAASDSGKSDDEESLCKSAHGAG---KGTAEEGKDHKRPKRPRTILTTQQRRAFKASF
::: ::: ::.::.. : :.: : ::....::: .:::::::::::::::::::::
NP_001 VSPDESDSVKSEDEDGDMKPAKGQGSQSKGSGDDGKDPRRPKRPRTILTTQQRRAFKASF
180 190 200 210 220 230
220 230 240 250 260 270
pF1KB9 EVSSKPCRKVRETLAAETGLSVRVVQVWFQNQRAKMKKLARRQQQQQQDQQNTQRLSSAQ
::::::::::::::::::::::::::::::::::::::::::.::::. :::.:::....
NP_001 EVSSKPCRKVRETLAAETGLSVRVVQVWFQNQRAKMKKLARRHQQQQE-QQNSQRLGQGE
240 250 260 270 280 290
280 290 300 310 320
pF1KB9 TNGGGSAG-------MEGIMNPYTALPTPQQ-LLAIEQSVY-SSDPFRQGLTPPQMPGDH
. : . : :::.: :: : ::: ..:.::: : :::::.::::::::::.
NP_001 PGPGQGLGQEVLSSRMEGMMASYTPLAPPQQQIVAMEQSPYGSSDPFQQGLTPPQMPGN-
300 310 320 330 340 350
330 340 350 360 370 380
pF1KB9 MHPYGAEPLFHDLDSDDTSLSNLGDCFLATSEAGPLQSRVGNPIDHLYSMQNSYFTS
. .:::.::: :::..:.::::..:..: ::.:::::::.:::::.:::.:
NP_001 ------DSIFHDIDSD-TSLTSLSDCFLGSSDVGSLQARVGNPIDRLYSMQSSYFAS
360 370 380 390 400
>>NP_665804 (OMIM: 609481) insulin gene enhancer protein (359 aa)
initn: 500 init1: 322 opt: 612 Z-score: 420.8 bits: 86.5 E(85289): 1.1e-16
Smith-Waterman score: 612; 38.7% identity (64.5% similar) in 256 aa overlap (33-278:25-278)
10 20 30 40 50 60
pF1KB9 DGLKMEENFQSAIDTSASFSSLLGRAVSPKSVCEGCQRVILDRFLLRLN-DSFWHEQCVQ
..: :: : :.:.::.. : :: :..
NP_665 MVDIIFHYPFLGAMGDHSKKKPGTAMCVGCGSQIHDQFILRVSPDLEWHAACLK
10 20 30 40 50
70 80 90 100 110
pF1KB9 CASCKEPLETTC--FYRDKKLYCKYDYEKLFAVKCGGCFEAIAPNEFVMRAQKSVYHLSC
:: :.. :. :: : :: : ::: :: .::..::. : ... ...::::. ::::. :
NP_665 CAECSQYLDETCTCFVRDGKTYCKRDYVRLFGIKCAKCQVGFSSSDLVMRARDSVYHIEC
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB9 FCCCVCERQLQKGDEFVLKEGQLLCKGDYEKERELLSLVSPAASDSGKSDDEESLCKSAH
: : :: ::: :::: :.: .:::..:. : . :: . : ..:
NP_665 FRCSVCSRQLLPGDEFSLREHELLCRADHGLLLERAAAGSPRS--PGPLPGARGLHLPDA
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB9 GAGKGTAEEGKDHKRPK---RPRTILTTQQRRAFKASFEVSSKPCRKVRETLAAETGLSV
:.:. : . . ::. . : ::.:. .: ..... . .. .: ..: :. ::::
NP_665 GSGRQPALRPHVHKQTEKTTRVRTVLNEKQLHTLRTCYAANPRPDALMKEQLVEMTGLSP
180 190 200 210 220 230
240 250 260 270 280 290
pF1KB9 RVVQVWFQNQRAKMKK----LARRQQQQQQDQQNTQRLSSAQTNGGGSAGMEGIMNPYTA
::..:::::.: : :: . . ::::..:. . : :... .:
NP_665 RVIRVWFQNKRCKDKKKSILMKQLQQQQHSDKTSLQGLTGTPLVAGSPIRHENAVQGSAV
240 250 260 270 280 290
300 310 320 330 340 350
pF1KB9 LPTPQQLLAIEQSVYSSDPFRQGLTPPQMPGDHMHPYGAEPLFHDLDSDDTSLSNLGDCF
NP_665 EVQTYQPPWKALSEFALQSDLDQPAFQQLVSFSESGSLGNSSGSDVTSLSSQLPDTPNSM
300 310 320 330 340 350
>>XP_016877994 (OMIM: 609481) PREDICTED: insulin gene en (534 aa)
initn: 442 init1: 322 opt: 612 Z-score: 419.3 bits: 86.8 E(85289): 1.3e-16
Smith-Waterman score: 618; 35.9% identity (58.5% similar) in 323 aa overlap (33-322:25-345)
10 20 30 40 50 60
pF1KB9 DGLKMEENFQSAIDTSASFSSLLGRAVSPKSVCEGCQRVILDRFLLRLN-DSFWHEQCVQ
..: :: : :.:.::.. : :: :..
XP_016 MVDIIFHYPFLGAMGDHSKKKPGTAMCVGCGSQIHDQFILRVSPDLEWHAACLK
10 20 30 40 50
70 80 90 100 110
pF1KB9 CASCKEPLETTC--FYRDKKLYCKYDYEKLFAVKCGGCFEAIAPNEFVMRAQKSVYHLSC
:: :.. :. :: : :: : ::: :: .::..::. : ... ...::::. ::::. :
XP_016 CAECSQYLDETCTCFVRDGKTYCKRDYVRLFGIKCAKCQVGFSSSDLVMRARDSVYHIEC
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB9 FCCCVCERQLQKGDEFVLKEGQLLCKGDYEKERELLSLVSPAASDSGKSDDEESLCKSAH
: : :: ::: :::: :.: .:::..:. : . :: . : ..:
XP_016 FRCSVCSRQLLPGDEFSLREHELLCRADHGLLLERAAAGSPRS--PGPLPGARGLHLPDA
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB9 GAGKGTAEEGKDHKRPK---RPRTILTTQQRRAFKASFEVSSKPCRKVRETLAAETGLSV
:.:. : . . ::. . : ::.:. .: ..... . .. .: ..: :. ::::
XP_016 GSGRQPALRPHVHKQTEKTTRVRTVLNEKQLHTLRTCYAANPRPDALMKEQLVEMTGLSP
180 190 200 210 220 230
240 250 260 270 280
pF1KB9 RVVQVWFQNQRAKMKK----LARRQQQQQQDQQNTQRLSSAQTNGG----------GSA-
::..:::::.: : :: . . ::::..:. . : :... .: :::
XP_016 RVIRVWFQNKRCKDKKKSILMKQLQQQQHSDKTSLQGLTGTPLVAGSPIRHENAVQGSAV
240 250 260 270 280 290
290 300 310 320
pF1KB9 GMEGIMNPYTAL-----------PTPQQLLAIEQSVYSS-DPFRQGLTPPQMPGDHMHPY
.. . :. :: :. ::: . . : : :: : : :
XP_016 EVQTYQPPWKALSEFALQSDLDQPAFQQLGLLSAGGRSPRDTFRLGARGPGAPARVPLHL
300 310 320 330 340 350
330 340 350 360 370 380
pF1KB9 GAEPLFHDLDSDDTSLSNLGDCFLATSEAGPLQSRVGNPIDHLYSMQNSYFTS
XP_016 PLPGVSAAPGLLLRVRLPRQLLRQRRDLPVLAAPGHPQQYGAESRGDVRGTPPCQPADLA
360 370 380 390 400 410
>>NP_002193 (OMIM: 600366) insulin gene enhancer protein (349 aa)
initn: 543 init1: 317 opt: 592 Z-score: 407.9 bits: 84.1 E(85289): 5.6e-16
Smith-Waterman score: 595; 35.0% identity (62.7% similar) in 300 aa overlap (33-314:15-310)
10 20 30 40 50 60
pF1KB9 DGLKMEENFQSAIDTSASFSSLLGRAVSPKSVCEGCQRVILDRFLLRLN-DSFWHEQCVQ
:.: :: : :...::.. : :: :..
NP_002 MGDMGDPPKKKRLISLCVGCGNQIHDQYILRVSPDLEWHAACLK
10 20 30 40
70 80 90 100 110
pF1KB9 CASCKEPLET--TCFYRDKKLYCKYDYEKLFAVKCGGCFEAIAPNEFVMRAQKSVYHLSC
:: :.. :. ::: :: : ::: :: .:...::. : ... :.:::::...:::. :
NP_002 CAECNQYLDESCTCFVRDGKTYCKRDYIRLYGIKCAKCSIGFSKNDFVMRARSKVYHIEC
50 60 70 80 90 100
120 130 140 150 160 170
pF1KB9 FCCCVCERQLQKGDEFVLKEGQLLCKGDYEK-ERELLSLVSPAASDSGKSDDEESLCKSA
: : .: ::: ::::.:.: :.:..:.. :: :. .: . . : .:
NP_002 FRCVACSRQLIPGDEFALREDGLFCRADHDVVERASLGAGDPLSP----LHPARPLQMAA
110 120 130 140 150 160
180 190 200 210 220 230
pF1KB9 HG-AGKGTAEEGKDHKRPK---RPRTILTTQQRRAFKASFEVSSKPCRKVRETLAAETGL
. ... : . . ::.:. : ::.:. .: ..... . .. .: ..: :. :::
NP_002 EPISARQPALRPHVHKQPEKTTRVRTVLNEKQLHTLRTCYAANPRPDALMKEQLVEMTGL
170 180 190 200 210 220
240 250 260 270 280
pF1KB9 SVRVVQVWFQNQRAKMKK----LARRQQQQQQDQQNTQRLSSAQTNGGGSAGMEGIM--N
: ::..:::::.: : :: . . :::: .:. : : .... ... .: . :
NP_002 SPRVIRVWFQNKRCKDKKRSIMMKQLQQQQPNDKTNIQGMTGTPMVAASPERHDGGLQAN
230 240 250 260 270 280
290 300 310 320 330 340
pF1KB9 PYT--ALPTPQQLLA--IEQSVYSSDPFRQGLTPPQMPGDHMHPYGAEPLFHDLDSDDTS
: . : ..:. :: .. :.:
NP_002 PVEVQSYQPPWKVLSDFALQSDIDQPAFQQLVNFSEGGPGSNSTGSEVASMSSQLPDTPN
290 300 310 320 330 340
382 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 18:29:02 2016 done: Fri Nov 4 18:29:03 2016
Total Scan time: 9.310 Total Display time: 0.060
Function used was FASTA [36.3.4 Apr, 2011]