FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB4220, 347 aa
1>>>pF1KB4220 347 - 347 aa - 347 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.4670+/-0.00028; mu= 11.9848+/- 0.018
mean_var=99.5127+/-19.535, 0's: 0 Z-trim(120.9): 21 B-trim: 0 in 0/52
Lambda= 0.128569
statistics sampled from 36845 (36866) to 36845 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.778), E-opt: 0.2 (0.432), width: 16
Scan time: 8.840
The best scores are: opt bits E(85289)
NP_036410 (OMIM: 300332) integrin beta-1-binding p ( 347) 2464 466.7 3.3e-131
XP_016884894 (OMIM: 300332) PREDICTED: integrin be ( 246) 1684 322.0 8.8e-88
NP_001290206 (OMIM: 300332) integrin beta-1-bindin ( 224) 1579 302.5 5.9e-82
NP_001137545 (OMIM: 604353) cysteine and histidine ( 313) 851 167.5 3.5e-41
NP_036256 (OMIM: 604353) cysteine and histidine-ri ( 332) 757 150.1 6.5e-36
XP_016873030 (OMIM: 604353) PREDICTED: cysteine an ( 218) 625 125.5 1.1e-28
XP_016873029 (OMIM: 604353) PREDICTED: cysteine an ( 218) 625 125.5 1.1e-28
>>NP_036410 (OMIM: 300332) integrin beta-1-binding prote (347 aa)
initn: 2464 init1: 2464 opt: 2464 Z-score: 2476.6 bits: 466.7 E(85289): 3.3e-131
Smith-Waterman score: 2464; 100.0% identity (100.0% similar) in 347 aa overlap (1-347:1-347)
10 20 30 40 50 60
pF1KB4 MSLLCRNKGCGQHFDPNTNLPDSCCHHPGVPIFHDALKGWSCCRKRTVDFSEFLNIKGCT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_036 MSLLCRNKGCGQHFDPNTNLPDSCCHHPGVPIFHDALKGWSCCRKRTVDFSEFLNIKGCT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB4 MGPHCAEKLPEAPQPEGPATSSSLQEQKPLNVIPKSAETLRRERPKSELPLKLLPLNISQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_036 MGPHCAEKLPEAPQPEGPATSSSLQEQKPLNVIPKSAETLRRERPKSELPLKLLPLNISQ
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB4 ALEMALEQKELDQEPGAGLDSLIRTGSSCQNPGCDAVYQGPESDATPCTYHPGAPRFHEG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_036 ALEMALEQKELDQEPGAGLDSLIRTGSSCQNPGCDAVYQGPESDATPCTYHPGAPRFHEG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB4 MKSWSCCGIQTLDFGAFLAQPGCRVGRHDWGKQLPASCRHDWHQTDSLVVVTVYGQIPLP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_036 MKSWSCCGIQTLDFGAFLAQPGCRVGRHDWGKQLPASCRHDWHQTDSLVVVTVYGQIPLP
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB4 AFNWVKASQTELHVHIVFDGNRVFQAQMKLWGVINVEQSSVFLMPSRVEISLVKADPGSW
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_036 AFNWVKASQTELHVHIVFDGNRVFQAQMKLWGVINVEQSSVFLMPSRVEISLVKADPGSW
250 260 270 280 290 300
310 320 330 340
pF1KB4 AQLEHPDALAKKARAGVVLEMDEEESDDSDDDLSWTEEEEEEEAMGE
:::::::::::::::::::::::::::::::::::::::::::::::
NP_036 AQLEHPDALAKKARAGVVLEMDEEESDDSDDDLSWTEEEEEEEAMGE
310 320 330 340
>>XP_016884894 (OMIM: 300332) PREDICTED: integrin beta-1 (246 aa)
initn: 1684 init1: 1684 opt: 1684 Z-score: 1697.0 bits: 322.0 E(85289): 8.8e-88
Smith-Waterman score: 1684; 99.6% identity (100.0% similar) in 242 aa overlap (106-347:5-246)
80 90 100 110 120 130
pF1KB4 EGPATSSSLQEQKPLNVIPKSAETLRRERPKSELPLKLLPLNISQALEMALEQKELDQEP
.:::::::::::::::::::::::::::::
XP_016 MMHLRSELPLKLLPLNISQALEMALEQKELDQEP
10 20 30
140 150 160 170 180 190
pF1KB4 GAGLDSLIRTGSSCQNPGCDAVYQGPESDATPCTYHPGAPRFHEGMKSWSCCGIQTLDFG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 GAGLDSLIRTGSSCQNPGCDAVYQGPESDATPCTYHPGAPRFHEGMKSWSCCGIQTLDFG
40 50 60 70 80 90
200 210 220 230 240 250
pF1KB4 AFLAQPGCRVGRHDWGKQLPASCRHDWHQTDSLVVVTVYGQIPLPAFNWVKASQTELHVH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 AFLAQPGCRVGRHDWGKQLPASCRHDWHQTDSLVVVTVYGQIPLPAFNWVKASQTELHVH
100 110 120 130 140 150
260 270 280 290 300 310
pF1KB4 IVFDGNRVFQAQMKLWGVINVEQSSVFLMPSRVEISLVKADPGSWAQLEHPDALAKKARA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 IVFDGNRVFQAQMKLWGVINVEQSSVFLMPSRVEISLVKADPGSWAQLEHPDALAKKARA
160 170 180 190 200 210
320 330 340
pF1KB4 GVVLEMDEEESDDSDDDLSWTEEEEEEEAMGE
::::::::::::::::::::::::::::::::
XP_016 GVVLEMDEEESDDSDDDLSWTEEEEEEEAMGE
220 230 240
>>NP_001290206 (OMIM: 300332) integrin beta-1-binding pr (224 aa)
initn: 1579 init1: 1579 opt: 1579 Z-score: 1592.3 bits: 302.5 E(85289): 5.9e-82
Smith-Waterman score: 1579; 100.0% identity (100.0% similar) in 224 aa overlap (124-347:1-224)
100 110 120 130 140 150
pF1KB4 PKSAETLRRERPKSELPLKLLPLNISQALEMALEQKELDQEPGAGLDSLIRTGSSCQNPG
::::::::::::::::::::::::::::::
NP_001 MALEQKELDQEPGAGLDSLIRTGSSCQNPG
10 20 30
160 170 180 190 200 210
pF1KB4 CDAVYQGPESDATPCTYHPGAPRFHEGMKSWSCCGIQTLDFGAFLAQPGCRVGRHDWGKQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 CDAVYQGPESDATPCTYHPGAPRFHEGMKSWSCCGIQTLDFGAFLAQPGCRVGRHDWGKQ
40 50 60 70 80 90
220 230 240 250 260 270
pF1KB4 LPASCRHDWHQTDSLVVVTVYGQIPLPAFNWVKASQTELHVHIVFDGNRVFQAQMKLWGV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 LPASCRHDWHQTDSLVVVTVYGQIPLPAFNWVKASQTELHVHIVFDGNRVFQAQMKLWGV
100 110 120 130 140 150
280 290 300 310 320 330
pF1KB4 INVEQSSVFLMPSRVEISLVKADPGSWAQLEHPDALAKKARAGVVLEMDEEESDDSDDDL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 INVEQSSVFLMPSRVEISLVKADPGSWAQLEHPDALAKKARAGVVLEMDEEESDDSDDDL
160 170 180 190 200 210
340
pF1KB4 SWTEEEEEEEAMGE
::::::::::::::
NP_001 SWTEEEEEEEAMGE
220
>>NP_001137545 (OMIM: 604353) cysteine and histidine-ric (313 aa)
initn: 1071 init1: 320 opt: 851 Z-score: 860.4 bits: 167.5 E(85289): 3.5e-41
Smith-Waterman score: 896; 44.9% identity (68.1% similar) in 323 aa overlap (1-308:1-301)
10 20 30 40 50 60
pF1KB4 MSLLCRNKGCGQHFDPNTNLPDSCCHHPGVPIFHDALKGWSCCRKRTVDFSEFLNIKGCT
:.::: :.::::.:::.:: :.: .:::::.:::::: :::
NP_001 MALLCYNRGCGQRFDPETNSDDACTYHPGVPVFHDALK-------------------GCT
10 20 30 40
70 80 90 100 110
pF1KB4 MGPHCAEKLPEAPQPEGPATSSS-LQEQKPL---NVI--PKSAETLRRERPKSELPLKLL
: : .:: :: .:: .: .. : : :: ..: :: .:...: :. . :. :
NP_001 KGRHNSEKPPEPVKPEVKTTEKKELCELKPKFQEHIIQAPKPVEAIKR--PSPDEPMTNL
50 60 70 80 90
120 130 140 150 160 170
pF1KB4 PLNISQALEMALEQKEL---DQEPGAGLDS-LIRTGSSCQNPGCDAVYQGPESDATPCTY
:.:: .:..::.. .: ..: :. :. :.::.: ::. .::: :: :.:
NP_001 ELKISASLKQALDKLKLSSGNEENKKEEDNDEIKIGTSCKNGGCSKTYQGLESLEEVCVY
100 110 120 130 140 150
180 190 200 210 220
pF1KB4 HPGAPRFHEGMKSWSCCGIQTLDFGAFLAQPGCRVGRHDW-----GKQLPASCRHDWHQT
: :.: :::::: :::: .: ::..:::: :: :.: : ::.. . ::::::::
NP_001 HSGVPIFHEGMKYWSCCRRKTSDFNTFLAQEGCTKGKHMWTKKDAGKKV-VPCRHDWHQT
160 170 180 190 200 210
230 240 250 260 270 280
pF1KB4 DSLVVVTVYGQIPLPAFNWVKASQTELHVHIVFDGNRVFQAQMKLWGVINVEQSSVFLMP
. :...::.. :: .. :.:..: :.:::::.:.. :. ..::::::.:..: : .
NP_001 GGEVTISVYAKNSLPELSRVEANSTLLNVHIVFEGEKEFDQNVKLWGVIDVKRSYVTMTA
220 230 240 250 260 270
290 300 310 320 330 340
pF1KB4 SRVEISLVKADPGSWAQLEHPDALAKKARAGVVLEMDEEESDDSDDDLSWTEEEEEEEAM
...::.. ::.: .::.:: : :
NP_001 TKIEITMRKAEPMQWASLELPAAKKQEKQKDATTD
280 290 300 310
>>NP_036256 (OMIM: 604353) cysteine and histidine-rich d (332 aa)
initn: 1035 init1: 409 opt: 757 Z-score: 765.7 bits: 150.1 E(85289): 6.5e-36
Smith-Waterman score: 1055; 48.9% identity (73.7% similar) in 323 aa overlap (1-308:1-320)
10 20 30 40 50 60
pF1KB4 MSLLCRNKGCGQHFDPNTNLPDSCCHHPGVPIFHDALKGWSCCRKRTVDFSEFLNIKGCT
:.::: :.::::.:::.:: :.: .:::::.:::::::::::..::.:::.::.: :::
NP_036 MALLCYNRGCGQRFDPETNSDDACTYHPGVPVFHDALKGWSCCKRRTTDFSDFLSIVGCT
10 20 30 40 50 60
70 80 90 100 110
pF1KB4 MGPHCAEKLPEAPQPEGPATSSS-LQEQKPL---NVI--PKSAETLRRERPKSELPLKLL
: : .:: :: .:: .: .. : : :: ..: :: .:...: :. . :. :
NP_036 KGRHNSEKPPEPVKPEVKTTEKKELCELKPKFQEHIIQAPKPVEAIKR--PSPDEPMTNL
70 80 90 100 110
120 130 140 150 160 170
pF1KB4 PLNISQALEMALEQKEL---DQEPGAGLDS-LIRTGSSCQNPGCDAVYQGPESDATPCTY
:.:: .:..::.. .: ..: :. :. :.::.: ::. .::: :: :.:
NP_036 ELKISASLKQALDKLKLSSGNEENKKEEDNDEIKIGTSCKNGGCSKTYQGLESLEEVCVY
120 130 140 150 160 170
180 190 200 210 220
pF1KB4 HPGAPRFHEGMKSWSCCGIQTLDFGAFLAQPGCRVGRHDW-----GKQLPASCRHDWHQT
: :.: :::::: :::: .: ::..:::: :: :.: : ::.. . ::::::::
NP_036 HSGVPIFHEGMKYWSCCRRKTSDFNTFLAQEGCTKGKHMWTKKDAGKKV-VPCRHDWHQT
180 190 200 210 220 230
230 240 250 260 270 280
pF1KB4 DSLVVVTVYGQIPLPAFNWVKASQTELHVHIVFDGNRVFQAQMKLWGVINVEQSSVFLMP
. :...::.. :: .. :.:..: :.:::::.:.. :. ..::::::.:..: : .
NP_036 GGEVTISVYAKNSLPELSRVEANSTLLNVHIVFEGEKEFDQNVKLWGVIDVKRSYVTMTA
240 250 260 270 280 290
290 300 310 320 330 340
pF1KB4 SRVEISLVKADPGSWAQLEHPDALAKKARAGVVLEMDEEESDDSDDDLSWTEEEEEEEAM
...::.. ::.: .::.:: : :
NP_036 TKIEITMRKAEPMQWASLELPAAKKQEKQKDATTD
300 310 320 330
>>XP_016873030 (OMIM: 604353) PREDICTED: cysteine and hi (218 aa)
initn: 855 init1: 320 opt: 625 Z-score: 636.1 bits: 125.5 E(85289): 1.1e-28
Smith-Waterman score: 625; 46.6% identity (73.0% similar) in 204 aa overlap (114-308:4-206)
90 100 110 120 130 140
pF1KB4 LQEQKPLNVIPKSAETLRRERPKSELPLKLLPLNISQALEMALEQKEL---DQEPGAGLD
: :.:: .:..::.. .: ..: :
XP_016 MTNLELKISASLKQALDKLKLSSGNEENKKEED
10 20 30
150 160 170 180 190
pF1KB4 S-LIRTGSSCQNPGCDAVYQGPESDATPCTYHPGAPRFHEGMKSWSCCGIQTLDFGAFLA
. :. :.::.: ::. .::: :: :.:: :.: :::::: :::: .: ::..:::
XP_016 NDEIKIGTSCKNGGCSKTYQGLESLEEVCVYHSGVPIFHEGMKYWSCCRRKTSDFNTFLA
40 50 60 70 80 90
200 210 220 230 240 250
pF1KB4 QPGCRVGRHDW-----GKQLPASCRHDWHQTDSLVVVTVYGQIPLPAFNWVKASQTELHV
: :: :.: : ::.. . :::::::: . :...::.. :: .. :.:..: :.:
XP_016 QEGCTKGKHMWTKKDAGKKV-VPCRHDWHQTGGEVTISVYAKNSLPELSRVEANSTLLNV
100 110 120 130 140 150
260 270 280 290 300 310
pF1KB4 HIVFDGNRVFQAQMKLWGVINVEQSSVFLMPSRVEISLVKADPGSWAQLEHPDALAKKAR
::::.:.. :. ..::::::.:..: : . ...::.. ::.: .::.:: : :
XP_016 HIVFEGEKEFDQNVKLWGVIDVKRSYVTMTATKIEITMRKAEPMQWASLELPAAKKQEKQ
160 170 180 190 200 210
320 330 340
pF1KB4 AGVVLEMDEEESDDSDDDLSWTEEEEEEEAMGE
XP_016 KDATTD
>>XP_016873029 (OMIM: 604353) PREDICTED: cysteine and hi (218 aa)
initn: 855 init1: 320 opt: 625 Z-score: 636.1 bits: 125.5 E(85289): 1.1e-28
Smith-Waterman score: 625; 46.6% identity (73.0% similar) in 204 aa overlap (114-308:4-206)
90 100 110 120 130 140
pF1KB4 LQEQKPLNVIPKSAETLRRERPKSELPLKLLPLNISQALEMALEQKEL---DQEPGAGLD
: :.:: .:..::.. .: ..: :
XP_016 MTNLELKISASLKQALDKLKLSSGNEENKKEED
10 20 30
150 160 170 180 190
pF1KB4 S-LIRTGSSCQNPGCDAVYQGPESDATPCTYHPGAPRFHEGMKSWSCCGIQTLDFGAFLA
. :. :.::.: ::. .::: :: :.:: :.: :::::: :::: .: ::..:::
XP_016 NDEIKIGTSCKNGGCSKTYQGLESLEEVCVYHSGVPIFHEGMKYWSCCRRKTSDFNTFLA
40 50 60 70 80 90
200 210 220 230 240 250
pF1KB4 QPGCRVGRHDW-----GKQLPASCRHDWHQTDSLVVVTVYGQIPLPAFNWVKASQTELHV
: :: :.: : ::.. . :::::::: . :...::.. :: .. :.:..: :.:
XP_016 QEGCTKGKHMWTKKDAGKKV-VPCRHDWHQTGGEVTISVYAKNSLPELSRVEANSTLLNV
100 110 120 130 140 150
260 270 280 290 300 310
pF1KB4 HIVFDGNRVFQAQMKLWGVINVEQSSVFLMPSRVEISLVKADPGSWAQLEHPDALAKKAR
::::.:.. :. ..::::::.:..: : . ...::.. ::.: .::.:: : :
XP_016 HIVFEGEKEFDQNVKLWGVIDVKRSYVTMTATKIEITMRKAEPMQWASLELPAAKKQEKQ
160 170 180 190 200 210
320 330 340
pF1KB4 AGVVLEMDEEESDDSDDDLSWTEEEEEEEAMGE
XP_016 KDATTD
347 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 05:42:54 2016 done: Sat Nov 5 05:42:55 2016
Total Scan time: 8.840 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]