FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB4220, 347 aa 1>>>pF1KB4220 347 - 347 aa - 347 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.4670+/-0.00028; mu= 11.9848+/- 0.018 mean_var=99.5127+/-19.535, 0's: 0 Z-trim(120.9): 21 B-trim: 0 in 0/52 Lambda= 0.128569 statistics sampled from 36845 (36866) to 36845 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.778), E-opt: 0.2 (0.432), width: 16 Scan time: 8.840 The best scores are: opt bits E(85289) NP_036410 (OMIM: 300332) integrin beta-1-binding p ( 347) 2464 466.7 3.3e-131 XP_016884894 (OMIM: 300332) PREDICTED: integrin be ( 246) 1684 322.0 8.8e-88 NP_001290206 (OMIM: 300332) integrin beta-1-bindin ( 224) 1579 302.5 5.9e-82 NP_001137545 (OMIM: 604353) cysteine and histidine ( 313) 851 167.5 3.5e-41 NP_036256 (OMIM: 604353) cysteine and histidine-ri ( 332) 757 150.1 6.5e-36 XP_016873030 (OMIM: 604353) PREDICTED: cysteine an ( 218) 625 125.5 1.1e-28 XP_016873029 (OMIM: 604353) PREDICTED: cysteine an ( 218) 625 125.5 1.1e-28 >>NP_036410 (OMIM: 300332) integrin beta-1-binding prote (347 aa) initn: 2464 init1: 2464 opt: 2464 Z-score: 2476.6 bits: 466.7 E(85289): 3.3e-131 Smith-Waterman score: 2464; 100.0% identity (100.0% similar) in 347 aa overlap (1-347:1-347) 10 20 30 40 50 60 pF1KB4 MSLLCRNKGCGQHFDPNTNLPDSCCHHPGVPIFHDALKGWSCCRKRTVDFSEFLNIKGCT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_036 MSLLCRNKGCGQHFDPNTNLPDSCCHHPGVPIFHDALKGWSCCRKRTVDFSEFLNIKGCT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB4 MGPHCAEKLPEAPQPEGPATSSSLQEQKPLNVIPKSAETLRRERPKSELPLKLLPLNISQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_036 MGPHCAEKLPEAPQPEGPATSSSLQEQKPLNVIPKSAETLRRERPKSELPLKLLPLNISQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB4 ALEMALEQKELDQEPGAGLDSLIRTGSSCQNPGCDAVYQGPESDATPCTYHPGAPRFHEG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_036 ALEMALEQKELDQEPGAGLDSLIRTGSSCQNPGCDAVYQGPESDATPCTYHPGAPRFHEG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB4 MKSWSCCGIQTLDFGAFLAQPGCRVGRHDWGKQLPASCRHDWHQTDSLVVVTVYGQIPLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_036 MKSWSCCGIQTLDFGAFLAQPGCRVGRHDWGKQLPASCRHDWHQTDSLVVVTVYGQIPLP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB4 AFNWVKASQTELHVHIVFDGNRVFQAQMKLWGVINVEQSSVFLMPSRVEISLVKADPGSW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_036 AFNWVKASQTELHVHIVFDGNRVFQAQMKLWGVINVEQSSVFLMPSRVEISLVKADPGSW 250 260 270 280 290 300 310 320 330 340 pF1KB4 AQLEHPDALAKKARAGVVLEMDEEESDDSDDDLSWTEEEEEEEAMGE ::::::::::::::::::::::::::::::::::::::::::::::: NP_036 AQLEHPDALAKKARAGVVLEMDEEESDDSDDDLSWTEEEEEEEAMGE 310 320 330 340 >>XP_016884894 (OMIM: 300332) PREDICTED: integrin beta-1 (246 aa) initn: 1684 init1: 1684 opt: 1684 Z-score: 1697.0 bits: 322.0 E(85289): 8.8e-88 Smith-Waterman score: 1684; 99.6% identity (100.0% similar) in 242 aa overlap (106-347:5-246) 80 90 100 110 120 130 pF1KB4 EGPATSSSLQEQKPLNVIPKSAETLRRERPKSELPLKLLPLNISQALEMALEQKELDQEP .::::::::::::::::::::::::::::: XP_016 MMHLRSELPLKLLPLNISQALEMALEQKELDQEP 10 20 30 140 150 160 170 180 190 pF1KB4 GAGLDSLIRTGSSCQNPGCDAVYQGPESDATPCTYHPGAPRFHEGMKSWSCCGIQTLDFG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 GAGLDSLIRTGSSCQNPGCDAVYQGPESDATPCTYHPGAPRFHEGMKSWSCCGIQTLDFG 40 50 60 70 80 90 200 210 220 230 240 250 pF1KB4 AFLAQPGCRVGRHDWGKQLPASCRHDWHQTDSLVVVTVYGQIPLPAFNWVKASQTELHVH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 AFLAQPGCRVGRHDWGKQLPASCRHDWHQTDSLVVVTVYGQIPLPAFNWVKASQTELHVH 100 110 120 130 140 150 260 270 280 290 300 310 pF1KB4 IVFDGNRVFQAQMKLWGVINVEQSSVFLMPSRVEISLVKADPGSWAQLEHPDALAKKARA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 IVFDGNRVFQAQMKLWGVINVEQSSVFLMPSRVEISLVKADPGSWAQLEHPDALAKKARA 160 170 180 190 200 210 320 330 340 pF1KB4 GVVLEMDEEESDDSDDDLSWTEEEEEEEAMGE :::::::::::::::::::::::::::::::: XP_016 GVVLEMDEEESDDSDDDLSWTEEEEEEEAMGE 220 230 240 >>NP_001290206 (OMIM: 300332) integrin beta-1-binding pr (224 aa) initn: 1579 init1: 1579 opt: 1579 Z-score: 1592.3 bits: 302.5 E(85289): 5.9e-82 Smith-Waterman score: 1579; 100.0% identity (100.0% similar) in 224 aa overlap (124-347:1-224) 100 110 120 130 140 150 pF1KB4 PKSAETLRRERPKSELPLKLLPLNISQALEMALEQKELDQEPGAGLDSLIRTGSSCQNPG :::::::::::::::::::::::::::::: NP_001 MALEQKELDQEPGAGLDSLIRTGSSCQNPG 10 20 30 160 170 180 190 200 210 pF1KB4 CDAVYQGPESDATPCTYHPGAPRFHEGMKSWSCCGIQTLDFGAFLAQPGCRVGRHDWGKQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 CDAVYQGPESDATPCTYHPGAPRFHEGMKSWSCCGIQTLDFGAFLAQPGCRVGRHDWGKQ 40 50 60 70 80 90 220 230 240 250 260 270 pF1KB4 LPASCRHDWHQTDSLVVVTVYGQIPLPAFNWVKASQTELHVHIVFDGNRVFQAQMKLWGV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 LPASCRHDWHQTDSLVVVTVYGQIPLPAFNWVKASQTELHVHIVFDGNRVFQAQMKLWGV 100 110 120 130 140 150 280 290 300 310 320 330 pF1KB4 INVEQSSVFLMPSRVEISLVKADPGSWAQLEHPDALAKKARAGVVLEMDEEESDDSDDDL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 INVEQSSVFLMPSRVEISLVKADPGSWAQLEHPDALAKKARAGVVLEMDEEESDDSDDDL 160 170 180 190 200 210 340 pF1KB4 SWTEEEEEEEAMGE :::::::::::::: NP_001 SWTEEEEEEEAMGE 220 >>NP_001137545 (OMIM: 604353) cysteine and histidine-ric (313 aa) initn: 1071 init1: 320 opt: 851 Z-score: 860.4 bits: 167.5 E(85289): 3.5e-41 Smith-Waterman score: 896; 44.9% identity (68.1% similar) in 323 aa overlap (1-308:1-301) 10 20 30 40 50 60 pF1KB4 MSLLCRNKGCGQHFDPNTNLPDSCCHHPGVPIFHDALKGWSCCRKRTVDFSEFLNIKGCT :.::: :.::::.:::.:: :.: .:::::.:::::: ::: NP_001 MALLCYNRGCGQRFDPETNSDDACTYHPGVPVFHDALK-------------------GCT 10 20 30 40 70 80 90 100 110 pF1KB4 MGPHCAEKLPEAPQPEGPATSSS-LQEQKPL---NVI--PKSAETLRRERPKSELPLKLL : : .:: :: .:: .: .. : : :: ..: :: .:...: :. . :. : NP_001 KGRHNSEKPPEPVKPEVKTTEKKELCELKPKFQEHIIQAPKPVEAIKR--PSPDEPMTNL 50 60 70 80 90 120 130 140 150 160 170 pF1KB4 PLNISQALEMALEQKEL---DQEPGAGLDS-LIRTGSSCQNPGCDAVYQGPESDATPCTY :.:: .:..::.. .: ..: :. :. :.::.: ::. .::: :: :.: NP_001 ELKISASLKQALDKLKLSSGNEENKKEEDNDEIKIGTSCKNGGCSKTYQGLESLEEVCVY 100 110 120 130 140 150 180 190 200 210 220 pF1KB4 HPGAPRFHEGMKSWSCCGIQTLDFGAFLAQPGCRVGRHDW-----GKQLPASCRHDWHQT : :.: :::::: :::: .: ::..:::: :: :.: : ::.. . :::::::: NP_001 HSGVPIFHEGMKYWSCCRRKTSDFNTFLAQEGCTKGKHMWTKKDAGKKV-VPCRHDWHQT 160 170 180 190 200 210 230 240 250 260 270 280 pF1KB4 DSLVVVTVYGQIPLPAFNWVKASQTELHVHIVFDGNRVFQAQMKLWGVINVEQSSVFLMP . :...::.. :: .. :.:..: :.:::::.:.. :. ..::::::.:..: : . NP_001 GGEVTISVYAKNSLPELSRVEANSTLLNVHIVFEGEKEFDQNVKLWGVIDVKRSYVTMTA 220 230 240 250 260 270 290 300 310 320 330 340 pF1KB4 SRVEISLVKADPGSWAQLEHPDALAKKARAGVVLEMDEEESDDSDDDLSWTEEEEEEEAM ...::.. ::.: .::.:: : : NP_001 TKIEITMRKAEPMQWASLELPAAKKQEKQKDATTD 280 290 300 310 >>NP_036256 (OMIM: 604353) cysteine and histidine-rich d (332 aa) initn: 1035 init1: 409 opt: 757 Z-score: 765.7 bits: 150.1 E(85289): 6.5e-36 Smith-Waterman score: 1055; 48.9% identity (73.7% similar) in 323 aa overlap (1-308:1-320) 10 20 30 40 50 60 pF1KB4 MSLLCRNKGCGQHFDPNTNLPDSCCHHPGVPIFHDALKGWSCCRKRTVDFSEFLNIKGCT :.::: :.::::.:::.:: :.: .:::::.:::::::::::..::.:::.::.: ::: NP_036 MALLCYNRGCGQRFDPETNSDDACTYHPGVPVFHDALKGWSCCKRRTTDFSDFLSIVGCT 10 20 30 40 50 60 70 80 90 100 110 pF1KB4 MGPHCAEKLPEAPQPEGPATSSS-LQEQKPL---NVI--PKSAETLRRERPKSELPLKLL : : .:: :: .:: .: .. : : :: ..: :: .:...: :. . :. : NP_036 KGRHNSEKPPEPVKPEVKTTEKKELCELKPKFQEHIIQAPKPVEAIKR--PSPDEPMTNL 70 80 90 100 110 120 130 140 150 160 170 pF1KB4 PLNISQALEMALEQKEL---DQEPGAGLDS-LIRTGSSCQNPGCDAVYQGPESDATPCTY :.:: .:..::.. .: ..: :. :. :.::.: ::. .::: :: :.: NP_036 ELKISASLKQALDKLKLSSGNEENKKEEDNDEIKIGTSCKNGGCSKTYQGLESLEEVCVY 120 130 140 150 160 170 180 190 200 210 220 pF1KB4 HPGAPRFHEGMKSWSCCGIQTLDFGAFLAQPGCRVGRHDW-----GKQLPASCRHDWHQT : :.: :::::: :::: .: ::..:::: :: :.: : ::.. . :::::::: NP_036 HSGVPIFHEGMKYWSCCRRKTSDFNTFLAQEGCTKGKHMWTKKDAGKKV-VPCRHDWHQT 180 190 200 210 220 230 230 240 250 260 270 280 pF1KB4 DSLVVVTVYGQIPLPAFNWVKASQTELHVHIVFDGNRVFQAQMKLWGVINVEQSSVFLMP . :...::.. :: .. :.:..: :.:::::.:.. :. ..::::::.:..: : . NP_036 GGEVTISVYAKNSLPELSRVEANSTLLNVHIVFEGEKEFDQNVKLWGVIDVKRSYVTMTA 240 250 260 270 280 290 290 300 310 320 330 340 pF1KB4 SRVEISLVKADPGSWAQLEHPDALAKKARAGVVLEMDEEESDDSDDDLSWTEEEEEEEAM ...::.. ::.: .::.:: : : NP_036 TKIEITMRKAEPMQWASLELPAAKKQEKQKDATTD 300 310 320 330 >>XP_016873030 (OMIM: 604353) PREDICTED: cysteine and hi (218 aa) initn: 855 init1: 320 opt: 625 Z-score: 636.1 bits: 125.5 E(85289): 1.1e-28 Smith-Waterman score: 625; 46.6% identity (73.0% similar) in 204 aa overlap (114-308:4-206) 90 100 110 120 130 140 pF1KB4 LQEQKPLNVIPKSAETLRRERPKSELPLKLLPLNISQALEMALEQKEL---DQEPGAGLD : :.:: .:..::.. .: ..: : XP_016 MTNLELKISASLKQALDKLKLSSGNEENKKEED 10 20 30 150 160 170 180 190 pF1KB4 S-LIRTGSSCQNPGCDAVYQGPESDATPCTYHPGAPRFHEGMKSWSCCGIQTLDFGAFLA . :. :.::.: ::. .::: :: :.:: :.: :::::: :::: .: ::..::: XP_016 NDEIKIGTSCKNGGCSKTYQGLESLEEVCVYHSGVPIFHEGMKYWSCCRRKTSDFNTFLA 40 50 60 70 80 90 200 210 220 230 240 250 pF1KB4 QPGCRVGRHDW-----GKQLPASCRHDWHQTDSLVVVTVYGQIPLPAFNWVKASQTELHV : :: :.: : ::.. . :::::::: . :...::.. :: .. :.:..: :.: XP_016 QEGCTKGKHMWTKKDAGKKV-VPCRHDWHQTGGEVTISVYAKNSLPELSRVEANSTLLNV 100 110 120 130 140 150 260 270 280 290 300 310 pF1KB4 HIVFDGNRVFQAQMKLWGVINVEQSSVFLMPSRVEISLVKADPGSWAQLEHPDALAKKAR ::::.:.. :. ..::::::.:..: : . ...::.. ::.: .::.:: : : XP_016 HIVFEGEKEFDQNVKLWGVIDVKRSYVTMTATKIEITMRKAEPMQWASLELPAAKKQEKQ 160 170 180 190 200 210 320 330 340 pF1KB4 AGVVLEMDEEESDDSDDDLSWTEEEEEEEAMGE XP_016 KDATTD >>XP_016873029 (OMIM: 604353) PREDICTED: cysteine and hi (218 aa) initn: 855 init1: 320 opt: 625 Z-score: 636.1 bits: 125.5 E(85289): 1.1e-28 Smith-Waterman score: 625; 46.6% identity (73.0% similar) in 204 aa overlap (114-308:4-206) 90 100 110 120 130 140 pF1KB4 LQEQKPLNVIPKSAETLRRERPKSELPLKLLPLNISQALEMALEQKEL---DQEPGAGLD : :.:: .:..::.. .: ..: : XP_016 MTNLELKISASLKQALDKLKLSSGNEENKKEED 10 20 30 150 160 170 180 190 pF1KB4 S-LIRTGSSCQNPGCDAVYQGPESDATPCTYHPGAPRFHEGMKSWSCCGIQTLDFGAFLA . :. :.::.: ::. .::: :: :.:: :.: :::::: :::: .: ::..::: XP_016 NDEIKIGTSCKNGGCSKTYQGLESLEEVCVYHSGVPIFHEGMKYWSCCRRKTSDFNTFLA 40 50 60 70 80 90 200 210 220 230 240 250 pF1KB4 QPGCRVGRHDW-----GKQLPASCRHDWHQTDSLVVVTVYGQIPLPAFNWVKASQTELHV : :: :.: : ::.. . :::::::: . :...::.. :: .. :.:..: :.: XP_016 QEGCTKGKHMWTKKDAGKKV-VPCRHDWHQTGGEVTISVYAKNSLPELSRVEANSTLLNV 100 110 120 130 140 150 260 270 280 290 300 310 pF1KB4 HIVFDGNRVFQAQMKLWGVINVEQSSVFLMPSRVEISLVKADPGSWAQLEHPDALAKKAR ::::.:.. :. ..::::::.:..: : . ...::.. ::.: .::.:: : : XP_016 HIVFEGEKEFDQNVKLWGVIDVKRSYVTMTATKIEITMRKAEPMQWASLELPAAKKQEKQ 160 170 180 190 200 210 320 330 340 pF1KB4 AGVVLEMDEEESDDSDDDLSWTEEEEEEEAMGE XP_016 KDATTD 347 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 05:42:54 2016 done: Sat Nov 5 05:42:55 2016 Total Scan time: 8.840 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]