FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1596, 354 aa 1>>>pF1KE1596 354 - 354 aa - 354 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 11.2075+/-0.000358; mu= -9.8633+/- 0.023 mean_var=286.0718+/-58.598, 0's: 0 Z-trim(122.9): 13 B-trim: 11 in 1/59 Lambda= 0.075829 statistics sampled from 41789 (41803) to 41789 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.769), E-opt: 0.2 (0.49), width: 16 Scan time: 9.700 The best scores are: opt bits E(85289) NP_001242 (OMIM: 153634) macrosialin isoform A pre ( 354) 2345 269.2 9.7e-72 NP_001035148 (OMIM: 153634) macrosialin isoform B ( 327) 2072 239.3 8.9e-63 XP_006713649 (OMIM: 605883) PREDICTED: lysosome-as ( 394) 377 53.9 6.9e-07 XP_005247417 (OMIM: 605883) PREDICTED: lysosome-as ( 418) 377 54.0 7.2e-07 XP_011535796 (OMIM: 153330) PREDICTED: lysosome-as ( 398) 356 51.7 3.4e-06 NP_005552 (OMIM: 153330) lysosome-associated membr ( 417) 356 51.7 3.5e-06 NP_054701 (OMIM: 300257,309060) lysosome-associate ( 410) 311 46.7 0.00011 >>NP_001242 (OMIM: 153634) macrosialin isoform A precurs (354 aa) initn: 2345 init1: 2345 opt: 2345 Z-score: 1409.0 bits: 269.2 E(85289): 9.7e-72 Smith-Waterman score: 2345; 100.0% identity (100.0% similar) in 354 aa overlap (1-354:1-354) 10 20 30 40 50 60 pF1KE1 MRLAVLFSGALLGLLAAQGTGNDCPHKKSATLLPSFTVTPTVTESTGTTSHRTTKSHKTT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MRLAVLFSGALLGLLAAQGTGNDCPHKKSATLLPSFTVTPTVTESTGTTSHRTTKSHKTT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 THRTTTTGTTSHGPTTATHNPTTTSHGNVTVHPTSNSTATSQGPSTATHSPATTSHGNAT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 THRTTTTGTTSHGPTTATHNPTTTSHGNVTVHPTSNSTATSQGPSTATHSPATTSHGNAT 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 VHPTSNSTATSPGFTSSAHPEPPPPSPSPSPTSKETIGDYTWTNGSQPCVHLQAQIQIRV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 VHPTSNSTATSPGFTSSAHPEPPPPSPSPSPTSKETIGDYTWTNGSQPCVHLQAQIQIRV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 MYTTQGGGEAWGISVLNPNKTKVQGSCEGAHPHLLLSFPYGHLSFGFMQDLQQKVVYLSY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MYTTQGGGEAWGISVLNPNKTKVQGSCEGAHPHLLLSFPYGHLSFGFMQDLQQKVVYLSY 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 MAVEYNVSFPHAAQWTFSAQNASLRDLQAPLGQSFSCSNSSIILSPAVHLDLLSLRLQAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MAVEYNVSFPHAAQWTFSAQNASLRDLQAPLGQSFSCSNSSIILSPAVHLDLLSLRLQAA 250 260 270 280 290 300 310 320 330 340 350 pF1KE1 QLPHTGVFGQSFSCPSDRSILLPLIIGLILLGLLALVLIAFCIIRRRPSAYQAL :::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 QLPHTGVFGQSFSCPSDRSILLPLIIGLILLGLLALVLIAFCIIRRRPSAYQAL 310 320 330 340 350 >>NP_001035148 (OMIM: 153634) macrosialin isoform B prec (327 aa) initn: 2072 init1: 2072 opt: 2072 Z-score: 1248.1 bits: 239.3 E(85289): 8.9e-63 Smith-Waterman score: 2100; 92.4% identity (92.4% similar) in 354 aa overlap (1-354:1-327) 10 20 30 40 50 60 pF1KE1 MRLAVLFSGALLGLLAAQGTGNDCPHKKSATLLPSFTVTPTVTESTGTTSHRTTKSHKTT :::::::::::::::: ::::::::::::::::: NP_001 MRLAVLFSGALLGLLA---------------------------ESTGTTSHRTTKSHKTT 10 20 30 70 80 90 100 110 120 pF1KE1 THRTTTTGTTSHGPTTATHNPTTTSHGNVTVHPTSNSTATSQGPSTATHSPATTSHGNAT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 THRTTTTGTTSHGPTTATHNPTTTSHGNVTVHPTSNSTATSQGPSTATHSPATTSHGNAT 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE1 VHPTSNSTATSPGFTSSAHPEPPPPSPSPSPTSKETIGDYTWTNGSQPCVHLQAQIQIRV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 VHPTSNSTATSPGFTSSAHPEPPPPSPSPSPTSKETIGDYTWTNGSQPCVHLQAQIQIRV 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE1 MYTTQGGGEAWGISVLNPNKTKVQGSCEGAHPHLLLSFPYGHLSFGFMQDLQQKVVYLSY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MYTTQGGGEAWGISVLNPNKTKVQGSCEGAHPHLLLSFPYGHLSFGFMQDLQQKVVYLSY 160 170 180 190 200 210 250 260 270 280 290 300 pF1KE1 MAVEYNVSFPHAAQWTFSAQNASLRDLQAPLGQSFSCSNSSIILSPAVHLDLLSLRLQAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MAVEYNVSFPHAAQWTFSAQNASLRDLQAPLGQSFSCSNSSIILSPAVHLDLLSLRLQAA 220 230 240 250 260 270 310 320 330 340 350 pF1KE1 QLPHTGVFGQSFSCPSDRSILLPLIIGLILLGLLALVLIAFCIIRRRPSAYQAL :::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 QLPHTGVFGQSFSCPSDRSILLPLIIGLILLGLLALVLIAFCIIRRRPSAYQAL 280 290 300 310 320 >>XP_006713649 (OMIM: 605883) PREDICTED: lysosome-associ (394 aa) initn: 188 init1: 86 opt: 377 Z-score: 244.7 bits: 53.9 E(85289): 6.9e-07 Smith-Waterman score: 395; 29.8% identity (60.7% similar) in 359 aa overlap (15-349:40-388) 10 20 30 40 pF1KE1 MRLAVLFSGALLGLLAAQGTGNDCPHKKSATL-LPSFTVTPTVT :::. . . .::. .: :.::..: XP_006 RDYSQPTAAATVQDIKKPVQQPAKQAPHQTLAARFMDGHITFQTAATVKIP--TTTPATT 10 20 30 40 50 60 50 60 70 80 90 pF1KE1 ESTGTTSH-----RTTKSHKTTTHRTTTTGTTSHGPTTATHN--PTTT--SHGNVTVHPT ..:.::: ::.. ...: . . .. ::. : .. :: : .: . : : XP_006 KNTATTSPITYTLVTTQATPNNSHTAPPVTEVTVGPSLAPYSLPPTITPPAHTTGTSSST 70 80 90 100 110 120 100 110 120 130 140 pF1KE1 -SNSTATSQGPSTATHSPATTS---H----GNATVHPTS--NSTATSPGFTSSAHPEPPP :..:... ::. : ::: : : :. :.:: ..::.. . : .: : XP_006 VSHTTGNTTQPSNQTTLPATLSIALHKSTTGQKPVQPTHAPGTTAAAHNTTRTAAPASTV 130 140 150 160 170 180 150 160 170 180 190 200 pF1KE1 PSPS--PSPTSKETIGDYTWTNGSQPCVHLQAQIQIRVMYTTQGGGEAWGISVLNPNKTK :.:. :.:.: .: : : :::. :.. . ::. :. . . ... .:: :. XP_006 PGPTLAPQPSSVKT-GIYQVLNGSRLCIKAEMGIQLIVQDKESVFSPRRYFNI-DPNATQ 190 200 210 220 230 240 210 220 230 240 250 260 pF1KE1 VQGSCEGAHPHLLLSFPYGHLSFGFMQDLQQKVVYLSYMAVEYNVSFPHAAQWTFSAQNA ..:.: . .:::.: : ... : .: .. :.: ... .:: :.. ... . XP_006 ASGNCGTRKSNLLLNFQGGFVNLTFTKD--EESYYISEVGAYLTVSDPETI---YQGIKH 250 260 270 280 290 300 270 280 290 300 310 320 pF1KE1 SLRDLQAPLGQSFSC-SNSSIILSPAVHLDLLSLRLQAAQLPHTGVFGQSFSCPSDRSIL .. .:. .:.::.: :..:. :: ... ...::: .. ::.. : :::. XP_006 AVVMFQTAVGHSFKCVSEQSLQLSAHLQVKTTDVQLQAFDFEDDH-FGNADECFSDRNRR 310 320 330 340 350 330 340 350 pF1KE1 -LPLIIGLILLGLLALVLIAFCIIRRRPSAYQAL .:. .:: . :::...: : . :.::: XP_006 EIPVAMGLSITGLLVILLTACLVARKRPSRGYERM 360 370 380 390 >>XP_005247417 (OMIM: 605883) PREDICTED: lysosome-associ (418 aa) initn: 188 init1: 86 opt: 377 Z-score: 244.3 bits: 54.0 E(85289): 7.2e-07 Smith-Waterman score: 395; 29.8% identity (60.7% similar) in 359 aa overlap (15-349:64-412) 10 20 30 40 pF1KE1 MRLAVLFSGALLGLLAAQGTGNDCPHKKSATL-LPSFTVTPTVT :::. . . .::. .: :.::..: XP_005 RDYSQPTAAATVQDIKKPVQQPAKQAPHQTLAARFMDGHITFQTAATVKIP--TTTPATT 40 50 60 70 80 90 50 60 70 80 90 pF1KE1 ESTGTTSH-----RTTKSHKTTTHRTTTTGTTSHGPTTATHN--PTTT--SHGNVTVHPT ..:.::: ::.. ...: . . .. ::. : .. :: : .: . : : XP_005 KNTATTSPITYTLVTTQATPNNSHTAPPVTEVTVGPSLAPYSLPPTITPPAHTTGTSSST 100 110 120 130 140 150 100 110 120 130 140 pF1KE1 -SNSTATSQGPSTATHSPATTS---H----GNATVHPTS--NSTATSPGFTSSAHPEPPP :..:... ::. : ::: : : :. :.:: ..::.. . : .: : XP_005 VSHTTGNTTQPSNQTTLPATLSIALHKSTTGQKPVQPTHAPGTTAAAHNTTRTAAPASTV 160 170 180 190 200 210 150 160 170 180 190 200 pF1KE1 PSPS--PSPTSKETIGDYTWTNGSQPCVHLQAQIQIRVMYTTQGGGEAWGISVLNPNKTK :.:. :.:.: .: : : :::. :.. . ::. :. . . ... .:: :. XP_005 PGPTLAPQPSSVKT-GIYQVLNGSRLCIKAEMGIQLIVQDKESVFSPRRYFNI-DPNATQ 220 230 240 250 260 210 220 230 240 250 260 pF1KE1 VQGSCEGAHPHLLLSFPYGHLSFGFMQDLQQKVVYLSYMAVEYNVSFPHAAQWTFSAQNA ..:.: . .:::.: : ... : .: .. :.: ... .:: :.. ... . XP_005 ASGNCGTRKSNLLLNFQGGFVNLTFTKD--EESYYISEVGAYLTVSDPETI---YQGIKH 270 280 290 300 310 320 270 280 290 300 310 320 pF1KE1 SLRDLQAPLGQSFSC-SNSSIILSPAVHLDLLSLRLQAAQLPHTGVFGQSFSCPSDRSIL .. .:. .:.::.: :..:. :: ... ...::: .. ::.. : :::. XP_005 AVVMFQTAVGHSFKCVSEQSLQLSAHLQVKTTDVQLQAFDFEDDH-FGNADECFSDRNRR 330 340 350 360 370 380 330 340 350 pF1KE1 -LPLIIGLILLGLLALVLIAFCIIRRRPSAYQAL .:. .:: . :::...: : . :.::: XP_005 EIPVAMGLSITGLLVILLTACLVARKRPSRGYERM 390 400 410 >>XP_011535796 (OMIM: 153330) PREDICTED: lysosome-associ (398 aa) initn: 212 init1: 134 opt: 356 Z-score: 232.2 bits: 51.7 E(85289): 3.4e-06 Smith-Waterman score: 359; 28.5% identity (59.2% similar) in 267 aa overlap (91-354:142-398) 70 80 90 100 110 120 pF1KE1 THRTTTTGTTSHGPTTATHNPTTTSHGNVTVHPTSNSTATSQGPSTATHSPATTSHGNAT :: .. ... .. : : .. :.:.. XP_011 ASSKEIKTVESITDIRADIDKKYRCVSGTQVHMNNVTVTLHDATIQAYLSNSSFSRGETR 120 130 140 150 160 170 130 140 150 160 170 180 pF1KE1 VHPTSNSTATSPGFTSSAHPEPPPPSPSPSPTSKETIGDYTWTNGSQPCVHLQAQIQIRV . : .:.: : :: ::::: : : .. :. .. . :. . .:. . XP_011 CEQDRPSPTTAP-------PAPPSPSPSPVPKSP-SVDKYNVSGTNGTCLLASMGLQLNL 180 190 200 210 220 190 200 210 220 230 240 pF1KE1 MYTTQGGGEAWGISVLNPNKTKVQGSCEGAHPHLLLSFPYGHLSFGFMQDLQQKVVYLSY : . . . . .:::::...::: ::: : : . :. .. . . XP_011 TYERKDNTTVTRLLNINPNKTSASGSC-GAHLVTLELHSEGTTVLLFQFGMNASSSRFFL 230 240 250 260 270 280 250 260 270 280 290 pF1KE1 MAVEYNVSFPHAAQWTFSAQNASLRDLQAPLGQSFSCS-NSSIILSPAVHLDLLSLRLQA .... :. .: : . .:.: :.::: ::: .:.:..:. . . .. : ...... .:: XP_011 QGIQLNTILPDARDPAFKAANGSLRALQATVGNSYKCNAEEHVRVTKAFSVNIFKVWVQA 290 300 310 320 330 340 300 310 320 330 340 350 pF1KE1 AQLPHTGVFGQSFSCPSDR-SILLPLIIGLILLGLLALVLIAFCIIRRRPSA-YQAL .. . : ::. : :. :.:.:. .: : ::. .::::. . :.: : ::.. XP_011 FKV-EGGQFGSVEECLLDENSMLIPIAVGGALAGLVLIVLIAYLVGRKRSHAGYQTI 350 360 370 380 390 >>NP_005552 (OMIM: 153330) lysosome-associated membrane (417 aa) initn: 212 init1: 134 opt: 356 Z-score: 231.9 bits: 51.7 E(85289): 3.5e-06 Smith-Waterman score: 359; 28.5% identity (59.2% similar) in 267 aa overlap (91-354:161-417) 70 80 90 100 110 120 pF1KE1 THRTTTTGTTSHGPTTATHNPTTTSHGNVTVHPTSNSTATSQGPSTATHSPATTSHGNAT :: .. ... .. : : .. :.:.. NP_005 ASSKEIKTVESITDIRADIDKKYRCVSGTQVHMNNVTVTLHDATIQAYLSNSSFSRGETR 140 150 160 170 180 190 130 140 150 160 170 180 pF1KE1 VHPTSNSTATSPGFTSSAHPEPPPPSPSPSPTSKETIGDYTWTNGSQPCVHLQAQIQIRV . : .:.: : :: ::::: : : .. :. .. . :. . .:. . NP_005 CEQDRPSPTTAP-------PAPPSPSPSPVPKSP-SVDKYNVSGTNGTCLLASMGLQLNL 200 210 220 230 240 190 200 210 220 230 240 pF1KE1 MYTTQGGGEAWGISVLNPNKTKVQGSCEGAHPHLLLSFPYGHLSFGFMQDLQQKVVYLSY : . . . . .:::::...::: ::: : : . :. .. . . NP_005 TYERKDNTTVTRLLNINPNKTSASGSC-GAHLVTLELHSEGTTVLLFQFGMNASSSRFFL 250 260 270 280 290 300 250 260 270 280 290 pF1KE1 MAVEYNVSFPHAAQWTFSAQNASLRDLQAPLGQSFSCS-NSSIILSPAVHLDLLSLRLQA .... :. .: : . .:.: :.::: ::: .:.:..:. . . .. : ...... .:: NP_005 QGIQLNTILPDARDPAFKAANGSLRALQATVGNSYKCNAEEHVRVTKAFSVNIFKVWVQA 310 320 330 340 350 360 300 310 320 330 340 350 pF1KE1 AQLPHTGVFGQSFSCPSDR-SILLPLIIGLILLGLLALVLIAFCIIRRRPSA-YQAL .. . : ::. : :. :.:.:. .: : ::. .::::. . :.: : ::.. NP_005 FKV-EGGQFGSVEECLLDENSMLIPIAVGGALAGLVLIVLIAYLVGRKRSHAGYQTI 370 380 390 400 410 >>NP_054701 (OMIM: 300257,309060) lysosome-associated me (410 aa) initn: 231 init1: 98 opt: 311 Z-score: 205.4 bits: 46.7 E(85289): 0.00011 Smith-Waterman score: 311; 25.6% identity (55.2% similar) in 355 aa overlap (13-354:70-410) 10 20 30 pF1KE1 MRLAVLFSGALLGLLAAQGT--GNDCPHKKSATLL-PSFTVT : .. .:. :.: : :. . :.:. NP_054 TCLYAKWQMNFTVRYETTNKTYKTVTISDHGTVTYNGSICGDDQNGPKIAVQFGPGFSWI 40 50 60 70 80 90 40 50 60 70 80 90 pF1KE1 PTVTESTGTTSHRTTKSHKTTTHRTTTTGTTSHGPTTATHNPTTTSHGNVTVHPTSNSTA . :....: : ... .: :: . ..: :. . . : . :: . NP_054 ANFTKAASTYSIDSVSFSYNTGDNTTFPDAEDKGILTVDELLAIRIPLNDLFR--CNSLS 100 110 120 130 140 150 100 110 120 130 140 150 pF1KE1 TSQGPSTATHS-----PATTSHGNATVHPTSNSTATSPGFTSSAHPEPPPPSPSPSPTSK : . ... : : ...:..... . . . . : : :. .:.: : NP_054 TLEKNDVVQHYWDVLVQAFVQNGTVSTNEFLCDKDKTSTVAPTIHTTVPSPTTTPTPKEK 160 170 180 190 200 210 160 170 180 190 200 210 pF1KE1 ETIGDYTWTNGSQPCVHLQAQIQIRVMYTTQGGGEAWGISVLNPNKTKVQGSCEGAHPHL : :. .::.. :. .:. . :: .. .. .::: :. :::. .: : NP_054 PEAGTYSVNNGNDTCLLATMGLQLNI---TQD--KVASVININPNTTHSTGSCR-SHTAL 220 230 240 250 260 270 220 230 240 250 260 270 pF1KE1 LL--SFPYGHLSFGFMQDLQQKVVYLSYMAVEYNVSFPHAAQWTFSAQNASLRDLQAPLG : : .:.: : ... ::. : :.:. . .:: : .: .:::: NP_054 LRLNSSTIKYLDFVFAVKNENRF-YLK----EVNISMYLVNGSVFSIANNNLSYWDAPLG 280 290 300 310 320 280 290 300 310 320 330 pF1KE1 QSFSCSN-SSIILSPAVHLDLLSLRLQAAQLPHTGVFGQSFSCP-SDRSILLPLIIGLIL .:. :.. ... .: : ... ..::.: .. . : .. . : .: .::.:.:.: : NP_054 SSYMCNKEQTVSVSGAFQINTFDLRVQPFNVTQ-GKYSTAQECSLDDDTILIPIIVGAGL 330 340 350 360 370 380 340 350 pF1KE1 LGLLALVLIAFCIIRRRPSA-YQAL ::. ...::. : ::. : ::.: NP_054 SGLIIVIVIAYVIGRRKSYAGYQTL 390 400 410 354 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 19:22:12 2016 done: Sat Nov 5 19:22:13 2016 Total Scan time: 9.700 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]