FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9794, 217 aa 1>>>pF1KB9794 217 - 217 aa - 217 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.2363+/-0.000286; mu= 9.9185+/- 0.018 mean_var=88.5101+/-17.522, 0's: 0 Z-trim(119.8): 7 B-trim: 0 in 0/54 Lambda= 0.136326 statistics sampled from 34156 (34164) to 34156 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.774), E-opt: 0.2 (0.401), width: 16 Scan time: 7.120 The best scores are: opt bits E(85289) NP_060836 (OMIM: 605695) biogenesis of lysosome-re ( 217) 1450 294.3 1e-79 XP_011527189 (OMIM: 607471) PREDICTED: breast carc ( 203) 408 89.4 4.7e-18 NP_942094 (OMIM: 607471) breast carcinoma-amplifie ( 203) 408 89.4 4.7e-18 XP_011527188 (OMIM: 607471) PREDICTED: breast carc ( 214) 330 74.0 2.1e-13 NP_060313 (OMIM: 607471) breast carcinoma-amplifie ( 211) 327 73.4 3.1e-13 XP_016883421 (OMIM: 607471) PREDICTED: breast carc ( 139) 199 48.2 8.1e-06 NP_001010974 (OMIM: 607471) breast carcinoma-ampli ( 158) 199 48.2 9.1e-06 >>NP_060836 (OMIM: 605695) biogenesis of lysosome-relate (217 aa) initn: 1450 init1: 1450 opt: 1450 Z-score: 1552.2 bits: 294.3 E(85289): 1e-79 Smith-Waterman score: 1450; 100.0% identity (100.0% similar) in 217 aa overlap (1-217:1-217) 10 20 30 40 50 60 pF1KB9 MEGSFSDGGALPEGLAEEAEPQGAAWSGDSGTVSQSHSSASGPWEDEGAEDGAPGRDLPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_060 MEGSFSDGGALPEGLAEEAEPQGAAWSGDSGTVSQSHSSASGPWEDEGAEDGAPGRDLPL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 LRRAAAGYAACLLPGAGARPEVEALDASLEDLLTRVDEFVGMLDMLRGDSSHVVSEGVPR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_060 LRRAAAGYAACLLPGAGARPEVEALDASLEDLLTRVDEFVGMLDMLRGDSSHVVSEGVPR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 IHAKAAEMRRIYSRIDRLEAFVRMVGGRVARMEEQVTKAEAELGTFPRAFKKLLHTMNVP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_060 IHAKAAEMRRIYSRIDRLEAFVRMVGGRVARMEEQVTKAEAELGTFPRAFKKLLHTMNVP 130 140 150 160 170 180 190 200 210 pF1KB9 SLFSKSAPSRPQQAGYEAPVLFRTEDYFPCCSERPQL ::::::::::::::::::::::::::::::::::::: NP_060 SLFSKSAPSRPQQAGYEAPVLFRTEDYFPCCSERPQL 190 200 210 >>XP_011527189 (OMIM: 607471) PREDICTED: breast carcinom (203 aa) initn: 384 init1: 315 opt: 408 Z-score: 445.0 bits: 89.4 E(85289): 4.7e-18 Smith-Waterman score: 408; 42.9% identity (74.0% similar) in 154 aa overlap (57-209:37-186) 30 40 50 60 70 80 pF1KB9 SGDSGTVSQSHSSASGPWEDEGAEDGAPGRDLPL-LRRAAAGYAACLLPGAGARPEVEAL : : .: .: : : : :: :.. . XP_011 GAPRPGRNHGLPGSLRQPDPVALLMLLVDADQPEPMRSGARELALFLTPEPGA--EAKEV 10 20 30 40 50 60 90 100 110 120 130 140 pF1KB9 DASLEDLLTRVDEFVGMLDMLRGDSSHVVSEGVPRIHAKAAEMRRIYSRIDRLEAFVRMV . ..: .: :..:: .. :..:.:.:... :..: ..:: .::: ::...:::::::.:: XP_011 EETIEGMLLRLEEFCSLADLIRSDTSQILEENIPVLKAKLTEMRGIYAKVDRLEAFVKMV 70 80 90 100 110 120 150 160 170 180 190 200 pF1KB9 GGRVARMEEQVTKAEAELGTFPRAFKKLLHTMNVPSLFSKSAPSRPQQAGYEAPVLFRTE : .:: .: .: .:: . :.::.:... : . ..::. .:: :. : . :: :.:.::: XP_011 GHHVAFLEADVLQAERDHGAFPQALRRWLGSAGLPSFRNKS-PA-PVPVTYELPTLYRTE 130 140 150 160 170 180 210 pF1KB9 DYFPCCSERPQL :::: XP_011 DYFPVDAGEAQHHPRTCPRPL 190 200 >>NP_942094 (OMIM: 607471) breast carcinoma-amplified se (203 aa) initn: 384 init1: 315 opt: 408 Z-score: 445.0 bits: 89.4 E(85289): 4.7e-18 Smith-Waterman score: 408; 42.9% identity (74.0% similar) in 154 aa overlap (57-209:37-186) 30 40 50 60 70 80 pF1KB9 SGDSGTVSQSHSSASGPWEDEGAEDGAPGRDLPL-LRRAAAGYAACLLPGAGARPEVEAL : : .: .: : : : :: :.. . NP_942 GAPRPGRNHGLPGSLRQPDPVALLMLLVDADQPEPMRSGARELALFLTPEPGA--EAKEV 10 20 30 40 50 60 90 100 110 120 130 140 pF1KB9 DASLEDLLTRVDEFVGMLDMLRGDSSHVVSEGVPRIHAKAAEMRRIYSRIDRLEAFVRMV . ..: .: :..:: .. :..:.:.:... :..: ..:: .::: ::...:::::::.:: NP_942 EETIEGMLLRLEEFCSLADLIRSDTSQILEENIPVLKAKLTEMRGIYAKVDRLEAFVKMV 70 80 90 100 110 120 150 160 170 180 190 200 pF1KB9 GGRVARMEEQVTKAEAELGTFPRAFKKLLHTMNVPSLFSKSAPSRPQQAGYEAPVLFRTE : .:: .: .: .:: . :.::.:... : . ..::. .:: :. : . :: :.:.::: NP_942 GHHVAFLEADVLQAERDHGAFPQALRRWLGSAGLPSFRNKS-PA-PVPVTYELPTLYRTE 130 140 150 160 170 180 210 pF1KB9 DYFPCCSERPQL :::: NP_942 DYFPVDAGEAQHHPRTCPRPL 190 200 >>XP_011527188 (OMIM: 607471) PREDICTED: breast carcinom (214 aa) initn: 311 init1: 311 opt: 330 Z-score: 361.8 bits: 74.0 E(85289): 2.1e-13 Smith-Waterman score: 330; 39.3% identity (72.6% similar) in 135 aa overlap (57-190:37-169) 30 40 50 60 70 80 pF1KB9 SGDSGTVSQSHSSASGPWEDEGAEDGAPGRDLPL-LRRAAAGYAACLLPGAGARPEVEAL : : .: .: : : : :: :.. . XP_011 GAPRPGRNHGLPGSLRQPDPVALLMLLVDADQPEPMRSGARELALFLTPEPGA--EAKEV 10 20 30 40 50 60 90 100 110 120 130 140 pF1KB9 DASLEDLLTRVDEFVGMLDMLRGDSSHVVSEGVPRIHAKAAEMRRIYSRIDRLEAFVRMV . ..: .: :..:: .. :..:.:.:... :..: ..:: .::: ::...:::::::.:: XP_011 EETIEGMLLRLEEFCSLADLIRSDTSQILEENIPVLKAKLTEMRGIYAKVDRLEAFVKMV 70 80 90 100 110 120 150 160 170 180 190 200 pF1KB9 GGRVARMEEQVTKAEAELGTFPRAFKKLLHTMNVPSLFSKSAPSRPQQAGYEAPVLFRTE : .:: .: .: .:: . :.::.:... : . ..::. . . :: XP_011 GHHVAFLEADVLQAERDHGAFPQALRRWLGSAGLPSFRNPESCSRVGPVSCLCHGDESSD 130 140 150 160 170 180 210 pF1KB9 DYFPCCSERPQL XP_011 AAGVFWAVFGPPQAALTCPTWFIPRPPGTE 190 200 210 >>NP_060313 (OMIM: 607471) breast carcinoma-amplified se (211 aa) initn: 293 init1: 272 opt: 327 Z-score: 358.7 bits: 73.4 E(85289): 3.1e-13 Smith-Waterman score: 327; 38.8% identity (71.2% similar) in 139 aa overlap (57-190:37-173) 30 40 50 60 70 80 pF1KB9 SGDSGTVSQSHSSASGPWEDEGAEDGAPGRDLPL-LRRAAAGYAACLLPGAGARPEVEAL : : .: .: : : : :: :.. . NP_060 GAPRPGRNHGLPGSLRQPDPVALLMLLVDADQPEPMRSGARELALFLTPEPGA--EAKEV 10 20 30 40 50 60 90 100 110 120 130 140 pF1KB9 DASLEDLLTRVDEFVGMLDMLRGDSSHVVSEGVPRIHAKAAEMRRIYSRIDRLEAFVRMV . ..: .: :..:: .. :..:.:.:... :..: ..:: .::: ::...:::::::.:: NP_060 EETIEGMLLRLEEFCSLADLIRSDTSQILEENIPVLKAKLTEMRGIYAKVDRLEAFVKMV 70 80 90 100 110 120 150 160 170 180 190 200 pF1KB9 GGRVARMEEQVTKAEAELGTFPRAFKKLLHTMNVPSL----FSKSAPSRPQQAGYEAPVL : .:: .: .: .:: . :.::.:... : . ..::. : . :.: NP_060 GHHVAFLEADVLQAERDHGAFPQALRRWLGSAGLPSFRNVECSGTIPARCNLRLPGSSDS 130 140 150 160 170 180 210 pF1KB9 FRTEDYFPCCSERPQL NP_060 PASASQVAGITEVTCTGARDVRAAHTV 190 200 210 >>XP_016883421 (OMIM: 607471) PREDICTED: breast carcinom (139 aa) initn: 193 init1: 180 opt: 199 Z-score: 225.4 bits: 48.2 E(85289): 8.1e-06 Smith-Waterman score: 199; 38.1% identity (71.4% similar) in 84 aa overlap (57-139:37-118) 30 40 50 60 70 80 pF1KB9 SGDSGTVSQSHSSASGPWEDEGAEDGAPGRDLPL-LRRAAAGYAACLLPGAGARPEVEAL : : .: .: : : : :: :.. . XP_016 GAPRPGRNHGLPGSLRQPDPVALLMLLVDADQPEPMRSGARELALFLTPEPGA--EAKEV 10 20 30 40 50 60 90 100 110 120 130 140 pF1KB9 DASLEDLLTRVDEFVGMLDMLRGDSSHVVSEGVPRIHAKAAEMRRIYSRIDRLEAFVRMV . ..: .: :..:: .. :..:.:.:... :..: ..:: .::: ::...:::: XP_016 EETIEGMLLRLEEFCSLADLIRSDTSQILEENIPVLKAKLTEMRGIYAKVDRLEILALPA 70 80 90 100 110 120 150 160 170 180 190 200 pF1KB9 GGRVARMEEQVTKAEAELGTFPRAFKKLLHTMNVPSLFSKSAPSRPQQAGYEAPVLFRTE XP_016 FLSSWPLPPPSKPTE 130 >>NP_001010974 (OMIM: 607471) breast carcinoma-amplified (158 aa) initn: 249 init1: 180 opt: 199 Z-score: 224.6 bits: 48.2 E(85289): 9.1e-06 Smith-Waterman score: 199; 38.1% identity (71.4% similar) in 84 aa overlap (57-139:37-118) 30 40 50 60 70 80 pF1KB9 SGDSGTVSQSHSSASGPWEDEGAEDGAPGRDLPL-LRRAAAGYAACLLPGAGARPEVEAL : : .: .: : : : :: :.. . NP_001 GAPRPGRNHGLPGSLRQPDPVALLMLLVDADQPEPMRSGARELALFLTPEPGA--EAKEV 10 20 30 40 50 60 90 100 110 120 130 140 pF1KB9 DASLEDLLTRVDEFVGMLDMLRGDSSHVVSEGVPRIHAKAAEMRRIYSRIDRLEAFVRMV . ..: .: :..:: .. :..:.:.:... :..: ..:: .::: ::...:::: NP_001 EETIEGMLLRLEEFCSLADLIRSDTSQILEENIPVLKAKLTEMRGIYAKVDRLEKSPAPV 70 80 90 100 110 120 150 160 170 180 190 200 pF1KB9 GGRVARMEEQVTKAEAELGTFPRAFKKLLHTMNVPSLFSKSAPSRPQQAGYEAPVLFRTE NP_001 PVTYELPTLYRTEDYFPVDAGEAQHHPRTCPRPL 130 140 150 217 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 18:59:04 2016 done: Fri Nov 4 18:59:05 2016 Total Scan time: 7.120 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]