FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9794, 217 aa
1>>>pF1KB9794 217 - 217 aa - 217 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.2363+/-0.000286; mu= 9.9185+/- 0.018
mean_var=88.5101+/-17.522, 0's: 0 Z-trim(119.8): 7 B-trim: 0 in 0/54
Lambda= 0.136326
statistics sampled from 34156 (34164) to 34156 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.774), E-opt: 0.2 (0.401), width: 16
Scan time: 7.120
The best scores are: opt bits E(85289)
NP_060836 (OMIM: 605695) biogenesis of lysosome-re ( 217) 1450 294.3 1e-79
XP_011527189 (OMIM: 607471) PREDICTED: breast carc ( 203) 408 89.4 4.7e-18
NP_942094 (OMIM: 607471) breast carcinoma-amplifie ( 203) 408 89.4 4.7e-18
XP_011527188 (OMIM: 607471) PREDICTED: breast carc ( 214) 330 74.0 2.1e-13
NP_060313 (OMIM: 607471) breast carcinoma-amplifie ( 211) 327 73.4 3.1e-13
XP_016883421 (OMIM: 607471) PREDICTED: breast carc ( 139) 199 48.2 8.1e-06
NP_001010974 (OMIM: 607471) breast carcinoma-ampli ( 158) 199 48.2 9.1e-06
>>NP_060836 (OMIM: 605695) biogenesis of lysosome-relate (217 aa)
initn: 1450 init1: 1450 opt: 1450 Z-score: 1552.2 bits: 294.3 E(85289): 1e-79
Smith-Waterman score: 1450; 100.0% identity (100.0% similar) in 217 aa overlap (1-217:1-217)
10 20 30 40 50 60
pF1KB9 MEGSFSDGGALPEGLAEEAEPQGAAWSGDSGTVSQSHSSASGPWEDEGAEDGAPGRDLPL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_060 MEGSFSDGGALPEGLAEEAEPQGAAWSGDSGTVSQSHSSASGPWEDEGAEDGAPGRDLPL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 LRRAAAGYAACLLPGAGARPEVEALDASLEDLLTRVDEFVGMLDMLRGDSSHVVSEGVPR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_060 LRRAAAGYAACLLPGAGARPEVEALDASLEDLLTRVDEFVGMLDMLRGDSSHVVSEGVPR
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 IHAKAAEMRRIYSRIDRLEAFVRMVGGRVARMEEQVTKAEAELGTFPRAFKKLLHTMNVP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_060 IHAKAAEMRRIYSRIDRLEAFVRMVGGRVARMEEQVTKAEAELGTFPRAFKKLLHTMNVP
130 140 150 160 170 180
190 200 210
pF1KB9 SLFSKSAPSRPQQAGYEAPVLFRTEDYFPCCSERPQL
:::::::::::::::::::::::::::::::::::::
NP_060 SLFSKSAPSRPQQAGYEAPVLFRTEDYFPCCSERPQL
190 200 210
>>XP_011527189 (OMIM: 607471) PREDICTED: breast carcinom (203 aa)
initn: 384 init1: 315 opt: 408 Z-score: 445.0 bits: 89.4 E(85289): 4.7e-18
Smith-Waterman score: 408; 42.9% identity (74.0% similar) in 154 aa overlap (57-209:37-186)
30 40 50 60 70 80
pF1KB9 SGDSGTVSQSHSSASGPWEDEGAEDGAPGRDLPL-LRRAAAGYAACLLPGAGARPEVEAL
: : .: .: : : : :: :.. .
XP_011 GAPRPGRNHGLPGSLRQPDPVALLMLLVDADQPEPMRSGARELALFLTPEPGA--EAKEV
10 20 30 40 50 60
90 100 110 120 130 140
pF1KB9 DASLEDLLTRVDEFVGMLDMLRGDSSHVVSEGVPRIHAKAAEMRRIYSRIDRLEAFVRMV
. ..: .: :..:: .. :..:.:.:... :..: ..:: .::: ::...:::::::.::
XP_011 EETIEGMLLRLEEFCSLADLIRSDTSQILEENIPVLKAKLTEMRGIYAKVDRLEAFVKMV
70 80 90 100 110 120
150 160 170 180 190 200
pF1KB9 GGRVARMEEQVTKAEAELGTFPRAFKKLLHTMNVPSLFSKSAPSRPQQAGYEAPVLFRTE
: .:: .: .: .:: . :.::.:... : . ..::. .:: :. : . :: :.:.:::
XP_011 GHHVAFLEADVLQAERDHGAFPQALRRWLGSAGLPSFRNKS-PA-PVPVTYELPTLYRTE
130 140 150 160 170 180
210
pF1KB9 DYFPCCSERPQL
::::
XP_011 DYFPVDAGEAQHHPRTCPRPL
190 200
>>NP_942094 (OMIM: 607471) breast carcinoma-amplified se (203 aa)
initn: 384 init1: 315 opt: 408 Z-score: 445.0 bits: 89.4 E(85289): 4.7e-18
Smith-Waterman score: 408; 42.9% identity (74.0% similar) in 154 aa overlap (57-209:37-186)
30 40 50 60 70 80
pF1KB9 SGDSGTVSQSHSSASGPWEDEGAEDGAPGRDLPL-LRRAAAGYAACLLPGAGARPEVEAL
: : .: .: : : : :: :.. .
NP_942 GAPRPGRNHGLPGSLRQPDPVALLMLLVDADQPEPMRSGARELALFLTPEPGA--EAKEV
10 20 30 40 50 60
90 100 110 120 130 140
pF1KB9 DASLEDLLTRVDEFVGMLDMLRGDSSHVVSEGVPRIHAKAAEMRRIYSRIDRLEAFVRMV
. ..: .: :..:: .. :..:.:.:... :..: ..:: .::: ::...:::::::.::
NP_942 EETIEGMLLRLEEFCSLADLIRSDTSQILEENIPVLKAKLTEMRGIYAKVDRLEAFVKMV
70 80 90 100 110 120
150 160 170 180 190 200
pF1KB9 GGRVARMEEQVTKAEAELGTFPRAFKKLLHTMNVPSLFSKSAPSRPQQAGYEAPVLFRTE
: .:: .: .: .:: . :.::.:... : . ..::. .:: :. : . :: :.:.:::
NP_942 GHHVAFLEADVLQAERDHGAFPQALRRWLGSAGLPSFRNKS-PA-PVPVTYELPTLYRTE
130 140 150 160 170 180
210
pF1KB9 DYFPCCSERPQL
::::
NP_942 DYFPVDAGEAQHHPRTCPRPL
190 200
>>XP_011527188 (OMIM: 607471) PREDICTED: breast carcinom (214 aa)
initn: 311 init1: 311 opt: 330 Z-score: 361.8 bits: 74.0 E(85289): 2.1e-13
Smith-Waterman score: 330; 39.3% identity (72.6% similar) in 135 aa overlap (57-190:37-169)
30 40 50 60 70 80
pF1KB9 SGDSGTVSQSHSSASGPWEDEGAEDGAPGRDLPL-LRRAAAGYAACLLPGAGARPEVEAL
: : .: .: : : : :: :.. .
XP_011 GAPRPGRNHGLPGSLRQPDPVALLMLLVDADQPEPMRSGARELALFLTPEPGA--EAKEV
10 20 30 40 50 60
90 100 110 120 130 140
pF1KB9 DASLEDLLTRVDEFVGMLDMLRGDSSHVVSEGVPRIHAKAAEMRRIYSRIDRLEAFVRMV
. ..: .: :..:: .. :..:.:.:... :..: ..:: .::: ::...:::::::.::
XP_011 EETIEGMLLRLEEFCSLADLIRSDTSQILEENIPVLKAKLTEMRGIYAKVDRLEAFVKMV
70 80 90 100 110 120
150 160 170 180 190 200
pF1KB9 GGRVARMEEQVTKAEAELGTFPRAFKKLLHTMNVPSLFSKSAPSRPQQAGYEAPVLFRTE
: .:: .: .: .:: . :.::.:... : . ..::. . . ::
XP_011 GHHVAFLEADVLQAERDHGAFPQALRRWLGSAGLPSFRNPESCSRVGPVSCLCHGDESSD
130 140 150 160 170 180
210
pF1KB9 DYFPCCSERPQL
XP_011 AAGVFWAVFGPPQAALTCPTWFIPRPPGTE
190 200 210
>>NP_060313 (OMIM: 607471) breast carcinoma-amplified se (211 aa)
initn: 293 init1: 272 opt: 327 Z-score: 358.7 bits: 73.4 E(85289): 3.1e-13
Smith-Waterman score: 327; 38.8% identity (71.2% similar) in 139 aa overlap (57-190:37-173)
30 40 50 60 70 80
pF1KB9 SGDSGTVSQSHSSASGPWEDEGAEDGAPGRDLPL-LRRAAAGYAACLLPGAGARPEVEAL
: : .: .: : : : :: :.. .
NP_060 GAPRPGRNHGLPGSLRQPDPVALLMLLVDADQPEPMRSGARELALFLTPEPGA--EAKEV
10 20 30 40 50 60
90 100 110 120 130 140
pF1KB9 DASLEDLLTRVDEFVGMLDMLRGDSSHVVSEGVPRIHAKAAEMRRIYSRIDRLEAFVRMV
. ..: .: :..:: .. :..:.:.:... :..: ..:: .::: ::...:::::::.::
NP_060 EETIEGMLLRLEEFCSLADLIRSDTSQILEENIPVLKAKLTEMRGIYAKVDRLEAFVKMV
70 80 90 100 110 120
150 160 170 180 190 200
pF1KB9 GGRVARMEEQVTKAEAELGTFPRAFKKLLHTMNVPSL----FSKSAPSRPQQAGYEAPVL
: .:: .: .: .:: . :.::.:... : . ..::. : . :.:
NP_060 GHHVAFLEADVLQAERDHGAFPQALRRWLGSAGLPSFRNVECSGTIPARCNLRLPGSSDS
130 140 150 160 170 180
210
pF1KB9 FRTEDYFPCCSERPQL
NP_060 PASASQVAGITEVTCTGARDVRAAHTV
190 200 210
>>XP_016883421 (OMIM: 607471) PREDICTED: breast carcinom (139 aa)
initn: 193 init1: 180 opt: 199 Z-score: 225.4 bits: 48.2 E(85289): 8.1e-06
Smith-Waterman score: 199; 38.1% identity (71.4% similar) in 84 aa overlap (57-139:37-118)
30 40 50 60 70 80
pF1KB9 SGDSGTVSQSHSSASGPWEDEGAEDGAPGRDLPL-LRRAAAGYAACLLPGAGARPEVEAL
: : .: .: : : : :: :.. .
XP_016 GAPRPGRNHGLPGSLRQPDPVALLMLLVDADQPEPMRSGARELALFLTPEPGA--EAKEV
10 20 30 40 50 60
90 100 110 120 130 140
pF1KB9 DASLEDLLTRVDEFVGMLDMLRGDSSHVVSEGVPRIHAKAAEMRRIYSRIDRLEAFVRMV
. ..: .: :..:: .. :..:.:.:... :..: ..:: .::: ::...::::
XP_016 EETIEGMLLRLEEFCSLADLIRSDTSQILEENIPVLKAKLTEMRGIYAKVDRLEILALPA
70 80 90 100 110 120
150 160 170 180 190 200
pF1KB9 GGRVARMEEQVTKAEAELGTFPRAFKKLLHTMNVPSLFSKSAPSRPQQAGYEAPVLFRTE
XP_016 FLSSWPLPPPSKPTE
130
>>NP_001010974 (OMIM: 607471) breast carcinoma-amplified (158 aa)
initn: 249 init1: 180 opt: 199 Z-score: 224.6 bits: 48.2 E(85289): 9.1e-06
Smith-Waterman score: 199; 38.1% identity (71.4% similar) in 84 aa overlap (57-139:37-118)
30 40 50 60 70 80
pF1KB9 SGDSGTVSQSHSSASGPWEDEGAEDGAPGRDLPL-LRRAAAGYAACLLPGAGARPEVEAL
: : .: .: : : : :: :.. .
NP_001 GAPRPGRNHGLPGSLRQPDPVALLMLLVDADQPEPMRSGARELALFLTPEPGA--EAKEV
10 20 30 40 50 60
90 100 110 120 130 140
pF1KB9 DASLEDLLTRVDEFVGMLDMLRGDSSHVVSEGVPRIHAKAAEMRRIYSRIDRLEAFVRMV
. ..: .: :..:: .. :..:.:.:... :..: ..:: .::: ::...::::
NP_001 EETIEGMLLRLEEFCSLADLIRSDTSQILEENIPVLKAKLTEMRGIYAKVDRLEKSPAPV
70 80 90 100 110 120
150 160 170 180 190 200
pF1KB9 GGRVARMEEQVTKAEAELGTFPRAFKKLLHTMNVPSLFSKSAPSRPQQAGYEAPVLFRTE
NP_001 PVTYELPTLYRTEDYFPVDAGEAQHHPRTCPRPL
130 140 150
217 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 18:59:04 2016 done: Fri Nov 4 18:59:05 2016
Total Scan time: 7.120 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]