FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB8453, 321 aa
1>>>pF1KB8453 321 - 321 aa - 321 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 4.8350+/-0.000294; mu= 18.8099+/- 0.018
mean_var=66.2081+/-13.060, 0's: 0 Z-trim(117.9): 23 B-trim: 0 in 0/54
Lambda= 0.157623
statistics sampled from 30223 (30244) to 30223 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.728), E-opt: 0.2 (0.355), width: 16
Scan time: 6.630
The best scores are: opt bits E(85289)
NP_002396 (OMIM: 602577) beta-1,3-N-acetylglucosam ( 321) 2237 517.1 2e-146
NP_001159815 (OMIM: 602577) beta-1,3-N-acetylgluco ( 307) 1541 358.8 8.4e-99
NP_001035257 (OMIM: 602576,609813) beta-1,3-N-acet ( 379) 1241 290.6 3.4e-78
NP_002908 (OMIM: 602578) beta-1,3-N-acetylglucosam ( 331) 1129 265.1 1.4e-70
NP_001035258 (OMIM: 602576,609813) beta-1,3-N-acet ( 361) 1115 262.0 1.4e-69
NP_001159827 (OMIM: 602576,609813) beta-1,3-N-acet ( 308) 1079 253.7 3.6e-67
NP_002295 (OMIM: 602576,609813) beta-1,3-N-acetylg ( 250) 1072 252.0 9.2e-67
XP_011521889 (OMIM: 602578) PREDICTED: beta-1,3-N- ( 205) 833 197.6 1.8e-50
>>NP_002396 (OMIM: 602577) beta-1,3-N-acetylglucosaminyl (321 aa)
initn: 2237 init1: 2237 opt: 2237 Z-score: 2749.9 bits: 517.1 E(85289): 2e-146
Smith-Waterman score: 2237; 100.0% identity (100.0% similar) in 321 aa overlap (1-321:1-321)
10 20 30 40 50 60
pF1KB8 MQCRLPRGLAGALLTLLCMGLLCLRYHLNLSPQRVQGTPELSQPNPGPPKLQLHDVFIAV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 MQCRLPRGLAGALLTLLCMGLLCLRYHLNLSPQRVQGTPELSQPNPGPPKLQLHDVFIAV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB8 KTTRAFHRLRLELLLDTWVSRTREQTFVFTDSPDKGLQERLGSHLVVTNCSAEHSHPALS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 KTTRAFHRLRLELLLDTWVSRTREQTFVFTDSPDKGLQERLGSHLVVTNCSAEHSHPALS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB8 CKMAAEFDTFLASGLRWFCHVDDDNYVNPRALLQLLRAFPLARDVYVGRPSLNRPIHASE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 CKMAAEFDTFLASGLRWFCHVDDDNYVNPRALLQLLRAFPLARDVYVGRPSLNRPIHASE
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB8 PQPHNRTRLVQFWFATGGAGFCINRKLALKMAPWASGSRFMDTSALIRLPDDCTMGYIIE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 PQPHNRTRLVQFWFATGGAGFCINRKLALKMAPWASGSRFMDTSALIRLPDDCTMGYIIE
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB8 CKLGGRLQPSPLFHSHLETLQLLRTAQLPEQVTLSYGVFEGKLNVIKLQGPFSPEEDPSR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 CKLGGRLQPSPLFHSHLETLQLLRTAQLPEQVTLSYGVFEGKLNVIKLQGPFSPEEDPSR
250 260 270 280 290 300
310 320
pF1KB8 FRSLHCLLYPDTPWCPQLGAR
:::::::::::::::::::::
NP_002 FRSLHCLLYPDTPWCPQLGAR
310 320
>>NP_001159815 (OMIM: 602577) beta-1,3-N-acetylglucosami (307 aa)
initn: 2109 init1: 1541 opt: 1541 Z-score: 1894.8 bits: 358.8 E(85289): 8.4e-99
Smith-Waterman score: 2081; 95.0% identity (95.3% similar) in 321 aa overlap (1-321:1-307)
10 20 30 40 50 60
pF1KB8 MQCRLPRGLAGALLTLLCMGLLCLRYHLNLSPQRVQGTPELSQPNPGPPKLQLHDVFIAV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MQCRLPRGLAGALLTLLCMGLLCLRYHLNLSPQRVQGTPELSQPNPGPPKLQLHDVFIAV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB8 KTTRAFHRLRLELLLDTWVSRTREQTFVFTDSPDKGLQERLGSHLVVTNCSAEHSHPALS
:::::::::::::::::::::::::. : ::::::::::::::::::
NP_001 KTTRAFHRLRLELLLDTWVSRTREQV------------TR--SHLVVTNCSAEHSHPALS
70 80 90 100
130 140 150 160 170 180
pF1KB8 CKMAAEFDTFLASGLRWFCHVDDDNYVNPRALLQLLRAFPLARDVYVGRPSLNRPIHASE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 CKMAAEFDTFLASGLRWFCHVDDDNYVNPRALLQLLRAFPLARDVYVGRPSLNRPIHASE
110 120 130 140 150 160
190 200 210 220 230 240
pF1KB8 PQPHNRTRLVQFWFATGGAGFCINRKLALKMAPWASGSRFMDTSALIRLPDDCTMGYIIE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 PQPHNRTRLVQFWFATGGAGFCINRKLALKMAPWASGSRFMDTSALIRLPDDCTMGYIIE
170 180 190 200 210 220
250 260 270 280 290 300
pF1KB8 CKLGGRLQPSPLFHSHLETLQLLRTAQLPEQVTLSYGVFEGKLNVIKLQGPFSPEEDPSR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 CKLGGRLQPSPLFHSHLETLQLLRTAQLPEQVTLSYGVFEGKLNVIKLQGPFSPEEDPSR
230 240 250 260 270 280
310 320
pF1KB8 FRSLHCLLYPDTPWCPQLGAR
:::::::::::::::::::::
NP_001 FRSLHCLLYPDTPWCPQLGAR
290 300
>>NP_001035257 (OMIM: 602576,609813) beta-1,3-N-acetylgl (379 aa)
initn: 1240 init1: 1014 opt: 1241 Z-score: 1524.8 bits: 290.6 E(85289): 3.4e-78
Smith-Waterman score: 1241; 62.6% identity (82.2% similar) in 286 aa overlap (32-317:91-375)
10 20 30 40 50 60
pF1KB8 QCRLPRGLAGALLTLLCMGLLCLRYHLNLSPQRVQGTPELSQPNPGPPKLQLHDVFIAVK
: . : ..: : : .:::::::
NP_001 AAAPGALVRDVHSLSEYFSLLTRARRDAGPPPGAAPRPADGHPRPLAEPLAPRDVFIAVK
70 80 90 100 110 120
70 80 90 100 110 120
pF1KB8 TTRAFHRLRLELLLDTWVSRTREQTFVFTDSPDKGLQERLGSHLVVTNCSAEHSHPALSC
::. ::: ::.:::.::.:: .:.::.:::. :..: .. :. .:.::::: ::. ::::
NP_001 TTKKFHRARLDLLLETWISRHKEMTFIFTDGEDEALARHTGN-VVITNCSAAHSRQALSC
130 140 150 160 170
130 140 150 160 170 180
pF1KB8 KMAAEFDTFLASGLRWFCHVDDDNYVNPRALLQLLRAFPLARDVYVGRPSLNRPIHASEP
:::.:.: :. :: .:::::::::::: ::::.:: ..: .::::::.:::.:::.: :
NP_001 KMAVEYDRFIESGRKWFCHVDDDNYVNLRALLRLLASYPHTRDVYVGKPSLDRPIQAMER
180 190 200 210 220 230
190 200 210 220 230 240
pF1KB8 QPHNRTRLVQFWFATGGAGFCINRKLALKMAPWASGSRFMDTSALIRLPDDCTMGYIIEC
.:..: :.::::::::::::.: :::::.:::::..::.:. ::::::::.:::.:
NP_001 VSENKVRPVHFWFATGGAGFCISRGLALKMSPWASGGHFMNTAERIRLPDDCTIGYIVEA
240 250 260 270 280 290
250 260 270 280 290 300
pF1KB8 KLGGRLQPSPLFHSHLETLQLLRTAQLPEQVTLSYGVFEGKLNVIKLQGPFSPEEDPSRF
:: : : :::::::.:: . :..: ::::::::.::.: :.....:::: : :::::
NP_001 LLGVPLIRSGLFHSHLENLQQVPTSELHEQVTLSYGMFENKRNAVHVKGPFSVEADPSRF
300 310 320 330 340 350
310 320
pF1KB8 RSLHCLLYPDTPWCPQLGAR
::.:: :::::::::.
NP_001 RSIHCHLYPDTPWCPRTAIF
360 370
>>NP_002908 (OMIM: 602578) beta-1,3-N-acetylglucosaminyl (331 aa)
initn: 1102 init1: 568 opt: 1129 Z-score: 1388.0 bits: 265.1 E(85289): 1.4e-70
Smith-Waterman score: 1129; 55.2% identity (76.0% similar) in 317 aa overlap (3-317:9-322)
10 20 30 40 50
pF1KB8 MQCRLPRGLAGALLTLLCMGLLCLRYHLNLSPQRVQGTPELSQPN-PGPPKLQL
:: .::.:: .:: :: : .: :. . . :. :. :.:.
NP_002 MSRARGALCRACLALAAALAALL---LLPLPLPRAPAPARTPAPAPRAPPSRPAAPSLRP
10 20 30 40 50
60 70 80 90 100 110
pF1KB8 HDVFIAVKTTRAFHRLRLELLLDTWVSRTREQTFVFTDSPDKGLQERLGSHLVVTNCSAE
:::::::::: : ::.::: ::.::.:.:::.:::. : :. . :.... :::::
NP_002 DDVFIAVKTTRKNHGPRLRLLLRTWISRARQQTFIFTDGDDPELELQGGDRVINTNCSAV
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB8 HSHPALSCKMAAEFDTFLASGLRWFCHVDDDNYVNPRALLQLLRAFPLARDVYVGRPSLN
... :: :::..:.: :. :: .:::::::::::: :.::.:: .: ..:::.:::::.
NP_002 RTRQALCCKMSVEYDKFIESGRKWFCHVDDDNYVNARSLLHLLSSFSPSQDVYLGRPSLD
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB8 RPIHASEPQPHNRT-RLVQFWFATGGAGFCINRKLALKMAPWASGSRFMDTSALIRLPDD
.::.:.: .:: :.:::::::::::..: :::::.:::: . ::.:. .:::::
NP_002 HPIEATERVQGGRTVTTVKFWFATGGAGFCLSRGLALKMSPWASLGSFMSTAEQVRLPDD
180 190 200 210 220 230
240 250 260 270 280 290
pF1KB8 CTMGYIIECKLGGRLQPSPLFHSHLETLQLLRTAQLPEQVTLSYGVFEGKLNVIKLQGPF
::.:::.: ::.:: :::::::::.:: : : .:::::.: :. ::... : :
NP_002 CTVGYIVEGLLGARLLHSPLFHSHLENLQRLPPDTLLQQVTLSHGGPENPHNVVNVAGGF
240 250 260 270 280 290
300 310 320
pF1KB8 SPEEDPSRFRSLHCLLYPDTPWCPQLGAR
: ..::.::.:.:::::::: :::.
NP_002 SLHQDPTRFKSIHCLLYPDTDWCPRQKQGAPTSR
300 310 320 330
>>NP_001035258 (OMIM: 602576,609813) beta-1,3-N-acetylgl (361 aa)
initn: 1114 init1: 888 opt: 1115 Z-score: 1370.3 bits: 262.0 E(85289): 1.4e-69
Smith-Waterman score: 1115; 61.1% identity (81.5% similar) in 270 aa overlap (32-301:91-359)
10 20 30 40 50 60
pF1KB8 QCRLPRGLAGALLTLLCMGLLCLRYHLNLSPQRVQGTPELSQPNPGPPKLQLHDVFIAVK
: . : ..: : : .:::::::
NP_001 AAAPGALVRDVHSLSEYFSLLTRARRDAGPPPGAAPRPADGHPRPLAEPLAPRDVFIAVK
70 80 90 100 110 120
70 80 90 100 110 120
pF1KB8 TTRAFHRLRLELLLDTWVSRTREQTFVFTDSPDKGLQERLGSHLVVTNCSAEHSHPALSC
::. ::: ::.:::.::.:: .:.::.:::. :..: .. :. .:.::::: ::. ::::
NP_001 TTKKFHRARLDLLLETWISRHKEMTFIFTDGEDEALARHTGN-VVITNCSAAHSRQALSC
130 140 150 160 170
130 140 150 160 170 180
pF1KB8 KMAAEFDTFLASGLRWFCHVDDDNYVNPRALLQLLRAFPLARDVYVGRPSLNRPIHASEP
:::.:.: :. :: .:::::::::::: ::::.:: ..: .::::::.:::.:::.: :
NP_001 KMAVEYDRFIESGRKWFCHVDDDNYVNLRALLRLLASYPHTRDVYVGKPSLDRPIQAMER
180 190 200 210 220 230
190 200 210 220 230 240
pF1KB8 QPHNRTRLVQFWFATGGAGFCINRKLALKMAPWASGSRFMDTSALIRLPDDCTMGYIIEC
.:..: :.::::::::::::.: :::::.:::::..::.:. ::::::::.:::.:
NP_001 VSENKVRPVHFWFATGGAGFCISRGLALKMSPWASGGHFMNTAERIRLPDDCTIGYIVEA
240 250 260 270 280 290
250 260 270 280 290 300
pF1KB8 KLGGRLQPSPLFHSHLETLQLLRTAQLPEQVTLSYGVFEGKLNVIKLQGPFSPEEDPSRF
:: : : :::::::.:: . :..: ::::::::.::.: :.....:::: : ::::.
NP_001 LLGVPLIRSGLFHSHLENLQQVPTSELHEQVTLSYGMFENKRNAVHVKGPFSVEADPSRW
300 310 320 330 340 350
310 320
pF1KB8 RSLHCLLYPDTPWCPQLGAR
NP_001 GN
360
>>NP_001159827 (OMIM: 602576,609813) beta-1,3-N-acetylgl (308 aa)
initn: 1071 init1: 1014 opt: 1079 Z-score: 1327.0 bits: 253.7 E(85289): 3.6e-67
Smith-Waterman score: 1079; 65.4% identity (85.5% similar) in 234 aa overlap (84-317:72-304)
60 70 80 90 100 110
pF1KB8 HDVFIAVKTTRAFHRLRLELLLDTWVSRTREQTFVFTDSPDKGLQERLGSHLVVTNCSAE
.:::.:::. :..: .. :. .:.:::::
NP_001 TDRWTDGWMDGWMDEWSPTPALRSYGGGLSQQTFIFTDGEDEALARHTGN-VVITNCSAA
50 60 70 80 90 100
120 130 140 150 160 170
pF1KB8 HSHPALSCKMAAEFDTFLASGLRWFCHVDDDNYVNPRALLQLLRAFPLARDVYVGRPSLN
::. :::::::.:.: :. :: .:::::::::::: ::::.:: ..: .::::::.:::.
NP_001 HSRQALSCKMAVEYDRFIESGRKWFCHVDDDNYVNLRALLRLLASYPHTRDVYVGKPSLD
110 120 130 140 150 160
180 190 200 210 220 230
pF1KB8 RPIHASEPQPHNRTRLVQFWFATGGAGFCINRKLALKMAPWASGSRFMDTSALIRLPDDC
:::.: : .:..: :.::::::::::::.: :::::.:::::..::.:. :::::::
NP_001 RPIQAMERVSENKVRPVHFWFATGGAGFCISRGLALKMSPWASGGHFMNTAERIRLPDDC
170 180 190 200 210 220
240 250 260 270 280 290
pF1KB8 TMGYIIECKLGGRLQPSPLFHSHLETLQLLRTAQLPEQVTLSYGVFEGKLNVIKLQGPFS
:.:::.: :: : : :::::::.:: . :..: ::::::::.::.: :.....::::
NP_001 TIGYIVEALLGVPLIRSGLFHSHLENLQQVPTSELHEQVTLSYGMFENKRNAVHVKGPFS
230 240 250 260 270 280
300 310 320
pF1KB8 PEEDPSRFRSLHCLLYPDTPWCPQLGAR
: :::::::.:: :::::::::.
NP_001 VEADPSRFRSIHCHLYPDTPWCPRTAIF
290 300
>>NP_002295 (OMIM: 602576,609813) beta-1,3-N-acetylgluco (250 aa)
initn: 1064 init1: 1014 opt: 1072 Z-score: 1319.6 bits: 252.0 E(85289): 9.2e-67
Smith-Waterman score: 1072; 65.2% identity (85.4% similar) in 233 aa overlap (85-317:15-246)
60 70 80 90 100 110
pF1KB8 DVFIAVKTTRAFHRLRLELLLDTWVSRTREQTFVFTDSPDKGLQERLGSHLVVTNCSAEH
.::.:::. :..: .. :. .:.::::: :
NP_002 MTPGRCCLAADIQVETFIFTDGEDEALARHTGN-VVITNCSAAH
10 20 30 40
120 130 140 150 160 170
pF1KB8 SHPALSCKMAAEFDTFLASGLRWFCHVDDDNYVNPRALLQLLRAFPLARDVYVGRPSLNR
:. :::::::.:.: :. :: .:::::::::::: ::::.:: ..: .::::::.:::.:
NP_002 SRQALSCKMAVEYDRFIESGRKWFCHVDDDNYVNLRALLRLLASYPHTRDVYVGKPSLDR
50 60 70 80 90 100
180 190 200 210 220 230
pF1KB8 PIHASEPQPHNRTRLVQFWFATGGAGFCINRKLALKMAPWASGSRFMDTSALIRLPDDCT
::.: : .:..: :.::::::::::::.: :::::.:::::..::.:. ::::::::
NP_002 PIQAMERVSENKVRPVHFWFATGGAGFCISRGLALKMSPWASGGHFMNTAERIRLPDDCT
110 120 130 140 150 160
240 250 260 270 280 290
pF1KB8 MGYIIECKLGGRLQPSPLFHSHLETLQLLRTAQLPEQVTLSYGVFEGKLNVIKLQGPFSP
.:::.: :: : : :::::::.:: . :..: ::::::::.::.: :.....::::
NP_002 IGYIVEALLGVPLIRSGLFHSHLENLQQVPTSELHEQVTLSYGMFENKRNAVHVKGPFSV
170 180 190 200 210 220
300 310 320
pF1KB8 EEDPSRFRSLHCLLYPDTPWCPQLGAR
: :::::::.:: :::::::::.
NP_002 EADPSRFRSIHCHLYPDTPWCPRTAIF
230 240 250
>>XP_011521889 (OMIM: 602578) PREDICTED: beta-1,3-N-acet (205 aa)
initn: 826 init1: 568 opt: 833 Z-score: 1027.1 bits: 197.6 E(85289): 1.8e-50
Smith-Waterman score: 833; 60.7% identity (81.1% similar) in 196 aa overlap (123-317:1-196)
100 110 120 130 140 150
pF1KB8 PDKGLQERLGSHLVVTNCSAEHSHPALSCKMAAEFDTFLASGLRWFCHVDDDNYVNPRAL
:..:.: :. :: .:::::::::::: :.:
XP_011 MSVEYDKFIESGRKWFCHVDDDNYVNARSL
10 20 30
160 170 180 190 200 210
pF1KB8 LQLLRAFPLARDVYVGRPSLNRPIHASEPQPHNRT-RLVQFWFATGGAGFCINRKLALKM
:.:: .: ..:::.:::::..::.:.: .:: :.:::::::::::..: :::::
XP_011 LHLLSSFSPSQDVYLGRPSLDHPIEATERVQGGRTVTTVKFWFATGGAGFCLSRGLALKM
40 50 60 70 80 90
220 230 240 250 260 270
pF1KB8 APWASGSRFMDTSALIRLPDDCTMGYIIECKLGGRLQPSPLFHSHLETLQLLRTAQLPEQ
.:::: . ::.:. .:::::::.:::.: ::.:: :::::::::.:: : : .:
XP_011 SPWASLGSFMSTAEQVRLPDDCTVGYIVEGLLGARLLHSPLFHSHLENLQRLPPDTLLQQ
100 110 120 130 140 150
280 290 300 310 320
pF1KB8 VTLSYGVFEGKLNVIKLQGPFSPEEDPSRFRSLHCLLYPDTPWCPQLGAR
::::.: :. ::... : :: ..::.::.:.:::::::: :::.
XP_011 VTLSHGGPENPHNVVNVAGGFSLHQDPTRFKSIHCLLYPDTDWCPRQKQGAPTSR
160 170 180 190 200
321 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 13:01:38 2016 done: Fri Nov 4 13:01:39 2016
Total Scan time: 6.630 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]