FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE0014, 487 aa
1>>>pF1KE0014 487 - 487 aa - 487 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 4.9303+/-0.00129; mu= 19.7812+/- 0.077
mean_var=62.4075+/-12.976, 0's: 0 Z-trim(99.6): 36 B-trim: 269 in 1/46
Lambda= 0.162351
statistics sampled from 5760 (5782) to 5760 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.52), E-opt: 0.2 (0.178), width: 16
Scan time: 2.900
The best scores are: opt bits E(32554)
CCDS8572.1 LPCAT3 gene_id:10162|Hs108|chr12 ( 487) 3270 775.3 0
CCDS1660.1 MBOAT2 gene_id:129642|Hs108|chr2 ( 520) 249 67.7 3.3e-11
CCDS34346.1 MBOAT1 gene_id:154141|Hs108|chr6 ( 495) 244 66.6 7.2e-11
>>CCDS8572.1 LPCAT3 gene_id:10162|Hs108|chr12 (487 aa)
initn: 3270 init1: 3270 opt: 3270 Z-score: 4139.1 bits: 775.3 E(32554): 0
Smith-Waterman score: 3270; 100.0% identity (100.0% similar) in 487 aa overlap (1-487:1-487)
10 20 30 40 50 60
pF1KE0 MASSAEGDEGTVVALAGVLQSGFQELSLNKLATSLGASEQALRLIISIFLGYPFALFYRH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 MASSAEGDEGTVVALAGVLQSGFQELSLNKLATSLGASEQALRLIISIFLGYPFALFYRH
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 YLFYKETYLIHLFHTFTGLSIAYFNFGNQLYHSLLCIVLQFLILRLMGRTITAVLTTFCF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 YLFYKETYLIHLFHTFTGLSIAYFNFGNQLYHSLLCIVLQFLILRLMGRTITAVLTTFCF
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 QMAYLLAGYYYTATGNYDIKWTMPHCVLTLKLIGLAVDYFDGGKDQNSLSSEQQKYAIRG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 QMAYLLAGYYYTATGNYDIKWTMPHCVLTLKLIGLAVDYFDGGKDQNSLSSEQQKYAIRG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE0 VPSLLEVAGFSYFYGAFLVGPQFSMNHYMKLVQGELIDIPGKIPNSIIPALKRLSLGLFY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 VPSLLEVAGFSYFYGAFLVGPQFSMNHYMKLVQGELIDIPGKIPNSIIPALKRLSLGLFY
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE0 LVGYTLLSPHITEDYLLTEDYDNHPFWFRCMYMLIWGKFVLYKYVTCWLVTEGVCILTGL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 LVGYTLLSPHITEDYLLTEDYDNHPFWFRCMYMLIWGKFVLYKYVTCWLVTEGVCILTGL
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE0 GFNGFEEKGKAKWDACANMKVWLFETNPRFTGTIASFNINTNAWVARYIFKRLKFLGNKE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 GFNGFEEKGKAKWDACANMKVWLFETNPRFTGTIASFNINTNAWVARYIFKRLKFLGNKE
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE0 LSQGLSLLFLALWHGLHSGYLVCFQMEFLIVIVERQAARLIQESPTLSKLAAITVLQPFY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 LSQGLSLLFLALWHGLHSGYLVCFQMEFLIVIVERQAARLIQESPTLSKLAAITVLQPFY
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE0 YLVQQTIHWLFMGYSMTAFCLFTWDKWLKVYKSIYFLGHIFFLSLLFILPYIHKAMVPRK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS85 YLVQQTIHWLFMGYSMTAFCLFTWDKWLKVYKSIYFLGHIFFLSLLFILPYIHKAMVPRK
430 440 450 460 470 480
pF1KE0 EKLKKME
:::::::
CCDS85 EKLKKME
>>CCDS1660.1 MBOAT2 gene_id:129642|Hs108|chr2 (520 aa)
initn: 320 init1: 177 opt: 249 Z-score: 314.6 bits: 67.7 E(32554): 3.3e-11
Smith-Waterman score: 498; 24.7% identity (55.0% similar) in 469 aa overlap (28-470:12-464)
10 20 30 40 50 60
pF1KE0 MASSAEGDEGTVVALAGVLQSGFQELSLNKLATSLGASEQALRLIISIFLGYPFALFYRH
:. :.... . . ... ... :...:
CCDS16 MATTSTTGSTLLQPLSNAVQLPIDQVNFVVCQLFALLAAIWFRT
10 20 30 40
70 80 90 100 110
pF1KE0 YLFYKET--YLIHLFHTFTGLSIAYFNFGNQLYHSLLCIVLQFLILRLMGRTITAVLTTF
:: ..: .. :. :. :: .: : :: : :. ... :. ..: . . ..
CCDS16 YLHSSKTSSFIRHVVATLLGLYLALFCFGWYALHFLVQSGISYCIMIIIG---VENMHNY
50 60 70 80 90 100
120 130 140 150 160 170
pF1KE0 CFQMA--YL----LAGYYYTATGNYDIKWTMPHCVLTLKLIGLAVDYFDGG--KDQNSLS
:: .: :: .. : :.:. .. : ..: :. .:: . :: ::.. :.
CCDS16 CFVFALGYLTVCQVTRVYIFDYGQYSADFSGPMMIITQKITSLACEIHDGMFRKDEE-LT
110 120 130 140 150 160
180 190 200 210 220
pF1KE0 SEQQKYAIRGVPSLLEVAGFSYFYGAFLVGPQFSMNHYMKLVQGELIDIP-----GK---
: :. :.: .::::: ... . ..:.:: :.. :. ...:. : ::
CCDS16 SSQRDLAVRRMPSLLEYLSYNCNFMGILAGPLCSYKDYITFIEGRSYHITQSGENGKEET
170 180 190 200 210 220
230 240 250 260 270
pF1KE0 -------IPNSIIPALKRLSLGLFYLVGYTLLSPHITEDYLLTEDYDNHPFW-FRCMYML
::. . . : : :: : :. . . .: . : .. : . .:.
CCDS16 QYERTEPSPNTAV-VQKLLVCGLSLLFHLTICTT-LPVEYNIDEHFQATASWPTKIIYLY
230 240 250 260 270
280 290 300 310 320 330
pF1KE0 IWGKFVLYKYVTCWLVTEGVCILTGLGFNGFEEKGKAKWDACANMKVWLFETNPRFTGTI
: . :: : ..... .:.:: :..:.: :.:: .:... .: . : .
CCDS16 ISLLAARPKYYFAWTLADAINNAAGFGFRGYDENGAARWDLISNLRIQQIEMSTSFKMFL
280 290 300 310 320 330
340 350 360 370 380 390
pF1KE0 ASFNINTNAWVARYIFKRLKFLGNKELSQGLSLLFLALWHGLHSGYLVCFQMEFLIVIVE
..::.: :. : ..: .: . . .... :.:::.. :: . :: ..
CCDS16 DNWNIQTALWLKRVCYERTSFSPTIQ-----TFILSAIWHGVYPGYY----LTFLTGVLM
340 350 360 370 380
400 410 420 430 440 450
pF1KE0 RQAARLIQESPTLSKLAAITVLQPFYYLVQQTIHWLFMGYSMTAFCLFTWDKWLKVYKSI
::: .... . . :. :: .. . . ..:... : :.. : :.:
CCDS16 TLAARAMRNN-FRHYFIEPSQLKLFYDVITWIVTQVAISYTVVPFVLLSIKPSLTFYSSW
390 400 410 420 430 440
460 470 480
pF1KE0 YFLGHIFFLSLLFILPYIHKAMVPRKEKLKKME
:. ::. . .:..::
CCDS16 YYCLHILGILVLLLLPVKKTQRRKNTHENIQLSQSKKFDEGENSLGQNSFSTTNNVCNQN
450 460 470 480 490 500
>>CCDS34346.1 MBOAT1 gene_id:154141|Hs108|chr6 (495 aa)
initn: 329 init1: 154 opt: 244 Z-score: 308.6 bits: 66.6 E(32554): 7.2e-11
Smith-Waterman score: 460; 23.6% identity (54.8% similar) in 478 aa overlap (28-482:20-484)
10 20 30 40 50 60
pF1KE0 MASSAEGDEGTVVALAGVLQSGFQELSLNKLATSLGASEQALRLIISIFLGYPFALFYRH
:. :. :: . . ... ... :...:
CCDS34 MAAEPQPSSLSYRTTGSTYLHPLSELLGIPLDQVNFVVCQLVALFAAFWFRI
10 20 30 40 50
70 80 90 100 110
pF1KE0 YLFYKETY--LIHLFHTFTGLSIAYFNFGNQLYHSLLCIVLQFLILRLMGRTITAVLT-T
:: : . : :. :. .. : :: : .. ... . :. . ... . .
CCDS34 YLRPGTTSSDVRHAVATIFGIYFVIFCFGWYSVHLFVLVLMCYAIM--VTASVSNIHRYS
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE0 FCFQMAYL----LAGYYYTATGNYDIKWTMPHCVLTLKLIGLAVDYFDG-GKDQNSLSSE
: :.:: .. : : .. : ..: :. :: . :: :. ..::.:
CCDS34 FFVAMGYLTICHISRIYIFHYGILTTDFSGPLMIVTQKITTLAFQVHDGLGRRAEDLSAE
120 130 140 150 160 170
180 190 200 210 220
pF1KE0 QQKYAIRGVPSLLEVAGFSYFYGAFLVGPQFSMNHYMKLVQGE-----LIDIPGKI----
:.. ::. ::.:: .. . . ..:: ... :. ...:. :... :
CCDS34 QHRLAIKVKPSFLEYLSYLLNFMSVIAGPCNNFKDYIAFIEGKHIHMKLLEVNWKRKGFH
180 190 200 210 220 230
230 240 250 260 270
pF1KE0 ----PNSIIPALKRLSLGLFYLVGYTLLSPHITEDYLLTEDYDNHPFWF--RCMYMLIWG
:. ....:.. : :. . :. . :. .:. : : : :. .
CCDS34 SLPEPSPTGAVIHKLGITLVSLLLFLTLTKTFPVTCLV-DDWFVHKASFPARLCYLYVVM
240 250 260 270 280
280 290 300 310 320 330
pF1KE0 KFVLYKYVTCWLVTEGVCILTGLGFNGFEEKGKAKWDACANMKVWLFETNPRFTGTIASF
. :: : ....: .:.::.: ...:. :: .:...: .:: : . ..
CCDS34 QASKPKYYFAWTLADAVNNAAGFGFSGVDKNGNFCWDLLSNLNIWKIETATSFKMYLENW
290 300 310 320 330 340
340 350 360 370 380 390
pF1KE0 NINTNAWVARYIFKRLKFLGNKELSQGLSLLFLALWHGLHSGYLVCFQMEFLIVIVERQA
::.: .:. ..:. . . :.... :::::.. :: : .:.... : :
CCDS34 NIQTATWLKCVCYQRVPWYPTV-----LTFILSALWHGVYPGYYFTFLTGILVTLAAR-A
350 360 370 380 390 400
400 410 420 430 440 450
pF1KE0 ARLIQESPTLSKLAAITVLQPFYYLVQQTIHWLFMGYSMTAFCLFTWDKWLKVYKSIYFL
.: . ::. : .: . . : : : ..:... : ... . ...:::.::
CCDS34 VRNNYRHYFLSSRALKAVYDAGTWAVTQ----LAVSYTVAPFVMLAVEPTISLYKSMYFY
410 420 430 440 450
460 470 480
pF1KE0 GHIFFLSLLFILPYIHKAMVPRKEKLKKME
::. : ....::. .: . :. .
CCDS34 LHIISLLIILFLPMKPQAHTQRRPQTLNSINKRKTD
460 470 480 490
487 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 08:23:09 2016 done: Fri Nov 4 08:23:10 2016
Total Scan time: 2.900 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]