FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE2345, 359 aa
1>>>pF1KE2345 359 - 359 aa - 359 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.3196+/-0.000692; mu= 16.5529+/- 0.042
mean_var=70.7303+/-13.971, 0's: 0 Z-trim(110.3): 17 B-trim: 0 in 0/51
Lambda= 0.152501
statistics sampled from 11481 (11492) to 11481 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.724), E-opt: 0.2 (0.353), width: 16
Scan time: 2.290
The best scores are: opt bits E(32554)
CCDS12152.1 FUT6 gene_id:2528|Hs108|chr19 ( 359) 2588 578.2 3.8e-165
CCDS12153.1 FUT3 gene_id:2525|Hs108|chr19 ( 361) 2141 479.8 1.5e-135
CCDS12154.1 FUT5 gene_id:2527|Hs108|chr19 ( 374) 2022 453.7 1.2e-127
CCDS7022.1 FUT7 gene_id:2529|Hs108|chr9 ( 342) 935 214.5 1.1e-55
CCDS5033.1 FUT9 gene_id:10690|Hs108|chr6 ( 359) 871 200.4 2e-51
CCDS8301.1 FUT4 gene_id:2526|Hs108|chr11 ( 530) 850 195.9 6.6e-50
>>CCDS12152.1 FUT6 gene_id:2528|Hs108|chr19 (359 aa)
initn: 2588 init1: 2588 opt: 2588 Z-score: 3078.5 bits: 578.2 E(32554): 3.8e-165
Smith-Waterman score: 2588; 100.0% identity (100.0% similar) in 359 aa overlap (1-359:1-359)
10 20 30 40 50 60
pF1KE2 MDPLGPAKPQWSWRCCLTTLLFQLLMAVCFFSYLRVSQDDPTVYPNGSRFPDSTGTPAHS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 MDPLGPAKPQWSWRCCLTTLLFQLLMAVCFFSYLRVSQDDPTVYPNGSRFPDSTGTPAHS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE2 IPLILLWTWPFNKPIALPRCSEMVPGTADCNITADRKVYPQADAVIVHHREVMYNPSAQL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 IPLILLWTWPFNKPIALPRCSEMVPGTADCNITADRKVYPQADAVIVHHREVMYNPSAQL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE2 PRSPRRQGQRWIWFSMESPSHCWQLKAMDGYFNLTMSYRSDSDIFTPYGWLEPWSGQPAH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 PRSPRRQGQRWIWFSMESPSHCWQLKAMDGYFNLTMSYRSDSDIFTPYGWLEPWSGQPAH
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE2 PPLNLSAKTELVAWAVSNWGPNSARVRYYQSLQAHLKVDVYGRSHKPLPQGTMMETLSRY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 PPLNLSAKTELVAWAVSNWGPNSARVRYYQSLQAHLKVDVYGRSHKPLPQGTMMETLSRY
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE2 KFYLAFENSLHPDYITEKLWRNALEAWAVPVVLGPSRSNYERFLPPDAFIHVDDFQSPKD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 KFYLAFENSLHPDYITEKLWRNALEAWAVPVVLGPSRSNYERFLPPDAFIHVDDFQSPKD
250 260 270 280 290 300
310 320 330 340 350
pF1KE2 LARYLQELDKDHARYLSYFRWRETLRPRSFSWALAFCKACWKLQEESRYQTRGIAAWFT
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 LARYLQELDKDHARYLSYFRWRETLRPRSFSWALAFCKACWKLQEESRYQTRGIAAWFT
310 320 330 340 350
>>CCDS12153.1 FUT3 gene_id:2525|Hs108|chr19 (361 aa)
initn: 2095 init1: 1863 opt: 2141 Z-score: 2546.9 bits: 479.8 E(32554): 1.5e-135
Smith-Waterman score: 2141; 84.0% identity (91.2% similar) in 363 aa overlap (1-359:1-361)
10 20 30 40 50
pF1KE2 MDPLGPAKPQWSWRCCLTTLLFQLLMAVCFFSYLRVSQDDPT---VYPNGSRFPDSTGTP
::::: ::::: :: ::..::::::.:::::::::::.:: : :.:: :. ::
CCDS12 MDPLGAAKPQWPWRRCLAALLFQLLVAVCFFSYLRVSRDDATGSPRAPSGSSRQDT--TP
10 20 30 40 50
60 70 80 90 100 110
pF1KE2 AHSIPLILLWTWPFNKPIALPRCSEMVPGTADCNITADRKVYPQADAVIVHHREVMYNPS
.. :::: ::::. :.:: ::::::::::::.:::::::::::: ::::: ..: ::.
CCDS12 TRPTLLILLRTWPFHIPVALSRCSEMVPGTADCHITADRKVYPQADMVIVHHWDIMSNPK
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE2 AQLPRSPRRQGQRWIWFSMESPSHCWQLKAMDGYFNLTMSYRSDSDIFTPYGWLEPWSGQ
..:: ::: ::::::::..: : .: .:.:.: :::::::::::::::::::::::::::
CCDS12 SRLPPSPRPQGQRWIWFNLEPPPNCQHLEALDRYFNLTMSYRSDSDIFTPYGWLEPWSGQ
120 130 140 150 160 170
180 190 200 210 220 230
pF1KE2 PAHPPLNLSAKTELVAWAVSNWGPNSARVRYYQSLQAHLKVDVYGRSHKPLPQGTMMETL
:::::::::::::::::::::: :.:::::::::::::::::::::::::::.:::::::
CCDS12 PAHPPLNLSAKTELVAWAVSNWKPDSARVRYYQSLQAHLKVDVYGRSHKPLPKGTMMETL
180 190 200 210 220 230
240 250 260 270 280 290
pF1KE2 SRYKFYLAFENSLHPDYITEKLWRNALEAWAVPVVLGPSRSNYERFLPPDAFIHVDDFQS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 SRYKFYLAFENSLHPDYITEKLWRNALEAWAVPVVLGPSRSNYERFLPPDAFIHVDDFQS
240 250 260 270 280 290
300 310 320 330 340 350
pF1KE2 PKDLARYLQELDKDHARYLSYFRWRETLRPRSFSWALAFCKACWKLQEESRYQT-RGIAA
::::::::::::::::::::::::::::::::::::: :::::::::.:::::: :.:::
CCDS12 PKDLARYLQELDKDHARYLSYFRWRETLRPRSFSWALDFCKACWKLQQESRYQTVRSIAA
300 310 320 330 340 350
pF1KE2 WFT
:::
CCDS12 WFT
360
>>CCDS12154.1 FUT5 gene_id:2527|Hs108|chr19 (374 aa)
initn: 1984 init1: 1984 opt: 2022 Z-score: 2405.2 bits: 453.7 E(32554): 1.2e-127
Smith-Waterman score: 2232; 85.3% identity (90.9% similar) in 374 aa overlap (1-359:1-374)
10 20 30 40
pF1KE2 MDPLGPAKPQWSWRCCLTTLLFQLLMAVCFFSYLRVSQDD------PTVY--------PN
::::::::::: :: ::. ::::::.:::::::::::.:: : .. ::
CCDS12 MDPLGPAKPQWLWRRCLAGLLFQLLVAVCFFSYLRVSRDDATGSPRPGLMAVEPVTGAPN
10 20 30 40 50 60
50 60 70 80 90 100
pF1KE2 GSRFPDSTGTPAHSIPLILLWTWPFNKPIALPRCSEMVPGTADCNITADRKVYPQADAVI
::: :: .:::: :::::::::: :.:::::::::::.:::::::: .:::::::::
CCDS12 GSRCQDSMATPAHPTLLILLWTWPFNTPVALPRCSEMVPGAADCNITADSSVYPQADAVI
70 80 90 100 110 120
110 120 130 140 150 160
pF1KE2 VHHREVMYNPSAQLPRSPRRQGQRWIWFSMESPSHCWQLKAMDGYFNLTMSYRSDSDIFT
::: ..::::::.:: : ::::::::::::::.: .:.:.::::::::::::::::::
CCDS12 VHHWDIMYNPSANLPPPTRPQGQRWIWFSMESPSNCRHLEALDGYFNLTMSYRSDSDIFT
130 140 150 160 170 180
170 180 190 200 210 220
pF1KE2 PYGWLEPWSGQPAHPPLNLSAKTELVAWAVSNWGPNSARVRYYQSLQAHLKVDVYGRSHK
::::::::::::::::::::::::::::::::: :.::::::::::::::::::::::::
CCDS12 PYGWLEPWSGQPAHPPLNLSAKTELVAWAVSNWKPDSARVRYYQSLQAHLKVDVYGRSHK
190 200 210 220 230 240
230 240 250 260 270 280
pF1KE2 PLPQGTMMETLSRYKFYLAFENSLHPDYITEKLWRNALEAWAVPVVLGPSRSNYERFLPP
:::.::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 PLPKGTMMETLSRYKFYLAFENSLHPDYITEKLWRNALEAWAVPVVLGPSRSNYERFLPP
250 260 270 280 290 300
290 300 310 320 330 340
pF1KE2 DAFIHVDDFQSPKDLARYLQELDKDHARYLSYFRWRETLRPRSFSWALAFCKACWKLQEE
:::::::::::::::::::::::::::::::::.::::::::::::::::::::::::.:
CCDS12 DAFIHVDDFQSPKDLARYLQELDKDHARYLSYFHWRETLRPRSFSWALAFCKACWKLQQE
310 320 330 340 350 360
350
pF1KE2 SRYQT-RGIAAWFT
::::: :.::::::
CCDS12 SRYQTVRSIAAWFT
370
>>CCDS7022.1 FUT7 gene_id:2529|Hs108|chr9 (342 aa)
initn: 635 init1: 347 opt: 935 Z-score: 1113.3 bits: 214.5 E(32554): 1.1e-55
Smith-Waterman score: 935; 47.6% identity (70.6% similar) in 309 aa overlap (55-358:39-340)
30 40 50 60 70 80
pF1KE2 LMAVCFFSYLRVSQDDPTVYPNGSRFPDSTGTPAHSIPL-ILLWTWPF-NKPIALPRCSE
:::: . . ::.: ::: ..: :: .
CCDS70 TRRLRGLGVLAGVALLAALWLLWLLGSAPRGTPAPQPTITILVWHWPFTDQPPELPSDTC
10 20 30 40 50 60
90 100 110 120 130 140
pF1KE2 MVPGTADCNITADRKVYPQADAVIVHHREVMYNPSAQLPRSPRRQGQRWIWFSMESPSHC
: : :...:.:.. .::::. ::::.. : .:: . : .:: :.: :::::::
CCDS70 TRYGIARCHLSANRSLLASADAVVFHHRELQTRRS-HLPLAQRPRGQPWVWASMESPSHT
70 80 90 100 110 120
150 160 170 180 190 200
pF1KE2 WQLKAMDGYFNLTMSYRSDSDIFTPYGWLEPWSGQPAHPPLNLSAKTELVAWAVSNWGPN
:. . : :: ..::: :::::.::: ::: : :. :: : ::....::.:::.
CCDS70 HGLSHLRGIFNWVLSYRRDSDIFVPYGRLEPHWG-PS-PP--LPAKSRVAAWVVSNFQER
130 140 150 160 170 180
210 220 230 240 250 260
pF1KE2 SARVRYYQSLQAHLKVDVYGRSH-KPLPQGTMMETLSRYKFYLAFENSLHPDYITEKLWR
. :.: :..: ::.:::.::.. .:: . .. :...:.:::.:::: : ::::::.::
CCDS70 QLRARLYRQLAPHLRVDVFGRANGRPLCASCLVPTVAQYRFYLSFENSQHRDYITEKFWR
190 200 210 220 230 240
270 280 290 300 310 320
pF1KE2 NALEAWAVPVVLGPSRSNYERFLPPDAFIHVDDFQSPKDLARYLQELDKDHARYLSYFRW
::: : .::::::: :..:: :.: :::.::::: : ..:: .: ... .:: .: :
CCDS70 NALVAGTVPVVLGPPRATYEAFVPADAFVHVDDFGSARELAAFLTGMNE--SRYQRFFAW
250 260 270 280 290 300
330 340 350
pF1KE2 RETLRPRSFS-WALAFCKACWKLQEESRYQT-RGIAAWFT
:. :: : :. : :: : . . : :. . . .::
CCDS70 RDRLRVRLFTDWRERFCAICDRYPHLPRSQVYEDLEGWFQA
310 320 330 340
>>CCDS5033.1 FUT9 gene_id:10690|Hs108|chr6 (359 aa)
initn: 772 init1: 353 opt: 871 Z-score: 1036.9 bits: 200.4 E(32554): 2e-51
Smith-Waterman score: 871; 43.1% identity (73.0% similar) in 311 aa overlap (53-358:55-357)
30 40 50 60 70 80
pF1KE2 QLLMAVCFFSYLRVSQDDPTVYPNGSRFPDSTGTPAHSIPLILLWTWPFNKPIALPRCSE
:: : . ::.:.:::.. . : :.
CCDS50 CLLIYIKPTNSWIFSPMESASSVLKMKNFFSTKTDYFNETTILVWVWPFGQTFDLTSCQA
30 40 50 60 70 80
90 100 110 120 130 140
pF1KE2 MVPGTADCNITADRKVYPQADAVIVHHREVMYNPSAQLPRSPRRQGQRWIWFSMESPSHC
: . :..:.::..: .. ::..:::.. .. . .::.. : :.:::...:::.:
CCDS50 MF-NIQGCHLTTDRSLYNKSHAVLIHHRDISWDLT-NLPQQARPPFQKWIWMNLESPTHT
90 100 110 120 130 140
150 160 170 180 190 200
pF1KE2 WQLKAMDGYFNLTMSYRSDSDIFTPYGWLEPWSGQPAHPPLNLSAKTELVAWAVSNWGPN
: .... ::::..:: :::: .:::.: : .: ... .: .:: :.::::.:.
CCDS50 PQKSGIEHLFNLTLTYRRDSDIQVPYGFLTV-STNPF--VFEVPSKEKLVCWVVSNWNPE
150 160 170 180 190
210 220 230 240 250 260
pF1KE2 SARVRYYQSLQAHLKVDVYGRSH-KPLPQGTMMETLSRYKFYLAFENSLHPDYITEKLWR
:::.::. :. ... .::.. . . . ... :.: ::::.::::.: :::::::.
CCDS50 HARVKYYNELSKSIEIHTYGQAFGEYVNDKNLIPTISTCKFYLSFENSIHKDYITEKLY-
200 210 220 230 240 250
270 280 290 300 310 320
pF1KE2 NALEAWAVPVVLGPSRSNYERFLPPDAFIHVDDFQSPKDLARYLQELDKDHARYLSYFRW
::. : .::::::::: ::: ..: :.::::.:..::..::.::.:.::.. ::::: :
CCDS50 NAFLAGSVPVVLGPSRENYENYIPADSFIHVEDYNSPSELAKYLKEVDKNNKLYLSYFNW
260 270 280 290 300 310
330 340 350
pF1KE2 RETLR---PRSFSWALAFCKACWKLQEESRYQTRG-IAAWFT
:. . :: : : : :: .......:.. : . ::
CCDS50 RKDFTVNLPR-F-WESHACLACDHVKRHQEYKSVGNLEKWFWN
320 330 340 350
>>CCDS8301.1 FUT4 gene_id:2526|Hs108|chr11 (530 aa)
initn: 843 init1: 401 opt: 850 Z-score: 1009.4 bits: 195.9 E(32554): 6.6e-50
Smith-Waterman score: 898; 43.5% identity (64.9% similar) in 359 aa overlap (47-358:173-528)
20 30 40 50 60 70
pF1KE2 LTTLLFQLLMAVCFFSYLRVSQDDPTVYPNGSRFPDSTGTPAHSIPL-ILLWTWPF----
:. : ..:. : :. .::: ::
CCDS83 WRRGRGLPWTVCVLAAAGLTCTALITYACWGQLPPLPWASPTPSRPVGVLLWWEPFGGRD
150 160 170 180 190 200
80 90 100 110 120
pF1KE2 NKPIALPRCSEMVPGTADCNITADRKVYPQADAVIVHHREVMYNPSAQLP----------
. : : : .. . . : . .:: : .:.::. :::... .: :
CCDS83 SAPRPPPDC-RLRFNISGCRLLTDRASYGEAQAVLFHHRDLVKGPPDWPPPWGIQAHTAE
210 220 230 240 250 260
130 140 150
pF1KE2 ---------------------RSPRRQGQRWIWFSMESPSHCWQLKAM-DGYFNLTMSYR
::: ::::.:...::::: :... .. :: :.:::
CCDS83 EVDLRVLDYEEAAAAAEALATSSPRPPGQRWVWMNFESPSHSPGLRSLASNLFNWTLSYR
270 280 290 300 310 320
160 170 180 190 200 210
pF1KE2 SDSDIFTPYGWLEPWSGQPAHPPLNL----SAKTELVAWAVSNWGPNSARVRYYQSLQAH
.:::.:.:::.: : : .:. :: .: : : ::::.::.: .::::::..:. :
CCDS83 ADSDVFVPYGYLYPRS-HPGDPPSGLAPPLSRKQGLVAWVVSHWDERQARVRYYHQLSQH
330 340 350 360 370 380
220 230 240 250 260 270
pF1KE2 LKVDVYGRSH--KPLPQGTMMETLSRYKFYLAFENSLHPDYITEKLWRNALEAWAVPVVL
. :::.::. .:.:. ...:..::::::::::: : :::::::::::: : ::::::
CCDS83 VTVDVFGRGGPGQPVPEIGLLHTVARYKFYLAFENSQHLDYITEKLWRNALLAGAVPVVL
390 400 410 420 430 440
280 290 300 310 320 330
pF1KE2 GPSRSNYERFLPPDAFIHVDDFQSPKDLARYLQELDKDHARYLSYFRWRET--LRPRSFS
::.:.:::::.: :::::::: : ..:: :: ::.. : : ::.::.. .. ::
CCDS83 GPDRANYERFVPRGAFIHVDDFPSASSLASYLLFLDRNPAVYRRYFHWRRSYAVHITSF-
450 460 470 480 490
340 350
pF1KE2 WALAFCKACWKLQEES-RYQT-RGIAAWFT
: .:..: .:. . : .. :..:.::
CCDS83 WDEPWCRVCQAVQRAGDRPKSIRNLASWFER
500 510 520 530
359 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 13:31:24 2016 done: Sun Nov 6 13:31:24 2016
Total Scan time: 2.290 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]