FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2345, 359 aa 1>>>pF1KE2345 359 - 359 aa - 359 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3196+/-0.000692; mu= 16.5529+/- 0.042 mean_var=70.7303+/-13.971, 0's: 0 Z-trim(110.3): 17 B-trim: 0 in 0/51 Lambda= 0.152501 statistics sampled from 11481 (11492) to 11481 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.724), E-opt: 0.2 (0.353), width: 16 Scan time: 2.290 The best scores are: opt bits E(32554) CCDS12152.1 FUT6 gene_id:2528|Hs108|chr19 ( 359) 2588 578.2 3.8e-165 CCDS12153.1 FUT3 gene_id:2525|Hs108|chr19 ( 361) 2141 479.8 1.5e-135 CCDS12154.1 FUT5 gene_id:2527|Hs108|chr19 ( 374) 2022 453.7 1.2e-127 CCDS7022.1 FUT7 gene_id:2529|Hs108|chr9 ( 342) 935 214.5 1.1e-55 CCDS5033.1 FUT9 gene_id:10690|Hs108|chr6 ( 359) 871 200.4 2e-51 CCDS8301.1 FUT4 gene_id:2526|Hs108|chr11 ( 530) 850 195.9 6.6e-50 >>CCDS12152.1 FUT6 gene_id:2528|Hs108|chr19 (359 aa) initn: 2588 init1: 2588 opt: 2588 Z-score: 3078.5 bits: 578.2 E(32554): 3.8e-165 Smith-Waterman score: 2588; 100.0% identity (100.0% similar) in 359 aa overlap (1-359:1-359) 10 20 30 40 50 60 pF1KE2 MDPLGPAKPQWSWRCCLTTLLFQLLMAVCFFSYLRVSQDDPTVYPNGSRFPDSTGTPAHS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 MDPLGPAKPQWSWRCCLTTLLFQLLMAVCFFSYLRVSQDDPTVYPNGSRFPDSTGTPAHS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 IPLILLWTWPFNKPIALPRCSEMVPGTADCNITADRKVYPQADAVIVHHREVMYNPSAQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 IPLILLWTWPFNKPIALPRCSEMVPGTADCNITADRKVYPQADAVIVHHREVMYNPSAQL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 PRSPRRQGQRWIWFSMESPSHCWQLKAMDGYFNLTMSYRSDSDIFTPYGWLEPWSGQPAH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 PRSPRRQGQRWIWFSMESPSHCWQLKAMDGYFNLTMSYRSDSDIFTPYGWLEPWSGQPAH 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 PPLNLSAKTELVAWAVSNWGPNSARVRYYQSLQAHLKVDVYGRSHKPLPQGTMMETLSRY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 PPLNLSAKTELVAWAVSNWGPNSARVRYYQSLQAHLKVDVYGRSHKPLPQGTMMETLSRY 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 KFYLAFENSLHPDYITEKLWRNALEAWAVPVVLGPSRSNYERFLPPDAFIHVDDFQSPKD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 KFYLAFENSLHPDYITEKLWRNALEAWAVPVVLGPSRSNYERFLPPDAFIHVDDFQSPKD 250 260 270 280 290 300 310 320 330 340 350 pF1KE2 LARYLQELDKDHARYLSYFRWRETLRPRSFSWALAFCKACWKLQEESRYQTRGIAAWFT ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 LARYLQELDKDHARYLSYFRWRETLRPRSFSWALAFCKACWKLQEESRYQTRGIAAWFT 310 320 330 340 350 >>CCDS12153.1 FUT3 gene_id:2525|Hs108|chr19 (361 aa) initn: 2095 init1: 1863 opt: 2141 Z-score: 2546.9 bits: 479.8 E(32554): 1.5e-135 Smith-Waterman score: 2141; 84.0% identity (91.2% similar) in 363 aa overlap (1-359:1-361) 10 20 30 40 50 pF1KE2 MDPLGPAKPQWSWRCCLTTLLFQLLMAVCFFSYLRVSQDDPT---VYPNGSRFPDSTGTP ::::: ::::: :: ::..::::::.:::::::::::.:: : :.:: :. :: CCDS12 MDPLGAAKPQWPWRRCLAALLFQLLVAVCFFSYLRVSRDDATGSPRAPSGSSRQDT--TP 10 20 30 40 50 60 70 80 90 100 110 pF1KE2 AHSIPLILLWTWPFNKPIALPRCSEMVPGTADCNITADRKVYPQADAVIVHHREVMYNPS .. :::: ::::. :.:: ::::::::::::.:::::::::::: ::::: ..: ::. CCDS12 TRPTLLILLRTWPFHIPVALSRCSEMVPGTADCHITADRKVYPQADMVIVHHWDIMSNPK 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE2 AQLPRSPRRQGQRWIWFSMESPSHCWQLKAMDGYFNLTMSYRSDSDIFTPYGWLEPWSGQ ..:: ::: ::::::::..: : .: .:.:.: ::::::::::::::::::::::::::: CCDS12 SRLPPSPRPQGQRWIWFNLEPPPNCQHLEALDRYFNLTMSYRSDSDIFTPYGWLEPWSGQ 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE2 PAHPPLNLSAKTELVAWAVSNWGPNSARVRYYQSLQAHLKVDVYGRSHKPLPQGTMMETL :::::::::::::::::::::: :.:::::::::::::::::::::::::::.::::::: CCDS12 PAHPPLNLSAKTELVAWAVSNWKPDSARVRYYQSLQAHLKVDVYGRSHKPLPKGTMMETL 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE2 SRYKFYLAFENSLHPDYITEKLWRNALEAWAVPVVLGPSRSNYERFLPPDAFIHVDDFQS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 SRYKFYLAFENSLHPDYITEKLWRNALEAWAVPVVLGPSRSNYERFLPPDAFIHVDDFQS 240 250 260 270 280 290 300 310 320 330 340 350 pF1KE2 PKDLARYLQELDKDHARYLSYFRWRETLRPRSFSWALAFCKACWKLQEESRYQT-RGIAA ::::::::::::::::::::::::::::::::::::: :::::::::.:::::: :.::: CCDS12 PKDLARYLQELDKDHARYLSYFRWRETLRPRSFSWALDFCKACWKLQQESRYQTVRSIAA 300 310 320 330 340 350 pF1KE2 WFT ::: CCDS12 WFT 360 >>CCDS12154.1 FUT5 gene_id:2527|Hs108|chr19 (374 aa) initn: 1984 init1: 1984 opt: 2022 Z-score: 2405.2 bits: 453.7 E(32554): 1.2e-127 Smith-Waterman score: 2232; 85.3% identity (90.9% similar) in 374 aa overlap (1-359:1-374) 10 20 30 40 pF1KE2 MDPLGPAKPQWSWRCCLTTLLFQLLMAVCFFSYLRVSQDD------PTVY--------PN ::::::::::: :: ::. ::::::.:::::::::::.:: : .. :: CCDS12 MDPLGPAKPQWLWRRCLAGLLFQLLVAVCFFSYLRVSRDDATGSPRPGLMAVEPVTGAPN 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE2 GSRFPDSTGTPAHSIPLILLWTWPFNKPIALPRCSEMVPGTADCNITADRKVYPQADAVI ::: :: .:::: :::::::::: :.:::::::::::.:::::::: .::::::::: CCDS12 GSRCQDSMATPAHPTLLILLWTWPFNTPVALPRCSEMVPGAADCNITADSSVYPQADAVI 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE2 VHHREVMYNPSAQLPRSPRRQGQRWIWFSMESPSHCWQLKAMDGYFNLTMSYRSDSDIFT ::: ..::::::.:: : ::::::::::::::.: .:.:.:::::::::::::::::: CCDS12 VHHWDIMYNPSANLPPPTRPQGQRWIWFSMESPSNCRHLEALDGYFNLTMSYRSDSDIFT 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE2 PYGWLEPWSGQPAHPPLNLSAKTELVAWAVSNWGPNSARVRYYQSLQAHLKVDVYGRSHK ::::::::::::::::::::::::::::::::: :.:::::::::::::::::::::::: CCDS12 PYGWLEPWSGQPAHPPLNLSAKTELVAWAVSNWKPDSARVRYYQSLQAHLKVDVYGRSHK 190 200 210 220 230 240 230 240 250 260 270 280 pF1KE2 PLPQGTMMETLSRYKFYLAFENSLHPDYITEKLWRNALEAWAVPVVLGPSRSNYERFLPP :::.:::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 PLPKGTMMETLSRYKFYLAFENSLHPDYITEKLWRNALEAWAVPVVLGPSRSNYERFLPP 250 260 270 280 290 300 290 300 310 320 330 340 pF1KE2 DAFIHVDDFQSPKDLARYLQELDKDHARYLSYFRWRETLRPRSFSWALAFCKACWKLQEE :::::::::::::::::::::::::::::::::.::::::::::::::::::::::::.: CCDS12 DAFIHVDDFQSPKDLARYLQELDKDHARYLSYFHWRETLRPRSFSWALAFCKACWKLQQE 310 320 330 340 350 360 350 pF1KE2 SRYQT-RGIAAWFT ::::: :.:::::: CCDS12 SRYQTVRSIAAWFT 370 >>CCDS7022.1 FUT7 gene_id:2529|Hs108|chr9 (342 aa) initn: 635 init1: 347 opt: 935 Z-score: 1113.3 bits: 214.5 E(32554): 1.1e-55 Smith-Waterman score: 935; 47.6% identity (70.6% similar) in 309 aa overlap (55-358:39-340) 30 40 50 60 70 80 pF1KE2 LMAVCFFSYLRVSQDDPTVYPNGSRFPDSTGTPAHSIPL-ILLWTWPF-NKPIALPRCSE :::: . . ::.: ::: ..: :: . CCDS70 TRRLRGLGVLAGVALLAALWLLWLLGSAPRGTPAPQPTITILVWHWPFTDQPPELPSDTC 10 20 30 40 50 60 90 100 110 120 130 140 pF1KE2 MVPGTADCNITADRKVYPQADAVIVHHREVMYNPSAQLPRSPRRQGQRWIWFSMESPSHC : : :...:.:.. .::::. ::::.. : .:: . : .:: :.: ::::::: CCDS70 TRYGIARCHLSANRSLLASADAVVFHHRELQTRRS-HLPLAQRPRGQPWVWASMESPSHT 70 80 90 100 110 120 150 160 170 180 190 200 pF1KE2 WQLKAMDGYFNLTMSYRSDSDIFTPYGWLEPWSGQPAHPPLNLSAKTELVAWAVSNWGPN :. . : :: ..::: :::::.::: ::: : :. :: : ::....::.:::. CCDS70 HGLSHLRGIFNWVLSYRRDSDIFVPYGRLEPHWG-PS-PP--LPAKSRVAAWVVSNFQER 130 140 150 160 170 180 210 220 230 240 250 260 pF1KE2 SARVRYYQSLQAHLKVDVYGRSH-KPLPQGTMMETLSRYKFYLAFENSLHPDYITEKLWR . :.: :..: ::.:::.::.. .:: . .. :...:.:::.:::: : ::::::.:: CCDS70 QLRARLYRQLAPHLRVDVFGRANGRPLCASCLVPTVAQYRFYLSFENSQHRDYITEKFWR 190 200 210 220 230 240 270 280 290 300 310 320 pF1KE2 NALEAWAVPVVLGPSRSNYERFLPPDAFIHVDDFQSPKDLARYLQELDKDHARYLSYFRW ::: : .::::::: :..:: :.: :::.::::: : ..:: .: ... .:: .: : CCDS70 NALVAGTVPVVLGPPRATYEAFVPADAFVHVDDFGSARELAAFLTGMNE--SRYQRFFAW 250 260 270 280 290 300 330 340 350 pF1KE2 RETLRPRSFS-WALAFCKACWKLQEESRYQT-RGIAAWFT :. :: : :. : :: : . . : :. . . .:: CCDS70 RDRLRVRLFTDWRERFCAICDRYPHLPRSQVYEDLEGWFQA 310 320 330 340 >>CCDS5033.1 FUT9 gene_id:10690|Hs108|chr6 (359 aa) initn: 772 init1: 353 opt: 871 Z-score: 1036.9 bits: 200.4 E(32554): 2e-51 Smith-Waterman score: 871; 43.1% identity (73.0% similar) in 311 aa overlap (53-358:55-357) 30 40 50 60 70 80 pF1KE2 QLLMAVCFFSYLRVSQDDPTVYPNGSRFPDSTGTPAHSIPLILLWTWPFNKPIALPRCSE :: : . ::.:.:::.. . : :. CCDS50 CLLIYIKPTNSWIFSPMESASSVLKMKNFFSTKTDYFNETTILVWVWPFGQTFDLTSCQA 30 40 50 60 70 80 90 100 110 120 130 140 pF1KE2 MVPGTADCNITADRKVYPQADAVIVHHREVMYNPSAQLPRSPRRQGQRWIWFSMESPSHC : . :..:.::..: .. ::..:::.. .. . .::.. : :.:::...:::.: CCDS50 MF-NIQGCHLTTDRSLYNKSHAVLIHHRDISWDLT-NLPQQARPPFQKWIWMNLESPTHT 90 100 110 120 130 140 150 160 170 180 190 200 pF1KE2 WQLKAMDGYFNLTMSYRSDSDIFTPYGWLEPWSGQPAHPPLNLSAKTELVAWAVSNWGPN : .... ::::..:: :::: .:::.: : .: ... .: .:: :.::::.:. CCDS50 PQKSGIEHLFNLTLTYRRDSDIQVPYGFLTV-STNPF--VFEVPSKEKLVCWVVSNWNPE 150 160 170 180 190 210 220 230 240 250 260 pF1KE2 SARVRYYQSLQAHLKVDVYGRSH-KPLPQGTMMETLSRYKFYLAFENSLHPDYITEKLWR :::.::. :. ... .::.. . . . ... :.: ::::.::::.: :::::::. CCDS50 HARVKYYNELSKSIEIHTYGQAFGEYVNDKNLIPTISTCKFYLSFENSIHKDYITEKLY- 200 210 220 230 240 250 270 280 290 300 310 320 pF1KE2 NALEAWAVPVVLGPSRSNYERFLPPDAFIHVDDFQSPKDLARYLQELDKDHARYLSYFRW ::. : .::::::::: ::: ..: :.::::.:..::..::.::.:.::.. ::::: : CCDS50 NAFLAGSVPVVLGPSRENYENYIPADSFIHVEDYNSPSELAKYLKEVDKNNKLYLSYFNW 260 270 280 290 300 310 330 340 350 pF1KE2 RETLR---PRSFSWALAFCKACWKLQEESRYQTRG-IAAWFT :. . :: : : : :: .......:.. : . :: CCDS50 RKDFTVNLPR-F-WESHACLACDHVKRHQEYKSVGNLEKWFWN 320 330 340 350 >>CCDS8301.1 FUT4 gene_id:2526|Hs108|chr11 (530 aa) initn: 843 init1: 401 opt: 850 Z-score: 1009.4 bits: 195.9 E(32554): 6.6e-50 Smith-Waterman score: 898; 43.5% identity (64.9% similar) in 359 aa overlap (47-358:173-528) 20 30 40 50 60 70 pF1KE2 LTTLLFQLLMAVCFFSYLRVSQDDPTVYPNGSRFPDSTGTPAHSIPL-ILLWTWPF---- :. : ..:. : :. .::: :: CCDS83 WRRGRGLPWTVCVLAAAGLTCTALITYACWGQLPPLPWASPTPSRPVGVLLWWEPFGGRD 150 160 170 180 190 200 80 90 100 110 120 pF1KE2 NKPIALPRCSEMVPGTADCNITADRKVYPQADAVIVHHREVMYNPSAQLP---------- . : : : .. . . : . .:: : .:.::. :::... .: : CCDS83 SAPRPPPDC-RLRFNISGCRLLTDRASYGEAQAVLFHHRDLVKGPPDWPPPWGIQAHTAE 210 220 230 240 250 260 130 140 150 pF1KE2 ---------------------RSPRRQGQRWIWFSMESPSHCWQLKAM-DGYFNLTMSYR ::: ::::.:...::::: :... .. :: :.::: CCDS83 EVDLRVLDYEEAAAAAEALATSSPRPPGQRWVWMNFESPSHSPGLRSLASNLFNWTLSYR 270 280 290 300 310 320 160 170 180 190 200 210 pF1KE2 SDSDIFTPYGWLEPWSGQPAHPPLNL----SAKTELVAWAVSNWGPNSARVRYYQSLQAH .:::.:.:::.: : : .:. :: .: : : ::::.::.: .::::::..:. : CCDS83 ADSDVFVPYGYLYPRS-HPGDPPSGLAPPLSRKQGLVAWVVSHWDERQARVRYYHQLSQH 330 340 350 360 370 380 220 230 240 250 260 270 pF1KE2 LKVDVYGRSH--KPLPQGTMMETLSRYKFYLAFENSLHPDYITEKLWRNALEAWAVPVVL . :::.::. .:.:. ...:..::::::::::: : :::::::::::: : :::::: CCDS83 VTVDVFGRGGPGQPVPEIGLLHTVARYKFYLAFENSQHLDYITEKLWRNALLAGAVPVVL 390 400 410 420 430 440 280 290 300 310 320 330 pF1KE2 GPSRSNYERFLPPDAFIHVDDFQSPKDLARYLQELDKDHARYLSYFRWRET--LRPRSFS ::.:.:::::.: :::::::: : ..:: :: ::.. : : ::.::.. .. :: CCDS83 GPDRANYERFVPRGAFIHVDDFPSASSLASYLLFLDRNPAVYRRYFHWRRSYAVHITSF- 450 460 470 480 490 340 350 pF1KE2 WALAFCKACWKLQEES-RYQT-RGIAAWFT : .:..: .:. . : .. :..:.:: CCDS83 WDEPWCRVCQAVQRAGDRPKSIRNLASWFER 500 510 520 530 359 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 13:31:24 2016 done: Sun Nov 6 13:31:24 2016 Total Scan time: 2.290 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]