FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0056, 472 aa 1>>>pF1KE0056 472 - 472 aa - 472 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.4767+/-0.000791; mu= 13.2442+/- 0.048 mean_var=133.0932+/-27.761, 0's: 0 Z-trim(112.3): 175 B-trim: 112 in 1/52 Lambda= 0.111172 statistics sampled from 12853 (13083) to 12853 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.74), E-opt: 0.2 (0.402), width: 16 Scan time: 3.530 The best scores are: opt bits E(32554) CCDS12429.1 WDR88 gene_id:126248|Hs108|chr19 ( 472) 3302 541.0 1e-153 CCDS340.1 SNRNP40 gene_id:9410|Hs108|chr1 ( 357) 436 81.2 2e-15 CCDS2470.1 DAW1 gene_id:164781|Hs108|chr2 ( 415) 392 74.2 2.9e-13 CCDS31869.1 POC1B gene_id:282809|Hs108|chr12 ( 478) 344 66.6 6.7e-11 >>CCDS12429.1 WDR88 gene_id:126248|Hs108|chr19 (472 aa) initn: 3302 init1: 3302 opt: 3302 Z-score: 2873.3 bits: 541.0 E(32554): 1e-153 Smith-Waterman score: 3302; 100.0% identity (100.0% similar) in 472 aa overlap (1-472:1-472) 10 20 30 40 50 60 pF1KE0 MASPPRCSPTAHDRECKLPPPSAPASEYCPGKLSWGTMARALGRFKLSIPHTHLLATLDP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 MASPPRCSPTAHDRECKLPPPSAPASEYCPGKLSWGTMARALGRFKLSIPHTHLLATLDP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 LALDREPPPHLLPEKHQVPEKLIWGDQDPLSKIPFKILSGHEHAVSTCHFCVDDTKLLSG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 LALDREPPPHLLPEKHQVPEKLIWGDQDPLSKIPFKILSGHEHAVSTCHFCVDDTKLLSG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 SYDCTVKLWDPVDGSVVRDFEHRPKAPVVECSITGDSSRVIAASYDKTVRAWDLETGKLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 SYDCTVKLWDPVDGSVVRDFEHRPKAPVVECSITGDSSRVIAASYDKTVRAWDLETGKLL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 WKVRYDTFIVSCKFSPDGKYVVSGFDVDHGICIMDAENITTVSVIKDHHTRSITSCCFDP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 WKVRYDTFIVSCKFSPDGKYVVSGFDVDHGICIMDAENITTVSVIKDHHTRSITSCCFDP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE0 DSQRVASVSLDRCIKIWDVTSQATLLTITKAHSNAISNCCFTFSGHFLCTSSWDKNLKIW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 DSQRVASVSLDRCIKIWDVTSQATLLTITKAHSNAISNCCFTFSGHFLCTSSWDKNLKIW 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE0 NVHTGEFRNCGACVTLMQGHEGSVSSCHFARDSSFLISGGFDRTVAIWDVAEGYRKLSLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 NVHTGEFRNCGACVTLMQGHEGSVSSCHFARDSSFLISGGFDRTVAIWDVAEGYRKLSLK 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE0 GHNDWVMDVAISNNKKWILSASKDRTMRLWNIEEIDEIPLVIKYKKAVGLKLKQCERCDR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 GHNDWVMDVAISNNKKWILSASKDRTMRLWNIEEIDEIPLVIKYKKAVGLKLKQCERCDR 370 380 390 400 410 420 430 440 450 460 470 pF1KE0 PFSIFKSDTSSEMFTQCVFCRIDTRGLPADTSSSSSSSERENSPPPRGSKDD :::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 PFSIFKSDTSSEMFTQCVFCRIDTRGLPADTSSSSSSSERENSPPPRGSKDD 430 440 450 460 470 >>CCDS340.1 SNRNP40 gene_id:9410|Hs108|chr1 (357 aa) initn: 302 init1: 166 opt: 436 Z-score: 390.6 bits: 81.2 E(32554): 2e-15 Smith-Waterman score: 436; 26.3% identity (59.7% similar) in 308 aa overlap (92-393:56-357) 70 80 90 100 110 120 pF1KE0 ALDREPPPHLLPEKHQVPEKLIWGDQDPLSKIPFKILSGHEHAVSTCHFCVDDTKLLSGS . :. .::::: : :.: . . : :.. CCDS34 LGAGSGPGAGQQQATPGALLQAGPPRCSSLQAPIMLLSGHEGEVYCCKFHPNGSTLASAG 30 40 50 60 70 80 130 140 150 160 170 180 pF1KE0 YDCTVKLWDPVDGSVVRDFEHRPKAPVVECSITGDSSRVIAASYDKTVRAWDLETGKLLW .: . ::. . .. :.: . :.: ...:: :::: .:: :::. . CCDS34 FDRLILLWNVYGDCDNYATLKGHSGAVMELHYNTDGSMLFSASTDKTVAVWDSETGERVK 90 100 110 120 130 140 190 200 210 220 230 pF1KE0 KVR-YDTFIVSCKFSPDG-KYVVSGFDVDHGICIMDAENITTVSVIKDHHTRSITSCCFD ... . .:. :: . : . : .: : : . . : .. ........ : .. . :. CCDS34 RLKGHTSFVNSCYPARRGPQLVCTGSD-DGTVKLWDIRKKAAIQTFQN--TYQVLAVTFN 150 160 170 180 190 200 240 250 260 270 280 290 pF1KE0 PDSQRVASVSLDRCIKIWDVTSQATLLTITKAHSNAISNCCFTFSGHFLCTSSWDKNLKI :... : ..: ::.::. : : ..:...... .. : .: ... :..... CCDS34 DTSDQIISGGIDNDIKVWDLR-QNKLTYTMRGHADSVTGLSLSSEGSYLLSNAMDNTVRV 210 220 230 240 250 260 300 310 320 330 340 350 pF1KE0 WNVHTGEFRNCGACVTLMQGH----EGSVSSCHFARDSSFLISGGFDRTVAIWDVAEGYR :.:. : :: ..::. : .. : .. :.: . .:. :: : .::.. CCDS34 WDVRP--FAPKERCVKIFQGNVHNFEKNLLRCSWSPDGSKIAAGSADRFVYVWDTTSRRI 270 280 290 300 310 360 370 380 390 400 410 pF1KE0 KLSLKGHNDWVMDVAISNNKKWILSASKDRTMRLWNIEEIDEIPLVIKYKKAVGLKLKQC .: :: . .::. .. :.:::.:. . . .:. CCDS34 LYKLPGHAGSINEVAFHPDEPIIISASSDKRLYMGEIQ 320 330 340 350 420 430 440 450 460 470 pF1KE0 ERCDRPFSIFKSDTSSEMFTQCVFCRIDTRGLPADTSSSSSSSERENSPPPRGSKDD >>CCDS2470.1 DAW1 gene_id:164781|Hs108|chr2 (415 aa) initn: 359 init1: 175 opt: 392 Z-score: 351.6 bits: 74.2 E(32554): 2.9e-13 Smith-Waterman score: 511; 26.9% identity (60.2% similar) in 394 aa overlap (5-390:37-414) 10 20 30 pF1KE0 MASPPRCSPTAHDRECKLPPPSAPASEYCPGKLS : . .: .: . : ::. :: CCDS24 LLRYYPPGIMLEYEKHGELKTKSIDLLDLGPSTDVSALVEEIQKAEPLLTASRTEQVKLL 10 20 30 40 50 60 40 50 60 70 80 pF1KE0 WGTMARALGR------FKLSIPHTHLLATLDPLALDREPPPHLLPEKHQVPEKLIWGDQD . . ::. . ... ..:.: : .::.. .. ... :: : : CCDS24 IQRLQEKLGQNSNHTFYLFKVLKAHILP-LTNVALNKSGS-CFITGSYDRTCKL-W---D 70 80 90 100 110 120 90 100 110 120 130 140 pF1KE0 PLSKIPFKILSGHEHAVSTCHFCVD-DTKLLSGSYDCTVKLWDPVDGSVVRDFEHRPKAP : .. : ::...: . : :. .::.: : :::. :. . :. . : CCDS24 TASGEELNTLEGHRNVVYAIAFNNPYGDKIATGSFDKTCKLWSVETGKCYHTFRGHT-AE 130 140 150 160 170 150 160 170 180 190 200 pF1KE0 VVECSITGDSSRVIAASYDKTVRAWDLETGKLLWKVR-YDTFIVSCKFSPDGKYVVSGFD .: :.. .:. : ..:.: :.. ::...:. .. .: ... :.: .:. .: ...: . CCDS24 IVCLSFNPQSTLVATGSMDTTAKLWDIQNGEEVYTLRGHSAEIISLSFNTSGDRIITG-S 180 190 200 210 220 230 210 220 230 240 250 260 pF1KE0 VDHGICIMDAENITTVSVIKDHHTRSITSCCFDPDSQRVASVSLDRCIKIWDVTSQATLL :: . . ::.. :... : .. :.: :. : . . . :.:. :.::.:. . CCDS24 FDHTVVVWDADTGRKVNILIGHCAE-ISSASFNWDCSLILTGSMDKTCKLWDATNGKCVA 240 250 260 270 280 290 270 280 290 300 310 320 pF1KE0 TITKAHSNAISNCCFTFSGHFLCTSSWDKNLKIWNVHTGEFRNCGACVTLMQGHEGSVSS :.: .:.. : . :: ..:... :.: : . .:... : . :.. ..:::: .:. CCDS24 TLT-GHDDEILDSCFDYTGKLIATASADGTARIFSAATRK------CIAKLEGHEGEISK 300 310 320 330 340 350 330 340 350 360 370 380 pF1KE0 CHFARDSSFLISGGFDRTVAIWDVAEGYRKLSLKGHNDWVMDVAISNNKKWILSASKDRT : ... :..:. :.:. :::. : :.::.: ... :.. . . ....::: : CCDS24 ISFNPQGNHLLTGSSDKTARIWDAQTGQCLQVLEGHTDEIFSCAFNYKGNIVITGSKDNT 360 370 380 390 400 410 390 400 410 420 430 440 pF1KE0 MRLWNIEEIDEIPLVIKYKKAVGLKLKQCERCDRPFSIFKSDTSSEMFTQCVFCRIDTRG :.: CCDS24 CRIWR >>CCDS31869.1 POC1B gene_id:282809|Hs108|chr12 (478 aa) initn: 169 init1: 100 opt: 344 Z-score: 309.2 bits: 66.6 E(32554): 6.7e-11 Smith-Waterman score: 406; 28.9% identity (58.1% similar) in 301 aa overlap (62-361:23-308) 40 50 60 70 80 90 pF1KE0 KLSWGTMARALGRFKLSIPHTHLLATLDPLALDREPPPHLLPEKHQVPEKLIWGDQDPLS .:: : . : ..: . : . CCDS31 MASATEDPVLERYFKGHKAAITSLDLSPNGKQLATASWDTFLMLW-NFKPHA 10 20 30 40 50 100 110 120 130 140 150 pF1KE0 KIPFKILSGHEHAVSTCHFCVDDTKLLSGSYDCTVKLWDPVDGSVVRDFEHRPKAPVVEC . .. . ::. .:.. .: . : :.: : ::.:: : . .:. . ::: CCDS31 RA-YRYV-GHKDVVTSVQFSPHGNLLASASRDRTVRLWIPDKRGKFSEFKAHT-APVRSV 60 70 80 90 100 160 170 180 190 200 210 pF1KE0 SITGDSSRVIAASYDKTVRAWDLETGKLLWKVRYDTFIVSC-KFSPDGKYVVSGFDVDHG ....:.. . .:: ::....:.. ..:... : : : ::::::. .:: . :. CCDS31 DFSADGQFLATASEDKSIKVWSMYRQRFLYSLYRHTHWVRCAKFSPDGRLIVSCSE-DKT 110 120 130 140 150 160 220 230 240 250 260 270 pF1KE0 ICIMDAENITTVSVIKDHHTRSITSCCFDPDSQRVASVSLDRCIKIWDVTSQATLLTITK : : :. : :. ..: . . :.:.. .::.. :. .:.::: . :: . CCDS31 IKIWDTTNKQCVNNFSDS-VGFANFVDFNPSGTCIASAGSDQTVKVWDVRVNK-LLQHYQ 170 180 190 200 210 220 280 290 300 310 320 330 pF1KE0 AHSNAISNCCFTFSGHFLCTSSWDKNLKIWNVHTGEFRNCGACVTLMQGHEGSVSSCHFA .::.... : ::..: :.: : .::: .. :.. . .::: : : . :. CCDS31 VHSGGVNCISFHPSGNYLITASSDGTLKILDLLEGRL------IYTLQGHTGPVFTVSFS 230 240 250 260 270 340 350 360 370 380 390 pF1KE0 RDSSFLISGGFDRTVAIWDVAEGYRKLSLKGHNDWVMDVAISNNKKWILSASKDRTMRLW . . .. ::: : : .: . .. .: :: CCDS31 KGGELFASGGADTQVLLWRT--NFDELHCKGLTKRNLKRLHFDSPPHLLDIYPRTPHPHE 280 290 300 310 320 330 400 410 420 430 440 450 pF1KE0 NIEEIDEIPLVIKYKKAVGLKLKQCERCDRPFSIFKSDTSSEMFTQCVFCRIDTRGLPAD CCDS31 EKVETVEINPKLEVIDLQISTPPVMDILSFDSTTTTETSGRTLPDKGEEACGYFLNPSLM 340 350 360 370 380 390 472 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 05:49:23 2016 done: Fri Nov 4 05:49:23 2016 Total Scan time: 3.530 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]