FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB3766, 504 aa 1>>>pF1KB3766 504 - 504 aa - 504 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.4943+/-0.0011; mu= 8.1153+/- 0.065 mean_var=138.7456+/-27.523, 0's: 0 Z-trim(107.1): 69 B-trim: 0 in 0/51 Lambda= 0.108884 statistics sampled from 9314 (9381) to 9314 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.644), E-opt: 0.2 (0.288), width: 16 Scan time: 2.550 The best scores are: opt bits E(32554) CCDS9479.1 DNAJC3 gene_id:5611|Hs108|chr13 ( 504) 3318 533.2 2.6e-151 CCDS45677.1 DNAJC7 gene_id:7266|Hs108|chr17 ( 494) 638 112.2 1.4e-24 CCDS45678.1 DNAJC7 gene_id:7266|Hs108|chr17 ( 438) 609 107.6 2.9e-23 >>CCDS9479.1 DNAJC3 gene_id:5611|Hs108|chr13 (504 aa) initn: 3318 init1: 3318 opt: 3318 Z-score: 2830.1 bits: 533.2 E(32554): 2.6e-151 Smith-Waterman score: 3318; 100.0% identity (100.0% similar) in 504 aa overlap (1-504:1-504) 10 20 30 40 50 60 pF1KB3 MVAPGSVTSRLGSVFPFLLVLVDLQYEGAECGVNADVEKHLELGKKLLAAGQLADALSQF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS94 MVAPGSVTSRLGSVFPFLLVLVDLQYEGAECGVNADVEKHLELGKKLLAAGQLADALSQF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB3 HAAVDGDPDNYIAYYRRATVFLAMGKSKAALPDLTKVIQLKMDFTAARLQRGHLLLKQGK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS94 HAAVDGDPDNYIAYYRRATVFLAMGKSKAALPDLTKVIQLKMDFTAARLQRGHLLLKQGK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB3 LDEAEDDFKKVLKSNPSENEEKEAQSQLIKSDEMQRLRSQALNAFGSGDYTAAIAFLDKI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS94 LDEAEDDFKKVLKSNPSENEEKEAQSQLIKSDEMQRLRSQALNAFGSGDYTAAIAFLDKI 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB3 LEVCVWDAELRELRAECFIKEGEPRKAISDLKAASKLKNDNTEAFYKISTLYYQLGDHEL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS94 LEVCVWDAELRELRAECFIKEGEPRKAISDLKAASKLKNDNTEAFYKISTLYYQLGDHEL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB3 SLSEVRECLKLDQDHKRCFAHYKQVKKLNKLIESAEELIRDGRYTDATSKYESVMKTEPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS94 SLSEVRECLKLDQDHKRCFAHYKQVKKLNKLIESAEELIRDGRYTDATSKYESVMKTEPS 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB3 IAEYTVRSKERICHCFSKDEKPVEAIRVCSEVLQMEPDNVNALKDRAEAYLIEEMYDEAI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS94 IAEYTVRSKERICHCFSKDEKPVEAIRVCSEVLQMEPDNVNALKDRAEAYLIEEMYDEAI 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB3 QDYETAQEHNENDQQIREGLEKAQRLLKQSQKRDYYKILGVKRNAKKQEIIKAYRKLALQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS94 QDYETAQEHNENDQQIREGLEKAQRLLKQSQKRDYYKILGVKRNAKKQEIIKAYRKLALQ 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB3 WHPDNFQNEEEKKKAEKKFIDIAAAKEVLSDPEMRKKFDDGEDPLDAESQQGGGGNPFHR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS94 WHPDNFQNEEEKKKAEKKFIDIAAAKEVLSDPEMRKKFDDGEDPLDAESQQGGGGNPFHR 430 440 450 460 470 480 490 500 pF1KB3 SWNSWQGFNPFSSGGPFRFKFHFN :::::::::::::::::::::::: CCDS94 SWNSWQGFNPFSSGGPFRFKFHFN 490 500 >>CCDS45677.1 DNAJC7 gene_id:7266|Hs108|chr17 (494 aa) initn: 323 init1: 187 opt: 638 Z-score: 555.0 bits: 112.2 E(32554): 1.4e-24 Smith-Waterman score: 638; 28.2% identity (62.8% similar) in 479 aa overlap (36-503:27-493) 10 20 30 40 50 60 pF1KB3 SVTSRLGSVFPFLLVLVDLQYEGAECGVNADVEKHLELGKKLLAAGQLADALSQFHAAVD ..: : :. : . .: . . :.: CCDS45 MAAAAECDVVMAATEPELLDDQEAKREAETFKEQGNAYYAKKDYNEAYNYYTKAID 10 20 30 40 50 70 80 90 100 110 120 pF1KB3 GDPDNYIAYYRRATVFLAMGKSKAALPDLTKVIQLKMDFTAARLQRGHLLLKQGKLDEAE : : : ::.... .:. . :: : . ..: .:. ..:..:. :. :. : CCDS45 MCPKNASYYGNRAATLMMLGRFREALGDAQQSVRLDDSFVRGHLREGKCHLSLGNAMAAC 60 70 80 90 100 110 130 140 150 160 170 180 pF1KB3 DDFKKVLKSNPSENEEKEAQSQLIKSDEMQRLRSQALNAFGSGDYTAAIAFLDKILEVCV .:...:. .... .::... ... ... .. : . : . :. .. .:. :: CCDS45 RSFQRALEL---DHKNAQAQQEFKNANAVMEYEKIAETDFEKRDFRKVVFCMDRALEFAP 120 130 140 150 160 170 190 200 210 220 230 240 pF1KB3 WDAELRELRAECFIKEGEPRKAISDLKAASKLKNDNTEAFY-KISTLYYQLGDHELSLSE ... :.:::. :. .: : . .. . :..:.: . :::. : ... CCDS45 ACHRFKILKAECLAMLGRYPEAQSVASDILRMDSTNADALYVRGLCLYYE-DCIEKAVQF 180 190 200 210 220 230 250 260 270 280 290 300 pF1KB3 VRECLKLDQDHKR-CFAHYKQVKKLNKLIESAEELIRDGRYTDATSKYESVMKTEPSIAE . :.. ::.. :.: ...: :. :.... ...: : : : .. .:. CCDS45 FVQALRMAPDHEKACIAC-RNAKALKAKKEDGNKAFKEGNYKLAYELYTEALGIDPN--- 240 250 260 270 280 310 320 330 340 350 pF1KB3 YTVRSKERI-CH---CFSKDEKPVEAIRVCSEVLQMEPDNVNALKDRAEAYLIEEMYDEA ..... .. :. :: .: .::. :....... ..: ::. :. :.:.:: CCDS45 -NIKTNAKLYCNRGTVNSKLRKLDDAIEDCTNAVKLDDTYIKAYLRRAQCYMDTEQYEEA 290 300 310 320 330 340 360 370 380 390 400 410 pF1KB3 IQDYETAQEHNENDQQIREGLEKAQRLLKQSQKRDYYKILGVKRNAKKQEIIKAYRKLAL ..::: . . .:. .. .. :..:: ::.:...:::::::: .::...:: ::::: :: CCDS45 VRDYEKVYQ-TEKTKEHKQLLKNAQLELKKSKRKDYYKILGVDKNASEDEIKKAYRKRAL 350 360 370 380 390 400 420 430 440 450 460 470 pF1KB3 QWHPDNFQ--NEEEKKKAEKKFIDIAAAKEVLSDPEMRKKFDDGEDPLDAESQQGGGGNP . ::: . . : .:. :::: ... : .::::. . ..:.:.: :: :... : .: CCDS45 MHHPDRHSGASAEVQKEEEKKFKEVGEAFTILSDPKKKTRYDSGQD-LDEEGMNMGDFDP 410 420 430 440 450 460 480 490 500 pF1KB3 ---FHRSWNSWQGFNPFSSGGPFRFKFHFN :. ... ::. : ..:: : :.: CCDS45 NNIFKAFFGGPGGFS-FEASGPGNFFFQFG 470 480 490 >>CCDS45678.1 DNAJC7 gene_id:7266|Hs108|chr17 (438 aa) initn: 323 init1: 187 opt: 609 Z-score: 531.1 bits: 107.6 E(32554): 2.9e-23 Smith-Waterman score: 609; 28.8% identity (64.1% similar) in 448 aa overlap (68-503:3-437) 40 50 60 70 80 90 pF1KB3 EKHLELGKKLLAAGQLADALSQFHAAVDGDPDNYIAYYRRATVFLAMGKSKAALPDLTKV : : : ::.... .:. . :: : . CCDS45 MCPKNASYYGNRAATLMMLGRFREALGDAQQS 10 20 30 100 110 120 130 140 150 pF1KB3 IQLKMDFTAARLQRGHLLLKQGKLDEAEDDFKKVLKSNPSENEEKEAQSQLIKSDEMQRL ..: .:. ..:..:. :. :. : .:...:. .... .::... ... ... CCDS45 VRLDDSFVRGHLREGKCHLSLGNAMAACRSFQRALEL---DHKNAQAQQEFKNANAVMEY 40 50 60 70 80 160 170 180 190 200 210 pF1KB3 RSQALNAFGSGDYTAAIAFLDKILEVCVWDAELRELRAECFIKEGEPRKAISDLKAASKL .. : . : . :. .. .:. :: ... :.:::. :. .: : . .. CCDS45 EKIAETDFEKRDFRKVVFCMDRALEFAPACHRFKILKAECLAMLGRYPEAQSVASDILRM 90 100 110 120 130 140 220 230 240 250 260 270 pF1KB3 KNDNTEAFY-KISTLYYQLGDH-ELSLSEVRECLKLDQDHKR-CFAHYKQVKKLNKLIES . :..:.: . :::. : : ... . :.. ::.. :.: ...: :. :. CCDS45 DSTNADALYVRGLCLYYE--DCIEKAVQFFVQALRMAPDHEKACIAC-RNAKALKAKKED 150 160 170 180 190 200 280 290 300 310 320 330 pF1KB3 AEELIRDGRYTDATSKYESVMKTEPSIAEYTVRSKERI-CH---CFSKDEKPVEAIRVCS ... ...: : : : .. .:. ..... .. :. :: .: .::. :. CCDS45 GNKAFKEGNYKLAYELYTEALGIDPN----NIKTNAKLYCNRGTVNSKLRKLDDAIEDCT 210 220 230 240 250 260 340 350 360 370 380 390 pF1KB3 EVLQMEPDNVNALKDRAEAYLIEEMYDEAIQDYETAQEHNENDQQIREGLEKAQRLLKQS ...... ..: ::. :. :.:.::..::: . . .:. .. .. :..:: ::.: CCDS45 NAVKLDDTYIKAYLRRAQCYMDTEQYEEAVRDYEKVYQ-TEKTKEHKQLLKNAQLELKKS 270 280 290 300 310 320 400 410 420 430 440 pF1KB3 QKRDYYKILGVKRNAKKQEIIKAYRKLALQWHPDNFQ--NEEEKKKAEKKFIDIAAAKEV ...:::::::: .::...:: ::::: ::. ::: . . : .:. :::: ... : . CCDS45 KRKDYYKILGVDKNASEDEIKKAYRKRALMHHPDRHSGASAEVQKEEEKKFKEVGEAFTI 330 340 350 360 370 380 450 460 470 480 490 500 pF1KB3 LSDPEMRKKFDDGEDPLDAESQQGGGGNP---FHRSWNSWQGFNPFSSGGPFRFKFHFN ::::. . ..:.:.: :: :... : .: :. ... ::. : ..:: : :.: CCDS45 LSDPKKKTRYDSGQD-LDEEGMNMGDFDPNNIFKAFFGGPGGFS-FEASGPGNFFFQFG 390 400 410 420 430 504 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 13:55:26 2016 done: Thu Nov 3 13:55:26 2016 Total Scan time: 2.550 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]