FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0031, 494 aa 1>>>pF1KE0031 494 - 494 aa - 494 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.8515+/-0.00114; mu= 11.1140+/- 0.068 mean_var=110.7554+/-22.090, 0's: 0 Z-trim(105.3): 109 B-trim: 438 in 1/49 Lambda= 0.121869 statistics sampled from 8253 (8368) to 8253 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.613), E-opt: 0.2 (0.257), width: 16 Scan time: 2.840 The best scores are: opt bits E(32554) CCDS45677.1 DNAJC7 gene_id:7266|Hs108|chr17 ( 494) 3291 589.9 2.1e-168 CCDS45678.1 DNAJC7 gene_id:7266|Hs108|chr17 ( 438) 2930 526.4 2.5e-149 CCDS9479.1 DNAJC3 gene_id:5611|Hs108|chr13 ( 504) 635 122.9 8.1e-28 >>CCDS45677.1 DNAJC7 gene_id:7266|Hs108|chr17 (494 aa) initn: 3291 init1: 3291 opt: 3291 Z-score: 3136.8 bits: 589.9 E(32554): 2.1e-168 Smith-Waterman score: 3291; 99.8% identity (100.0% similar) in 494 aa overlap (1-494:1-494) 10 20 30 40 50 60 pF1KE0 MAAAAECDVVMAATEPELLDDQEAKREAETFKEQGNAYYAKKDYNEAYNYYTKAIDMCPK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 MAAAAECDVVMAATEPELLDDQEAKREAETFKEQGNAYYAKKDYNEAYNYYTKAIDMCPK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 NASYYGNRAATLMMLGRFREALGDAQQSVRLDDSFVRGHLREGKCHLSLGNAMAACRSFQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 NASYYGNRAATLMMLGRFREALGDAQQSVRLDDSFVRGHLREGKCHLSLGNAMAACRSFQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 RALELDHKNAQAQQEFKNANAVMEYEKIAETDFEKRDFRKVVFCMDRALEFAPACHRFKI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 RALELDHKNAQAQQEFKNANAVMEYEKIAETDFEKRDFRKVVFCMDRALEFAPACHRFKI 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 LKAECLAMLGRYPEAQSVASDILRMDSTNADALYVRGLCLYYEDCIEKAVQFFVQALRMA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 LKAECLAMLGRYPEAQSVASDILRMDSTNADALYVRGLCLYYEDCIEKAVQFFVQALRMA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE0 PDHEKACIACRNAKALKAKKEDGNKAFKEGNYKLAYELYTEALGIDPNNIKTNAKLYCNR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 PDHEKACIACRNAKALKAKKEDGNKAFKEGNYKLAYELYTEALGIDPNNIKTNAKLYCNR 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE0 GTVNSKLRKLDDAIEDCTNAVKLDDTYIKAYLRRAQCYMDTEQYEEAVRDYEKVYQTEKT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 GTVNSKLRKLDDAIEDCTNAVKLDDTYIKAYLRRAQCYMDTEQYEEAVRDYEKVYQTEKT 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE0 KEHKQLLKNAQLELRKSKRKDYYKILGVDKNASEDEIKKAYRKRALMHHPDRHSGASAEV ::::::::::::::.::::::::::::::::::::::::::::::::::::::::::::: CCDS45 KEHKQLLKNAQLELKKSKRKDYYKILGVDKNASEDEIKKAYRKRALMHHPDRHSGASAEV 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE0 QKEEEKKFKEVGEAFTILSDPKKKTRYDSGQDLDEEGMNMGDFDPNNIFKAFFGGPGGFS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 QKEEEKKFKEVGEAFTILSDPKKKTRYDSGQDLDEEGMNMGDFDPNNIFKAFFGGPGGFS 430 440 450 460 470 480 490 pF1KE0 FEASGPGNFFFQFG :::::::::::::: CCDS45 FEASGPGNFFFQFG 490 >>CCDS45678.1 DNAJC7 gene_id:7266|Hs108|chr17 (438 aa) initn: 2930 init1: 2930 opt: 2930 Z-score: 2794.5 bits: 526.4 E(32554): 2.5e-149 Smith-Waterman score: 2930; 99.8% identity (100.0% similar) in 438 aa overlap (57-494:1-438) 30 40 50 60 70 80 pF1KE0 EAETFKEQGNAYYAKKDYNEAYNYYTKAIDMCPKNASYYGNRAATLMMLGRFREALGDAQ :::::::::::::::::::::::::::::: CCDS45 MCPKNASYYGNRAATLMMLGRFREALGDAQ 10 20 30 90 100 110 120 130 140 pF1KE0 QSVRLDDSFVRGHLREGKCHLSLGNAMAACRSFQRALELDHKNAQAQQEFKNANAVMEYE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 QSVRLDDSFVRGHLREGKCHLSLGNAMAACRSFQRALELDHKNAQAQQEFKNANAVMEYE 40 50 60 70 80 90 150 160 170 180 190 200 pF1KE0 KIAETDFEKRDFRKVVFCMDRALEFAPACHRFKILKAECLAMLGRYPEAQSVASDILRMD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 KIAETDFEKRDFRKVVFCMDRALEFAPACHRFKILKAECLAMLGRYPEAQSVASDILRMD 100 110 120 130 140 150 210 220 230 240 250 260 pF1KE0 STNADALYVRGLCLYYEDCIEKAVQFFVQALRMAPDHEKACIACRNAKALKAKKEDGNKA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 STNADALYVRGLCLYYEDCIEKAVQFFVQALRMAPDHEKACIACRNAKALKAKKEDGNKA 160 170 180 190 200 210 270 280 290 300 310 320 pF1KE0 FKEGNYKLAYELYTEALGIDPNNIKTNAKLYCNRGTVNSKLRKLDDAIEDCTNAVKLDDT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 FKEGNYKLAYELYTEALGIDPNNIKTNAKLYCNRGTVNSKLRKLDDAIEDCTNAVKLDDT 220 230 240 250 260 270 330 340 350 360 370 380 pF1KE0 YIKAYLRRAQCYMDTEQYEEAVRDYEKVYQTEKTKEHKQLLKNAQLELRKSKRKDYYKIL ::::::::::::::::::::::::::::::::::::::::::::::::.::::::::::: CCDS45 YIKAYLRRAQCYMDTEQYEEAVRDYEKVYQTEKTKEHKQLLKNAQLELKKSKRKDYYKIL 280 290 300 310 320 330 390 400 410 420 430 440 pF1KE0 GVDKNASEDEIKKAYRKRALMHHPDRHSGASAEVQKEEEKKFKEVGEAFTILSDPKKKTR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 GVDKNASEDEIKKAYRKRALMHHPDRHSGASAEVQKEEEKKFKEVGEAFTILSDPKKKTR 340 350 360 370 380 390 450 460 470 480 490 pF1KE0 YDSGQDLDEEGMNMGDFDPNNIFKAFFGGPGGFSFEASGPGNFFFQFG :::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 YDSGQDLDEEGMNMGDFDPNNIFKAFFGGPGGFSFEASGPGNFFFQFG 400 410 420 430 >>CCDS9479.1 DNAJC3 gene_id:5611|Hs108|chr13 (504 aa) initn: 320 init1: 184 opt: 635 Z-score: 612.9 bits: 122.9 E(32554): 8.1e-28 Smith-Waterman score: 635; 28.3% identity (62.7% similar) in 480 aa overlap (27-493:36-503) 10 20 30 40 50 pF1KE0 MAAAAECDVVMAATEPELLDDQEAKREAETFKEQGNAYYAKKDYNEAYNYYTKAID ..: : :. : . .: . . :.: CCDS94 SVTSRLGSVFPFLLVLVDLQYEGAECGVNADVEKHLELGKKLLAAGQLADALSQFHAAVD 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE0 MCPKNASYYGNRAATLMMLGRFREALGDAQQSVRLDDSFVRGHLREGKCHLSLGNAMAAC : : : ::.... .:. . :: : . ..: .:. ..:..:. :. :. : CCDS94 GDPDNYIAYYRRATVFLAMGKSKAALPDLTKVIQLKMDFTAARLQRGHLLLKQGKLDEAE 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE0 RSFQRALELD---HKNAQAQQEFKNANAVMEYEKIAETDFEKRDFRKVVFCMDRALEFAP .:...:. . ... .::... ... ... .. : . : . :. .. .:. :: CCDS94 DDFKKVLKSNPSENEEKEAQSQLIKSDEMQRLRSQALNAFGSGDYTAAIAFLDKILEVCV 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE0 ACHRFKILKAECLAMLGRYPEAQSVASDILRMDSTNADALYVRGLCLYYE--DCIEKAVQ ... :.:::. :. .: : . .. . :..:.: . :::. : : ... CCDS94 WDAELRELRAECFIKEGEPRKAISDLKAASKLKNDNTEAFY-KISTLYYQLGDH-ELSLS 190 200 210 220 230 240 240 250 260 270 280 pF1KE0 FFVQALRMAPDHEKACIAC-RNAKALKAKKEDGNKAFKEGNYKLAYELYTEALGIDPN-- . :.. :: : :.: ...: :. :.... ...: : : : .. .:. CCDS94 EVRECLKLDQDH-KRCFAHYKQVKKLNKLIESAEELIRDGRYTDATSKYESVMKTEPSIA 250 260 270 280 290 300 290 300 310 320 330 340 pF1KE0 --NIKTNAKLYCNRGTVNSKLRKLDDAIEDCTNAVKLDDTYIKAYLRRAQCYMDTEQYEE ..... .. :. :: .: .::. :....... ..: ::. :. :.:.: CCDS94 EYTVRSKERI-CH---CFSKDEKPVEAIRVCSEVLQMEPDNVNALKDRAEAYLIEEMYDE 310 320 330 340 350 350 360 370 380 390 400 pF1KE0 AVRDYEKVYQ-TEKTKEHKQLLKNAQLELRKSKRKDYYKILGVDKNASEDEIKKAYRKRA :..::: . . .:. .. .. :..:: :..:...:::::::: .::...:: ::::: : CCDS94 AIQDYETAQEHNENDQQIREGLEKAQRLLKQSQKRDYYKILGVKRNAKKQEIIKAYRKLA 360 370 380 390 400 410 410 420 430 440 450 460 pF1KE0 LMHHPDRHSGASAEVQKEEEKKFKEVGEAFTILSDPKKKTRYDSGQD-LDEEGMNMGDFD :. ::: . . : .:. :::: ... : .::::. . ..:.:.: :: :... : . CCDS94 LQWHPDNFQ--NEEEKKKAEKKFIDIAAAKEVLSDPEMRKKFDDGEDPLDAESQQGGGGN 420 430 440 450 460 470 470 480 490 pF1KE0 PNNIFKAFFGGPGGFS-FEASGPGNFFFQFG : :. ... ::. : ..:: : :.: CCDS94 P---FHRSWNSWQGFNPFSSGGPFRFKFHFN 480 490 500 494 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 21:52:12 2016 done: Sat Nov 5 21:52:13 2016 Total Scan time: 2.840 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]