FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1037, 387 aa 1>>>pF1KE1037 387 - 387 aa - 387 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2422+/-0.000916; mu= 16.5871+/- 0.055 mean_var=58.6471+/-11.512, 0's: 0 Z-trim(104.6): 14 B-trim: 0 in 0/50 Lambda= 0.167475 statistics sampled from 7995 (7998) to 7995 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.62), E-opt: 0.2 (0.246), width: 16 Scan time: 2.260 The best scores are: opt bits E(32554) CCDS7862.1 BBOX1 gene_id:8424|Hs108|chr11 ( 387) 2656 650.2 8.9e-187 CCDS14768.1 TMLHE gene_id:55217|Hs108|chrX ( 421) 586 150.1 3.5e-36 CCDS55547.1 TMLHE gene_id:55217|Hs108|chrX ( 376) 431 112.6 5.9e-25 >>CCDS7862.1 BBOX1 gene_id:8424|Hs108|chr11 (387 aa) initn: 2656 init1: 2656 opt: 2656 Z-score: 3466.7 bits: 650.2 E(32554): 8.9e-187 Smith-Waterman score: 2656; 100.0% identity (100.0% similar) in 387 aa overlap (1-387:1-387) 10 20 30 40 50 60 pF1KE1 MACTIQKAEALDGAHLMQILWYDEEESLYPAVWLRDNCPCSDCYLDSAKARKLLVEALDV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 MACTIQKAEALDGAHLMQILWYDEEESLYPAVWLRDNCPCSDCYLDSAKARKLLVEALDV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 NIGIKGLIFDRKKVYITWPDEHYSEFQADWLKKRCFSKQARAKLQRELFFPECQYWGSEL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 NIGIKGLIFDRKKVYITWPDEHYSEFQADWLKKRCFSKQARAKLQRELFFPECQYWGSEL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 QLPTLDFEDVLRYDEHAYKWLSTLKKVGIVRLTGASDKPGEVSKLGKRMGFLYLTFYGHT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 QLPTLDFEDVLRYDEHAYKWLSTLKKVGIVRLTGASDKPGEVSKLGKRMGFLYLTFYGHT 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 WQVQDKIDANNVAYTTGKLSFHTDYPALHHPPGVQLLHCIKQTVTGGDSEIVDGFNVCQK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 WQVQDKIDANNVAYTTGKLSFHTDYPALHHPPGVQLLHCIKQTVTGGDSEIVDGFNVCQK 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 LKKNNPQAFQILSSTFVDFTDIGVDYCDFSVQSKHKIIELDDKGQVVRINFNNATRDTIF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 LKKNNPQAFQILSSTFVDFTDIGVDYCDFSVQSKHKIIELDDKGQVVRINFNNATRDTIF 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 DVPVERVQPFYAALKEFVDLMNSKESKFTFKMNPGDVITFDNWRLLHGRRSYEAGTEISR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 DVPVERVQPFYAALKEFVDLMNSKESKFTFKMNPGDVITFDNWRLLHGRRSYEAGTEISR 310 320 330 340 350 360 370 380 pF1KE1 HLEGAYADWDVVMSRLRILRQRVENGN ::::::::::::::::::::::::::: CCDS78 HLEGAYADWDVVMSRLRILRQRVENGN 370 380 >>CCDS14768.1 TMLHE gene_id:55217|Hs108|chrX (421 aa) initn: 530 init1: 223 opt: 586 Z-score: 763.1 bits: 150.1 E(32554): 3.5e-36 Smith-Waterman score: 586; 28.3% identity (62.3% similar) in 353 aa overlap (29-379:71-417) 10 20 30 40 50 pF1KE1 MACTIQKAEALDGAHLMQILWYDEEESLYPAVWLRDNCPCSDCYLDSAKARKLLVEAL . :::::.: ..:: .... :.: . .. CCDS14 WHHTASKSLTCAWQQHEDHFELKYANTVMRFDYVWLRDHCRSASCYNSKTHQRSLDTASV 50 60 70 80 90 100 60 70 80 90 100 110 pF1KE1 DVNIGIKGLIFDRKKVYITWPDEHYSEFQADWLKKRCFSKQARAKLQRELFFPECQYWGS :. : : . .:. ...:::: : .... .:: : . : . .: .... : . CCDS14 DLCIKPKTIRLDETTLFFTWPDGHVTKYDLNWLVKNSYEGQKQKVIQPRILWNAEIY--Q 110 120 130 140 150 120 130 140 150 160 170 pF1KE1 ELQLPTLDFEDVLRYDEHAYKWLSTLKKVGIVRLTGASDKPGEVSKLGKRMGFLYLTFYG . :.:..: .. :. .: :.:... ::. . .. .. ::..:.... :.:: CCDS14 QAQVPSVDCQSFLETNEGLKKFLQNFLLYGIAFVENVPPTQEHTEKLAERISLIRETIYG 160 170 180 190 200 210 180 190 200 210 220 230 pF1KE1 HTWQVQDKIDANNVAYTTGKLSFHTDYPALHHPPGVQLLHCIKQTVTGGDSEIVDGFNVC . : . .. ...::: :. ::: ...: :.:..::.:. ::: . .:::: . CCDS14 RMWYFTSDFSRGDTAYTKLALDRHTDTTYFQEPCGIQVFHCLKHEGTGGRTLLVDGFYAA 220 230 240 250 260 270 240 250 260 270 280 290 pF1KE1 QKLKKNNPQAFQILSSTFVDFTDI-GVDYCDFSVQSKHKIIELDD-KGQVVRINFNNATR ... .. :. :..::.. . : : : . . .... . .. : .:: : CCDS14 EQVLQKAPEEFELLSKVPLKHEYIEDVGECHNHMIGIGPVLNIYPWNKELYLIRYNNYDR 280 290 300 310 320 330 300 310 320 330 340 350 pF1KE1 DTIFDVPVERVQPFYAALKEFVDLMNSKESKFTFKMNPGDVITFDNWRLLHGRRSYEAGT .: :: . :. .:.: . .. . :..: :..:: :. .::::.::::. . . CCDS14 AVINTVPYDVVHRWYTAHRTLTIELRRPENEFWVKLKPGRVLFIDNWRVLHGRECFTG-- 340 350 360 370 380 390 360 370 380 pF1KE1 EISRHLEGAYADWDVVMSRLRILRQRVENGN :.: : : : :.. :.: CCDS14 --YRQLCGCYLTRDDVLNTARLLGLQA 400 410 420 >>CCDS55547.1 TMLHE gene_id:55217|Hs108|chrX (376 aa) initn: 392 init1: 223 opt: 431 Z-score: 561.5 bits: 112.6 E(32554): 5.9e-25 Smith-Waterman score: 431; 28.9% identity (66.2% similar) in 225 aa overlap (29-253:71-293) 10 20 30 40 50 pF1KE1 MACTIQKAEALDGAHLMQILWYDEEESLYPAVWLRDNCPCSDCYLDSAKARKLLVEAL . :::::.: ..:: .... :.: . .. CCDS55 WHHTASKSLTCAWQQHEDHFELKYANTVMRFDYVWLRDHCRSASCYNSKTHQRSLDTASV 50 60 70 80 90 100 60 70 80 90 100 110 pF1KE1 DVNIGIKGLIFDRKKVYITWPDEHYSEFQADWLKKRCFSKQARAKLQRELFFPECQYWGS :. : : . .:. ...:::: : .... .:: : . : . .: .... : . CCDS55 DLCIKPKTIRLDETTLFFTWPDGHVTKYDLNWLVKNSYEGQKQKVIQPRILWNAEIY--Q 110 120 130 140 150 120 130 140 150 160 170 pF1KE1 ELQLPTLDFEDVLRYDEHAYKWLSTLKKVGIVRLTGASDKPGEVSKLGKRMGFLYLTFYG . :.:..: .. :. .: :.:... ::. . .. .. ::..:.... :.:: CCDS55 QAQVPSVDCQSFLETNEGLKKFLQNFLLYGIAFVENVPPTQEHTEKLAERISLIRETIYG 160 170 180 190 200 210 180 190 200 210 220 230 pF1KE1 HTWQVQDKIDANNVAYTTGKLSFHTDYPALHHPPGVQLLHCIKQTVTGGDSEIVDGFNVC . : . .. ...::: :. ::: ...: :.:..::.:. ::: . .:::: . CCDS55 RMWYFTSDFSRGDTAYTKLALDRHTDTTYFQEPCGIQVFHCLKHEGTGGRTLLVDGFYAA 220 230 240 250 260 270 240 250 260 270 280 290 pF1KE1 QKLKKNNPQAFQILSSTFVDFTDIGVDYCDFSVQSKHKIIELDDKGQVVRINFNNATRDT ... .. :. :..:: CCDS55 EQVLQKAPEEFELLSKVPLKHEYIEDVGECHNHMIGIGPVLNIYPWNKELYLIRLFKEKQ 280 290 300 310 320 330 387 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 07:41:13 2016 done: Sat Nov 5 07:41:13 2016 Total Scan time: 2.260 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]