FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9684, 164 aa 1>>>pF1KB9684 164 - 164 aa - 164 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3775+/-0.000655; mu= 12.0412+/- 0.039 mean_var=67.1663+/-12.824, 0's: 0 Z-trim(110.9): 77 B-trim: 5 in 2/52 Lambda= 0.156494 statistics sampled from 11889 (11967) to 11889 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.757), E-opt: 0.2 (0.368), width: 16 Scan time: 1.930 The best scores are: opt bits E(32554) CCDS14649.1 ZNF449 gene_id:203523|Hs108|chrX ( 518) 815 192.3 3.5e-49 CCDS66921.1 ZSCAN32 gene_id:54925|Hs108|chr16 ( 697) 482 117.2 1.9e-26 CCDS10095.2 ZSCAN29 gene_id:146050|Hs108|chr15 ( 852) 471 114.8 1.3e-25 CCDS42427.1 ZSCAN30 gene_id:100101467|Hs108|chr18 ( 494) 441 107.9 8.7e-24 CCDS10495.1 ZNF213 gene_id:7760|Hs108|chr16 ( 459) 437 107.0 1.5e-23 CCDS82246.1 ZNF396 gene_id:252884|Hs108|chr18 ( 210) 431 105.4 2e-23 CCDS5681.1 ZSCAN21 gene_id:7589|Hs108|chr7 ( 473) 435 106.5 2.1e-23 CCDS11913.1 ZNF396 gene_id:252884|Hs108|chr18 ( 333) 431 105.6 3e-23 CCDS82247.1 ZNF396 gene_id:252884|Hs108|chr18 ( 335) 431 105.6 3e-23 CCDS5667.1 ZKSCAN5 gene_id:23660|Hs108|chr7 ( 839) 433 106.2 4.8e-23 CCDS41300.1 ZSCAN20 gene_id:7579|Hs108|chr1 (1043) 430 105.6 9.1e-23 CCDS34355.1 ZBED9 gene_id:114821|Hs108|chr6 (1325) 427 105.0 1.8e-22 CCDS32410.1 ZKSCAN2 gene_id:342357|Hs108|chr16 ( 967) 423 104.0 2.6e-22 CCDS47393.1 ZSCAN23 gene_id:222696|Hs108|chr6 ( 389) 412 101.3 6.7e-22 CCDS4646.1 ZSCAN9 gene_id:7746|Hs108|chr6 ( 394) 404 99.5 2.4e-21 CCDS32814.1 ZNF397 gene_id:84307|Hs108|chr18 ( 275) 402 99.0 2.4e-21 CCDS56407.1 ZSCAN9 gene_id:7746|Hs108|chr6 ( 445) 404 99.5 2.6e-21 CCDS45852.1 ZNF397 gene_id:84307|Hs108|chr18 ( 534) 402 99.1 4.2e-21 CCDS35105.1 ZNF483 gene_id:158399|Hs108|chr9 ( 256) 398 98.0 4.2e-21 CCDS76814.1 ZNF500 gene_id:26048|Hs108|chr16 ( 441) 400 98.6 4.8e-21 CCDS32383.1 ZNF500 gene_id:26048|Hs108|chr16 ( 480) 400 98.6 5.2e-21 CCDS77175.1 ZNF24 gene_id:7572|Hs108|chr18 ( 193) 395 97.3 5.3e-21 CCDS11912.1 ZNF24 gene_id:7572|Hs108|chr18 ( 368) 395 97.4 9.1e-21 CCDS35106.1 ZNF483 gene_id:158399|Hs108|chr9 ( 744) 398 98.3 1e-20 CCDS4644.1 ZSCAN16 gene_id:80345|Hs108|chr6 ( 348) 392 96.8 1.4e-20 CCDS82043.1 ZNF232 gene_id:7775|Hs108|chr17 ( 435) 390 96.4 2.3e-20 CCDS11068.1 ZNF232 gene_id:7775|Hs108|chr17 ( 444) 390 96.4 2.3e-20 CCDS4647.1 ZKSCAN4 gene_id:387032|Hs108|chr6 ( 545) 391 96.6 2.4e-20 CCDS4650.1 ZKSCAN3 gene_id:80317|Hs108|chr6 ( 538) 389 96.2 3.2e-20 CCDS4649.1 ZSCAN31 gene_id:64288|Hs108|chr6 ( 406) 385 95.2 4.7e-20 CCDS2714.1 ZKSCAN7 gene_id:55888|Hs108|chr3 ( 276) 381 94.2 6.4e-20 CCDS32380.1 ZNF174 gene_id:7727|Hs108|chr16 ( 234) 378 93.5 8.9e-20 CCDS4643.1 ZNF165 gene_id:7718|Hs108|chr6 ( 485) 381 94.3 1e-19 CCDS10504.1 ZNF174 gene_id:7727|Hs108|chr16 ( 407) 378 93.6 1.4e-19 CCDS2715.1 ZKSCAN7 gene_id:55888|Hs108|chr3 ( 754) 381 94.4 1.5e-19 CCDS5666.1 ZNF394 gene_id:84124|Hs108|chr7 ( 561) 378 93.7 1.9e-19 CCDS69349.1 ZKSCAN1 gene_id:7586|Hs108|chr7 ( 527) 377 93.5 2.1e-19 CCDS34698.1 ZKSCAN1 gene_id:7586|Hs108|chr7 ( 563) 377 93.5 2.2e-19 CCDS5671.2 ZSCAN25 gene_id:221785|Hs108|chr7 ( 544) 376 93.2 2.5e-19 CCDS7775.1 ZNF215 gene_id:7762|Hs108|chr11 ( 517) 375 93.0 2.8e-19 CCDS10499.1 ZNF263 gene_id:10127|Hs108|chr16 ( 683) 370 91.9 7.7e-19 CCDS8443.1 ZNF202 gene_id:7753|Hs108|chr11 ( 648) 369 91.7 8.6e-19 CCDS4645.1 ZKSCAN8 gene_id:7745|Hs108|chr6 ( 578) 367 91.2 1.1e-18 CCDS74476.1 ZNF274 gene_id:10782|Hs108|chr19 ( 548) 361 89.9 2.6e-18 CCDS74475.1 ZNF274 gene_id:10782|Hs108|chr19 ( 611) 361 89.9 2.9e-18 CCDS74474.1 ZNF274 gene_id:10782|Hs108|chr19 ( 621) 361 89.9 2.9e-18 CCDS74473.1 ZNF274 gene_id:10782|Hs108|chr19 ( 653) 361 89.9 3e-18 CCDS1631.1 ZNF496 gene_id:84838|Hs108|chr1 ( 587) 360 89.7 3.2e-18 CCDS33743.1 ZNF197 gene_id:10168|Hs108|chr3 ( 267) 355 88.3 3.6e-18 CCDS12975.1 ZSCAN22 gene_id:342945|Hs108|chr19 ( 491) 356 88.7 5.2e-18 >>CCDS14649.1 ZNF449 gene_id:203523|Hs108|chrX (518 aa) initn: 833 init1: 815 opt: 815 Z-score: 996.5 bits: 192.3 E(32554): 3.5e-49 Smith-Waterman score: 815; 100.0% identity (100.0% similar) in 119 aa overlap (1-119:1-119) 10 20 30 40 50 60 pF1KB9 MAVALGCAIQASLNQGSVFQEYDTDCEVFRQRFRQFQYREAAGPHEAFNKLWELCCQWLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MAVALGCAIQASLNQGSVFQEYDTDCEVFRQRFRQFQYREAAGPHEAFNKLWELCCQWLK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 PKMRSKEQILELLVLEQFLTILPTEIETWVREHCPENRERVVSLIEDLQRELEIPEQQVR ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 PKMRSKEQILELLVLEQFLTILPTEIETWVREHCPENRERVVSLIEDLQRELEIPEQQVD 70 80 90 100 110 120 130 140 150 160 pF1KB9 KECGIFVIGPKEVAEAAGTRPHICLYVIPQPQDFSRVLFLFLFC CCDS14 MHDMLLEELAPVGTAHIPPTMHLESPALQVMGPAQEAPVAEAWIPQAGPPELNYGATGEC 130 140 150 160 170 180 >>CCDS66921.1 ZSCAN32 gene_id:54925|Hs108|chr16 (697 aa) initn: 473 init1: 453 opt: 482 Z-score: 588.3 bits: 117.2 E(32554): 1.9e-26 Smith-Waterman score: 482; 60.3% identity (78.6% similar) in 126 aa overlap (15-140:22-144) 10 20 30 40 50 pF1KB9 MAVALGCAIQASLNQGSVFQEYDTDCEVFRQRFRQFQYREAAGPHEAFNKLWE : :..: . : :. ::::::: :.:..::::::.:::: CCDS66 MMAAVKSTEAHPSSNKDPTQGQKSALQGNSPDSEASRQRFRQFCYQEVTGPHEAFSKLWE 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB9 LCCQWLKPKMRSKEQILELLVLEQFLTILPTEIETWVREHCPENRERVVSLIEDLQRELE ::::::.:: .:::.::::::::::::::: ::.:::::. ::: :..:.:.::.:: CCDS66 LCCQWLRPKTHSKEEILELLVLEQFLTILPEEIQTWVREQHPENGEEAVALVEDVQRA-- 70 80 90 100 110 120 130 140 150 160 pF1KB9 IPEQQVRKECGIFVIGPKEVAEAAGTRPHICLYVIPQPQDFSRVLFLFLFC : ::: . . ::.: ..:: CCDS66 -PGQQVLDSEKDLKVLMKEMAPLGATRESLRSQWKQEVQPEEPTFKGSQSSHQRPGEQSE 120 130 140 150 160 170 >>CCDS10095.2 ZSCAN29 gene_id:146050|Hs108|chr15 (852 aa) initn: 497 init1: 467 opt: 471 Z-score: 573.5 bits: 114.8 E(32554): 1.3e-25 Smith-Waterman score: 473; 51.0% identity (75.9% similar) in 145 aa overlap (17-152:5-143) 10 20 30 40 50 60 pF1KB9 MAVALGCAIQASLNQGSVFQEYDTDCEVFRQRFRQFQYREAAGPHEAFNKLWELCCQWLK :...: :. :.::::::.:.:.:.:::.:::..::::::.::. CCDS10 MMAKSALRENGTNSETFRQRFRRFHYQEVAGPREAFSQLWELCCRWLR 10 20 30 40 70 80 90 100 110 pF1KB9 PKMRSKEQILELLVLEQFLTILPTEIETWVREHCPENRERVVSLIEDLQRE--------- :..:.::::.::::::::::.:: ::..::.:.:::: :..:.:.:::.:: CCDS10 PEVRTKEQIVELLVLEQFLTVLPGEIQNWVQEQCPENGEEAVTLVEDLEREPGRPRSSVT 50 60 70 80 90 100 120 130 140 150 160 pF1KB9 LEIPEQQVRKECGIFVIGPKEVAEAAGTRPHICLYVIPQPQDFSRVLFLFLFC . . :.:: : . :: : ..: . : :::. CCDS10 VSVKGQEVRLEK---MTPPKSSQELLSVRQE---SVEPQPRGVPKKERARSPDLGPQEQM 110 120 130 140 150 160 CCDS10 NPKEKLKPFQRSGLPFPKSGVVSRLEQGEPWIPDLLGSKEKELPSGSHIGDRRVHADLLP 170 180 190 200 210 220 >>CCDS42427.1 ZSCAN30 gene_id:100101467|Hs108|chr18 (494 aa) initn: 454 init1: 435 opt: 441 Z-score: 540.5 bits: 107.9 E(32554): 8.7e-24 Smith-Waterman score: 441; 58.5% identity (87.7% similar) in 106 aa overlap (13-118:31-136) 10 20 30 40 pF1KB9 MAVALGCAIQASLNQGSVFQEYDTDCEVFRQRFRQFQYREAA :.: .:: . :::::.::::.: ... CCDS42 MSGEATVLAYHAPEEQEGLLVVKVEEENYVLDQDFGLQENPWSQEVFRQKFRQFSYSDST 10 20 30 40 50 60 50 60 70 80 90 100 pF1KB9 GPHEAFNKLWELCCQWLKPKMRSKEQILELLVLEQFLTILPTEIETWVREHCPENRERVV ::.::...: :::::::.:...:::::::::.:::::.::: :...:.::: ::: :..: CCDS42 GPREALSRLRELCCQWLRPEVHSKEQILELLMLEQFLAILPEELQAWLREHRPENGEEAV 70 80 90 100 110 120 110 120 130 140 150 160 pF1KB9 SLIEDLQRELEIPEQQVRKECGIFVIGPKEVAEAAGTRPHICLYVIPQPQDFSRVLFLFL ...:.:..::: :.:: CCDS42 TMLEELEKELEEPRQQDTTHGQEMFWQEMTSTGALKSLSLNSPVQPLENQCKTETQESQA 130 140 150 160 170 180 >>CCDS10495.1 ZNF213 gene_id:7760|Hs108|chr16 (459 aa) initn: 409 init1: 409 opt: 437 Z-score: 536.1 bits: 107.0 E(32554): 1.5e-23 Smith-Waterman score: 442; 47.7% identity (70.9% similar) in 151 aa overlap (11-152:26-170) 10 20 30 40 pF1KB9 MAVALGCAIQASLNQGSVFQEYDTDCEVFRQRFRQFQYREAAGPH .: .: :. .: : :. ::::::: : .. ::: CCDS10 MAAPLEAQDQAPGEGEGLLIVKVEDSSWEQESAQHEDGRDSEACRQRFRQFCYGDVHGPH 10 20 30 40 50 60 50 60 70 80 90 100 pF1KB9 EAFNKLWELCCQWLKPKMRSKEQILELLVLEQFLTILPTEIETWVREHCPENRERVVSLI :::..::::::.::.:..:.:::::::::::::::.:: ::. ::::. : . :..:.:. CCDS10 EAFSQLWELCCRWLRPELRTKEQILELLVLEQFLTVLPGEIQGWVREQHPGSGEEAVALV 70 80 90 100 110 120 110 120 130 140 150 pF1KB9 EDLQRE------LEIPEQQVRKEC---GIFVIGPKEVAEAAGTRPHICLYVIPQPQDFSR ::::.. ..: .... : : . :: .. .: :: . :: : CCDS10 EDLQKQPVKAWRQDVPSEEAEPEAAGRGSQATGPPPTV-GARRRPSV-----PQEQHSHS 130 140 150 160 170 160 pF1KB9 VLFLFLFC CCDS10 AQPPALLKEGRPGETTDTCFVSGVHGPVALGDIPFYFSREEWGTLDPAQRDLFWDIKREN 180 190 200 210 220 230 >>CCDS82246.1 ZNF396 gene_id:252884|Hs108|chr18 (210 aa) initn: 456 init1: 410 opt: 431 Z-score: 533.9 bits: 105.4 E(32554): 2e-23 Smith-Waterman score: 431; 48.5% identity (78.8% similar) in 132 aa overlap (7-135:34-160) 10 20 30 pF1KB9 MAVALGCAIQASLNQGSVFQEYDTDCEVFRQRFRQF : ..::. .: .. :.:::.:::: CCDS82 KLGKSSSLLTQTSEECNGILTEKMEEEEQTCDPDSSLHWSSSYSP-----ETFRQQFRQF 10 20 30 40 50 40 50 60 70 80 90 pF1KB9 QYREAAGPHEAFNKLWELCCQWLKPKMRSKEQILELLVLEQFLTILPTEIETWVREHCPE :... :::::...::::: ::.:....::::::::::::::.::: :...::..: :: CCDS82 GYQDSPGPHEALSRLWELCHLWLRPEVHTKEQILELLVLEQFLAILPKELQAWVQKHHPE 60 70 80 90 100 110 100 110 120 130 140 150 pF1KB9 NRERVVSLIEDLQRELEIPEQQV---RKECGIFVIGPKEVAEAAGTRPHICLYVIPQPQD : :..:...::..:::. :.: ::. ..:.:..: CCDS82 NGEETVTMLEDVERELDGPKQIFFGRRKDMIAEKLAPSEITEELPSSQLMPVKKQLQGAS 120 130 140 150 160 170 160 pF1KB9 FSRVLFLFLFC CCDS82 WELQSLRPHGEGAGFSSCWKVWGPSAHHTHLS 180 190 200 210 >>CCDS5681.1 ZSCAN21 gene_id:7589|Hs108|chr7 (473 aa) initn: 452 init1: 433 opt: 435 Z-score: 533.5 bits: 106.5 E(32554): 2.1e-23 Smith-Waterman score: 435; 60.6% identity (88.9% similar) in 99 aa overlap (21-119:36-134) 10 20 30 40 50 pF1KB9 MAVALGCAIQASLNQGSVFQEYDTDCEVFRQRFRQFQYREAAGPHEAFNK .: . :.:::::::: :... ::.::... CCDS56 LGMAPVLGPRPPQEQVGPLMVKVEEKEEKGKYLPSLEMFRQRFRQFGYHDTPGPREALSQ 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB9 LWELCCQWLKPKMRSKEQILELLVLEQFLTILPTEIETWVREHCPENRERVVSLIEDLQR : :::.::.:....:::::::::::::::::: :...::.:::::. :..:.:.:::.: CCDS56 LRVLCCEWLRPEIHTKEQILELLVLEQFLTILPQELQAWVQEHCPESAEEAVTLLEDLER 70 80 90 100 110 120 120 130 140 150 160 pF1KB9 ELEIPEQQVRKECGIFVIGPKEVAEAAGTRPHICLYVIPQPQDFSRVLFLFLFC ::. : .:: CCDS56 ELDEPGHQVSTPPNEQKPVWEKISSSGTAKESPSSMQPQPLETSHKYESWGPLYIQESGE 130 140 150 160 170 180 >>CCDS11913.1 ZNF396 gene_id:252884|Hs108|chr18 (333 aa) initn: 430 init1: 410 opt: 431 Z-score: 530.9 bits: 105.6 E(32554): 3e-23 Smith-Waterman score: 431; 48.5% identity (78.8% similar) in 132 aa overlap (7-135:34-160) 10 20 30 pF1KB9 MAVALGCAIQASLNQGSVFQEYDTDCEVFRQRFRQF : ..::. .: .. :.:::.:::: CCDS11 KLGKSSSLLTQTSEECNGILTEKMEEEEQTCDPDSSLHWSSSYSP-----ETFRQQFRQF 10 20 30 40 50 40 50 60 70 80 90 pF1KB9 QYREAAGPHEAFNKLWELCCQWLKPKMRSKEQILELLVLEQFLTILPTEIETWVREHCPE :... :::::...::::: ::.:....::::::::::::::.::: :...::..: :: CCDS11 GYQDSPGPHEALSRLWELCHLWLRPEVHTKEQILELLVLEQFLAILPKELQAWVQKHHPE 60 70 80 90 100 110 100 110 120 130 140 150 pF1KB9 NRERVVSLIEDLQRELEIPEQQV---RKECGIFVIGPKEVAEAAGTRPHICLYVIPQPQD : :..:...::..:::. :.: ::. ..:.:..: CCDS11 NGEETVTMLEDVERELDGPKQIFFGRRKDMIAEKLAPSEITEELPSSQLMPVKKQLQGAS 120 130 140 150 160 170 160 pF1KB9 FSRVLFLFLFC CCDS11 WELQSLRPHDEDIKTTNVKSASRQKTSLGIELHCNVSNILHMNGSQSSTYRGTYEQDGRF 180 190 200 210 220 230 >>CCDS82247.1 ZNF396 gene_id:252884|Hs108|chr18 (335 aa) initn: 430 init1: 410 opt: 431 Z-score: 530.9 bits: 105.6 E(32554): 3e-23 Smith-Waterman score: 431; 48.5% identity (78.8% similar) in 132 aa overlap (7-135:34-160) 10 20 30 pF1KB9 MAVALGCAIQASLNQGSVFQEYDTDCEVFRQRFRQF : ..::. .: .. :.:::.:::: CCDS82 KLGKSSSLLTQTSEECNGILTEKMEEEEQTCDPDSSLHWSSSYSP-----ETFRQQFRQF 10 20 30 40 50 40 50 60 70 80 90 pF1KB9 QYREAAGPHEAFNKLWELCCQWLKPKMRSKEQILELLVLEQFLTILPTEIETWVREHCPE :... :::::...::::: ::.:....::::::::::::::.::: :...::..: :: CCDS82 GYQDSPGPHEALSRLWELCHLWLRPEVHTKEQILELLVLEQFLAILPKELQAWVQKHHPE 60 70 80 90 100 110 100 110 120 130 140 150 pF1KB9 NRERVVSLIEDLQRELEIPEQQV---RKECGIFVIGPKEVAEAAGTRPHICLYVIPQPQD : :..:...::..:::. :.: ::. ..:.:..: CCDS82 NGEETVTMLEDVERELDGPKQIFFGRRKDMIAEKLAPSEITEELPSSQLMPVKKQLQGAS 120 130 140 150 160 170 160 pF1KB9 FSRVLFLFLFC CCDS82 WELQSLRPHDEDIKTTNVKSASRQKTSLGIELHCNVSNILHMNGSQSSTYRGTYEQDGRF 180 190 200 210 220 230 >>CCDS5667.1 ZKSCAN5 gene_id:23660|Hs108|chr7 (839 aa) initn: 441 init1: 418 opt: 433 Z-score: 527.3 bits: 106.2 E(32554): 4.8e-23 Smith-Waterman score: 433; 51.1% identity (75.2% similar) in 137 aa overlap (20-152:39-172) 10 20 30 40 pF1KB9 MAVALGCAIQASLNQGSVFQEYDTDC-EVFRQRFRQFQYREAAGPHEAF :::. :.: ::::.:::.::.::.::. CCDS56 VIDLDPPAETSQEQEDLFIVKVEEEDCTWMQEYNPPTFETFYQRFRHFQYHEASGPREAL 10 20 30 40 50 60 50 60 70 80 90 100 pF1KB9 NKLWELCCQWLKPKMRSKEQILELLVLEQFLTILPTEIETWVREHCPENRERVVSLIEDL ..: :::.::.:....:::::::::::::::::: :.. ::::: ::. :..:..::.. CCDS56 SQLRVLCCEWLRPELHTKEQILELLVLEQFLTILPEEFQPWVREHHPESGEEAVAVIENI 70 80 90 100 110 120 110 120 130 140 150 160 pF1KB9 QRELEIPEQQVRKECGIF---VIGPKEVAEAAGTRPHICLYVIPQPQDFSRVLFLFLFC ::::: .::. .. . : : :. . .: : : ::. CCDS56 QRELEERRQQIVACPDVLPRKMATPGAVQESCSPHP---LTVDTQPEQAPQKPRLLEENA 130 140 150 160 170 180 CCDS56 LPVLQVPSLPLKDSQELTASLLSTGSQKLVKIEEVADVAVSFILEEWGHLDQSQKSLYRD 190 200 210 220 230 240 164 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 18:18:47 2016 done: Fri Nov 4 18:18:47 2016 Total Scan time: 1.930 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]