FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KSDA1993, 504 aa 1>>>pF1KSDA1993 504 - 504 aa - 504 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.6299+/-0.00151; mu= -3.8160+/- 0.086 mean_var=315.5695+/-72.543, 0's: 0 Z-trim(107.6): 824 B-trim: 432 in 1/52 Lambda= 0.072198 statistics sampled from 8724 (9702) to 8724 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.658), E-opt: 0.2 (0.298), width: 16 Scan time: 3.220 The best scores are: opt bits E(32554) CCDS48023.1 ZBTB34 gene_id:403341|Hs108|chr9 ( 500) 3347 363.3 3.5e-100 CCDS44278.1 ZBTB37 gene_id:84614|Hs108|chr1 ( 503) 1254 145.3 1.5e-34 CCDS1312.1 ZBTB37 gene_id:84614|Hs108|chr1 ( 361) 679 85.3 1.3e-16 CCDS6867.1 ZBTB43 gene_id:23099|Hs108|chr9 ( 467) 630 80.3 5.2e-15 >>CCDS48023.1 ZBTB34 gene_id:403341|Hs108|chr9 (500 aa) initn: 3347 init1: 3347 opt: 3347 Z-score: 1912.0 bits: 363.3 E(32554): 3.5e-100 Smith-Waterman score: 3347; 100.0% identity (100.0% similar) in 500 aa overlap (5-504:1-500) 10 20 30 40 50 60 pF1KSD MSVEMDSSSFIQFDVPEYSSTVLSQLNELRLQGKLCDIIVHIQGQPFRAHKAVLAASSPY :::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 MDSSSFIQFDVPEYSSTVLSQLNELRLQGKLCDIIVHIQGQPFRAHKAVLAASSPY 10 20 30 40 50 70 80 90 100 110 120 pF1KSD FRDHSALSTMSGLSISVIKNPNVFEQLLSFCYTGRMSLQLKDVVSFLTAASFLQMQCVID :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 FRDHSALSTMSGLSISVIKNPNVFEQLLSFCYTGRMSLQLKDVVSFLTAASFLQMQCVID 60 70 80 90 100 110 130 140 150 160 170 180 pF1KSD KCTQILESIHSKISVGDVDSVTVGAEENPESRNGVKDSSFFANPVEISPPYCSQGRQPTA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 KCTQILESIHSKISVGDVDSVTVGAEENPESRNGVKDSSFFANPVEISPPYCSQGRQPTA 120 130 140 150 160 170 190 200 210 220 230 240 pF1KSD SSDLRMETTPSKALRSRLQEEGHSDRGSSGSVSEYEIQIEGDHEQGDLLVRESQITEVKV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 SSDLRMETTPSKALRSRLQEEGHSDRGSSGSVSEYEIQIEGDHEQGDLLVRESQITEVKV 180 190 200 210 220 230 250 260 270 280 290 300 pF1KSD KMEKSDRPSCSDSSSLGDDGYHTEMVDGEQVVAVNVGSYGSVLQHAYSYSQAASQPTNVS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 KMEKSDRPSCSDSSSLGDDGYHTEMVDGEQVVAVNVGSYGSVLQHAYSYSQAASQPTNVS 240 250 260 270 280 290 310 320 330 340 350 360 pF1KSD EAFGSLSNSSPSRSMLSCFRGGRARQKRALSVHLHSDLQGLVQGSDSEAMMNNPGYESSP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 EAFGSLSNSSPSRSMLSCFRGGRARQKRALSVHLHSDLQGLVQGSDSEAMMNNPGYESSP 300 310 320 330 340 350 370 380 390 400 410 420 pF1KSD RERSARGHWYPYNERLICIYCGKSFNQKGSLDRHMRLHMGITPFVCKFCGKKYTRKDQLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 RERSARGHWYPYNERLICIYCGKSFNQKGSLDRHMRLHMGITPFVCKFCGKKYTRKDQLE 360 370 380 390 400 410 430 440 450 460 470 480 pF1KSD YHIRGHTDDKPFRCEICGKCFPFQGTLNQHLRKNHPGVAEVRSRIESPERTDVYVEQKLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS48 YHIRGHTDDKPFRCEICGKCFPFQGTLNQHLRKNHPGVAEVRSRIESPERTDVYVEQKLE 420 430 440 450 460 470 490 500 pF1KSD NDASASEMGLDSRMEIHTVSDAPD :::::::::::::::::::::::: CCDS48 NDASASEMGLDSRMEIHTVSDAPD 480 490 500 >>CCDS44278.1 ZBTB37 gene_id:84614|Hs108|chr1 (503 aa) initn: 1225 init1: 648 opt: 1254 Z-score: 733.8 bits: 145.3 E(32554): 1.5e-34 Smith-Waterman score: 1292; 45.6% identity (69.7% similar) in 498 aa overlap (5-487:1-485) 10 20 30 40 50 60 pF1KSD MSVEMDSSSFIQFDVPEYSSTVLSQLNELRLQGKLCDIIVHIQGQPFRAHKAVLAASSPY :.... ::...:..:..:::.::.::.::.::::.:..::: :::::.:::::::: CCDS44 MEKGGNIQLEIPDFSNSVLSHLNQLRMQGRLCDIVVNVQGQAFRAHKVVLAASSPY 10 20 30 40 50 70 80 90 100 110 120 pF1KSD FRDHSALSTMSGLSISVIKNPNVFEQLLSFCYTGRMSLQLKDVVSFLTAASFLQMQCVID :::: .:. :: .::::::::.:::::::::::::. ::: :..:.:::::::::: .:: CCDS44 FRDHMSLNEMSTVSISVIKNPTVFEQLLSFCYTGRICLQLADIISYLTAASFLQMQHIID 60 70 80 90 100 110 130 140 150 160 170 pF1KSD KCTQILESIHSKISVGDVD-----SVTVGAEENPESRNGVKDSSFFANPVEISPPYCSQG :::::::.:: ::.:..:. . : :. :::. . . . .: . .: .: CCDS44 KCTQILEGIHFKINVAEVEAELSQTRTKHQERPPESHRVTPNLNRSLSPRHNTPKGNRRG 120 130 140 150 160 170 180 190 200 210 220 230 pF1KSD RQPTASSDLRMETTPSKALRSRLQEEGHSDRGSSGSVSEY----EIQIE-GDHEQGDLLV : .: :.: . : .. .. : . :: : . . . .: : ..: CCDS44 -QVSAVLDIRELSPPEESTSPQIIEPS-SDVESREPILRINRAGQWYVETGVADRGGRSD 180 190 200 210 220 230 240 250 260 270 280 pF1KSD RESQIT-EVKVKMEKSDRPSCSDSSSLGDDGYHTEMVDGEQVVAVNVGSYGSVLQHAYSY : .. :..: :. .. ... :.:: .: : . ..... ..::: :. :. CCDS44 DEVRVLGAVHIKTENLEEWLGPENQPSGEDGSSAEEVTA---MVIDTTGHGSVGQENYTL 240 250 260 270 280 290 290 300 310 320 330 340 pF1KSD SQAASQ---PTNVSEAFGSLSNSSPSRSMLSCFRGGRARQKRALSVHLHSDLQGLVQGSD ...... ::. :: .. ::: :.. . :::.. . .. .. :. : CCDS44 GSSGAKVARPTS-SE----VDRFSPSGSVVPLTERHRARSESPGRMDEPKQPSSQVEES- 300 310 320 330 340 350 360 370 380 390 400 pF1KSD SEAMMNNPGYESSPRERSARGHWYPYNERLICIYCGKSFNQKGSLDRHMRLHMGITPFVC :::. :: ::. . .:. :: :: ::::.:::::::::::::::::::::::: CCDS44 --AMMGVSGYVEYLREQEVSERWFRYNPRLTCIYCAKSFNQKGSLDRHMRLHMGITPFVC 350 360 370 380 390 400 410 420 430 440 450 460 pF1KSD KFCGKKYTRKDQLEYHIRGHTDDKPFRCEICGKCFPFQGTLNQHLRKNHPGVAEVRS-RI ..:::::::::::::::: :: .:::.:..::: ::::. ::::.:::::: ... . CCDS44 RMCGKKYTRKDQLEYHIRKHTGNKPFHCHVCGKSFPFQAILNQHFRKNHPGCIPLEGPHS 410 420 430 440 450 460 470 480 490 500 pF1KSD ESPERTDVYVEQKLENDASASEMGLDSRMEIHTVSDAPD ::: : . : :.. : : CCDS44 ISPETTVTSRGQAEEESPSQEETVAPGEAVQGSVSTTGPD 470 480 490 500 >>CCDS1312.1 ZBTB37 gene_id:84614|Hs108|chr1 (361 aa) initn: 670 init1: 648 opt: 679 Z-score: 411.9 bits: 85.3 E(32554): 1.3e-16 Smith-Waterman score: 717; 40.9% identity (68.5% similar) in 337 aa overlap (5-327:1-327) 10 20 30 40 50 60 pF1KSD MSVEMDSSSFIQFDVPEYSSTVLSQLNELRLQGKLCDIIVHIQGQPFRAHKAVLAASSPY :.... ::...:..:..:::.::.::.::.::::.:..::: :::::.:::::::: CCDS13 MEKGGNIQLEIPDFSNSVLSHLNQLRMQGRLCDIVVNVQGQAFRAHKVVLAASSPY 10 20 30 40 50 70 80 90 100 110 120 pF1KSD FRDHSALSTMSGLSISVIKNPNVFEQLLSFCYTGRMSLQLKDVVSFLTAASFLQMQCVID :::: .:. :: .::::::::.:::::::::::::. ::: :..:.:::::::::: .:: CCDS13 FRDHMSLNEMSTVSISVIKNPTVFEQLLSFCYTGRICLQLADIISYLTAASFLQMQHIID 60 70 80 90 100 110 130 140 150 160 170 pF1KSD KCTQILESIHSKISVGDVD-----SVTVGAEENPESRNGVKDSSFFANPVEISPPYCSQG :::::::.:: ::.:..:. . : :. :::. . . . .: . .: .: CCDS13 KCTQILEGIHFKINVAEVEAELSQTRTKHQERPPESHRVTPNLNRSLSPRHNTPKGNRRG 120 130 140 150 160 170 180 190 200 210 220 230 pF1KSD RQPTASSDLRMETTPSKALRSRLQEEGHSDRGSSGSVSEY----EIQIE-GDHEQGDLLV : .: :.: . : .. .. : . :: : . . . .: : ..: CCDS13 -QVSAVLDIRELSPPEESTSPQIIEPS-SDVESREPILRINRAGQWYVETGVADRGGRSD 180 190 200 210 220 230 240 250 260 270 280 pF1KSD RESQIT-EVKVKMEKSDRPSCSDSSSLGDDGYHTEMVDGEQVVAVNVGSYGSVLQHAYSY : .. :..: :. .. ... :.:: .: : . ..... ..::: :. :. CCDS13 DEVRVLGAVHIKTENLEEWLGPENQPSGEDGSSAEEVTA---MVIDTTGHGSVGQENYTL 240 250 260 270 280 290 290 300 310 320 330 340 pF1KSD SQAASQ---PTNVSEAFGSLSNSSPSRSMLSCFRGGRARQKRALSVHLHSDLQGLVQGSD ...... ::. :: .. ::: :.. . :::.. CCDS13 GSSGAKVARPTS-SE----VDRFSPSGSVVPLTERHRARSESPGRMDEPKQPSSQVWSCG 300 310 320 330 340 350 360 370 380 390 400 pF1KSD SEAMMNNPGYESSPRERSARGHWYPYNERLICIYCGKSFNQKGSLDRHMRLHMGITPFVC CCDS13 FRTALVVGGIATVYE 350 360 >>CCDS6867.1 ZBTB43 gene_id:23099|Hs108|chr9 (467 aa) initn: 754 init1: 406 opt: 630 Z-score: 382.9 bits: 80.3 E(32554): 5.2e-15 Smith-Waterman score: 631; 30.4% identity (57.6% similar) in 467 aa overlap (3-451:1-447) 10 20 30 40 50 60 pF1KSD MSVEMDSSSFIQFDVPEYSSTVLSQLNELRLQGKLCDIIVHIQGQPFRAHKAVLAASSPY .: ..:: . . :..:::.:..::. : ::.:::. . .::. :::::::::::::: CCDS68 MEPGTNSF-RVEFPDFSSTILQKLNQQRQQGQLCDVSIVVQGHIFRAHKAVLAASSPY 10 20 30 40 50 70 80 90 100 110 120 pF1KSD FRDHSALSTMSGLSISVIKNPNVFEQLLSFCYTGRMSLQLKDVVSFLTAASFLQMQCVID : :. :.. . . . :: :::..: ::::. . ..::.::::::::: :.: CCDS68 FCDQVLLKNSRRIVLPDVMNPRVFENILLSSYTGRLVMPAPEIVSYLTAASFLQMWHVVD 60 70 80 90 100 110 130 140 150 160 170 pF1KSD KCTQILESIHSKISVGDVDSVTVGAEENPESRNGVKDSSFFANPVEISPPYCSQGR---- :::..::. . . .. . . : ::. .: ... . . : .. : CCDS68 KCTEVLEG-NPTVLCQKLNHGSDHQSPSSSSYNGLVESFELGSGGHTDFPKAQELRDGEN 120 130 140 150 160 170 180 190 200 210 220 pF1KSD -----QPTASSDL-RMETTPSKAL--RSRLQEEGHSDRGSSGSVSEYEIQIEGDHEQGDL . ::.: . : ::.. ..::. : :. : :. . :. : . CCDS68 EEESTKDELSSQLTEHEYLPSNSSTEHDRLSTEMASQDGEEGASDSAEF-----HYTRPM 180 190 200 210 220 230 230 240 250 260 270 280 pF1KSD LVRESQITE---VKVKMEKSDRPSCS--DSSSLGDDGYHTEMVDGEQVVAVNVGSYGSVL . : ... ..:: :. .. .: : . :. :: .. :. . : : CCDS68 YSKPSIMAHKRWIHVKPERLEQ-ACEGMDVHATYDEHQVTESINTVQTEHT-VQPSGVEE 240 250 260 270 280 290 300 310 320 330 340 pF1KSD QHAYSYSQAASQPTNVSEAFGSLSNSSPSRSMLSCFRGGRARQKRALSVHLHSDLQGLVQ . . ... .. . .. . . . : . : : :. .. : . .:. CCDS68 DFHIGEKKVEAEFDEQADESNYDEQVDFYGSSMEEFSGERSDG----NLIGHRQEAALAA 290 300 310 320 330 340 350 360 370 380 390 400 pF1KSD G-SDSEAMMNNPGYESSPRERSARGHWYPYNERLICIYCGKSFNQKGSLDRHMRLHMGIT : :.. :... :.: :: . :: : :::::..:.. :::: .:.:. CCDS68 GYSENIEMVTGIKEEASHLGFSATDKLYP------C-QCGKSFTHKSQRDRHMSMHLGLR 350 360 370 380 390 410 420 430 440 450 460 pF1KSD PFVCKFCGKKYTRKDQLEYHIRGHTDDKPFRCEICGKCFPFQGTLNQHLRKNHPGVAEVR :. : ::::. : .: :.. :: ::..:.::.: : .. ....:. CCDS68 PYGCGVCGKKFKMKHHLVGHMKIHTGIKPYECNICAKRFMWRDSFHRHVTSCTKSYEAAK 400 410 420 430 440 450 470 480 490 500 pF1KSD SRIESPERTDVYVEQKLENDASASEMGLDSRMEIHTVSDAPD CCDS68 AEQNTTEAN 460 504 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 07:56:28 2016 done: Thu Nov 3 07:56:28 2016 Total Scan time: 3.220 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]