FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB3465, 345 aa 1>>>pF1KB3465 345 - 345 aa - 345 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 12.6023+/-0.00107; mu= -17.0451+/- 0.063 mean_var=424.8386+/-95.198, 0's: 0 Z-trim(115.5): 619 B-trim: 911 in 1/50 Lambda= 0.062225 statistics sampled from 15286 (16060) to 15286 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.801), E-opt: 0.2 (0.493), width: 16 Scan time: 3.310 The best scores are: opt bits E(32554) CCDS3444.1 KLF3 gene_id:51274|Hs108|chr4 ( 345) 2404 229.4 3.3e-60 CCDS9449.1 KLF12 gene_id:11278|Hs108|chr13 ( 402) 668 73.6 3.1e-13 CCDS14373.1 KLF8 gene_id:11279|Hs108|chrX ( 359) 665 73.3 3.4e-13 CCDS66562.1 KLF5 gene_id:688|Hs108|chr13 ( 366) 624 69.7 4.4e-12 CCDS9448.1 KLF5 gene_id:688|Hs108|chr13 ( 457) 624 69.7 5.2e-12 CCDS7060.1 KLF6 gene_id:1316|Hs108|chr10 ( 283) 576 65.3 7.1e-11 CCDS6770.2 KLF4 gene_id:9314|Hs108|chr9 ( 479) 582 66.0 7.4e-11 >>CCDS3444.1 KLF3 gene_id:51274|Hs108|chr4 (345 aa) initn: 2404 init1: 2404 opt: 2404 Z-score: 1194.4 bits: 229.4 E(32554): 3.3e-60 Smith-Waterman score: 2404; 100.0% identity (100.0% similar) in 345 aa overlap (1-345:1-345) 10 20 30 40 50 60 pF1KB3 MLMFDPVPVKQEAMDPVSVSYPSNYMESMKPNKYGVIYSTPLPEKFFQTPEGLSHGIQME :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 MLMFDPVPVKQEAMDPVSVSYPSNYMESMKPNKYGVIYSTPLPEKFFQTPEGLSHGIQME 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB3 PVDLTVNKRSSPPSAGNSPSSLKFPSSHRRASPGLSMPSSSPPIKKYSPPSPGVQPFGVP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 PVDLTVNKRSSPPSAGNSPSSLKFPSSHRRASPGLSMPSSSPPIKKYSPPSPGVQPFGVP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB3 LSMPPVMAAALSRHGIRSPGILPVIQPVVVQPVPFMYTSHLQQPLMVSLSEEMENSSSSM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 LSMPPVMAAALSRHGIRSPGILPVIQPVVVQPVPFMYTSHLQQPLMVSLSEEMENSSSSM 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB3 QVPVIESYEKPISQKKIKIEPGIEPQRTDYYPEEMSPPLMNSVSPPQALLQENHPSVIVQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 QVPVIESYEKPISQKKIKIEPGIEPQRTDYYPEEMSPPLMNSVSPPQALLQENHPSVIVQ 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB3 PGKRPLPVESPDTQRKRRIHRCDYDGCNKVYTKSSHLKAHRRTHTGEKPYKCTWEGCTWK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 PGKRPLPVESPDTQRKRRIHRCDYDGCNKVYTKSSHLKAHRRTHTGEKPYKCTWEGCTWK 250 260 270 280 290 300 310 320 330 340 pF1KB3 FARSDELTRHFRKHTGIKPFQCPDCDRSFSRSDHLALHRKRHMLV ::::::::::::::::::::::::::::::::::::::::::::: CCDS34 FARSDELTRHFRKHTGIKPFQCPDCDRSFSRSDHLALHRKRHMLV 310 320 330 340 >>CCDS9449.1 KLF12 gene_id:11278|Hs108|chr13 (402 aa) initn: 786 init1: 645 opt: 668 Z-score: 351.2 bits: 73.6 E(32554): 3.1e-13 Smith-Waterman score: 779; 40.8% identity (63.6% similar) in 385 aa overlap (9-345:36-402) 10 20 30 pF1KB3 MLMFDPVPVKQEAMDPVSVSYPSNYMESMKPNKYGVIY ...: .: .::. ::.. : . . CCDS94 KRKTIKNINTFENRMLMLDGMPAVRVKTELLESEQGSPNVHNYPD--MEAV-PLLLNNVK 10 20 30 40 50 60 40 50 60 70 80 90 pF1KB3 STPLPEKFFQTPEGLSHGIQMEPVDLTVNK-RSSPPSAGNSPSSL----KFPSSHRRASP . : :: ... . : :::::..:: :.:: ....:: :. . ::: .: CCDS94 GEP-PEDSLSVDH---FQTQTEPVDLSINKARTSPTAVSSSPVSMTASASSPSSTSTSSS 70 80 90 100 110 100 110 120 130 140 150 pF1KB3 GLSMPSSSPP-IKKYSPPSPGVQPFGVPLSMPPVMAAALSRHGIRSPGILPVIQPVV-VQ . : .::: : . : : . .. :. :..:.: :. . .: .:.:: . CCDS94 SSSRLASSPTVITSVSSASSS----STVLTPGPLVASA---SGVGGQQFLHIIHPVPPSS 120 130 140 150 160 170 160 170 180 190 200 pF1KB3 PVPFMYT--SHLQQ--------PLMVSLSEEMENSSSSMQVPVIESYEKPISQKKIKIEP :. .. . ::... :.. . . : .... ::..:. . .. : ...: CCDS94 PMNLQSNKLSHVHRIPVVVQSVPVVYTAVRSPGNVNNTIVVPLLEDGR---GHGKAQMDP 180 190 200 210 220 210 220 230 240 pF1KB3 -GIEPQ--RTDYYPEEMSPPLMNSVSPP--QAL-----LQENHPSVIVQP-GKR------ :. :. ..: ... ..::. :: .:: ::: . . :.: CCDS94 RGLSPRQSKSDSDDDDLPNVTLDSVNETGSTALSIARAVQEVHPSPVSRVRGNRMNNQKF 230 240 250 260 270 280 250 260 270 280 290 pF1KB3 -----PLPVES---------PDTQRKRRIHRCDYDGCNKVYTKSSHLKAHRRTHTGEKPY :. .:: ::. :::::::::..::::::::::::::::::::::::: CCDS94 PCSISPFSIESTRRQRRSESPDS-RKRRIHRCDFEGCNKVYTKSSHLKAHRRTHTGEKPY 290 300 310 320 330 340 300 310 320 330 340 pF1KB3 KCTWEGCTWKFARSDELTRHFRKHTGIKPFQCPDCDRSFSRSDHLALHRKRHMLV ::::::::::::::::::::.:::::.:::.: ::::::::::::::::.::::: CCDS94 KCTWEGCTWKFARSDELTRHYRKHTGVKPFKCADCDRSFSRSDHLALHRRRHMLV 350 360 370 380 390 400 >>CCDS14373.1 KLF8 gene_id:11279|Hs108|chrX (359 aa) initn: 692 init1: 586 opt: 665 Z-score: 350.4 bits: 73.3 E(32554): 3.4e-13 Smith-Waterman score: 696; 38.9% identity (59.5% similar) in 368 aa overlap (1-342:22-356) 10 20 30 pF1KB3 MLMFDPVPVKQEAMDPVSVSYPSNYMESM--------KP : .: : .. . :: . : ::. .: CCDS14 MVDMDKLINNLEVQLNSEGGSMQVFKQVTASVRNRDPPEIEYRSNMTSPTLLDANPMENP 10 20 30 40 50 60 40 50 60 70 pF1KB3 NKYGVIYSTPLPEKFFQTPEGLSHGIQMEPVDLTVNKRSSP-------------PSAGNS .. : : ::... . .: :.:::::. .: ..: :. .: CCDS14 ALFNDIKIEP-PEELLASDFSLP---QVEPVDLSFHKPKAPLQPASMLQAPIRPPKPQSS 70 80 90 100 110 80 90 100 110 120 130 pF1KB3 PSSLKFPSSHRRASPGLSMPSS-SPPIKKYSPPSPGVQP-FGVPLSMPPVMAAALSRHGI :..: .: : . ..:. .: : : : : . : ..: : .. : CCDS14 PQTLVVSTSTSDMSTSANIPTVLTPGSVLTSSQSTGSQQILHVIHTIPSVSLP--NKMG- 120 130 140 150 160 170 140 150 160 170 180 190 pF1KB3 RSPGILPVIQPVVVQPVPFMYTSHLQQPLMVSLSEEMENSSSSMQVPVIESYEKPISQKK :. . ::::: .:..::. : ... ... ::.: . : . . CCDS14 ---GLKTI--PVVVQSLPMVYTT---LP--------ADGGPAAITVPLIGGDGK--NAGS 180 190 200 210 200 210 220 230 240 250 pF1KB3 IKIEP-GIEPQRTDYYPEEMSPPLMNS-VSPPQALLQENHPSVIVQ-PGKRPLPVESPDT .:..: .. : . :: . .: .. :.: :: :....: :. :: : CCDS14 VKVDPTSMSPLEIPSDSEESTIESGSSALQSLQGLQQE--PAAMAQMQGE-----ESLDL 220 230 240 250 260 260 270 280 290 300 310 pF1KB3 QRKRRIHRCDYDGCNKVYTKSSHLKAHRRTHTGEKPYKCTWEGCTWKFARSDELTRHFRK .: ::::.::. ::.:::::::::::::: :::::::::::.::.::::::::::::::: CCDS14 KR-RRIHQCDFAGCSKVYTKSSHLKAHRRIHTGEKPYKCTWDGCSWKFARSDELTRHFRK 270 280 290 300 310 320 320 330 340 pF1KB3 HTGIKPFQCPDCDRSFSRSDHLALHRKRHMLV :::::::.: ::.:::::::::.:::.:: CCDS14 HTGIKPFRCTDCNRSFSRSDHLSLHRRRHDTM 330 340 350 >>CCDS66562.1 KLF5 gene_id:688|Hs108|chr13 (366 aa) initn: 615 init1: 563 opt: 624 Z-score: 330.4 bits: 69.7 E(32554): 4.4e-12 Smith-Waterman score: 641; 35.3% identity (60.3% similar) in 368 aa overlap (6-342:11-364) 10 20 30 40 pF1KB3 MLMFDPVPV--KQEAMDPVSVSYPSNYMESMKPNKYGVIYSTPLPE------KFF :::. ... . :.: .... . . :.. ... ::. .. CCDS66 MEKYLTPQLPPVPIIPEHKKYRRDSASVVDQFFTDTEGLPYSINMNVFLPDITHLRTGLY 10 20 30 40 50 60 50 60 70 80 90 pF1KB3 QTPEGLSHGIQMEPVDLTVNKR--SSPPSAGNS--PSSLKFPSSHRRASPGLS------- .. . :. ::: . .. ..:: : .. : .. :::. :.: .. CCDS66 KSQRPCVTHIKTEPVAIFSHQSETTAPPPAPTQALPEFTSIFSSHQTAAPEVNNIFIKQE 70 80 90 100 110 120 100 110 120 130 140 150 pF1KB3 MPSSSPPIKKYSPPSPG--VQPFGVP-LSMPPV--MAAALSRHGIRSPGILPVIQPVVVQ .:. : .. : . : : ...: :.:: ..::.. .. . . .. . .. CCDS66 LPT--PDLHLSVPTQQGHLYQLLNTPDLDMPSSTNQTAAMDTLNVSMSAAMAGLN-THTS 130 140 150 160 170 160 170 180 190 200 210 pF1KB3 PVPFMYTSHLQQPLMVSLSEEMENSSSSMQVPVIESYEKPISQKKIKIEPGIEPQRTDYY :: ....: : . : :. .: .: : .. ::: :.: . CCDS66 AVPQTAVKQFQG--MPPCTYTMP----SQFLPQQATYFPPSPPSS---EPG-SPDRQAEM 180 190 200 210 220 220 230 240 250 260 pF1KB3 PEEMSPP--LMNSVSPPQALLQENHPSVIVQPGKRPLPVE-----SPDTQRKRRIHRCDY ....:: ... :. . : :... .. ::. .:: . ::::: ::: CCDS66 LQNLTPPPSYAATIASKLAIHNPNLPTTLPVNSQNIQPVRYNRRSNPDLE-KRRIHYCDY 230 240 250 260 270 280 270 280 290 300 310 320 pF1KB3 DGCNKVYTKSSHLKAHRRTHTGEKPYKCTWEGCTWKFARSDELTRHFRKHTGIKPFQCPD ::.:::::::::::: :::::::::::::::: :.::::::::::.::::: ::::: CCDS66 PGCTKVYTKSSHLKAHLRTHTGEKPYKCTWEGCDWRFARSDELTRHYRKHTGAKPFQCGV 290 300 310 320 330 340 330 340 pF1KB3 CDRSFSRSDHLALHRKRHMLV :.:::::::::::: ::: CCDS66 CNRSFSRSDHLALHMKRHQN 350 360 >>CCDS9448.1 KLF5 gene_id:688|Hs108|chr13 (457 aa) initn: 615 init1: 563 opt: 624 Z-score: 329.1 bits: 69.7 E(32554): 5.2e-12 Smith-Waterman score: 641; 35.3% identity (60.3% similar) in 368 aa overlap (6-342:102-455) 10 20 30 pF1KB3 MLMFDPVPV--KQEAMDPVSVSYPSNYMESMKPNK :::. ... . :.: .... . . CCDS94 QPPATGPRLPPEDLVQTRCEMEKYLTPQLPPVPIIPEHKKYRRDSASVVDQFFTDTEGLP 80 90 100 110 120 130 40 50 60 70 80 pF1KB3 YGVIYSTPLPE------KFFQTPEGLSHGIQMEPVDLTVNKR--SSPPSAGNS--PSSLK :.. ... ::. .... . :. ::: . .. ..:: : .. : . CCDS94 YSINMNVFLPDITHLRTGLYKSQRPCVTHIKTEPVAIFSHQSETTAPPPAPTQALPEFTS 140 150 160 170 180 190 90 100 110 120 130 pF1KB3 FPSSHRRASPGLS-------MPSSSPPIKKYSPPSPG--VQPFGVP-LSMPPV--MAAAL . :::. :.: .. .:. : .. : . : : ...: :.:: ..::. CCDS94 IFSSHQTAAPEVNNIFIKQELPT--PDLHLSVPTQQGHLYQLLNTPDLDMPSSTNQTAAM 200 210 220 230 240 140 150 160 170 180 190 pF1KB3 SRHGIRSPGILPVIQPVVVQPVPFMYTSHLQQPLMVSLSEEMENSSSSMQVPVIESYEKP . .. . . .. . .. :: ....: : . : :. .: .: : CCDS94 DTLNVSMSAAMAGLN-THTSAVPQTAVKQFQG--MPPCTYTMP----SQFLPQQATYFPP 250 260 270 280 290 300 200 210 220 230 240 pF1KB3 ISQKKIKIEPGIEPQRTDYYPEEMSPP--LMNSVSPPQALLQENHPSVIVQPGKRPLPVE .. ::: :.: . ....:: ... :. . : :... .. ::. CCDS94 SPPSS---EPG-SPDRQAEMLQNLTPPPSYAATIASKLAIHNPNLPTTLPVNSQNIQPVR 310 320 330 340 350 250 260 270 280 290 300 pF1KB3 -----SPDTQRKRRIHRCDYDGCNKVYTKSSHLKAHRRTHTGEKPYKCTWEGCTWKFARS .:: . ::::: ::: ::.:::::::::::: :::::::::::::::: :.:::: CCDS94 YNRRSNPDLE-KRRIHYCDYPGCTKVYTKSSHLKAHLRTHTGEKPYKCTWEGCDWRFARS 360 370 380 390 400 410 310 320 330 340 pF1KB3 DELTRHFRKHTGIKPFQCPDCDRSFSRSDHLALHRKRHMLV ::::::.::::: ::::: :.:::::::::::: ::: CCDS94 DELTRHYRKHTGAKPFQCGVCNRSFSRSDHLALHMKRHQN 420 430 440 450 >>CCDS7060.1 KLF6 gene_id:1316|Hs108|chr10 (283 aa) initn: 747 init1: 557 opt: 576 Z-score: 308.7 bits: 65.3 E(32554): 7.1e-11 Smith-Waterman score: 576; 72.1% identity (83.7% similar) in 104 aa overlap (240-343:182-283) 210 220 230 240 250 260 pF1KB3 YYPEEMSPPLMNSVSPPQALLQENHPSVIVQPGKRPLPVESPDTQRKRRIHRCDYDGCNK .:: . ::: .::.::: ..:: : CCDS70 PELSREPSQLWGCVPGELPSPGKVRSGTSGKPGDKGNGDASPDG--RRRVHRCHFNGCRK 160 170 180 190 200 270 280 290 300 310 320 pF1KB3 VYTKSSHLKAHRRTHTGEKPYKCTWEGCTWKFARSDELTRHFRKHTGIKPFQCPDCDRSF :::::::::::.:::::::::.:.:::: :.:::::::::::::::: :::.: ::: : CCDS70 VYTKSSHLKAHQRTHTGEKPYRCSWEGCEWRFARSDELTRHFRKHTGAKPFKCSHCDRCF 210 220 230 240 250 260 330 340 pF1KB3 SRSDHLALHRKRHMLV ::::::::: :::. CCDS70 SRSDHLALHMKRHL 270 280 >>CCDS6770.2 KLF4 gene_id:9314|Hs108|chr9 (479 aa) initn: 547 init1: 525 opt: 582 Z-score: 308.4 bits: 66.0 E(32554): 7.4e-11 Smith-Waterman score: 591; 39.0% identity (59.0% similar) in 290 aa overlap (58-342:207-478) 30 40 50 60 70 80 pF1KB3 SMKPNKYGVIYSTPLPEKFFQTPEGLSHGIQMEPVDLTVNKRSSPPSAGNSPSSLKFPSS ...:: . .. .::..: :: . CCDS67 SAPPPTAPFNLADINDVSPSGGFVAELLRPELDPVYIP-PQQPQPPGGGLMG---KFVLK 180 190 200 210 220 230 90 100 110 120 130 140 pF1KB3 HRRASPGLSMPSSSPPIKKYSPPSP-GVQPFGV-PLSM-PPVMAAALSRHGIRSPGILPV ..:: . .:: . . : :: : .: : : . :: ...... : : . CCDS67 ASLSAPGSEY--GSPSVISVSKGSPDGSHPVVVAPYNGGPPRTCPKIKQEAVSSCTHLGA 240 250 260 270 280 290 150 160 170 180 190 200 pF1KB3 IQPVVVQPVPFMYTSHLQQPLMVSLSEEMENSSSSMQVPVIESYEKPISQKKIKIEPGIE :. : ..: . :: .: : .. . . : . . . . ::.. CCDS67 GPPLSNGHRP---AAH-DFPLGRQLP-----SRTTPTLGLEEVLSSRDCHPALPLPPGFH 300 310 320 330 340 210 220 230 240 250 260 pF1KB3 PQRTDYYPEEMSPPLMNSVSPPQALLQENHPSVIVQPGKRPLPVESPDTQRKRRI--HRC :. :: . : :. :: :: : .: ..: : .. . ..: : : CCDS67 PHPGPNYPSFL-PDQMQPQVPPLHY-QELMPPGSCMP-EEPKPKRGRRSWPRKRTATHTC 350 360 370 380 390 270 280 290 300 310 320 pF1KB3 DYDGCNKVYTKSSHLKAHRRTHTGEKPYKCTWEGCTWKFARSDELTRHFRKHTGIKPFQC :: ::.:.:::::::::: :::::::::.: :.:: ::::::::::::.::::: .:::: CCDS67 DYAGCGKTYTKSSHLKAHLRTHTGEKPYHCDWDGCGWKFARSDELTRHYRKHTGHRPFQC 400 410 420 430 440 450 330 340 pF1KB3 PDCDRSFSRSDHLALHRKRHMLV :::.:::::::::: ::: CCDS67 QKCDRAFSRSDHLALHMKRHF 460 470 345 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 13:19:20 2016 done: Thu Nov 3 13:19:21 2016 Total Scan time: 3.310 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]