FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9574, 380 aa 1>>>pF1KB9574 380 - 380 aa - 380 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.4890+/-0.000843; mu= 8.4443+/- 0.051 mean_var=150.2520+/-30.492, 0's: 0 Z-trim(112.8): 47 B-trim: 543 in 1/52 Lambda= 0.104632 statistics sampled from 13504 (13545) to 13504 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.759), E-opt: 0.2 (0.416), width: 16 Scan time: 2.750 The best scores are: opt bits E(32554) CCDS9841.1 FOS gene_id:2353|Hs108|chr14 ( 380) 2507 389.7 2.3e-108 CCDS1766.1 FOSL2 gene_id:2355|Hs108|chr2 ( 326) 630 106.3 4e-23 CCDS12664.1 FOSB gene_id:2354|Hs108|chr19 ( 338) 484 84.3 1.8e-16 CCDS8121.1 FOSL1 gene_id:8061|Hs108|chr11 ( 271) 468 81.8 8e-16 >>CCDS9841.1 FOS gene_id:2353|Hs108|chr14 (380 aa) initn: 2507 init1: 2507 opt: 2507 Z-score: 2058.9 bits: 389.7 E(32554): 2.3e-108 Smith-Waterman score: 2507; 100.0% identity (100.0% similar) in 380 aa overlap (1-380:1-380) 10 20 30 40 50 60 pF1KB9 MMFSGFNADYEASSSRCSSASPAGDSLSYYHSPADSFSSMGSPVNAQDFCTDLAVSSANF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 MMFSGFNADYEASSSRCSSASPAGDSLSYYHSPADSFSSMGSPVNAQDFCTDLAVSSANF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 IPTVTAISTSPDLQWLVQPALVSSVAPSQTRAPHPFGVPAPSAGAYSRAGVVKTMTGGRA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 IPTVTAISTSPDLQWLVQPALVSSVAPSQTRAPHPFGVPAPSAGAYSRAGVVKTMTGGRA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 QSIGRRGKVEQLSPEEEEKRRIRRERNKMAAAKCRNRRRELTDTLQAETDQLEDEKSALQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 QSIGRRGKVEQLSPEEEEKRRIRRERNKMAAAKCRNRRRELTDTLQAETDQLEDEKSALQ 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 TEIANLLKEKEKLEFILAAHRPACKIPDDLGFPEEMSVASLDLTGGLPEVATPESEEAFT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 TEIANLLKEKEKLEFILAAHRPACKIPDDLGFPEEMSVASLDLTGGLPEVATPESEEAFT 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB9 LPLLNDPEPKPSVEPVKSISSMELKTEPFDDFLFPASSRPSGSETARSVPDMDLSGSFYA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 LPLLNDPEPKPSVEPVKSISSMELKTEPFDDFLFPASSRPSGSETARSVPDMDLSGSFYA 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB9 ADWEPLHSGSLGMGPMATELEPLCTPVVTCTPSCTAYTSSFVFTYPEADSFPSCAAAHRK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS98 ADWEPLHSGSLGMGPMATELEPLCTPVVTCTPSCTAYTSSFVFTYPEADSFPSCAAAHRK 310 320 330 340 350 360 370 380 pF1KB9 GSSSNEPSSDSLSSPTLLAL :::::::::::::::::::: CCDS98 GSSSNEPSSDSLSSPTLLAL 370 380 >>CCDS1766.1 FOSL2 gene_id:2355|Hs108|chr2 (326 aa) initn: 703 init1: 560 opt: 630 Z-score: 528.6 bits: 106.3 E(32554): 4e-23 Smith-Waterman score: 817; 43.4% identity (65.4% similar) in 396 aa overlap (2-380:1-326) 10 20 30 40 50 60 pF1KB9 MMFSGFNADYEASSSRCSSASPAGDSLSYYHSPADSFSSMGSPVNAQDFCTDLAVSSANF :.. . .... .::: ::.::: : :.:.:: :. . : : .:. :.. : CCDS17 MYQDYPGNFD-TSSRGSSGSPA-------H--AESYSSGGG--GQQKFRVDMPGSGSAF 10 20 30 40 70 80 90 100 110 pF1KB9 IPTVTAISTSPDLQWLVQPALVSSVAPSQTRAPHPFG-VPA----PSAGAYSRAGVVKTM :::..::.:: ::::.:::....:.. :. ::.. .:. :. : : ::.::. CCDS17 IPTINAITTSQDLQWMVQPTVITSMSNPYPRS-HPYSPLPGLASVPGHMALPRPGVIKTI 50 60 70 80 90 100 120 130 140 150 160 170 pF1KB9 TGGRAQSIGRRGKVEQLSPEEEEKRRIRRERNKMAAAKCRNRRRELTDTLQAETDQLEDE . ..::: . :::::::::::::::::::.:::::::::::::. :::::..::.: CCDS17 ----GTTVGRRRRDEQLSPEEEEKRRIRRERNKLAAAKCRNRRRELTEKLQAETEELEEE 110 120 130 140 150 160 180 190 200 210 220 230 pF1KB9 KSALQTEIANLLKEKEKLEFILAAHRPACKIPDDLGFPEEMSVASLDLTGGLPEVATPES ::.:: :::.: ::::::::.:.:: :.::: ::: .: CCDS17 KSGLQKEIAELQKEKEKLEFMLVAHGPVCKIS-----PEERR--------------SP-- 170 180 190 200 240 250 260 270 280 290 pF1KB9 EEAFTLPLLNDPEPKPSVEPVKS----ISSMELKTEPFDDFLFPASSRPSGSETARSV-P : :...:..: .... .: ::... :.:: . ... ::: CCDS17 -------------PAPGLQPMRSGGGSVGAVVVKQEPLEE-DSPSSSSAGLDKAQRSVIK 210 220 230 240 300 310 320 330 340 pF1KB9 DMDLSGSFYAADWEPLHSGSLGMGPMATELEPLCTPVVTCTPSCTAYTSSFVFTYP---- ....:.::. ::::. :. ::: ::. : ::..::::: CCDS17 PISIAGGFYGE--EPLHT------PI----------VVTSTPAVTPGTSNLVFTYPSVLE 250 260 270 280 350 360 370 380 pF1KB9 -EADSFPS--CAAAHRKGSSSNEPSSDSLSSPTLLAL :. . :: :. :::..:::.. :::::.::::::: CCDS17 QESPASPSESCSKAHRRSSSSGDQSSDSLNSPTLLAL 290 300 310 320 >>CCDS12664.1 FOSB gene_id:2354|Hs108|chr19 (338 aa) initn: 711 init1: 440 opt: 484 Z-score: 409.2 bits: 84.3 E(32554): 1.8e-16 Smith-Waterman score: 762; 42.8% identity (64.4% similar) in 404 aa overlap (2-380:1-338) 10 20 30 40 50 60 pF1KB9 MMFSGFNADYEASSSRCSSASPAGDSLSYYHSPADSFSSMGSPVNAQDFCTDLAVSSANF ::..: .::. :.::::: ::... : : : .:::.: . . .:. :. :. ..: CCDS12 MFQAFPGDYD-SGSRCSS-SPSAE--SQYLSSVDSFGSPPTAAASQE-CAGLGEMPGSF 10 20 30 40 50 70 80 90 100 pF1KB9 IPTVTAISTSPDLQWLVQPALVSSVAPSQ-----TRAP--HPFGVPA-----PSAGAYSR .::::::.:: ::::::::.:.::.: :: .. : :. .:. :. ..:: CCDS12 VPTVTAITTSQDLQWLVQPTLISSMAQSQGQPLASQPPVVDPYDMPGTSYSTPGMSGYSS 60 70 80 90 100 110 110 120 130 140 150 pF1KB9 AGVVKT---MTGGRAQSIG---------RRGKVEQLSPEEEEKRRIRRERNKMAAAKCRN .:. . :.: ... : :: . : :.::::::::.::::::.::::::: CCDS12 GGASGSGGPSTSGTTSGPGPARPARARPRRPREETLTPEEEEKRRVRRERNKLAAAKCRN 120 130 140 150 160 170 160 170 180 190 200 210 pF1KB9 RRRELTDTLQAETDQLEDEKSALQTEIANLLKEKEKLEFILAAHRPACKIPDDLGFPEEM ::::::: :::::::::.::. :..:::.: ::::.:::.:.::.:.:::: . : : CCDS12 RRRELTDRLQAETDQLEEEKAELESEIAELQKEKERLEFVLVAHKPGCKIPYEEG-PGPG 180 190 200 210 220 230 220 230 240 250 260 270 pF1KB9 SVASL-DLTGGLPEVATPESEEAFTLPLLNDPEPKPSVEPVKSISSMELKTEPFDDFLFP .: . :: : ..: .:..:. :: : : : :: CCDS12 PLAEVRDLPG-----SAPAKEDGFSW-LLPPPPPPPL---------------PF------ 240 250 260 280 290 300 310 320 330 pF1KB9 ASSRPSGSETARSVPDMDLSGSFYAADWEPLHSGSLGMGPMATELEPLCTPVVTCTPSCT .:....: .:..:... :: .: .:. :::. :: CCDS12 --------QTSQDAPP-NLTASLFT------HSEVQVLG------DPF--PVVN--PS-- 270 280 290 340 350 360 370 380 pF1KB9 AYTSSFVFTYPEADSFPSCAAAHRKGSSSNEPSSDSLSSPTLLAL ::::::.: ::...: :.:.: :.:..:: : :.::.:::: CCDS12 -YTSSFVLTCPEVSAF---AGAQRT-SGSDQPS-DPLNSPSLLAL 300 310 320 330 >>CCDS8121.1 FOSL1 gene_id:8061|Hs108|chr11 (271 aa) initn: 677 init1: 422 opt: 468 Z-score: 397.5 bits: 81.8 E(32554): 8e-16 Smith-Waterman score: 603; 41.7% identity (57.7% similar) in 324 aa overlap (59-380:36-271) 30 40 50 60 70 80 pF1KB9 YYHSPADSFSSMGSPVNAQDFCTDLAVSSANFIPTVTAISTSPDLQWLVQPALVSSVAPS ...:.....: : .:::.::: ... :: CCDS81 GEPGPSSGNGGGYGGPAQPPAAAQAAQQKFHLVPSINTMSGSQELQWMVQPHFLG---PS 10 20 30 40 50 60 90 100 110 120 130 140 pF1KB9 QTRAPHPFGVPAPSAGAYSRAGVVKTMTGGRAQSIGRRGKVEQLSPEEEEKRRIRRERNK . :.:. : : : ::.... : .. :: ::.::::::.::.:::::: CCDS81 SY--PRPLTYPQYSP-PQPRPGVIRAL--GPPPGV-RRRPCEQISPEEEERRRVRRERNK 70 80 90 100 110 150 160 170 180 190 200 pF1KB9 MAAAKCRNRRRELTDTLQAETDQLEDEKSALQTEIANLLKEKEKLEFILAAHRPACKIPD .:::::::::.:::: ::::::.::::::.:: :: .: :.::.::..: :::: ::::. CCDS81 LAAAKCRNRRKELTDFLQAETDKLEDEKSGLQREIEELQKQKERLELVLEAHRPICKIPE 120 130 140 150 160 170 210 220 230 240 250 260 pF1KB9 DLGFPEEMSVASLDLTGGLPEVATPESEEAFTLPLLNDPEPKPSVEPVKSISSMELKTEP :. ::. ...: CCDS81 G---------AKEGDTGSTSGTSSP----------------------------------- 180 190 270 280 290 300 310 320 pF1KB9 FDDFLFPASSRPSGSETARSVPDMDLSGSFYAADWEPLHSGSLGMGPMATELEPLCTPVV :: :: :: ..:: : ::. : : : ::.. CCDS81 ------PAPCRP--------VPCISLS---------P--------GPV-LEPEALHTPTL 200 210 220 330 340 350 360 370 380 pF1KB9 TCTPSCTAYTSSFVFTYPEADSFPS-CAAAHRKGSSSN-EPSSDSLSSPTLLAL ::: : .: :.::::: : : ::.::::.:::. .:::: :.::::::: CCDS81 MTTPSLTPFTPSLVFTYP---STPEPCASAHRKSSSSSGDPSSDPLGSPTLLAL 230 240 250 260 270 380 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 00:15:31 2016 done: Fri Nov 4 00:15:32 2016 Total Scan time: 2.750 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]