FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0666, 357 aa 1>>>pF1KE0666 357 - 357 aa - 357 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.7393+/-0.000965; mu= 3.6532+/- 0.058 mean_var=315.5497+/-64.557, 0's: 0 Z-trim(114.8): 39 B-trim: 12 in 1/51 Lambda= 0.072201 statistics sampled from 15342 (15375) to 15342 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.781), E-opt: 0.2 (0.472), width: 16 Scan time: 2.230 The best scores are: opt bits E(32554) CCDS41870.1 PSPC1 gene_id:55269|Hs108|chr13 ( 523) 2336 256.7 3.3e-68 CCDS14410.1 NONO gene_id:4841|Hs108|chrX ( 471) 1655 185.7 7e-47 CCDS388.1 SFPQ gene_id:6421|Hs108|chr1 ( 707) 1638 184.1 3.1e-46 CCDS55445.1 NONO gene_id:4841|Hs108|chrX ( 382) 1427 161.8 8.6e-40 >>CCDS41870.1 PSPC1 gene_id:55269|Hs108|chr13 (523 aa) initn: 2478 init1: 2336 opt: 2336 Z-score: 1338.0 bits: 256.7 E(32554): 3.3e-68 Smith-Waterman score: 2336; 99.2% identity (99.7% similar) in 353 aa overlap (1-353:37-389) 10 20 30 pF1KE0 MALALAGEPAPPAPAPPEDHPDEEMGFTID :::::::::::::::::::::::::::::: CCDS41 LKQVRIEKNPARLRALESAVGESEPAAAAAMALALAGEPAPPAPAPPEDHPDEEMGFTID 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE0 IKSFLKPGEKTYTQRCRLFVGNLPTDITEEDFKRLFERYGEPSEVFINRDRGFGFIRLES :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 IKSFLKPGEKTYTQRCRLFVGNLPTDITEEDFKRLFERYGEPSEVFINRDRGFGFIRLES 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE0 RTLAEIAKAELDGTILKSRPLRIRFATHGAALTVKNLSPVVSNELLEQAFSQFGPVEKAV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 RTLAEIAKAELDGTILKSRPLRIRFATHGAALTVKNLSPVVSNELLEQAFSQFGPVEKAV 130 140 150 160 170 180 160 170 180 190 200 210 pF1KE0 VVVDDRGRATGKGFVEFAAKPPARKALERCGDGAFLLTTTPRPVIVEPMEQFDDEDGLPE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 VVVDDRGRATGKGFVEFAAKPPARKALERCGDGAFLLTTTPRPVIVEPMEQFDDEDGLPE 190 200 210 220 230 240 220 230 240 250 260 270 pF1KE0 KLMQKTQQYHKEREQPPRFAQPGTFEFEYASRWKALDEMEKQQREQVDRNIREAKEKLEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 KLMQKTQQYHKEREQPPRFAQPGTFEFEYASRWKALDEMEKQQREQVDRNIREAKEKLEA 250 260 270 280 290 300 280 290 300 310 320 330 pF1KE0 EMEAARHEHQLMLMRQDLMRRQEELRRLEELRNQELQKRKQIQLRHEEEHRRREEEMIRH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 EMEAARHEHQLMLMRQDLMRRQEELRRLEELRNQELQKRKQIQLRHEEEHRRREEEMIRH 310 320 330 340 350 360 340 350 pF1KE0 REQEELRRQQEGFKPNYMENGDKRKCG :::::::::::::::::::: .. CCDS41 REQEELRRQQEGFKPNYMENREQEMRMGDMGPRGAINMGDAFSPAPAGNQGPPPMMGMNM 370 380 390 400 410 420 >>CCDS14410.1 NONO gene_id:4841|Hs108|chrX (471 aa) initn: 1971 init1: 1594 opt: 1655 Z-score: 955.2 bits: 185.7 E(32554): 7e-47 Smith-Waterman score: 1655; 71.1% identity (90.9% similar) in 339 aa overlap (9-347:37-374) 10 20 30 pF1KE0 MALALAGEPAPPAPAPPEDHPDEEMGFTIDIKSFLKPG : :: :: .. ... :.:::.:.: ::: CCDS14 FNLEKQNHTPRKHHQHHHQQQHHQQQQQQPPPPPIPANGQQASSQNEGLTIDLKNFRKPG 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE0 EKTYTQRCRLFVGNLPTDITEEDFKRLFERYGEPSEVFINRDRGFGFIRLESRTLAEIAK :::.::: :::::::: :::::....:::.::. .::::..:.::::::::.:::::::: CCDS14 EKTFTQRSRLFVGNLPPDITEEEMRKLFEKYGKAGEVFIHKDKGFGFIRLETRTLAEIAK 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE0 AELDGTILKSRPLRIRFATHGAALTVKNLSPVVSNELLEQAFSQFGPVEKAVVVVDDRGR .:::. :... ::.::: :.:.:::.:: :::::::.::: :: ::.:::.:::::: CCDS14 VELDNMPLRGKQLRVRFACHSASLTVRNLPQYVSNELLEEAFSVFGQVERAVVIVDDRGR 130 140 150 160 170 180 160 170 180 190 200 210 pF1KE0 ATGKGFVEFAAKPPARKALERCGDGAFLLTTTPRPVIVEPMEQFDDEDGLPEKLMQKTQQ .:::.:::..:: :::::.::..:.::::: :::: ::::.:.:::.::::::. :.:: CCDS14 PSGKGIVEFSGKPAARKALDRCSEGSFLLTTFPRPVTVEPMDQLDDEEGLPEKLVIKNQQ 190 200 210 220 230 240 220 230 240 250 260 270 pF1KE0 YHKEREQPPRFAQPGTFEFEYASRWKALDEMEKQQREQVDRNIREAKEKLEAEMEAARHE .::::::::::::::.::.::: ::::: ::::::..::::::.::.:::: :::::::: CCDS14 FHKEREQPPRFAQPGSFEYEYAMRWKALIEMEKQQQDQVDRNIKEAREKLEMEMEAARHE 250 260 270 280 290 300 280 290 300 310 320 330 pF1KE0 HQLMLMRQDLMRRQEELRRLEELRNQELQKRKQIQLRHEEEHRRREEEMIRHREQEELRR ::.::::::::::::::::.:::.:::.:::::..::.:::.::::::: :....: .:: CCDS14 HQVMLMRQDLMRRQEELRRMEELHNQEVQKRKQLELRQEEERRRREEEM-RRQQEEMMRR 310 320 330 340 350 360 340 350 pF1KE0 QQEGFKPNYMENGDKRKCG :::::: .. CCDS14 QQEGFKGTFPDAREQEIRMGQMAMGGAMGINNRGAMPPAPVPAGTPAPPGPATMMPDGTL 370 380 390 400 410 420 >>CCDS388.1 SFPQ gene_id:6421|Hs108|chr1 (707 aa) initn: 1776 init1: 1582 opt: 1638 Z-score: 943.6 bits: 184.1 E(32554): 3.1e-46 Smith-Waterman score: 1638; 71.7% identity (91.4% similar) in 336 aa overlap (9-341:259-594) 10 20 30 pF1KE0 MALALAGEPAPPAPAP-PEDHPDEEMGFTIDIKSFLKP : : .:. :.. .. :: ... . .: CCDS38 GHPKPPHRGGGEPRGGRQHHPPYHQQHHQGPPPGGPGGRSEEKISDSEGFKANLSLLRRP 230 240 250 260 270 280 40 50 60 70 80 90 pF1KE0 GEKTYTQRCRLFVGNLPTDITEEDFKRLFERYGEPSEVFINRDRGFGFIRLESRTLAEIA :::::::::::::::::.::::..::::: .::::.:::::. .:::::.::::.::::: CCDS38 GEKTYTQRCRLFVGNLPADITEDEFKRLFAKYGEPGEVFINKGKGFGFIKLESRALAEIA 290 300 310 320 330 340 100 110 120 130 140 150 pF1KE0 KAELDGTILKSRPLRIRFATHGAALTVKNLSPVVSNELLEQAFSQFGPVEKAVVVVDDRG ::::: : ...: ::.:::::.:::.:.:::: :::::::.:::::::.:.:::.::::: CCDS38 KAELDDTPMRGRQLRVRFATHAAALSVRNLSPYVSNELLEEAFSQFGPIERAVVIVDDRG 350 360 370 380 390 400 160 170 180 190 200 210 pF1KE0 RATGKGFVEFAAKPPARKALERCGDGAFLLTTTPRPVIVEPMEQFDDEDGLPEKLMQKTQ :.::::.::::.:: ::::.:::..:.::::::::::::::.::.:::::::::: ::. CCDS38 RSTGKGIVEFASKPAARKAFERCSEGVFLLTTTPRPVIVEPLEQLDDEDGLPEKLAQKNP 410 420 430 440 450 460 220 230 240 250 260 270 pF1KE0 QYHKEREQPPRFAQPGTFEFEYASRWKALDEMEKQQREQVDRNIREAKEKLEAEMEAARH .:.:::: :::::: ::::.::..:::.::::::::::::..:...::.:::.::: : : CCDS38 MYQKERETPPRFAQHGTFEYEYSQRWKSLDEMEKQQREQVEKNMKDAKDKLESEMEDAYH 470 480 490 500 510 520 280 290 300 310 320 330 pF1KE0 EHQLMLMRQDLMRRQEELRRLEELRNQELQKRKQIQLRHEEEHRRREEEM-IRHREQEE- ::: :.:::::::::::::.:::.:::.::::..:::.:::.::::::: ::.::.:: CCDS38 EHQANLLRQDLMRRQEELRRMEELHNQEMQKRKEMQLRQEEERRRREEEMMIRQREMEEQ 530 540 550 560 570 580 340 350 pF1KE0 LRRQQEGFKPNYMENGDKRKCG .:::.: CCDS38 MRRQREESYSRMGYMDPRERDMRMGGGGAMNMGDPYGSGGQKFPPLGGGGGIGYEANPGV 590 600 610 620 630 640 >>CCDS55445.1 NONO gene_id:4841|Hs108|chrX (382 aa) initn: 1746 init1: 1369 opt: 1427 Z-score: 827.9 bits: 161.8 E(32554): 8.6e-40 Smith-Waterman score: 1427; 73.0% identity (92.6% similar) in 285 aa overlap (63-347:2-285) 40 50 60 70 80 90 pF1KE0 SFLKPGEKTYTQRCRLFVGNLPTDITEEDFKRLFERYGEPSEVFINRDRGFGFIRLESRT ..:::.::. .::::..:.::::::::.:: CCDS55 MRKLFEKYGKAGEVFIHKDKGFGFIRLETRT 10 20 30 100 110 120 130 140 150 pF1KE0 LAEIAKAELDGTILKSRPLRIRFATHGAALTVKNLSPVVSNELLEQAFSQFGPVEKAVVV ::::::.:::. :... ::.::: :.:.:::.:: :::::::.::: :: ::.:::. CCDS55 LAEIAKVELDNMPLRGKQLRVRFACHSASLTVRNLPQYVSNELLEEAFSVFGQVERAVVI 40 50 60 70 80 90 160 170 180 190 200 210 pF1KE0 VDDRGRATGKGFVEFAAKPPARKALERCGDGAFLLTTTPRPVIVEPMEQFDDEDGLPEKL :::::: .:::.:::..:: :::::.::..:.::::: :::: ::::.:.:::.:::::: CCDS55 VDDRGRPSGKGIVEFSGKPAARKALDRCSEGSFLLTTFPRPVTVEPMDQLDDEEGLPEKL 100 110 120 130 140 150 220 230 240 250 260 270 pF1KE0 MQKTQQYHKEREQPPRFAQPGTFEFEYASRWKALDEMEKQQREQVDRNIREAKEKLEAEM . :.::.::::::::::::::.::.::: ::::: ::::::..::::::.::.:::: :: CCDS55 VIKNQQFHKEREQPPRFAQPGSFEYEYAMRWKALIEMEKQQQDQVDRNIKEAREKLEMEM 160 170 180 190 200 210 280 290 300 310 320 330 pF1KE0 EAARHEHQLMLMRQDLMRRQEELRRLEELRNQELQKRKQIQLRHEEEHRRREEEMIRHRE ::::::::.::::::::::::::::.:::.:::.:::::..::.:::.::::::: :... CCDS55 EAARHEHQVMLMRQDLMRRQEELRRMEELHNQEVQKRKQLELRQEEERRRREEEM-RRQQ 220 230 240 250 260 270 340 350 pF1KE0 QEELRRQQEGFKPNYMENGDKRKCG .: .:::::::: .. CCDS55 EEMMRRQQEGFKGTFPDAREQEIRMGQMAMGGAMGINNRGAMPPAPVPAGTPAPPGPATM 280 290 300 310 320 330 357 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 13:29:20 2016 done: Sat Nov 5 13:29:21 2016 Total Scan time: 2.230 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]