FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB7689, 349 aa
1>>>pF1KB7689 349 - 349 aa - 349 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.6441+/-0.000812; mu= 10.7239+/- 0.049
mean_var=98.0125+/-19.326, 0's: 0 Z-trim(110.1): 24 B-trim: 0 in 0/53
Lambda= 0.129549
statistics sampled from 11332 (11349) to 11332 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.718), E-opt: 0.2 (0.349), width: 16
Scan time: 2.840
The best scores are: opt bits E(32554)
CCDS3835.1 IRF2 gene_id:3660|Hs108|chr4 ( 349) 2346 448.4 4e-126
CCDS4155.1 IRF1 gene_id:3659|Hs108|chr5 ( 325) 722 144.9 8.9e-35
CCDS4469.1 IRF4 gene_id:3662|Hs108|chr6 ( 451) 406 85.9 7.1e-17
CCDS10956.1 IRF8 gene_id:3394|Hs108|chr16 ( 426) 398 84.4 1.9e-16
CCDS43645.1 IRF5 gene_id:3663|Hs108|chr7 ( 514) 398 84.4 2.2e-16
CCDS56512.1 IRF5 gene_id:3663|Hs108|chr7 ( 412) 393 83.5 3.5e-16
CCDS1492.1 IRF6 gene_id:3664|Hs108|chr1 ( 467) 392 83.3 4.5e-16
CCDS5808.1 IRF5 gene_id:3663|Hs108|chr7 ( 498) 386 82.2 1e-15
CCDS9615.1 IRF9 gene_id:10379|Hs108|chr14 ( 393) 346 74.7 1.5e-13
>>CCDS3835.1 IRF2 gene_id:3660|Hs108|chr4 (349 aa)
initn: 2346 init1: 2346 opt: 2346 Z-score: 2377.8 bits: 448.4 E(32554): 4e-126
Smith-Waterman score: 2346; 100.0% identity (100.0% similar) in 349 aa overlap (1-349:1-349)
10 20 30 40 50 60
pF1KB7 MPVERMRMRPWLEEQINSNTIPGLKWLNKEKKIFQIPWMHAARHGWDVEKDAPLFRNWAI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 MPVERMRMRPWLEEQINSNTIPGLKWLNKEKKIFQIPWMHAARHGWDVEKDAPLFRNWAI
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 HTGKHQPGVDKPDPKTWKANFRCAMNSLPDIEEVKDKSIKKGNNAFRVYRMLPLSERPSK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 HTGKHQPGVDKPDPKTWKANFRCAMNSLPDIEEVKDKSIKKGNNAFRVYRMLPLSERPSK
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 KGKKPKTEKEDKVKHIKQEPVESSLGLSNGVSDLSPEYAVLTSTIKNEVDSTVNIIVVGQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 KGKKPKTEKEDKVKHIKQEPVESSLGLSNGVSDLSPEYAVLTSTIKNEVDSTVNIIVVGQ
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 SHLDSNIENQEIVTNPPDICQVVEVTTESDEQPVSMSELYPLQISPVSSYAESETTDSVP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 SHLDSNIENQEIVTNPPDICQVVEVTTESDEQPVSMSELYPLQISPVSSYAESETTDSVP
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB7 SDEESAEGRPHWRKRNIEGKQYLSNMGTRGSYLLPGMASFVTSNKPDLQVTIKEESNPVP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 SDEESAEGRPHWRKRNIEGKQYLSNMGTRGSYLLPGMASFVTSNKPDLQVTIKEESNPVP
250 260 270 280 290 300
310 320 330 340
pF1KB7 YNSSWPPFQDLPLSSSMTPASSSSRPDRETRASVIKKTSDITQARVKSC
:::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 YNSSWPPFQDLPLSSSMTPASSSSRPDRETRASVIKKTSDITQARVKSC
310 320 330 340
>>CCDS4155.1 IRF1 gene_id:3659|Hs108|chr5 (325 aa)
initn: 718 init1: 675 opt: 722 Z-score: 737.9 bits: 144.9 E(32554): 8.9e-35
Smith-Waterman score: 760; 46.7% identity (67.6% similar) in 272 aa overlap (1-265:1-259)
10 20 30 40 50 60
pF1KB7 MPVERMRMRPWLEEQINSNTIPGLKWLNKEKKIFQIPWMHAARHGWDVEKDAPLFRNWAI
::. ::::::::: ::::: :::: :.:::. :::::: :::.::::..::: :::.:::
CCDS41 MPITRMRMRPWLEMQINSNQIPGLIWINKEEMIFQIPWKHAAKHGWDINKDACLFRSWAI
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 HTGKHQPGVDKPDPKTWKANFRCAMNSLPDIEEVKDKSIKKGNNAFRVYRMLPLSERPSK
:::... : .:::::::::::::::::::::::::.: .::..: ::::::: . ..
CCDS41 HTGRYKAGEKEPDPKTWKANFRCAMNSLPDIEEVKDQSRNKGSSAVRVYRMLPPLTKNQR
70 80 90 100 110 120
130 140 150 160 170
pF1KB7 KGKKPKTEKEDKVKHIKQEPVESSLG-LSNGVSD--LSPEYAVLTSTIKNEVDSTVNIIV
: .: :. .. : : .. .:: .:.:.:. : ... : : . .. .
CCDS41 KERKSKSSRDAKSKAKRKSCGDSSPDTFSDGLSSSTLPDDHSSYT------VPGYMQDLE
130 140 150 160 170
180 190 200 210 220 230
pF1KB7 VGQSHLDSNIENQEIVTNPPDICQVVEVTTESDEQPVSMSELYPLQISPVSSYAESETTD
: :. : . . .. :: :::. : : :.:: .:.::. : .:. : .
CCDS41 VEQA-LTPALSPCAVSSTLPDWHIPVEVV------PDSTSDLYNFQVSPMPSTSEATTDE
180 190 200 210 220
240 250 260 270 280 290
pF1KB7 S----VPSDEESAEGRPHWRKRNIEGKQYLSNMGTRGSYLLPGMASFVTSNKPDLQVTIK
. .: : . . .:. :..:: :: :
CCDS41 DEEGKLPEDIMKLLEQSEWQPTNVDGKGYLLNEPGVQPTSVYGDFSCKEEPEIDSPGGDI
230 240 250 260 270 280
300 310 320 330 340
pF1KB7 EESNPVPYNSSWPPFQDLPLSSSMTPASSSSRPDRETRASVIKKTSDITQARVKSC
CCDS41 GLSLQRVFTDLKNMDATWLDSLLTPVRLPSIQAIPCAP
290 300 310 320
>>CCDS4469.1 IRF4 gene_id:3662|Hs108|chr6 (451 aa)
initn: 398 init1: 398 opt: 406 Z-score: 416.5 bits: 85.9 E(32554): 7.1e-17
Smith-Waterman score: 406; 43.3% identity (73.2% similar) in 127 aa overlap (7-133:23-146)
10 20 30 40
pF1KB7 MPVERMRMRPWLEEQINSNTIPGLKWLNKEKKIFQIPWMHAARH
..: :: .::.:. ::: : :.::.::.::: ::...
CCDS44 MNLEGGGRGGEFGMSAVSCGNGKLRQWLIDQIDSGKYPGLVWENEEKSIFRIPWKHAGKQ
10 20 30 40 50 60
50 60 70 80 90 100
pF1KB7 GWDVEKDAPLFRNWAIHTGKHQPGVDKPDPKTWKANFRCAMNSLPDIEEVKDKSIKKGNN
.. :.:: ::. ::. :: . :.::::: :::. .:::.:. :.::. ..: ..
CCDS44 DYNREEDAALFKAWALFKGKFREGIDKPDPPTWKTRLRCALNKSNDFEELVERSQLDISD
70 80 90 100 110 120
110 120 130 140 150 160
pF1KB7 AFRVYRMLPLSERPSKKGKKPKTEKEDKVKHIKQEPVESSLGLSNGVSDLSPEYAVLTST
..:::..: . .::: : : .. ..
CCDS44 PYKVYRIVP---EGAKKGAKQLTLEDPQMSMSHPYTMTTPYPSLPAQQVHNYMMPPLDRS
130 140 150 160 170
>>CCDS10956.1 IRF8 gene_id:3394|Hs108|chr16 (426 aa)
initn: 403 init1: 221 opt: 398 Z-score: 408.8 bits: 84.4 E(32554): 1.9e-16
Smith-Waterman score: 398; 46.6% identity (72.4% similar) in 116 aa overlap (7-122:9-123)
10 20 30 40 50
pF1KB7 MPVERMRMRPWLEEQINSNTIPGLKWLNKEKKIFQIPWMHAARHGWDVEKDAPLFRNW
:.: :: :::.:. ::: : :.::..:.::: ::... .. : :: .:. :
CCDS10 MCDRNGGRRLRQWLIEQIDSSMYPGLIWENEEKSMFRIPWKHAGKQDYNQEVDASIFKAW
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB7 AIHTGKHQPGVDKPDPKTWKANFRCAMNSLPDIEEVKDKSIKKGNNAFRVYRMLPLSERP
:. :: . : :: .: :::. .:::.:. ::.::: :.: .. ..:::..: :.
CCDS10 AVFKGKFKEG-DKAEPATWKTRLRCALNKSPDFEEVTDRSQLDISEPYKVYRIVPEEEQK
70 80 90 100 110
120 130 140 150 160 170
pF1KB7 SKKGKKPKTEKEDKVKHIKQEPVESSLGLSNGVSDLSPEYAVLTSTIKNEVDSTVNIIVV
: :
CCDS10 CKLGVATAGCVNEVTEMECGRSEIDELIKEPSVDDYMGMIKRSPSPPEACRSQLLPDWWA
120 130 140 150 160 170
>>CCDS43645.1 IRF5 gene_id:3663|Hs108|chr7 (514 aa)
initn: 371 init1: 337 opt: 398 Z-score: 407.5 bits: 84.4 E(32554): 2.2e-16
Smith-Waterman score: 398; 35.3% identity (65.3% similar) in 173 aa overlap (2-166:11-180)
10 20 30 40 50
pF1KB7 MPVERMRMRPWLEEQINSNTIPGLKWLNKEKKIFQIPWMHAARHGWDVEKD
: .:.:..::: :.:: :::.:.: :::.: ::: ::.::: . . :
CCDS43 MNQSIPVAPTPPRRVRLKPWLVAQVNSCQYPGLQWVNGEKKLFCIPWRHATRHGPSQDGD
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB7 APLFRNWAIHTGKHQPGVDKPDPKTWKANFRCAMNSLPDIEEVKDKSIKKGNNAFRVYRM
.:. :: .:::. :::. :: ::::.:::.:. :.. . : . ...:..
CCDS43 NTIFKAWAKETGKYTEGVDEADPAKWKANLRCALNKSRDFRLIYDGPRDMPPQPYKIYEV
70 80 90 100 110 120
120 130 140 150 160
pF1KB7 L-----PLSERPSKKGKKPKTEKEDKVKHIKQEPVESSLGLSNGVSD---LSPEYAVLTS
: . .: . . :.:.. ..... . ::.:...:.. ..: :..:
CCDS43 CSNGPAPTDSQPPEDYSFGAGEEEEEEEELQR--MLPSLSLTDAVQSGPHMTP-YSLLKE
130 140 150 160 170
170 180 190 200 210 220
pF1KB7 TIKNEVDSTVNIIVVGQSHLDSNIENQEIVTNPPDICQVVEVTTESDEQPVSMSELYPLQ
.:
CCDS43 DVKWPPTLQPPTLRPPTLQPPTLQPPVVLGPPAPDPSPLAPPPGNPAGFRELLSEVLEPG
180 190 200 210 220 230
>>CCDS56512.1 IRF5 gene_id:3663|Hs108|chr7 (412 aa)
initn: 381 init1: 381 opt: 393 Z-score: 403.9 bits: 83.5 E(32554): 3.5e-16
Smith-Waterman score: 393; 31.9% identity (60.2% similar) in 226 aa overlap (2-219:11-231)
10 20 30 40 50
pF1KB7 MPVERMRMRPWLEEQINSNTIPGLKWLNKEKKIFQIPWMHAARHGWDVEKD
: .:.:..::: :.:: :::.:.: :::.: ::: ::.::: . . :
CCDS56 MNQSIPVAPTPPRRVRLKPWLVAQVNSCQYPGLQWVNGEKKLFCIPWRHATRHGPSQDGD
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB7 APLFRNWAIHTGKHQPGVDKPDPKTWKANFRCAMNSLPDIEEVKDKSIKKGNNAFRVYRM
.:. :: .:::. :::. :: ::::.:::.:. :.. . : . ...:..
CCDS56 NTIFKAWAKETGKYTEGVDEADPAKWKANLRCALNKSRDFRLIYDGPRDMPPQPYKIYEV
70 80 90 100 110 120
120 130 140 150 160
pF1KB7 L-----PLSERPSKKGKKPKTEKEDKVKHIKQEPVESSLGLSNGVSDLSPEYAVLTSTIK
: . .: . . :.:.. ..... . ::.:. :.:: .. .
CCDS56 CSNGPAPTDSQPPEDYSFGAGEEEEEEEELQR--MLPSLSLT--VTDLEIKFQYRGRPPR
130 140 150 160 170
170 180 190 200 210 220
pF1KB7 NEVDSTVNIIVVGQSHLDSNIENQEIVTNPPDICQVVEVTTE---SDEQPVSMSELYPLQ
. :. . . :.:... :. :. .: .. :: . : ::.: ..:
CCDS56 ALTISNPHGCRLFYSQLEATQEQVELF-GPISLEQVRFPSPEDIPSDKQRFYTNQLLDVL
180 190 200 210 220 230
230 240 250 260 270 280
pF1KB7 ISPVSSYAESETTDSVPSDEESAEGRPHWRKRNIEGKQYLSNMGTRGSYLLPGMASFVTS
CCDS56 DRGLILQLQGQDLYAIRLCQCKVFWSGPCASAHDSCPNPIQREVKTKLFSLEHFLNELIL
240 250 260 270 280 290
>>CCDS1492.1 IRF6 gene_id:3664|Hs108|chr1 (467 aa)
initn: 408 init1: 370 opt: 392 Z-score: 402.1 bits: 83.3 E(32554): 4.5e-16
Smith-Waterman score: 392; 38.3% identity (68.8% similar) in 141 aa overlap (5-139:7-147)
10 20 30 40 50
pF1KB7 MPVERMRMRPWLEEQINSNTIPGLKWLNKEKKIFQIPWMHAARHGWDVEKDAPLFRNW
:.:..::: :..:. ::: ::....: ::::: ::.::. . :.. .:. :
CCDS14 MALHPRRVRLKPWLVAQVDSGLYPGLIWLHRDSKRFQIPWKHATRHSPQQEEENTIFKAW
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB7 AIHTGKHQPGVDKPDPKTWKANFRCAMNSLPDIEEVKDKSIKKGNNAFRVYRM--LPLSE
:..:::.: ::: ::: :::..:::.:. ... . : . . : ..:.. .: .
CCDS14 AVETGKYQEGVDDPDPAKWKAQLRCALNKSREFNLMYDGTKEVPMNPVKIYQVCDIPQPQ
70 80 90 100 110 120
120 130 140 150 160 170
pF1KB7 ----RPSKKGKKPKTEKEDKVKHIKQEPVESSLGLSNGVSDLSPEYAVLTSTIKNEVDST
:.. :. : ::.. : . .:
CCDS14 GSIINPGSTGSAPWDEKDNDVDEEDEEDELDQSQHHVPIQDTFPFLNINGSPMAPASVGN
130 140 150 160 170 180
>>CCDS5808.1 IRF5 gene_id:3663|Hs108|chr7 (498 aa)
initn: 371 init1: 337 opt: 386 Z-score: 395.6 bits: 82.2 E(32554): 1e-15
Smith-Waterman score: 386; 36.8% identity (65.2% similar) in 155 aa overlap (2-151:11-163)
10 20 30 40 50
pF1KB7 MPVERMRMRPWLEEQINSNTIPGLKWLNKEKKIFQIPWMHAARHGWDVEKD
: .:.:..::: :.:: :::.:.: :::.: ::: ::.::: . . :
CCDS58 MNQSIPVAPTPPRRVRLKPWLVAQVNSCQYPGLQWVNGEKKLFCIPWRHATRHGPSQDGD
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB7 APLFRNWAIHTGKHQPGVDKPDPKTWKANFRCAMNSLPDIEEVKDKSIKKGNNAFRVYRM
.:. :: .:::. :::. :: ::::.:::.:. :.. . : . ...:..
CCDS58 NTIFKAWAKETGKYTEGVDEADPAKWKANLRCALNKSRDFRLIYDGPRDMPPQPYKIYEV
70 80 90 100 110 120
120 130 140 150 160
pF1KB7 L-----PLSERPSKKGKKPKTEKEDKVKHIKQEPVESSLGLSNGVSDLSPEYAVLTSTIK
: . .: . . :.:.. ..... . ::.:.. :
CCDS58 CSNGPAPTDSQPPEDYSFGAGEEEEEEEELQR--MLPSLSLTEDVKWPPTLQPPTLRPPT
130 140 150 160 170
170 180 190 200 210 220
pF1KB7 NEVDSTVNIIVVGQSHLDSNIENQEIVTNPPDICQVVEVTTESDEQPVSMSELYPLQISP
CCDS58 LQPPTLQPPVVLGPPAPDPSPLAPPPGNPAGFRELLSEVLEPGPLPASLPPAGEQLLPDL
180 190 200 210 220 230
>>CCDS9615.1 IRF9 gene_id:10379|Hs108|chr14 (393 aa)
initn: 276 init1: 181 opt: 346 Z-score: 356.8 bits: 74.7 E(32554): 1.5e-13
Smith-Waterman score: 348; 28.9% identity (62.4% similar) in 218 aa overlap (7-221:11-213)
10 20 30 40 50
pF1KB7 MPVERMRMRPWLEEQINSNTIPGLKWLNKEKKIFQIPWMHAARHGWDVEKDAPLFR
..: :. ::..:. .::. : . : .:.::: ::... . ..:: .:.
CCDS96 MASGRARCTRKLRNWVVEQVESGQFPGVCWDDTAKTMFRIPWKHAGKQDFREDQDAAFFK
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB7 NWAIHTGKHQPGVDKPDPKTWKANFRCAMNSLPDIEEVKDKSIKKGNNAFRVYRMLPLSE
::: ::.. : : : .::. .:::.:. ...:: ... . ..::..::
CCDS96 AWAIFKGKYKEG-DTGGPAVWKTRLRCALNKSSEFKEVPERGRMDVAEPYKVYQLLP---
70 80 90 100 110
120 130 140 150 160 170
pF1KB7 RPSKKGKKPKTEK-EDKVKH--IKQEPVESSLGLSNGVSDLSPEYAVLTSTIKNEVDSTV
:. . .: :.: .: .: ...: : ...: ::: .:: ....:: ...
CCDS96 -PGIVSGQPGTQKVPSKRQHSSVSSERKEEEDAMQN--CTLSP--SVLQDSLNNEEEGAS
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB7 NIIVVGQSHLDSNIENQEIVTNPPDICQVVEVTTESDEQPVSMSELYPLQISPVSSYAES
. : : : . .. .: .. ...:. ..:.. :. : :
CCDS96 G----GAVHSDIGSSSSSSSPEPQEVTDTTEAPFQGDQR--SLEFLLPPEPDYSLLLTFI
180 190 200 210 220
240 250 260 270 280 290
pF1KB7 ETTDSVPSDEESAEGRPHWRKRNIEGKQYLSNMGTRGSYLLPGMASFVTSNKPDLQVTIK
CCDS96 YNGRVVGEAQVQSLDCRLVAEPSGSESSMEQVLFPKPGPLEPTQRLLSQLERGILVASNP
230 240 250 260 270 280
349 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 21:35:15 2016 done: Fri Nov 4 21:35:15 2016
Total Scan time: 2.840 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]