FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB8904, 237 aa
1>>>pF1KB8904 237 - 237 aa - 237 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.2337+/-0.000708; mu= 7.5967+/- 0.043
mean_var=144.8985+/-29.649, 0's: 0 Z-trim(115.5): 45 B-trim: 586 in 2/52
Lambda= 0.106547
statistics sampled from 16054 (16100) to 16054 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.815), E-opt: 0.2 (0.495), width: 16
Scan time: 2.780
The best scores are: opt bits E(32554)
CCDS13278.1 TGIF2 gene_id:60436|Hs108|chr20 ( 237) 1581 253.4 9.6e-68
CCDS11833.1 TGIF1 gene_id:7050|Hs108|chr18 ( 272) 533 92.4 3.3e-19
CCDS11832.1 TGIF1 gene_id:7050|Hs108|chr18 ( 286) 533 92.4 3.4e-19
CCDS11835.1 TGIF1 gene_id:7050|Hs108|chr18 ( 252) 532 92.2 3.5e-19
CCDS11834.1 TGIF1 gene_id:7050|Hs108|chr18 ( 401) 533 92.5 4.5e-19
CCDS58772.1 C20orf24 gene_id:100527943|Hs108|chr20 ( 155) 418 74.5 4.5e-14
CCDS14459.1 TGIF2LX gene_id:90316|Hs108|chrX ( 241) 416 74.3 7.8e-14
CCDS14775.1 TGIF2LY gene_id:90655|Hs108|chrY ( 185) 369 67.0 9.6e-12
>>CCDS13278.1 TGIF2 gene_id:60436|Hs108|chr20 (237 aa)
initn: 1581 init1: 1581 opt: 1581 Z-score: 1329.7 bits: 253.4 E(32554): 9.6e-68
Smith-Waterman score: 1581; 100.0% identity (100.0% similar) in 237 aa overlap (1-237:1-237)
10 20 30 40 50 60
pF1KB8 MSDSDLGEDEGLLSLAGKRKRRGNLPKESVKILRDWLYLHRYNAYPSEQEKLSLSGQTNL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 MSDSDLGEDEGLLSLAGKRKRRGNLPKESVKILRDWLYLHRYNAYPSEQEKLSLSGQTNL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB8 SVLQICNWFINARRRLLPDMLRKDGKDPNQFTISRRGGKASDVALPRGSSPSVLAVSVPA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 SVLQICNWFINARRRLLPDMLRKDGKDPNQFTISRRGGKASDVALPRGSSPSVLAVSVPA
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB8 PTNVLSLSVCSMPLHSGQGEKPAAPFPRGELESPKPLVTPGSTLTLLTRAEAGSPTGGLF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 PTNVLSLSVCSMPLHSGQGEKPAAPFPRGELESPKPLVTPGSTLTLLTRAEAGSPTGGLF
130 140 150 160 170 180
190 200 210 220 230
pF1KB8 NTPPPTPPEQDKEDFSSFQLLVEVALQRAAEMELQKQQDPSLPLLHTPIPLVSENPQ
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 NTPPPTPPEQDKEDFSSFQLLVEVALQRAAEMELQKQQDPSLPLLHTPIPLVSENPQ
190 200 210 220 230
>>CCDS11833.1 TGIF1 gene_id:7050|Hs108|chr18 (272 aa)
initn: 762 init1: 515 opt: 533 Z-score: 458.3 bits: 92.4 E(32554): 3.3e-19
Smith-Waterman score: 661; 49.6% identity (64.7% similar) in 252 aa overlap (3-217:19-269)
10 20 30 40
pF1KB8 MSDS-DLGEDEGLLSLAGKRKRRGNLPKESVKILRDWLYLHRYN
:: :. : . . .:::.::::::::::.::::::: ::::
CCDS11 MKGKKGIVAASGSETEDEDSMDIPLDLSSSAGSGKRRRRGNLPKESVQILRDWLYEHRYN
10 20 30 40 50 60
50 60 70 80 90 100
pF1KB8 AYPSEQEKLSLSGQTNLSVLQICNWFINARRRLLPDMLRKDGKDPNQFTISRRGGKASDV
:::::::: :: ::.::.::.::::::::::::::::::::::::::::::::.: :..
CCDS11 AYPSEQEKALLSQQTHLSTLQVCNWFINARRRLLPDMLRKDGKDPNQFTISRRGAKISET
70 80 90 100 110 120
110 120 130 140
pF1KB8 A-----------LPR-----------GSSPSVLAVSVPAPTNVLS-LSVCSMPLHSGQGE
. .: : .:.. : :.. : :. :. :.
CCDS11 SSVESVMGIKNFMPALEETPFHSCTAGPNPTLGRPLSPKPSSPGSVLARPSVICHTTVTA
130 140 150 160 170 180
150 160 170 180
pF1KB8 KPAAPFPR------GELESPKPLVTPGSTLTLLTRAEAGSPTG-------GLFNTPPPTP
.:: :. . . ... . : : : : .: ::::::::::
CCDS11 LKDVPFSLCQSVGVGQNTDIQQIAAKNFTDTSLMYPEDTCKSGPSTNTQSGLFNTPPPTP
190 200 210 220 230 240
190 200 210 220 230
pF1KB8 PEQDKEDFSSFQLLVEVALQRAAEMELQKQQDPSLPLLHTPIPLVSENPQ
:. .. :::.:::::.:::.:::::::: .
CCDS11 PDLNQ-DFSGFQLLVDVALKRAAEMELQAKLTA
250 260 270
>>CCDS11832.1 TGIF1 gene_id:7050|Hs108|chr18 (286 aa)
initn: 762 init1: 515 opt: 533 Z-score: 458.0 bits: 92.4 E(32554): 3.4e-19
Smith-Waterman score: 661; 49.6% identity (64.7% similar) in 252 aa overlap (3-217:33-283)
10 20 30
pF1KB8 MSDS-DLGEDEGLLSLAGKRKRRGNLPKESVK
:: :. : . . .:::.::::::::::.
CCDS11 CSGKSCALARSSLTSSQGIVAASGSETEDEDSMDIPLDLSSSAGSGKRRRRGNLPKESVQ
10 20 30 40 50 60
40 50 60 70 80 90
pF1KB8 ILRDWLYLHRYNAYPSEQEKLSLSGQTNLSVLQICNWFINARRRLLPDMLRKDGKDPNQF
::::::: :::::::::::: :: ::.::.::.::::::::::::::::::::::::::
CCDS11 ILRDWLYEHRYNAYPSEQEKALLSQQTHLSTLQVCNWFINARRRLLPDMLRKDGKDPNQF
70 80 90 100 110 120
100 110 120
pF1KB8 TISRRGGKASDVA-----------LPR-----------GSSPSVLAVSVPAPTNVLS-LS
::::::.: :... .: : .:.. : :.. : :.
CCDS11 TISRRGAKISETSSVESVMGIKNFMPALEETPFHSCTAGPNPTLGRPLSPKPSSPGSVLA
130 140 150 160 170 180
130 140 150 160 170
pF1KB8 VCSMPLHSGQGEKPAAPFPR------GELESPKPLVTPGSTLTLLTRAEAGSPTG-----
:. :. .:: :. . . ... . : : : : .:
CCDS11 RPSVICHTTVTALKDVPFSLCQSVGVGQNTDIQQIAAKNFTDTSLMYPEDTCKSGPSTNT
190 200 210 220 230 240
180 190 200 210 220 230
pF1KB8 --GLFNTPPPTPPEQDKEDFSSFQLLVEVALQRAAEMELQKQQDPSLPLLHTPIPLVSEN
:::::::::::. .. :::.:::::.:::.:::::::: .
CCDS11 QSGLFNTPPPTPPDLNQ-DFSGFQLLVDVALKRAAEMELQAKLTA
250 260 270 280
pF1KB8 PQ
>>CCDS11835.1 TGIF1 gene_id:7050|Hs108|chr18 (252 aa)
initn: 762 init1: 515 opt: 532 Z-score: 457.9 bits: 92.2 E(32554): 3.5e-19
Smith-Waterman score: 660; 49.8% identity (64.7% similar) in 249 aa overlap (5-217:6-249)
10 20 30 40 50
pF1KB8 MSDSDLGEDEGLLSLAGKRKRRGNLPKESVKILRDWLYLHRYNAYPSEQEKLSLSGQTN
::. . : .:::.::::::::::.::::::: :::::::::::: :: ::.
CCDS11 MDIPLDLSSSAG----SGKRRRRGNLPKESVQILRDWLYEHRYNAYPSEQEKALLSQQTH
10 20 30 40 50
60 70 80 90 100
pF1KB8 LSVLQICNWFINARRRLLPDMLRKDGKDPNQFTISRRGGKASDVA-----------LPR-
::.::.::::::::::::::::::::::::::::::::.: :... .:
CCDS11 LSTLQVCNWFINARRRLLPDMLRKDGKDPNQFTISRRGAKISETSSVESVMGIKNFMPAL
60 70 80 90 100 110
110 120 130 140 150
pF1KB8 ----------GSSPSVLAVSVPAPTNVLS-LSVCSMPLHSGQGEKPAAPFPR------GE
: .:.. : :.. : :. :. :. .:: :.
CCDS11 EETPFHSCTAGPNPTLGRPLSPKPSSPGSVLARPSVICHTTVTALKDVPFSLCQSVGVGQ
120 130 140 150 160 170
160 170 180 190 200
pF1KB8 LESPKPLVTPGSTLTLLTRAEAGSPTG-------GLFNTPPPTPPEQDKEDFSSFQLLVE
. . ... . : : : : .: :::::::::::. .. :::.:::::.
CCDS11 NTDIQQIAAKNFTDTSLMYPEDTCKSGPSTNTQSGLFNTPPPTPPDLNQ-DFSGFQLLVD
180 190 200 210 220 230
210 220 230
pF1KB8 VALQRAAEMELQKQQDPSLPLLHTPIPLVSENPQ
:::.:::::::: .
CCDS11 VALKRAAEMELQAKLTA
240 250
>>CCDS11834.1 TGIF1 gene_id:7050|Hs108|chr18 (401 aa)
initn: 725 init1: 515 opt: 533 Z-score: 455.9 bits: 92.5 E(32554): 4.5e-19
Smith-Waterman score: 661; 49.6% identity (64.7% similar) in 252 aa overlap (3-217:148-398)
10 20 30
pF1KB8 MSDS-DLGEDEGLLSLAGKRKRRGNLPKESVK
:: :. : . . .:::.::::::::::.
CCDS11 QGAQGPAPRRRLLETMKGIVAASGSETEDEDSMDIPLDLSSSAGSGKRRRRGNLPKESVQ
120 130 140 150 160 170
40 50 60 70 80 90
pF1KB8 ILRDWLYLHRYNAYPSEQEKLSLSGQTNLSVLQICNWFINARRRLLPDMLRKDGKDPNQF
::::::: :::::::::::: :: ::.::.::.::::::::::::::::::::::::::
CCDS11 ILRDWLYEHRYNAYPSEQEKALLSQQTHLSTLQVCNWFINARRRLLPDMLRKDGKDPNQF
180 190 200 210 220 230
100 110 120
pF1KB8 TISRRGGKASDVA-----------LPR-----------GSSPSVLAVSVPAPTNVLS-LS
::::::.: :... .: : .:.. : :.. : :.
CCDS11 TISRRGAKISETSSVESVMGIKNFMPALEETPFHSCTAGPNPTLGRPLSPKPSSPGSVLA
240 250 260 270 280 290
130 140 150 160 170
pF1KB8 VCSMPLHSGQGEKPAAPFPR------GELESPKPLVTPGSTLTLLTRAEAGSPTG-----
:. :. .:: :. . . ... . : : : : .:
CCDS11 RPSVICHTTVTALKDVPFSLCQSVGVGQNTDIQQIAAKNFTDTSLMYPEDTCKSGPSTNT
300 310 320 330 340 350
180 190 200 210 220 230
pF1KB8 --GLFNTPPPTPPEQDKEDFSSFQLLVEVALQRAAEMELQKQQDPSLPLLHTPIPLVSEN
:::::::::::. .. :::.:::::.:::.:::::::: .
CCDS11 QSGLFNTPPPTPPDLNQ-DFSGFQLLVDVALKRAAEMELQAKLTA
360 370 380 390 400
pF1KB8 PQ
>>CCDS58772.1 C20orf24 gene_id:100527943|Hs108|chr20 (155 aa)
initn: 415 init1: 415 opt: 418 Z-score: 366.1 bits: 74.5 E(32554): 4.5e-14
Smith-Waterman score: 418; 89.2% identity (90.5% similar) in 74 aa overlap (1-69:1-74)
10 20 30 40 50 60
pF1KB8 MSDSDLGEDEGLLSLAGKRKRRGNLPKESVKILRDWLYLHRYNAYPSEQEKLSLSGQTNL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS58 MSDSDLGEDEGLLSLAGKRKRRGNLPKESVKILRDWLYLHRYNAYPSEQEKLSLSGQTNL
10 20 30 40 50 60
70 80 90 100 110
pF1KB8 SVLQ-----ICNWFINARRRLLPDMLRKDGKDPNQFTISRRGGKASDVALPRGSSPSVLA
:::: . ::
CCDS58 SVLQDEFLDVIYWFRQIIAVVLGVIWGVLPLRGFLGIAGFCLINAGVLYLYFSNYLQIDE
70 80 90 100 110 120
>>CCDS14459.1 TGIF2LX gene_id:90316|Hs108|chrX (241 aa)
initn: 498 init1: 358 opt: 416 Z-score: 361.8 bits: 74.3 E(32554): 7.8e-14
Smith-Waterman score: 531; 44.7% identity (68.3% similar) in 208 aa overlap (18-221:50-240)
10 20 30 40
pF1KB8 MSDSDLGEDEGLLSLAGKRKRRGNLPKESVKILRDWLYLHRYNAYPS
:.::.:::: :::::::::.: ::..::::
CCDS14 PAKTQSPAQDTSIMSRNNADTGRVLALPEHKKKRKGNLPAESVKILRDWMYKHRFKAYPS
20 30 40 50 60 70
50 60 70 80 90 100
pF1KB8 EQEKLSLSGQTNLSVLQICNWFINARRRLLPDMLRKDGKDPNQFTISRRGGKASDVALPR
:.:: :: .::::.::: :::::::::.:::::.. .:: :... :: . .. .
CCDS14 EEEKQMLSEKTNLSLLQISNWFINARRRILPDMLQQRRNDP---IIGHKTGKDAHATHLQ
80 90 100 110 120 130
110 120 130 140 150 160
pF1KB8 GSSPSVLAVSVPA-PTNVLSLSVCSMPLHSGQGEKPAAPFPRGEL---ESPKPLVTPGST
.. :: : : :. : :: :: : :.:.:.. ..: : .:..
CCDS14 STEASVPAKSGPSGPDNVQSL--------------PLWPLPKGQMSREKQPDPESAPSQK
140 150 160 170 180
170 180 190 200 210 220
pF1KB8 LTLLTRAEAGSPTGGLFNTPPPTPPEQDKEDFSSFQLLVEVALQRAAEMELQKQQDPSLP
:: ... . .. . : ... ::::: :::..:.:::::.::.:.:.:.
CCDS14 LTGIAQPKKKVKVSVTSPSSPELVSPEEHADFSSFLLLVDAAVQRAAELELEKKQEPNP
190 200 210 220 230 240
230
pF1KB8 LLHTPIPLVSENPQ
>>CCDS14775.1 TGIF2LY gene_id:90655|Hs108|chrY (185 aa)
initn: 352 init1: 352 opt: 369 Z-score: 324.4 bits: 67.0 E(32554): 9.6e-12
Smith-Waterman score: 369; 52.2% identity (75.2% similar) in 113 aa overlap (18-130:50-159)
10 20 30 40
pF1KB8 MSDSDLGEDEGLLSLAGKRKRRGNLPKESVKILRDWLYLHRYNAYPS
:.::.:::: :::::::::.: ::..::::
CCDS14 PAKTQSPAQDTSIMSRNNADTGRVLALPEHKKKRKGNLPAESVKILRDWMYKHRFKAYPS
20 30 40 50 60 70
50 60 70 80 90 100
pF1KB8 EQEKLSLSGQTNLSVLQICNWFINARRRLLPDMLRKDGKDPNQFTISRRGGKASDVALPR
:.:: :: .::::.:.: :::::::::.:::::.. .:: :... :: . .. .
CCDS14 EEEKQMLSEKTNLSLLRISNWFINARRRILPDMLQQRRNDP---IIGHKTGKDAHATHLQ
80 90 100 110 120 130
110 120 130 140 150 160
pF1KB8 GSSPSVLAVSVPAPTNVLSLSVCSMPLHSGQGEKPAAPFPRGELESPKPLVTPGSTLTLL
.. :: : : :. .. . :
CCDS14 STEASVPAKSGPVVQTMYKACPCGPCQRARCQERSNQIRSRPLARSSPE
140 150 160 170 180
237 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 16:25:39 2016 done: Fri Nov 4 16:25:39 2016
Total Scan time: 2.780 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]