FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB5229, 293 aa
1>>>pF1KB5229 293 - 293 aa - 293 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.0289+/-0.000916; mu= 16.2519+/- 0.055
mean_var=59.8304+/-12.232, 0's: 0 Z-trim(104.7): 24 B-trim: 0 in 0/51
Lambda= 0.165811
statistics sampled from 8001 (8014) to 8001 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.62), E-opt: 0.2 (0.246), width: 16
Scan time: 2.200
The best scores are: opt bits E(32554)
CCDS3809.1 MSMO1 gene_id:6307|Hs108|chr4 ( 293) 2096 509.9 8.8e-145
CCDS43280.1 MSMO1 gene_id:6307|Hs108|chr4 ( 162) 1166 287.3 5e-78
CCDS7400.1 CH25H gene_id:9023|Hs108|chr10 ( 272) 381 99.6 2.6e-21
CCDS43390.1 FAXDC2 gene_id:10826|Hs108|chr5 ( 333) 355 93.5 2.3e-19
CCDS8435.1 SC5D gene_id:6309|Hs108|chr11 ( 299) 279 75.3 6.3e-14
>>CCDS3809.1 MSMO1 gene_id:6307|Hs108|chr4 (293 aa)
initn: 2096 init1: 2096 opt: 2096 Z-score: 2712.7 bits: 509.9 E(32554): 8.8e-145
Smith-Waterman score: 2096; 100.0% identity (100.0% similar) in 293 aa overlap (1-293:1-293)
10 20 30 40 50 60
pF1KB5 MATNESVSIFSSASLAVEYVDSLLPENPLQEPFKNAWNYMLNNYTKFQIATWGSLIVHEA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 MATNESVSIFSSASLAVEYVDSLLPENPLQEPFKNAWNYMLNNYTKFQIATWGSLIVHEA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB5 LYFLFCLPGFLFQFIPYMKKYKIQKDKPETWENQWKCFKVLLFNHFCIQLPLICGTYYFT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 LYFLFCLPGFLFQFIPYMKKYKIQKDKPETWENQWKCFKVLLFNHFCIQLPLICGTYYFT
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB5 EYFNIPYDWERMPRWYFLLARCFGCAVIEDTWHYFLHRLLHHKRIYKYIHKVHHEFQAPF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 EYFNIPYDWERMPRWYFLLARCFGCAVIEDTWHYFLHRLLHHKRIYKYIHKVHHEFQAPF
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB5 GMEAEYAHPLETLILGTGFFIGIVLLCDHVILLWAWVTIRLLETIDVHSGYDIPLNPLNL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 GMEAEYAHPLETLILGTGFFIGIVLLCDHVILLWAWVTIRLLETIDVHSGYDIPLNPLNL
190 200 210 220 230 240
250 260 270 280 290
pF1KB5 IPFYAGSRHHDFHHMNFIGNYASTFTWWDRIFGTDSQYNAYNEKRKKFEKKTE
:::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 IPFYAGSRHHDFHHMNFIGNYASTFTWWDRIFGTDSQYNAYNEKRKKFEKKTE
250 260 270 280 290
>>CCDS43280.1 MSMO1 gene_id:6307|Hs108|chr4 (162 aa)
initn: 1166 init1: 1166 opt: 1166 Z-score: 1514.3 bits: 287.3 E(32554): 5e-78
Smith-Waterman score: 1166; 100.0% identity (100.0% similar) in 162 aa overlap (132-293:1-162)
110 120 130 140 150 160
pF1KB5 LFNHFCIQLPLICGTYYFTEYFNIPYDWERMPRWYFLLARCFGCAVIEDTWHYFLHRLLH
::::::::::::::::::::::::::::::
CCDS43 MPRWYFLLARCFGCAVIEDTWHYFLHRLLH
10 20 30
170 180 190 200 210 220
pF1KB5 HKRIYKYIHKVHHEFQAPFGMEAEYAHPLETLILGTGFFIGIVLLCDHVILLWAWVTIRL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 HKRIYKYIHKVHHEFQAPFGMEAEYAHPLETLILGTGFFIGIVLLCDHVILLWAWVTIRL
40 50 60 70 80 90
230 240 250 260 270 280
pF1KB5 LETIDVHSGYDIPLNPLNLIPFYAGSRHHDFHHMNFIGNYASTFTWWDRIFGTDSQYNAY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 LETIDVHSGYDIPLNPLNLIPFYAGSRHHDFHHMNFIGNYASTFTWWDRIFGTDSQYNAY
100 110 120 130 140 150
290
pF1KB5 NEKRKKFEKKTE
::::::::::::
CCDS43 NEKRKKFEKKTE
160
>>CCDS7400.1 CH25H gene_id:9023|Hs108|chr10 (272 aa)
initn: 273 init1: 147 opt: 381 Z-score: 496.0 bits: 99.6 E(32554): 2.6e-21
Smith-Waterman score: 381; 28.0% identity (59.3% similar) in 246 aa overlap (37-274:24-263)
10 20 30 40 50 60
pF1KB5 VSIFSSASLAVEYVDSLLPENPLQEPFKNAWNYMLNNYTKFQIATWGSLIVHEALYFLFC
:.. : .. . . . .: . : ::
CCDS74 MSCHNCSDPQVLCSSGQLFLQPLWDH-LRSWEALLQSPFFPVIFSITTYVGFC
10 20 30 40 50
70 80 90 100 110 120
pF1KB5 LP----GFLFQFIPYMKKYKIQKDKPETWENQWKCFKVLLFNHFCIQLPLICGTYYFTEY
:: .: ...: ...:::. : . .. :. :..: . .:. . .
CCDS74 LPFVVLDILCSWVPALRRYKIHPDFSPSAQQLLPCLGQTLYQHVMFVFPVTLLHWARSPA
60 70 80 90 100 110
130 140 150 160 170 180
pF1KB5 FNIPYDWERMPRWYFLLARCFGCAVIEDTWHYFLHRLLHHKR--IYKYIHKVHHEFQAPF
. .:.. :. .:: . . : .. : ..:. .::::: .:. .:::::. .. :
CCDS74 L-LPHE---APELLLLLHHILFCLLLFDM-EFFVWHLLHHKVPWLYRTFHKVHHQNSSSF
120 130 140 150 160
190 200 210 220 230 240
pF1KB5 GMEAEYAHPLETLILGTGFFIGIVLLCDHVILLWAWVTIRLLETIDVHSGYDIPLNPLNL
.. ..: : . :: .....:: : . .. .. . ... ::::..: . :
CCDS74 ALATQYMSVWELFSLGFFDMMNVTLLGCHPLTTLTFHVVNIWLSVEDHSGYNFPWSTHRL
170 180 190 200 210 220
250 260 270 280 290
pF1KB5 IPF--YAGSRHHDFHHMNFIGNYASTFTWWDRIFGTDSQYNAYNEKRKKFEKKTE
.:: :.: :::.:: .: :.: :: ::.:.::
CCDS74 VPFGWYGGVVHHDLHHSHFNCNFAPYFTHWDKILGTLRTASVPAR
230 240 250 260 270
>>CCDS43390.1 FAXDC2 gene_id:10826|Hs108|chr5 (333 aa)
initn: 257 init1: 179 opt: 355 Z-score: 461.1 bits: 93.5 E(32554): 2.3e-19
Smith-Waterman score: 359; 27.1% identity (58.0% similar) in 269 aa overlap (33-290:60-311)
10 20 30 40 50
pF1KB5 TNESVSIFSSASLAVEYVDSLLPENPLQEPFKNAWNYMLNNYTKFQIATWGSLIV---HE
.. :. .: : :. : ... .
CCDS43 ILGSGLLSFVAFWNSVTWHLQRFWGASGYFWQAQWERLL---TTFEGKEWILFFIGAIQV
30 40 50 60 70 80
60 70 80 90 100 110
pF1KB5 ALYFLFCLPGFLFQF----IP-YMKKYKIQ--KDKPETWENQWKCFKVLLFNHFCIQLPL
:.. . :.:. : ....:.:: :..: . . ....:::. :..:.
CCDS43 PCLFFWSFNGLLLVVDTTGKPNFISRYRIQVGKNEPVDPVKLRQSIRTVLFNQCMISFPM
90 100 110 120 130 140
120 130 140 150 160 170
pF1KB5 ICGTYYFTEYFNIPYDWERMPRWYFLLARCFGCAVIEDTWHYFLHRLLHHKRIYKYIHKV
. : : ... : : .: ....: . ..::.. :. :::::: .:: :::
CCDS43 VVFLYPFLKWWRDPCRRE-LPTFHWFLLELAIFTLIEEVLFYYSHRLLHHPTFYKKIHKK
150 160 170 180 190 200
180 190 200 210 220 230
pF1KB5 HHEFQAPFGMEAEYAHPLETLILGT-GFFIGIVLLCDHVILLWAWVTIRLLETIDVHSGY
:::. ::.:. . ::::.: . . ..: ... .:. . : .. :. : : ::
CCDS43 HHEWTAPIGVISLYAHPIEHAVSNMLPVIVGPLVMGSHLSSITMWFSLALIITTISHCGY
210 220 230 240 250 260
240 250 260 270 280 290
pF1KB5 DIPLNPLNLIPFYAGSRHHDFHHMNFIGNYASTFTWWDRIFGTDSQYNAYNEKRKKFEKK
.:. : : . ::.::..: :. .. :.. :::... .. : .:.
CCDS43 HLPFLPS---PEF-----HDYHHLKFNQCYG-VLGVLDHLHGTDTMF----KQTKAYERH
270 280 290 300 310
pF1KB5 TE
CCDS43 VLLLGFTPLSESIPDSPKRME
320 330
>>CCDS8435.1 SC5D gene_id:6309|Hs108|chr11 (299 aa)
initn: 268 init1: 152 opt: 279 Z-score: 363.6 bits: 75.3 E(32554): 6.3e-14
Smith-Waterman score: 279; 29.5% identity (55.1% similar) in 234 aa overlap (61-284:45-263)
40 50 60 70 80
pF1KB5 EPFKNAWNYMLNNYTKFQIATWGSLIVHEALYFLFCLP-GFLFQFIPYMKKYKIQKDKPE
::: :: .. : : . :. :.
CCDS84 PYVYPATWPEDDIFRQAISLLIVTNVGAYILYF-FCATLSYYFVFDHALMKH------PQ
20 30 40 50 60
90 100 110 120 130 140
pF1KB5 TWENQWKCFKVLLFNHFCIQ-LP---LICGTYYFTE---YFNIPYDWERMPRWYF-LLAR
.:: : .: .: :: .. . .. : : .. : ..: : :..
CCDS84 FLKNQ-----VRREIKFTVQALPWISILTVALFLLEIRGYSKLHDDLGEFPYGLFELVVS
70 80 90 100 110 120
150 160 170 180 190 200
pF1KB5 CFGCAVIEDTWHYFLHRLLHHKRIYKYIHKVHHEFQAPFGMEAEYAHPLETLILGTGFFI
.. . : . :..:: :::. .:: .:: :: .. : . .. ::.. .. . . :
CCDS84 IISFLFFTDMFIYWIHRGLHHRLVYKRLHKPHHIWKIPTPFASHAFHPIDGFLQSLPYHI
130 140 150 160 170 180
210 220 230 240 250 260
pF1KB5 GIVLLCDH-VILLWAWVTIRLLETIDVHSGYDIPLNPLNLIPFYAGSRHHDFHHMNFIGN
.. : :. : .. . . ::..:.: :. . : : :: :: :: ::: : :
CCDS84 YPFIFPLHKVVYLSLYILVNIW-TISIHDG-DFRV-PQILQPFINGSAHHTDHHMFFDYN
190 200 210 220 230
270 280 290
pF1KB5 YASTFTWWDRIFGTDSQYNAYNEKRKKFEKKTE
:.. :: :::: :. .. .... :
CCDS84 YGQYFTLWDRIGGSFKNPSSFEGKGPLSYVKEMTEGKRSSHSGNGCKNEKLFNGEFTKTE
240 250 260 270 280 290
293 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 06:30:02 2016 done: Sat Nov 5 06:30:02 2016
Total Scan time: 2.200 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]