FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KSDA0194, 1292 aa
1>>>pF1KSDA0194 1292 - 1292 aa - 1292 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.9096+/- 0.001; mu= 10.8640+/- 0.060
mean_var=157.4650+/-31.739, 0's: 0 Z-trim(110.0): 30 B-trim: 4 in 1/49
Lambda= 0.102207
statistics sampled from 11269 (11287) to 11269 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.689), E-opt: 0.2 (0.347), width: 16
Scan time: 4.680
The best scores are: opt bits E(32554)
CCDS54935.1 HMGXB3 gene_id:22993|Hs108|chr5 (1292) 8555 1274.5 0
>>CCDS54935.1 HMGXB3 gene_id:22993|Hs108|chr5 (1292 aa)
initn: 8555 init1: 8555 opt: 8555 Z-score: 6822.0 bits: 1274.5 E(32554): 0
Smith-Waterman score: 8555; 99.9% identity (100.0% similar) in 1292 aa overlap (1-1292:1-1292)
10 20 30 40 50 60
pF1KSD MDASYDGTEVTVVMEEIEEAYCYTSPGPPKKKKKYKIHGEKTKKPRSAYLLYYYDIYLKV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 MDASYDGTEVTVVMEEIEEAYCYTSPGPPKKKKKYKIHGEKTKKPRSAYLLYYYDIYLKV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KSD QQELPHLPQSEINKKISESWRLLSVAERSYYLEKAKLEKEGLDPNSKLSALTAVVPDIPG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 QQELPHLPQSEINKKISESWRLLSVAERSYYLEKAKLEKEGLDPNSKLSALTAVVPDIPG
70 80 90 100 110 120
130 140 150 160 170 180
pF1KSD FRKILPRSDYIIIPKSSLQEDRSCPQLELCVAQNQMSPKGPPLVSNTAPETVPSHAGMAE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 FRKILPRSDYIIIPKSSLQEDRSCPQLELCVAQNQMSPKGPPLVSNTAPETVPSHAGMAE
130 140 150 160 170 180
190 200 210 220 230 240
pF1KSD QCLAVEALAEEVGALTQSGAVQEIATSEILSQDVLLEDASLEVGESHQPYQTSLVIEETL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 QCLAVEALAEEVGALTQSGAVQEIATSEILSQDVLLEDASLEVGESHQPYQTSLVIEETL
190 200 210 220 230 240
250 260 270 280 290 300
pF1KSD VNGSPDLPTGSLAVPHPQVGESVSVVTVMRDSSESSSSAPATQFIMLPLPAYSVVENPTS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 VNGSPDLPTGSLAVPHPQVGESVSVVTVMRDSSESSSSAPATQFIMLPLPAYSVVENPTS
250 260 270 280 290 300
310 320 330 340 350 360
pF1KSD IKLTTTYTRRGHGTCTSPGCSFTYVTRHKPPKCPTCGNFLGGKWIPKEKPAKVKVELASG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 IKLTTTYTRRGHGTCTSPGCSFTYVTRHKPPKCPTCGNFLGGKWIPKEKPAKVKVELASG
310 320 330 340 350 360
370 380 390 400 410 420
pF1KSD VSSKGSVVKRNQQPVTTEQNSSKENASKLTLENSEAVSQLLNVAPPREVGEESEWEEVII
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 VSSKGSVVKRNQQPVTTEQNSSKENASKLTLENSEAVSQLLNVAPPREVGEESEWEEVII
370 380 390 400 410 420
430 440 450 460 470 480
pF1KSD SDAHVLVKEAPGNCGTAVTKTPVVKSGVQPEVTLGTTDNDSPGADVPTPSEGTSTSSPLP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 SDAHVLVKEAPGNCGTAVTKTPVVKSGVQPEVTLGTTDNDSPGADVPTPSEGTSTSSPLP
430 440 450 460 470 480
490 500 510 520 530 540
pF1KSD APKKPTGVDLLTPGSRAPELKGRARGKPSLLAAARPMRAILPAPVNVGRGSSMGLPRARQ
:::::::.::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 APKKPTGADLLTPGSRAPELKGRARGKPSLLAAARPMRAILPAPVNVGRGSSMGLPRARQ
490 500 510 520 530 540
550 560 570 580 590 600
pF1KSD AFSLSDKTPSVRTCGLKPSTLKQLGQPIQQPSGPGEVKLPSGPSNRTSQVKVVEVKPDMF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 AFSLSDKTPSVRTCGLKPSTLKQLGQPIQQPSGPGEVKLPSGPSNRTSQVKVVEVKPDMF
550 560 570 580 590 600
610 620 630 640 650 660
pF1KSD PPYKYSCTVTLDLGLATSRGRGKCKNPSCSYVYTNRHKPRICPSCGVNLAKDRTEKTTKA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 PPYKYSCTVTLDLGLATSRGRGKCKNPSCSYVYTNRHKPRICPSCGVNLAKDRTEKTTKA
610 620 630 640 650 660
670 680 690 700 710 720
pF1KSD IEVSSPLPDVLNATEPLSTAQREIQRQSTLQLLRKVLQIPENESELAEVFALIHELNSSR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 IEVSSPLPDVLNATEPLSTAQREIQRQSTLQLLRKVLQIPENESELAEVFALIHELNSSR
670 680 690 700 710 720
730 740 750 760 770 780
pF1KSD LILSNVSEETVTIEQTSWSNYYESPSTQCLLCSSPLFKGGQNSLAGPQECWLLTASRLQT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 LILSNVSEETVTIEQTSWSNYYESPSTQCLLCSSPLFKGGQNSLAGPQECWLLTASRLQT
730 740 750 760 770 780
790 800 810 820 830 840
pF1KSD VTAQVKMCLNPHCLALHSFIDIYTGLFNVGNKLLVSLDLLFAIRNQIKLGEDPRVSINVV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 VTAQVKMCLNPHCLALHSFIDIYTGLFNVGNKLLVSLDLLFAIRNQIKLGEDPRVSINVV
790 800 810 820 830 840
850 860 870 880 890 900
pF1KSD LKSVQEQTEKTLTSEELSQLQELLCNGYWAFECLTVRDYNDMICGICGVAPKVEMAQRSE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 LKSVQEQTEKTLTSEELSQLQELLCNGYWAFECLTVRDYNDMICGICGVAPKVEMAQRSE
850 860 870 880 890 900
910 920 930 940 950 960
pF1KSD ENVLALKSVEFTWPEFLGSNEVNVEDFWATMETEVIEQVAFPASIPITKFDASVIAPFFP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 ENVLALKSVEFTWPEFLGSNEVNVEDFWATMETEVIEQVAFPASIPITKFDASVIAPFFP
910 920 930 940 950 960
970 980 990 1000 1010 1020
pF1KSD PLMRGAVVVNTEKDKNLDVQPVPGSGSALVRLLQEGTCKLDEIGSYSEEKLQHLLRQCGI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 PLMRGAVVVNTEKDKNLDVQPVPGSGSALVRLLQEGTCKLDEIGSYSEEKLQHLLRQCGI
970 980 990 1000 1010 1020
1030 1040 1050 1060 1070 1080
pF1KSD PFGAEDSKDQLCFSLLALYESVQNGARAIRPPRHFTGGKIYKVCPHQVVCGSKYLVRGES
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 PFGAEDSKDQLCFSLLALYESVQNGARAIRPPRHFTGGKIYKVCPHQVVCGSKYLVRGES
1030 1040 1050 1060 1070 1080
1090 1100 1110 1120 1130 1140
pF1KSD ARDHVDLLASSRHWPPVYVVDMATSVALCADLCYPELTNQMWGRNQGCFSSPTEPPVSVS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 ARDHVDLLASSRHWPPVYVVDMATSVALCADLCYPELTNQMWGRNQGCFSSPTEPPVSVS
1090 1100 1110 1120 1130 1140
1150 1160 1170 1180 1190 1200
pF1KSD CPELLDQHYTVDMTETEHSIQHPVTKTATRRIVHAGLQPNPGDPSAGHHSLALCPELAPY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 CPELLDQHYTVDMTETEHSIQHPVTKTATRRIVHAGLQPNPGDPSAGHHSLALCPELAPY
1150 1160 1170 1180 1190 1200
1210 1220 1230 1240 1250 1260
pF1KSD ATILASIVDSKPNGVRQRPIAFDNATHYYLYNRLMDFLTSREIVNRQIHDIVQSCQPGEV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 ATILASIVDSKPNGVRQRPIAFDNATHYYLYNRLMDFLTSREIVNRQIHDIVQSCQPGEV
1210 1220 1230 1240 1250 1260
1270 1280 1290
pF1KSD VIRDTLYRLGVAQIKTETEEEGEEEEVAAVAE
::::::::::::::::::::::::::::::::
CCDS54 VIRDTLYRLGVAQIKTETEEEGEEEEVAAVAE
1270 1280 1290
1292 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 18:27:20 2016 done: Thu Nov 3 18:27:20 2016
Total Scan time: 4.680 Total Display time: 0.060
Function used was FASTA [36.3.4 Apr, 2011]