FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB6431, 312 aa
1>>>pF1KB6431 312 - 312 aa - 312 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.9971+/-0.000303; mu= 13.5702+/- 0.019
mean_var=91.9773+/-17.901, 0's: 0 Z-trim(119.0): 33 B-trim: 100 in 1/55
Lambda= 0.133732
statistics sampled from 32540 (32573) to 32540 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.739), E-opt: 0.2 (0.382), width: 16
Scan time: 7.880
The best scores are: opt bits E(85289)
NP_002519 (OMIM: 602656,616415) endonuclease III-l ( 312) 2126 419.7 3.8e-117
XP_016878742 (OMIM: 602656,616415) PREDICTED: endo ( 356) 1829 362.4 7.4e-100
NP_001305123 (OMIM: 602656,616415) endonuclease II ( 194) 1254 251.3 1.1e-66
NP_001305122 (OMIM: 602656,616415) endonuclease II ( 255) 890 181.2 2e-45
>>NP_002519 (OMIM: 602656,616415) endonuclease III-like (312 aa)
initn: 2126 init1: 2126 opt: 2126 Z-score: 2224.2 bits: 419.7 E(85289): 3.8e-117
Smith-Waterman score: 2126; 100.0% identity (100.0% similar) in 312 aa overlap (1-312:1-312)
10 20 30 40 50 60
pF1KB6 MCSPQESGMTALSARMLTRSRSLGPGAGPRGCREEPGPLRRREAAAEARKSHSPVKRPRK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 MCSPQESGMTALSARMLTRSRSLGPGAGPRGCREEPGPLRRREAAAEARKSHSPVKRPRK
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB6 AQRLRVAYEGSDSEKGEGAEPLKVPVWEPQDWQQQLVNIRAMRNKKDAPVDHLGTEHCYD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 AQRLRVAYEGSDSEKGEGAEPLKVPVWEPQDWQQQLVNIRAMRNKKDAPVDHLGTEHCYD
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB6 SSAPPKVRRYQVLLSLMLSSQTKDQVTAGAMQRLRARGLTVDSILQTDDATLGKLIYPVG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 SSAPPKVRRYQVLLSLMLSSQTKDQVTAGAMQRLRARGLTVDSILQTDDATLGKLIYPVG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB6 FWRSKVKYIKQTSAILQQHYGGDIPASVAELVALPGVGPKMAHLAMAVAWGTVSGIAVDT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 FWRSKVKYIKQTSAILQQHYGGDIPASVAELVALPGVGPKMAHLAMAVAWGTVSGIAVDT
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB6 HVHRIANRLRWTKKATKSPEETRAALEEWLPRELWHEINGLLVGFGQQTCLPVHPRCHAC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 HVHRIANRLRWTKKATKSPEETRAALEEWLPRELWHEINGLLVGFGQQTCLPVHPRCHAC
250 260 270 280 290 300
310
pF1KB6 LNQALCPAAQGL
::::::::::::
NP_002 LNQALCPAAQGL
310
>>XP_016878742 (OMIM: 602656,616415) PREDICTED: endonucl (356 aa)
initn: 1823 init1: 1823 opt: 1829 Z-score: 1913.7 bits: 362.4 E(85289): 7.4e-100
Smith-Waterman score: 1832; 91.0% identity (92.9% similar) in 311 aa overlap (1-311:1-301)
10 20 30 40 50 60
pF1KB6 MCSPQESGMTALSARMLTRSRSLGPGAGPRGCREEPGPLRRREAAAEARKSHSPVKRPRK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 MCSPQESGMTALSARMLTRSRSLGPGAGPRGCREEPGPLRRREAAAEARKSHSPVKRPRK
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB6 AQRLRVAYEGSDSEKGEGAEPLKVPVWEPQDWQQQLVNIRAMRNKKDAPVDHLGTEHCYD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 AQRLRVAYEGSDSEKGEGAEPLKVPVWEPQDWQQQLVNIRAMRNKKDAPVDHLGTEHCYD
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB6 SSAPPKVRRYQVLLSLMLSSQTKDQVTAGAMQRLRARGLTVDSILQTDDATLGKLIYPVG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 SSAPPKVRRYQVLLSLMLSSQTKDQVTAGAMQRLRARGLTVDSILQTDDATLGKLIYPVG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB6 FWRSKVKYIKQTSAILQQHYGGDIPASVAELVALPGVGPKMAHLAMAVAWGTVSGIAVDT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 FWRSKVKYIKQTSAILQQHYGGDIPASVAELVALPGVGPKMAHLAMAVAWGTVSGIAVDT
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB6 HVHRIANRLRWTKKATKSPEETRAALEEWLPRELWHEINGLLVGFGQQTCLPVHPRCHAC
:::::::::::::::::::::::::::::::: .: : :: . :..: .
XP_016 HVHRIANRLRWTKKATKSPEETRAALEEWLPR---YEYVG---GFLGRRAHPARPL--SA
250 260 270 280 290
310
pF1KB6 LNQALCPAAQGL
:.: : ::
XP_016 LTQP--PPPQGAVARDQWTLGGLRPADLSACAPSLPRLPQPSPLPGRPGSLMAAWLWPRC
300 310 320 330 340 350
>>NP_001305123 (OMIM: 602656,616415) endonuclease III-li (194 aa)
initn: 1254 init1: 1254 opt: 1254 Z-score: 1318.0 bits: 251.3 E(85289): 1.1e-66
Smith-Waterman score: 1254; 100.0% identity (100.0% similar) in 187 aa overlap (126-312:8-194)
100 110 120 130 140 150
pF1KB6 LVNIRAMRNKKDAPVDHLGTEHCYDSSAPPKVRRYQVLLSLMLSSQTKDQVTAGAMQRLR
::::::::::::::::::::::::::::::
NP_001 MRARTVRKVRRYQVLLSLMLSSQTKDQVTAGAMQRLR
10 20 30
160 170 180 190 200 210
pF1KB6 ARGLTVDSILQTDDATLGKLIYPVGFWRSKVKYIKQTSAILQQHYGGDIPASVAELVALP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 ARGLTVDSILQTDDATLGKLIYPVGFWRSKVKYIKQTSAILQQHYGGDIPASVAELVALP
40 50 60 70 80 90
220 230 240 250 260 270
pF1KB6 GVGPKMAHLAMAVAWGTVSGIAVDTHVHRIANRLRWTKKATKSPEETRAALEEWLPRELW
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 GVGPKMAHLAMAVAWGTVSGIAVDTHVHRIANRLRWTKKATKSPEETRAALEEWLPRELW
100 110 120 130 140 150
280 290 300 310
pF1KB6 HEINGLLVGFGQQTCLPVHPRCHACLNQALCPAAQGL
:::::::::::::::::::::::::::::::::::::
NP_001 HEINGLLVGFGQQTCLPVHPRCHACLNQALCPAAQGL
160 170 180 190
>>NP_001305122 (OMIM: 602656,616415) endonuclease III-li (255 aa)
initn: 890 init1: 890 opt: 890 Z-score: 936.7 bits: 181.2 E(85289): 2e-45
Smith-Waterman score: 1641; 81.7% identity (81.7% similar) in 312 aa overlap (1-312:1-255)
10 20 30 40 50 60
pF1KB6 MCSPQESGMTALSARMLTRSRSLGPGAGPRGCREEPGPLRRREAAAEARKSHSPVKRPRK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MCSPQESGMTALSARMLTRSRSLGPGAGPRGCREEPGPLRRREAAAEARKSHSPVKRPRK
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB6 AQRLRVAYEGSDSEKGEGAEPLKVPVWEPQDWQQQLVNIRAMRNKKDAPVDHLGTEHCYD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 AQRLRVAYEGSDSEKGEGAEPLKVPVWEPQDWQQQLVNIRAMRNKKDAPVDHLGTEHCYD
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB6 SSAPPKVRRYQVLLSLMLSSQTKDQVTAGAMQRLRARGLTVDSILQTDDATLGKLIYPVG
::::::
NP_001 SSAPPK------------------------------------------------------
190 200 210 220 230 240
pF1KB6 FWRSKVKYIKQTSAILQQHYGGDIPASVAELVALPGVGPKMAHLAMAVAWGTVSGIAVDT
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 ---SKVKYIKQTSAILQQHYGGDIPASVAELVALPGVGPKMAHLAMAVAWGTVSGIAVDT
130 140 150 160 170 180
250 260 270 280 290 300
pF1KB6 HVHRIANRLRWTKKATKSPEETRAALEEWLPRELWHEINGLLVGFGQQTCLPVHPRCHAC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 HVHRIANRLRWTKKATKSPEETRAALEEWLPRELWHEINGLLVGFGQQTCLPVHPRCHAC
190 200 210 220 230 240
310
pF1KB6 LNQALCPAAQGL
::::::::::::
NP_001 LNQALCPAAQGL
250
312 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 17:37:24 2016 done: Fri Nov 4 17:37:25 2016
Total Scan time: 7.880 Total Display time: -0.020
Function used was FASTA [36.3.4 Apr, 2011]