FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB5447, 413 aa
1>>>pF1KB5447 413 - 413 aa - 413 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.3649+/-0.00092; mu= 16.0110+/- 0.055
mean_var=57.7077+/-11.909, 0's: 0 Z-trim(104.4): 24 B-trim: 0 in 0/49
Lambda= 0.168833
statistics sampled from 7893 (7903) to 7893 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.621), E-opt: 0.2 (0.243), width: 16
Scan time: 2.590
The best scores are: opt bits E(32554)
CCDS7479.1 GOT1 gene_id:2805|Hs108|chr10 ( 413) 2781 685.9 1.9e-197
CCDS10801.1 GOT2 gene_id:2806|Hs108|chr16 ( 430) 1323 330.7 1.6e-90
CCDS67045.1 GOT2 gene_id:2806|Hs108|chr16 ( 387) 1075 270.3 2.2e-72
CCDS47839.1 GOT1L1 gene_id:137362|Hs108|chr8 ( 421) 761 193.8 2.5e-49
>>CCDS7479.1 GOT1 gene_id:2805|Hs108|chr10 (413 aa)
initn: 2781 init1: 2781 opt: 2781 Z-score: 3658.3 bits: 685.9 E(32554): 1.9e-197
Smith-Waterman score: 2781; 100.0% identity (100.0% similar) in 413 aa overlap (1-413:1-413)
10 20 30 40 50 60
pF1KB5 MAPPSVFAEVPQAQPVLVFKLTADFREDPDPRKVNLGVGAYRTDDCHPWVLPVVKKVEQK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 MAPPSVFAEVPQAQPVLVFKLTADFREDPDPRKVNLGVGAYRTDDCHPWVLPVVKKVEQK
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB5 IANDNSLNHEYLPILGLAEFRSCASRLALGDDSPALKEKRVGGVQSLGGTGALRIGADFL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 IANDNSLNHEYLPILGLAEFRSCASRLALGDDSPALKEKRVGGVQSLGGTGALRIGADFL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB5 ARWYNGTNNKNTPVYVSSPTWENHNAVFSAAGFKDIRSYRYWDAEKRGLDLQGFLNDLEN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 ARWYNGTNNKNTPVYVSSPTWENHNAVFSAAGFKDIRSYRYWDAEKRGLDLQGFLNDLEN
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB5 APEFSIVVLHACAHNPTGIDPTPEQWKQIASVMKHRFLFPFFDSAYQGFASGNLERDAWA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 APEFSIVVLHACAHNPTGIDPTPEQWKQIASVMKHRFLFPFFDSAYQGFASGNLERDAWA
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB5 IRYFVSEGFEFFCAQSFSKNFGLYNERVGNLTVVGKEPESILQVLSQMEKIVRITWSNPP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 IRYFVSEGFEFFCAQSFSKNFGLYNERVGNLTVVGKEPESILQVLSQMEKIVRITWSNPP
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB5 AQGARIVASTLSNPELFEEWTGNVKTMADRILTMRSELRARLEALKTPGTWNHITDQIGM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 AQGARIVASTLSNPELFEEWTGNVKTMADRILTMRSELRARLEALKTPGTWNHITDQIGM
310 320 330 340 350 360
370 380 390 400 410
pF1KB5 FSFTGLNPKQVEYLVNEKHIYLLPSGRINVSGLTTKNLDYVATSIHEAVTKIQ
:::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 FSFTGLNPKQVEYLVNEKHIYLLPSGRINVSGLTTKNLDYVATSIHEAVTKIQ
370 380 390 400 410
>>CCDS10801.1 GOT2 gene_id:2806|Hs108|chr16 (430 aa)
initn: 1293 init1: 914 opt: 1323 Z-score: 1738.7 bits: 330.7 E(32554): 1.6e-90
Smith-Waterman score: 1323; 48.9% identity (78.1% similar) in 407 aa overlap (5-411:31-430)
10 20 30
pF1KB5 MAPPSVFAEVPQAQPVLVFKLTADFREDPDPRKV
: ...: .. : .. .: :..: . .:.
CCDS10 MALLHSGRVLPGIAAAFHPGLAAAASARASSWWTHVEMGPPDPILGVTEAFKRDTNSKKM
10 20 30 40 50 60
40 50 60 70 80 90
pF1KB5 NLGVGAYRTDDCHPWVLPVVKKVEQKIANDNSLNHEYLPILGLAEFRSCASRLALGDDSP
:::::::: :. .:.::: :.:.: .:: : :..::::: ::::: . ...::::..:
CCDS10 NLGVGAYRDDNGKPYVLPSVRKAEAQIAAKN-LDKEYLPIGGLAEFCKASAELALGENSE
70 80 90 100 110
100 110 120 130 140 150
pF1KB5 ALKEKRVGGVQSLGGTGALRIGADFLARWYNGTNNKNTPVYVSSPTWENHNAVFSAAGFK
.:: : ::...:::::::::.:: :... . . :.. .::: ::. .: ::..
CCDS10 VLKSGRFVTVQTISGTGALRIGASFLQRFFKFSRD----VFLPKPTWGNHTPIFRDAGMQ
120 130 140 150 160 170
160 170 180 190 200 210
pF1KB5 DIRSYRYWDAEKRGLDLQGFLNDLENAPEFSIVVLHACAHNPTGIDPTPEQWKQIASVMK
...:::.: . :.:. : ..:. . :: :...::::::::::.:: :::::.::.:.:
CCDS10 -LQGYRYYDPKTCGFDFTGAVEDISKIPEQSVLLLHACAHNPTGVDPRPEQWKEIATVVK
180 190 200 210 220 230
220 230 240 250 260 270
pF1KB5 HRFLFPFFDSAYQGFASGNLERDAWAIRYFVSEGFEFFCAQSFSKNFGLYNERVGNLTVV
.: :: ::: ::::::::. ..::::.:.:. .:.. ::..::.:::.:::: .:.:
CCDS10 KRNLFAFFDMAYQGFASGDGDKDAWAVRHFIEQGINVCLCQSYAKNMGLYGERVGAFTMV
240 250 260 270 280 290
280 290 300 310 320 330
pF1KB5 GKEPESILQVLSQMEKIVRITWSNPPAQGARIVASTLSNPELFEEWTGNVKTMADRILTM
:. . .: ::.. ..: .:::: .::::.:. :..:.: ..: .::.:::::. :
CCDS10 CKDADEAKRVESQLKILIRPMYSNPPLNGARIAAAILNTPDLRKQWLQEVKVMADRIIGM
300 310 320 330 340 350
340 350 360 370 380 390
pF1KB5 RSELRARLEALKTPGTWNHITDQIGMFSFTGLNPKQVEYLVNEKHIYLLPSGRINVSGLT
:..: . :. . .:.::::::::: ::::.:.::: :..: ::. .:::.:.:.:
CCDS10 RTQLVSNLKKEGSTHNWQHITDQIGMFCFTGLKPEQVERLIKEFSIYMTKDGRISVAGVT
360 370 380 390 400 410
400 410
pF1KB5 TKNLDYVATSIHEAVTKIQ
..:. :.: .::. :::
CCDS10 SSNVGYLAHAIHQ-VTK
420 430
>>CCDS67045.1 GOT2 gene_id:2806|Hs108|chr16 (387 aa)
initn: 1165 init1: 914 opt: 1075 Z-score: 1413.0 bits: 270.3 E(32554): 2.2e-72
Smith-Waterman score: 1107; 43.5% identity (70.3% similar) in 407 aa overlap (5-411:31-387)
10 20 30
pF1KB5 MAPPSVFAEVPQAQPVLVFKLTADFREDPDPRKV
: ...: .. : .. .: :..: . .:.
CCDS67 MALLHSGRVLPGIAAAFHPGLAAAASARASSWWTHVEMGPPDPILGVTEAFKRDTNSKKM
10 20 30 40 50 60
40 50 60 70 80 90
pF1KB5 NLGVGAYRTDDCHPWVLPVVKKVEQKIANDNSLNHEYLPILGLAEFRSCASRLALGDDSP
:::::::: :. .:.::: :.: : .
CCDS67 NLGVGAYRDDNGKPYVLPSVRK-----------------------FVT------------
70 80
100 110 120 130 140 150
pF1KB5 ALKEKRVGGVQSLGGTGALRIGADFLARWYNGTNNKNTPVYVSSPTWENHNAVFSAAGFK
::...:::::::::.:: :... . . :.. .::: ::. .: ::..
CCDS67 ---------VQTISGTGALRIGASFLQRFFKFSRD----VFLPKPTWGNHTPIFRDAGMQ
90 100 110 120 130
160 170 180 190 200 210
pF1KB5 DIRSYRYWDAEKRGLDLQGFLNDLENAPEFSIVVLHACAHNPTGIDPTPEQWKQIASVMK
...:::.: . :.:. : ..:. . :: :...::::::::::.:: :::::.::.:.:
CCDS67 -LQGYRYYDPKTCGFDFTGAVEDISKIPEQSVLLLHACAHNPTGVDPRPEQWKEIATVVK
140 150 160 170 180 190
220 230 240 250 260 270
pF1KB5 HRFLFPFFDSAYQGFASGNLERDAWAIRYFVSEGFEFFCAQSFSKNFGLYNERVGNLTVV
.: :: ::: ::::::::. ..::::.:.:. .:.. ::..::.:::.:::: .:.:
CCDS67 KRNLFAFFDMAYQGFASGDGDKDAWAVRHFIEQGINVCLCQSYAKNMGLYGERVGAFTMV
200 210 220 230 240 250
280 290 300 310 320 330
pF1KB5 GKEPESILQVLSQMEKIVRITWSNPPAQGARIVASTLSNPELFEEWTGNVKTMADRILTM
:. . .: ::.. ..: .:::: .::::.:. :..:.: ..: .::.:::::. :
CCDS67 CKDADEAKRVESQLKILIRPMYSNPPLNGARIAAAILNTPDLRKQWLQEVKVMADRIIGM
260 270 280 290 300 310
340 350 360 370 380 390
pF1KB5 RSELRARLEALKTPGTWNHITDQIGMFSFTGLNPKQVEYLVNEKHIYLLPSGRINVSGLT
:..: . :. . .:.::::::::: ::::.:.::: :..: ::. .:::.:.:.:
CCDS67 RTQLVSNLKKEGSTHNWQHITDQIGMFCFTGLKPEQVERLIKEFSIYMTKDGRISVAGVT
320 330 340 350 360 370
400 410
pF1KB5 TKNLDYVATSIHEAVTKIQ
..:. :.: .::. :::
CCDS67 SSNVGYLAHAIHQ-VTK
380
>>CCDS47839.1 GOT1L1 gene_id:137362|Hs108|chr8 (421 aa)
initn: 1015 init1: 682 opt: 761 Z-score: 999.1 bits: 193.8 E(32554): 2.5e-49
Smith-Waterman score: 1038; 40.3% identity (68.5% similar) in 409 aa overlap (1-409:1-399)
10 20 30 40 50 60
pF1KB5 MAPPSVFAEVPQAQPVLVFKLTADFREDPDPRKVNLGVGAYRTDDCHPWVLPVVKKVEQK
: ::: .:: :. : .: ...: : :. :. . :.. :::: ::.:.. .
CCDS47 MPTLSVFMDVPLAHK-LEGSLLKTYKQDDYPNKIFLAYRVCMTNEGHPWVSLVVQKTRLQ
10 20 30 40 50
70 80 90 100 110 120
pF1KB5 IANDNSLNHEYLPILGLAEFRSCASRLALGDDSPALKEKRVGGVQSLGGTGALRIGADFL
:..: :::.:::: .:: : . . : .: : :. :.:::::...: .::...:..::
CCDS47 ISQDPSLNYEYLPTMGLKSFIQASLALLFGKHSQAIVENRVGGVHTVGDSGAFQLGVQFL
60 70 80 90 100 110
130 140 150 160 170 180
pF1KB5 ARWYNGTNNKNTPVYVSSPTWENHNAVFSAAGFKDIRSYRYWDAEKRGLDLQGFLNDLEN
:.. . ::. : : :. ::. :: . : :: .: .: . .:: .:.
CCDS47 RAWHKDARI----VYIISSQKELHGLVFQDMGFT-VYEYSVWDPKKLCMDPDILLNVVEQ
120 130 140 150 160 170
190 200 210 220 230 240
pF1KB5 APEFSIVVLHACAHNPTGIDPTPEQWKQIASVMKHRFLFPFFDSAYQGFASGNLERDAWA
:. ..:. : :: : .. :..: . .::::: ::. ...::.:.
CCDS47 IPHGCVLVMG----NIIDCKLTPSGWAKLMSMIKSKQIFPFFDIPCQGLYTSDLEEDTRI
180 190 200 210 220 230
250 260 270 280 290 300
pF1KB5 IRYFVSEGFEFFCAQSFSKNFGLYNERVGNLTVVGKEPESILQVLSQMEKIVRITWSNPP
..::::.::::::.::.:::::.:.: :: :.::. . ...: ::::.: ... : :::
CCDS47 LQYFVSQGFEFFCSQSLSKNFGIYDEGVGMLVVVAVNNQQLLCVLSQLEGLAQALWLNPP
240 250 260 270 280 290
310 320 330 340 350 360
pF1KB5 AQGARIVASTLSNPELFEEWTGNVKTMADRILTMRSELRARLEALKTPGTWNHITDQIGM
:::...: : :: :. :: ..: ... :. . ... .:. : :::.:.:::.: :
CCDS47 NTGARVITSILCNPALLGEWKQSLKEVVENIMLTKEKVKEKLQLLGTPGSWGHITEQSGT
300 310 320 330 340 350
370 380 390 400 410
pF1KB5 FSFTGLNPKQVEYLVNEKHIYLLPSGRINVSGLTTKNLDYVATSIHEAVTKIQ
.. ::: .:::::: .::::. .:.:: : ....:..:.. .:.:::
CCDS47 HGYLGLNSQQVEYLVRKKHIYIPKNGQINFSCINANNINYITEGINEAVLLTESSEMCLP
360 370 380 390 400 410
CCDS47 KEKKTLIGIKL
420
413 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 19:39:35 2016 done: Sat Nov 5 19:39:35 2016
Total Scan time: 2.590 Total Display time: 0.030
Function used was FASTA [36.3.4 Apr, 2011]