FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB6403, 304 aa
1>>>pF1KB6403 304 - 304 aa - 304 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.3608+/-0.000912; mu= 10.8500+/- 0.055
mean_var=85.5772+/-17.534, 0's: 0 Z-trim(107.1): 71 B-trim: 449 in 1/51
Lambda= 0.138642
statistics sampled from 9269 (9347) to 9269 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.656), E-opt: 0.2 (0.287), width: 16
Scan time: 2.420
The best scores are: opt bits E(32554)
CCDS3988.1 SGTB gene_id:54557|Hs108|chr5 ( 304) 1954 400.5 8.1e-112
CCDS12094.1 SGTA gene_id:6449|Hs108|chr19 ( 313) 1123 234.3 9e-62
CCDS53783.1 RPAP3 gene_id:79657|Hs108|chr12 ( 631) 307 71.2 2.3e-12
CCDS8753.1 RPAP3 gene_id:79657|Hs108|chr12 ( 665) 307 71.2 2.4e-12
CCDS8058.1 STIP1 gene_id:10963|Hs108|chr11 ( 543) 285 66.8 4.2e-11
CCDS60827.1 STIP1 gene_id:10963|Hs108|chr11 ( 590) 283 66.4 6e-11
>>CCDS3988.1 SGTB gene_id:54557|Hs108|chr5 (304 aa)
initn: 1954 init1: 1954 opt: 1954 Z-score: 2120.9 bits: 400.5 E(32554): 8.1e-112
Smith-Waterman score: 1954; 100.0% identity (100.0% similar) in 304 aa overlap (1-304:1-304)
10 20 30 40 50 60
pF1KB6 MSSIKHLVYAVIRFLREQSQMDTYTSDEQESLEVAIQCLETVFKISPEDTHLAVSQPLTE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS39 MSSIKHLVYAVIRFLREQSQMDTYTSDEQESLEVAIQCLETVFKISPEDTHLAVSQPLTE
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB6 MFTSSFCKNDVLPLSNSVPEDVGKADQLKDEGNNHMKEENYAAAVDCYTQAIELDPNNAV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS39 MFTSSFCKNDVLPLSNSVPEDVGKADQLKDEGNNHMKEENYAAAVDCYTQAIELDPNNAV
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB6 YYCNRAAAQSKLGHYTDAIKDCEKAIAIDSKYSKAYGRMGLALTALNKFEEAVTSYQKAL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS39 YYCNRAAAQSKLGHYTDAIKDCEKAIAIDSKYSKAYGRMGLALTALNKFEEAVTSYQKAL
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB6 DLDPENDSYKSNLKIAEQKLREVSSPTGTGLSFDMASLINNPAFISMAASLMQNPQVQQL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS39 DLDPENDSYKSNLKIAEQKLREVSSPTGTGLSFDMASLINNPAFISMAASLMQNPQVQQL
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB6 MSGMMTNAIGGPAAGVGGLTDLSSLIQAGQQFAQQIQQQNPELIEQLRNHIRSRSFSSSA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS39 MSGMMTNAIGGPAAGVGGLTDLSSLIQAGQQFAQQIQQQNPELIEQLRNHIRSRSFSSSA
250 260 270 280 290 300
pF1KB6 EEHS
::::
CCDS39 EEHS
>>CCDS12094.1 SGTA gene_id:6449|Hs108|chr19 (313 aa)
initn: 1152 init1: 736 opt: 1123 Z-score: 1222.4 bits: 234.3 E(32554): 9e-62
Smith-Waterman score: 1123; 58.3% identity (81.4% similar) in 312 aa overlap (1-303:1-311)
10 20 30 40 50 60
pF1KB6 MSSIKHLVYAVIRFLREQSQMDTYTSDEQESLEVAIQCLETVFKISPEDTHLAVSQPLTE
:.. :.:.::.:.::..: . .:: :::::::::::::.: .. ::. ::. : : :
CCDS12 MDNKKRLAYAIIQFLHDQLRHGGLSSDAQESLEVAIQCLETAFGVTVEDSDLALPQTLPE
10 20 30 40 50 60
70 80 90 100 110
pF1KB6 MF----TSSFCKNDVLPLSNSVP--EDVGKADQLKDEGNNHMKEENYAAAVDCYTQAIEL
.: :.. .:. . . : :: ..:..:: :::..:: ::. ::: : .::::
CCDS12 IFEAAATGKEMPQDLRSPARTPPSEEDSAEAERLKTEGNEQMKVENFEAAVHFYGKAIEL
70 80 90 100 110 120
120 130 140 150 160 170
pF1KB6 DPNNAVYYCNRAAAQSKLGHYTDAIKDCEKAIAIDSKYSKAYGRMGLALTALNKFEEAVT
.: ::::.:::::: ::::.:. :..:::.:: :: ::::::::::::..::: :::.
CCDS12 NPANAVYFCNRAAAYSKLGNYAGAVQDCERAICIDPAYSKAYGRMGLALSSLNKHVEAVA
130 140 150 160 170 180
180 190 200 210 220 230
pF1KB6 SYQKALDLDPENDSYKSNLKIAEQKLREVSSPTGTGLSFDMASLINNPAFISMAASLMQN
:.:::.:::.:..::::::::: ::::. :::: :::.:.:.:::.:.:::..::.:
CCDS12 YYKKALELDPDNETYKSNLKIAELKLREAPSPTGGVGSFDIAGLLNNPGFMSMASNLMNN
190 200 210 220 230 240
240 250 260 270 280 290
pF1KB6 PQVQQLMSGMMT---NAIGGPAAGVGGLTDLSSLIQAGQQFAQQIQQQNPELIEQLRNHI
::.:::::::.. : .: :... . .::.::::::::::::.::::::::::::..:
CCDS12 PQIQQLMSGMISGGNNPLGTPGTS-PSQNDLASLIQAGQQFAQQMQQQNPELIEQLRSQI
250 260 270 280 290
300
pF1KB6 RSRSFSSSAEEHS
:::. :.: ...
CCDS12 RSRTPSASNDDQQE
300 310
>>CCDS53783.1 RPAP3 gene_id:79657|Hs108|chr12 (631 aa)
initn: 350 init1: 291 opt: 307 Z-score: 335.5 bits: 71.2 E(32554): 2.3e-12
Smith-Waterman score: 307; 39.1% identity (69.9% similar) in 133 aa overlap (76-206:124-256)
50 60 70 80 90 100
pF1KB6 SPEDTHLAVSQPLTEMFTSSFCKNDVLPLSNSVPEDVGKADQLKDEGNNHMKEENYAAAV
... : :: ::..::...:. .: :.
CCDS53 AKLDVDRILDELDKDDSTHESLSQESESEEDGIHVDSQKALVLKEKGNKYFKQGKYDEAI
100 110 120 130 140 150
110 120 130 140 150 160
pF1KB6 DCYTQAIELDPNNAVYYCNRAAAQSKLGHYTDAIKDCEKAIAIDSKYSKAYGRMGLALTA
::::.... :: : : :::.: .: ... : .::. :.:.. .:.:::.: : : :
CCDS53 DCYTKGMDADPYNPVLPTNRASAYFRLKKFAVAESDCNLAVALNRSYTKAYSRRGAARFA
160 170 180 190 200 210
170 180 190 200 210 220
pF1KB6 LNKFEEAVTSYQKALDLDPENDSYKSNLKIAEQKL--REVSSPTGTGLSFDMASLINNPA
:.:.::: .:...:.:.:.: ..:. : : .: : :
CCDS53 LQKLEEAKKDYERVLELEPNNFEATNELRKISQALASKENSYPKEADIVIKSTEGERKQI
220 230 240 250 260 270
230 240 250 260 270 280
pF1KB6 FISMAASLMQNPQVQQLMSGMMTNAIGGPAAGVGGLTDLSSLIQAGQQFAQQIQQQNPEL
CCDS53 EAQQNKQQAISEKDRGNGFFKEGKYERAIECYTRGIAADGANALLPANRAMAYLKIQKYE
280 290 300 310 320 330
>>CCDS8753.1 RPAP3 gene_id:79657|Hs108|chr12 (665 aa)
initn: 602 init1: 291 opt: 307 Z-score: 335.2 bits: 71.2 E(32554): 2.4e-12
Smith-Waterman score: 307; 39.1% identity (69.9% similar) in 133 aa overlap (76-206:124-256)
50 60 70 80 90 100
pF1KB6 SPEDTHLAVSQPLTEMFTSSFCKNDVLPLSNSVPEDVGKADQLKDEGNNHMKEENYAAAV
... : :: ::..::...:. .: :.
CCDS87 AKLDVDRILDELDKDDSTHESLSQESESEEDGIHVDSQKALVLKEKGNKYFKQGKYDEAI
100 110 120 130 140 150
110 120 130 140 150 160
pF1KB6 DCYTQAIELDPNNAVYYCNRAAAQSKLGHYTDAIKDCEKAIAIDSKYSKAYGRMGLALTA
::::.... :: : : :::.: .: ... : .::. :.:.. .:.:::.: : : :
CCDS87 DCYTKGMDADPYNPVLPTNRASAYFRLKKFAVAESDCNLAVALNRSYTKAYSRRGAARFA
160 170 180 190 200 210
170 180 190 200 210 220
pF1KB6 LNKFEEAVTSYQKALDLDPENDSYKSNLKIAEQKL--REVSSPTGTGLSFDMASLINNPA
:.:.::: .:...:.:.:.: ..:. : : .: : :
CCDS87 LQKLEEAKKDYERVLELEPNNFEATNELRKISQALASKENSYPKEADIVIKSTEGERKQI
220 230 240 250 260 270
230 240 250 260 270 280
pF1KB6 FISMAASLMQNPQVQQLMSGMMTNAIGGPAAGVGGLTDLSSLIQAGQQFAQQIQQQNPEL
CCDS87 EAQQNKQQAISEKDRGNGFFKEGKYERAIECYTRGIAADGANALLPANRAMAYLKIQKYE
280 290 300 310 320 330
>>CCDS8058.1 STIP1 gene_id:10963|Hs108|chr11 (543 aa)
initn: 314 init1: 275 opt: 285 Z-score: 312.8 bits: 66.8 E(32554): 4.2e-11
Smith-Waterman score: 285; 32.5% identity (64.4% similar) in 160 aa overlap (84-241:3-156)
60 70 80 90 100 110
pF1KB6 VSQPLTEMFTSSFCKNDVLPLSNSVPEDVGKADQLKDEGNNHMKEENYAAAVDCYTQAIE
....::..::. .. : :..::..::.
CCDS80 MEQVNELKEKGNKALSVGNIDDALQCYSEAIK
10 20 30
120 130 140 150 160 170
pF1KB6 LDPNNAVYYCNRAAAQSKLGHYTDAIKDCEKAIAIDSKYSKAYGRMGLALTALNKFEEAV
:::.: : : ::.:: .: : : : .: :.. . ..:.:.: . :: ::.::::
CCDS80 LDPHNHVLYSNRSAAYAKKGDYQKAYEDGCKTVDLKPDWGKGYSRKAAALEFLNRFEEAK
40 50 60 70 80 90
180 190 200 210 220 230
pF1KB6 TSYQKALDLDPENDSYKSNLKIAEQKL--REVSSPTGTGLSFDMASLINNPAFISMAASL
.:...: . .: . : .:. : .: :. .: :.: .: .. . .:
CCDS80 RTYEEGLKHEANNPQLKEGLQNMEARLAERKFMNP------FNMPNLYQKLESDPRTRTL
100 110 120 130 140
240 250 260 270 280 290
pF1KB6 MQNPQVQQLMSGMMTNAIGGPAAGVGGLTDLSSLIQAGQQFAQQIQQQNPELIEQLRNHI
...: ..:.
CCDS80 LSDPTYRELIEQLRNKPSDLGTKLQDPRIMTTLSVLLGVDLGSMDEEEEIATPPPPPPPK
150 160 170 180 190 200
>>CCDS60827.1 STIP1 gene_id:10963|Hs108|chr11 (590 aa)
initn: 312 init1: 273 opt: 283 Z-score: 310.1 bits: 66.4 E(32554): 6e-11
Smith-Waterman score: 283; 32.9% identity (63.9% similar) in 158 aa overlap (86-241:52-203)
60 70 80 90 100 110
pF1KB6 QPLTEMFTSSFCKNDVLPLSNSVPEDVGKADQLKDEGNNHMKEENYAAAVDCYTQAIELD
..::..::. .. : :..::..::.::
CCDS60 GQRGYDWQCKRPIRVAEVRSSLHSWSLRWVNELKEKGNKALSVGNIDDALQCYSEAIKLD
30 40 50 60 70 80
120 130 140 150 160 170
pF1KB6 PNNAVYYCNRAAAQSKLGHYTDAIKDCEKAIAIDSKYSKAYGRMGLALTALNKFEEAVTS
:.: : : ::.:: .: : : : .: :.. . ..:.:.: . :: ::.:::: .
CCDS60 PHNHVLYSNRSAAYAKKGDYQKAYEDGCKTVDLKPDWGKGYSRKAAALEFLNRFEEAKRT
90 100 110 120 130 140
180 190 200 210 220 230
pF1KB6 YQKALDLDPENDSYKSNLKIAEQKL--REVSSPTGTGLSFDMASLINNPAFISMAASLMQ
:...: . .: . : .:. : .: :. .: :.: .: .. . .:..
CCDS60 YEEGLKHEANNPQLKEGLQNMEARLAERKFMNP------FNMPNLYQKLESDPRTRTLLS
150 160 170 180 190
240 250 260 270 280 290
pF1KB6 NPQVQQLMSGMMTNAIGGPAAGVGGLTDLSSLIQAGQQFAQQIQQQNPELIEQLRNHIRS
.: ..:.
CCDS60 DPTYRELIEQLRNKPSDLGTKLQDPRIMTTLSVLLGVDLGSMDEEEEIATPPPPPPPKKE
200 210 220 230 240 250
304 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 17:04:49 2016 done: Fri Nov 4 17:04:49 2016
Total Scan time: 2.420 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]