FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB6403, 304 aa 1>>>pF1KB6403 304 - 304 aa - 304 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.3608+/-0.000912; mu= 10.8500+/- 0.055 mean_var=85.5772+/-17.534, 0's: 0 Z-trim(107.1): 71 B-trim: 449 in 1/51 Lambda= 0.138642 statistics sampled from 9269 (9347) to 9269 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.656), E-opt: 0.2 (0.287), width: 16 Scan time: 2.420 The best scores are: opt bits E(32554) CCDS3988.1 SGTB gene_id:54557|Hs108|chr5 ( 304) 1954 400.5 8.1e-112 CCDS12094.1 SGTA gene_id:6449|Hs108|chr19 ( 313) 1123 234.3 9e-62 CCDS53783.1 RPAP3 gene_id:79657|Hs108|chr12 ( 631) 307 71.2 2.3e-12 CCDS8753.1 RPAP3 gene_id:79657|Hs108|chr12 ( 665) 307 71.2 2.4e-12 CCDS8058.1 STIP1 gene_id:10963|Hs108|chr11 ( 543) 285 66.8 4.2e-11 CCDS60827.1 STIP1 gene_id:10963|Hs108|chr11 ( 590) 283 66.4 6e-11 >>CCDS3988.1 SGTB gene_id:54557|Hs108|chr5 (304 aa) initn: 1954 init1: 1954 opt: 1954 Z-score: 2120.9 bits: 400.5 E(32554): 8.1e-112 Smith-Waterman score: 1954; 100.0% identity (100.0% similar) in 304 aa overlap (1-304:1-304) 10 20 30 40 50 60 pF1KB6 MSSIKHLVYAVIRFLREQSQMDTYTSDEQESLEVAIQCLETVFKISPEDTHLAVSQPLTE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS39 MSSIKHLVYAVIRFLREQSQMDTYTSDEQESLEVAIQCLETVFKISPEDTHLAVSQPLTE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 MFTSSFCKNDVLPLSNSVPEDVGKADQLKDEGNNHMKEENYAAAVDCYTQAIELDPNNAV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS39 MFTSSFCKNDVLPLSNSVPEDVGKADQLKDEGNNHMKEENYAAAVDCYTQAIELDPNNAV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB6 YYCNRAAAQSKLGHYTDAIKDCEKAIAIDSKYSKAYGRMGLALTALNKFEEAVTSYQKAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS39 YYCNRAAAQSKLGHYTDAIKDCEKAIAIDSKYSKAYGRMGLALTALNKFEEAVTSYQKAL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB6 DLDPENDSYKSNLKIAEQKLREVSSPTGTGLSFDMASLINNPAFISMAASLMQNPQVQQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS39 DLDPENDSYKSNLKIAEQKLREVSSPTGTGLSFDMASLINNPAFISMAASLMQNPQVQQL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB6 MSGMMTNAIGGPAAGVGGLTDLSSLIQAGQQFAQQIQQQNPELIEQLRNHIRSRSFSSSA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS39 MSGMMTNAIGGPAAGVGGLTDLSSLIQAGQQFAQQIQQQNPELIEQLRNHIRSRSFSSSA 250 260 270 280 290 300 pF1KB6 EEHS :::: CCDS39 EEHS >>CCDS12094.1 SGTA gene_id:6449|Hs108|chr19 (313 aa) initn: 1152 init1: 736 opt: 1123 Z-score: 1222.4 bits: 234.3 E(32554): 9e-62 Smith-Waterman score: 1123; 58.3% identity (81.4% similar) in 312 aa overlap (1-303:1-311) 10 20 30 40 50 60 pF1KB6 MSSIKHLVYAVIRFLREQSQMDTYTSDEQESLEVAIQCLETVFKISPEDTHLAVSQPLTE :.. :.:.::.:.::..: . .:: :::::::::::::.: .. ::. ::. : : : CCDS12 MDNKKRLAYAIIQFLHDQLRHGGLSSDAQESLEVAIQCLETAFGVTVEDSDLALPQTLPE 10 20 30 40 50 60 70 80 90 100 110 pF1KB6 MF----TSSFCKNDVLPLSNSVP--EDVGKADQLKDEGNNHMKEENYAAAVDCYTQAIEL .: :.. .:. . . : :: ..:..:: :::..:: ::. ::: : .:::: CCDS12 IFEAAATGKEMPQDLRSPARTPPSEEDSAEAERLKTEGNEQMKVENFEAAVHFYGKAIEL 70 80 90 100 110 120 120 130 140 150 160 170 pF1KB6 DPNNAVYYCNRAAAQSKLGHYTDAIKDCEKAIAIDSKYSKAYGRMGLALTALNKFEEAVT .: ::::.:::::: ::::.:. :..:::.:: :: ::::::::::::..::: :::. CCDS12 NPANAVYFCNRAAAYSKLGNYAGAVQDCERAICIDPAYSKAYGRMGLALSSLNKHVEAVA 130 140 150 160 170 180 180 190 200 210 220 230 pF1KB6 SYQKALDLDPENDSYKSNLKIAEQKLREVSSPTGTGLSFDMASLINNPAFISMAASLMQN :.:::.:::.:..::::::::: ::::. :::: :::.:.:.:::.:.:::..::.: CCDS12 YYKKALELDPDNETYKSNLKIAELKLREAPSPTGGVGSFDIAGLLNNPGFMSMASNLMNN 190 200 210 220 230 240 240 250 260 270 280 290 pF1KB6 PQVQQLMSGMMT---NAIGGPAAGVGGLTDLSSLIQAGQQFAQQIQQQNPELIEQLRNHI ::.:::::::.. : .: :... . .::.::::::::::::.::::::::::::..: CCDS12 PQIQQLMSGMISGGNNPLGTPGTS-PSQNDLASLIQAGQQFAQQMQQQNPELIEQLRSQI 250 260 270 280 290 300 pF1KB6 RSRSFSSSAEEHS :::. :.: ... CCDS12 RSRTPSASNDDQQE 300 310 >>CCDS53783.1 RPAP3 gene_id:79657|Hs108|chr12 (631 aa) initn: 350 init1: 291 opt: 307 Z-score: 335.5 bits: 71.2 E(32554): 2.3e-12 Smith-Waterman score: 307; 39.1% identity (69.9% similar) in 133 aa overlap (76-206:124-256) 50 60 70 80 90 100 pF1KB6 SPEDTHLAVSQPLTEMFTSSFCKNDVLPLSNSVPEDVGKADQLKDEGNNHMKEENYAAAV ... : :: ::..::...:. .: :. CCDS53 AKLDVDRILDELDKDDSTHESLSQESESEEDGIHVDSQKALVLKEKGNKYFKQGKYDEAI 100 110 120 130 140 150 110 120 130 140 150 160 pF1KB6 DCYTQAIELDPNNAVYYCNRAAAQSKLGHYTDAIKDCEKAIAIDSKYSKAYGRMGLALTA ::::.... :: : : :::.: .: ... : .::. :.:.. .:.:::.: : : : CCDS53 DCYTKGMDADPYNPVLPTNRASAYFRLKKFAVAESDCNLAVALNRSYTKAYSRRGAARFA 160 170 180 190 200 210 170 180 190 200 210 220 pF1KB6 LNKFEEAVTSYQKALDLDPENDSYKSNLKIAEQKL--REVSSPTGTGLSFDMASLINNPA :.:.::: .:...:.:.:.: ..:. : : .: : : CCDS53 LQKLEEAKKDYERVLELEPNNFEATNELRKISQALASKENSYPKEADIVIKSTEGERKQI 220 230 240 250 260 270 230 240 250 260 270 280 pF1KB6 FISMAASLMQNPQVQQLMSGMMTNAIGGPAAGVGGLTDLSSLIQAGQQFAQQIQQQNPEL CCDS53 EAQQNKQQAISEKDRGNGFFKEGKYERAIECYTRGIAADGANALLPANRAMAYLKIQKYE 280 290 300 310 320 330 >>CCDS8753.1 RPAP3 gene_id:79657|Hs108|chr12 (665 aa) initn: 602 init1: 291 opt: 307 Z-score: 335.2 bits: 71.2 E(32554): 2.4e-12 Smith-Waterman score: 307; 39.1% identity (69.9% similar) in 133 aa overlap (76-206:124-256) 50 60 70 80 90 100 pF1KB6 SPEDTHLAVSQPLTEMFTSSFCKNDVLPLSNSVPEDVGKADQLKDEGNNHMKEENYAAAV ... : :: ::..::...:. .: :. CCDS87 AKLDVDRILDELDKDDSTHESLSQESESEEDGIHVDSQKALVLKEKGNKYFKQGKYDEAI 100 110 120 130 140 150 110 120 130 140 150 160 pF1KB6 DCYTQAIELDPNNAVYYCNRAAAQSKLGHYTDAIKDCEKAIAIDSKYSKAYGRMGLALTA ::::.... :: : : :::.: .: ... : .::. :.:.. .:.:::.: : : : CCDS87 DCYTKGMDADPYNPVLPTNRASAYFRLKKFAVAESDCNLAVALNRSYTKAYSRRGAARFA 160 170 180 190 200 210 170 180 190 200 210 220 pF1KB6 LNKFEEAVTSYQKALDLDPENDSYKSNLKIAEQKL--REVSSPTGTGLSFDMASLINNPA :.:.::: .:...:.:.:.: ..:. : : .: : : CCDS87 LQKLEEAKKDYERVLELEPNNFEATNELRKISQALASKENSYPKEADIVIKSTEGERKQI 220 230 240 250 260 270 230 240 250 260 270 280 pF1KB6 FISMAASLMQNPQVQQLMSGMMTNAIGGPAAGVGGLTDLSSLIQAGQQFAQQIQQQNPEL CCDS87 EAQQNKQQAISEKDRGNGFFKEGKYERAIECYTRGIAADGANALLPANRAMAYLKIQKYE 280 290 300 310 320 330 >>CCDS8058.1 STIP1 gene_id:10963|Hs108|chr11 (543 aa) initn: 314 init1: 275 opt: 285 Z-score: 312.8 bits: 66.8 E(32554): 4.2e-11 Smith-Waterman score: 285; 32.5% identity (64.4% similar) in 160 aa overlap (84-241:3-156) 60 70 80 90 100 110 pF1KB6 VSQPLTEMFTSSFCKNDVLPLSNSVPEDVGKADQLKDEGNNHMKEENYAAAVDCYTQAIE ....::..::. .. : :..::..::. CCDS80 MEQVNELKEKGNKALSVGNIDDALQCYSEAIK 10 20 30 120 130 140 150 160 170 pF1KB6 LDPNNAVYYCNRAAAQSKLGHYTDAIKDCEKAIAIDSKYSKAYGRMGLALTALNKFEEAV :::.: : : ::.:: .: : : : .: :.. . ..:.:.: . :: ::.:::: CCDS80 LDPHNHVLYSNRSAAYAKKGDYQKAYEDGCKTVDLKPDWGKGYSRKAAALEFLNRFEEAK 40 50 60 70 80 90 180 190 200 210 220 230 pF1KB6 TSYQKALDLDPENDSYKSNLKIAEQKL--REVSSPTGTGLSFDMASLINNPAFISMAASL .:...: . .: . : .:. : .: :. .: :.: .: .. . .: CCDS80 RTYEEGLKHEANNPQLKEGLQNMEARLAERKFMNP------FNMPNLYQKLESDPRTRTL 100 110 120 130 140 240 250 260 270 280 290 pF1KB6 MQNPQVQQLMSGMMTNAIGGPAAGVGGLTDLSSLIQAGQQFAQQIQQQNPELIEQLRNHI ...: ..:. CCDS80 LSDPTYRELIEQLRNKPSDLGTKLQDPRIMTTLSVLLGVDLGSMDEEEEIATPPPPPPPK 150 160 170 180 190 200 >>CCDS60827.1 STIP1 gene_id:10963|Hs108|chr11 (590 aa) initn: 312 init1: 273 opt: 283 Z-score: 310.1 bits: 66.4 E(32554): 6e-11 Smith-Waterman score: 283; 32.9% identity (63.9% similar) in 158 aa overlap (86-241:52-203) 60 70 80 90 100 110 pF1KB6 QPLTEMFTSSFCKNDVLPLSNSVPEDVGKADQLKDEGNNHMKEENYAAAVDCYTQAIELD ..::..::. .. : :..::..::.:: CCDS60 GQRGYDWQCKRPIRVAEVRSSLHSWSLRWVNELKEKGNKALSVGNIDDALQCYSEAIKLD 30 40 50 60 70 80 120 130 140 150 160 170 pF1KB6 PNNAVYYCNRAAAQSKLGHYTDAIKDCEKAIAIDSKYSKAYGRMGLALTALNKFEEAVTS :.: : : ::.:: .: : : : .: :.. . ..:.:.: . :: ::.:::: . CCDS60 PHNHVLYSNRSAAYAKKGDYQKAYEDGCKTVDLKPDWGKGYSRKAAALEFLNRFEEAKRT 90 100 110 120 130 140 180 190 200 210 220 230 pF1KB6 YQKALDLDPENDSYKSNLKIAEQKL--REVSSPTGTGLSFDMASLINNPAFISMAASLMQ :...: . .: . : .:. : .: :. .: :.: .: .. . .:.. CCDS60 YEEGLKHEANNPQLKEGLQNMEARLAERKFMNP------FNMPNLYQKLESDPRTRTLLS 150 160 170 180 190 240 250 260 270 280 290 pF1KB6 NPQVQQLMSGMMTNAIGGPAAGVGGLTDLSSLIQAGQQFAQQIQQQNPELIEQLRNHIRS .: ..:. CCDS60 DPTYRELIEQLRNKPSDLGTKLQDPRIMTTLSVLLGVDLGSMDEEEEIATPPPPPPPKKE 200 210 220 230 240 250 304 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 17:04:49 2016 done: Fri Nov 4 17:04:49 2016 Total Scan time: 2.420 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]