FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB4046, 445 aa 1>>>pF1KB4046 445 - 445 aa - 445 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.3588+/-0.00074; mu= 13.1752+/- 0.044 mean_var=89.0577+/-18.252, 0's: 0 Z-trim(111.0): 8 B-trim: 575 in 1/52 Lambda= 0.135906 statistics sampled from 12001 (12005) to 12001 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.722), E-opt: 0.2 (0.369), width: 16 Scan time: 3.300 The best scores are: opt bits E(32554) CCDS4458.1 MGAT1 gene_id:4245|Hs108|chr5 ( 445) 3079 613.4 1.5e-175 CCDS531.1 POMGNT1 gene_id:55624|Hs108|chr1 ( 660) 476 103.1 8.9e-22 CCDS57995.1 POMGNT1 gene_id:55624|Hs108|chr1 ( 748) 476 103.1 9.9e-22 >>CCDS4458.1 MGAT1 gene_id:4245|Hs108|chr5 (445 aa) initn: 3079 init1: 3079 opt: 3079 Z-score: 3265.4 bits: 613.4 E(32554): 1.5e-175 Smith-Waterman score: 3079; 100.0% identity (100.0% similar) in 445 aa overlap (1-445:1-445) 10 20 30 40 50 60 pF1KB4 MLKKQSAGLVLWGAILFVAWNALLLLFFWTRPAPGRPPSVSALDGDPASLTREVIRLAQD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 MLKKQSAGLVLWGAILFVAWNALLLLFFWTRPAPGRPPSVSALDGDPASLTREVIRLAQD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB4 AEVELERQRGLLQQIGDALSSQRGRVPTAAPPAQPRVPVTPAPAVIPILVIACDRSTVRR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 AEVELERQRGLLQQIGDALSSQRGRVPTAAPPAQPRVPVTPAPAVIPILVIACDRSTVRR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB4 CLDKLLHYRPSAELFPIIVSQDCGHEETAQAIASYGSAVTHIRQPDLSSIAVPPDHRKFQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 CLDKLLHYRPSAELFPIIVSQDCGHEETAQAIASYGSAVTHIRQPDLSSIAVPPDHRKFQ 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB4 GYYKIARHYRWALGQVFRQFRFPAAVVVEDDLEVAPDFFEYFRATYPLLKADPSLWCVSA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 GYYKIARHYRWALGQVFRQFRFPAAVVVEDDLEVAPDFFEYFRATYPLLKADPSLWCVSA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB4 WNDNGKEQMVDASRPELLYRTDFFPGLGWLLLAELWAELEPKWPKAFWDDWMRRPEQRQG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 WNDNGKEQMVDASRPELLYRTDFFPGLGWLLLAELWAELEPKWPKAFWDDWMRRPEQRQG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB4 RACIRPEISRTMTFGRKGVSHGQFFDQHLKFIKLNQQFVHFTQLDLSYLQREAYDRDFLA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 RACIRPEISRTMTFGRKGVSHGQFFDQHLKFIKLNQQFVHFTQLDLSYLQREAYDRDFLA 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB4 RVYGAPQLQVEKVRTNDRKELGEVRVQYTGRDSFKAFAKALGVMDDLKSGVPRAGYRGIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 RVYGAPQLQVEKVRTNDRKELGEVRVQYTGRDSFKAFAKALGVMDDLKSGVPRAGYRGIV 370 380 390 400 410 420 430 440 pF1KB4 TFQFRGRRVHLAPPLTWEGYDPSWN ::::::::::::::::::::::::: CCDS44 TFQFRGRRVHLAPPLTWEGYDPSWN 430 440 >>CCDS531.1 POMGNT1 gene_id:55624|Hs108|chr1 (660 aa) initn: 425 init1: 170 opt: 476 Z-score: 504.5 bits: 103.1 E(32554): 8.9e-22 Smith-Waterman score: 519; 33.9% identity (59.6% similar) in 342 aa overlap (36-355:227-542) 10 20 30 40 50 60 pF1KB4 SAGLVLWGAILFVAWNALLLLFFWTRPAPGRPPSVSALDGDPASLTREV-IRLAQDAE-- . :..:. :::. : .: . :..:: CCDS53 GSQAGPALGWRDTWAFVGRKGGPVFGEKHSKSPALSSW-GDPVLLKTDVPLSSAEEAECH 200 210 220 230 240 250 70 80 90 100 110 pF1KB4 ---VELERQRGLLQQIGDALSSQRGRVPTAAPPAQPRVPVTPAPAV----IPILVIACDR .::.:.: ... . . . : : . :. . : : .:. ::: .: CCDS53 WADTELNRRR---RRFCSKVEGY-GSVCSCKDPTPIEFSPDPLPDNKVLNVPVAVIAGNR 260 270 280 290 300 310 120 130 140 150 160 170 pF1KB4 ST-VRRCLDKLLHYRP-SAELFPIIVSQDCGHEETAQAIASYGSAVTHIRQPDLSSIAVP . . : : .:: . : ... ... : .:: ...: .: .: . . :.. CCDS53 PNYLYRMLRSLLSAQGVSPQMITVFI--DGYYEEPMDVVALFG-----LRGIQHTPISIK 320 330 340 350 360 180 190 200 210 220 230 pF1KB4 PDHRKFQGYYKIARHYRWALGQVFRQF---RFPAAVVVEDDLEVAPDFFEYFRATYPLLK ....::. .: .: : .: :::.:.::..: ::: .. . ::. CCDS53 NA--------RVSQHYKASLTATFNLFPEAKF--AVVLEEDLDIAVDFFSFLSQSIHLLE 370 380 390 400 410 240 250 260 270 280 pF1KB4 ADPSLWCVSAWNDNGKEQMVDASRPELLYRTDFFPGLGWLLLAELWAE-LEPKWPKAF-- : ::.:.:::::.: :. : : ::::.. .:::::.: :. : :::::: CCDS53 EDDSLYCISAWNDQGYEH--TAEDPALLYRVETMPGLGWVLRRSLYKEELEPKWPTPEKL 420 430 440 450 460 470 290 300 310 320 330 340 pF1KB4 --WDDWMRRPEQRQGRACIRPEISRTMTFGRKGVS-HGQFFDQHLKFIKLNQQFVHFTQL :: ::: ::::.:: :: :..::.. :: :.. .: : . ..: :.: : .:: CCDS53 WDWDMWMRMPEQRRGRECIIPDVSRSYHFGIVGLNMNGYFHEAYFKKHKFNT--VPGVQL 480 490 500 510 520 530 350 360 370 380 390 400 pF1KB4 -DLSYLQREAYDRDFLARVYGAPQLQVEKVRTNDRKELGEVRVQYTGRDSFKAFAKALGV ... :..:::. CCDS53 RNVDSLKKEAYEVEVHRLLSEAEVLDHSKNPCEDSFLPDTEGHTYVAFIRMEKDDDFTTW 540 550 560 570 580 590 >>CCDS57995.1 POMGNT1 gene_id:55624|Hs108|chr1 (748 aa) initn: 425 init1: 170 opt: 476 Z-score: 503.6 bits: 103.1 E(32554): 9.9e-22 Smith-Waterman score: 519; 33.9% identity (59.6% similar) in 342 aa overlap (36-355:227-542) 10 20 30 40 50 60 pF1KB4 SAGLVLWGAILFVAWNALLLLFFWTRPAPGRPPSVSALDGDPASLTREV-IRLAQDAE-- . :..:. :::. : .: . :..:: CCDS57 GSQAGPALGWRDTWAFVGRKGGPVFGEKHSKSPALSSW-GDPVLLKTDVPLSSAEEAECH 200 210 220 230 240 250 70 80 90 100 110 pF1KB4 ---VELERQRGLLQQIGDALSSQRGRVPTAAPPAQPRVPVTPAPAV----IPILVIACDR .::.:.: ... . . . : : . :. . : : .:. ::: .: CCDS57 WADTELNRRR---RRFCSKVEGY-GSVCSCKDPTPIEFSPDPLPDNKVLNVPVAVIAGNR 260 270 280 290 300 310 120 130 140 150 160 170 pF1KB4 ST-VRRCLDKLLHYRP-SAELFPIIVSQDCGHEETAQAIASYGSAVTHIRQPDLSSIAVP . . : : .:: . : ... ... : .:: ...: .: .: . . :.. CCDS57 PNYLYRMLRSLLSAQGVSPQMITVFI--DGYYEEPMDVVALFG-----LRGIQHTPISIK 320 330 340 350 360 180 190 200 210 220 230 pF1KB4 PDHRKFQGYYKIARHYRWALGQVFRQF---RFPAAVVVEDDLEVAPDFFEYFRATYPLLK ....::. .: .: : .: :::.:.::..: ::: .. . ::. CCDS57 NA--------RVSQHYKASLTATFNLFPEAKF--AVVLEEDLDIAVDFFSFLSQSIHLLE 370 380 390 400 410 240 250 260 270 280 pF1KB4 ADPSLWCVSAWNDNGKEQMVDASRPELLYRTDFFPGLGWLLLAELWAE-LEPKWPKAF-- : ::.:.:::::.: :. : : ::::.. .:::::.: :. : :::::: CCDS57 EDDSLYCISAWNDQGYEH--TAEDPALLYRVETMPGLGWVLRRSLYKEELEPKWPTPEKL 420 430 440 450 460 470 290 300 310 320 330 340 pF1KB4 --WDDWMRRPEQRQGRACIRPEISRTMTFGRKGVS-HGQFFDQHLKFIKLNQQFVHFTQL :: ::: ::::.:: :: :..::.. :: :.. .: : . ..: :.: : .:: CCDS57 WDWDMWMRMPEQRRGRECIIPDVSRSYHFGIVGLNMNGYFHEAYFKKHKFNT--VPGVQL 480 490 500 510 520 530 350 360 370 380 390 400 pF1KB4 -DLSYLQREAYDRDFLARVYGAPQLQVEKVRTNDRKELGEVRVQYTGRDSFKAFAKALGV ... :..:::. CCDS57 RNVDSLKKEAYEVEVHRLLSEAEVLDHSKNPCEDSFLPDTEGHTYVAFIRMEKDDDFTTW 540 550 560 570 580 590 445 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 02:48:41 2016 done: Fri Nov 4 02:48:41 2016 Total Scan time: 3.300 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]