FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9185, 376 aa 1>>>pF1KB9185 376 - 376 aa - 376 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1646+/-0.000968; mu= 17.2183+/- 0.058 mean_var=62.5109+/-12.661, 0's: 0 Z-trim(104.1): 14 B-trim: 0 in 0/49 Lambda= 0.162217 statistics sampled from 7719 (7725) to 7719 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.61), E-opt: 0.2 (0.237), width: 16 Scan time: 2.610 The best scores are: opt bits E(32554) CCDS13703.1 AGPAT3 gene_id:56894|Hs108|chr21 ( 376) 2553 606.3 1.4e-173 CCDS5280.1 AGPAT4 gene_id:56895|Hs108|chr6 ( 378) 1661 397.5 9.8e-111 CCDS42670.1 LCLAT1 gene_id:253558|Hs108|chr2 ( 376) 386 99.2 6.5e-21 CCDS1772.1 LCLAT1 gene_id:253558|Hs108|chr2 ( 414) 386 99.2 7.1e-21 CCDS34796.1 AGPAT5 gene_id:55326|Hs108|chr8 ( 364) 307 80.7 2.3e-15 >>CCDS13703.1 AGPAT3 gene_id:56894|Hs108|chr21 (376 aa) initn: 2553 init1: 2553 opt: 2553 Z-score: 3229.7 bits: 606.3 E(32554): 1.4e-173 Smith-Waterman score: 2553; 100.0% identity (100.0% similar) in 376 aa overlap (1-376:1-376) 10 20 30 40 50 60 pF1KB9 MGLLAFLKTQFVLHLLVGFVFVVSGLVINFVQLCTLALWPVSKQLYRRLNCRLAYSLWSQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MGLLAFLKTQFVLHLLVGFVFVVSGLVINFVQLCTLALWPVSKQLYRRLNCRLAYSLWSQ 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 LVMLLEWWSCTECTLFTDQATVERFGKEHAVIILNHNFEIDFLCGWTMCERFGVLGSSKV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 LVMLLEWWSCTECTLFTDQATVERFGKEHAVIILNHNFEIDFLCGWTMCERFGVLGSSKV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 LAKKELLYVPLIGWTWYFLEIVFCKRKWEEDRDTVVEGLRRLSDYPEYMWFLLYCEGTRF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 LAKKELLYVPLIGWTWYFLEIVFCKRKWEEDRDTVVEGLRRLSDYPEYMWFLLYCEGTRF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 TETKHRVSMEVAAAKGLPVLKYHLLPRTKGFTTAVKCLRGTVAAVYDVTLNFRGNKNPSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 TETKHRVSMEVAAAKGLPVLKYHLLPRTKGFTTAVKCLRGTVAAVYDVTLNFRGNKNPSL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB9 LGILYGKKYEADMCVRRFPLEDIPLDEKEAAQWLHKLYQEKDALQEIYNQKGMFPGEQFK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 LGILYGKKYEADMCVRRFPLEDIPLDEKEAAQWLHKLYQEKDALQEIYNQKGMFPGEQFK 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB9 PARRPWTLLNFLSWATILLSPLFSFVLGVFASGSPLLILTFLGFVGAASFGVRRLIGVTE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 PARRPWTLLNFLSWATILLSPLFSFVLGVFASGSPLLILTFLGFVGAASFGVRRLIGVTE 310 320 330 340 350 360 370 pF1KB9 IEKGSSYGNQEFKKKE :::::::::::::::: CCDS13 IEKGSSYGNQEFKKKE 370 >>CCDS5280.1 AGPAT4 gene_id:56895|Hs108|chr6 (378 aa) initn: 1700 init1: 1651 opt: 1661 Z-score: 2101.5 bits: 397.5 E(32554): 9.8e-111 Smith-Waterman score: 1661; 61.9% identity (84.8% similar) in 375 aa overlap (1-375:1-375) 10 20 30 40 50 60 pF1KB9 MGLLAFLKTQFVLHLLVGFVFVVSGLVINFVQLCTLALWPVSKQLYRRLNCRLAYSLWSQ : : ..::.::. ::. .::..:::.:: .:: :: :::..:::.:..::::.: . :: CCDS52 MDLAGLLKSQFLCHLVFCYVFIASGLIINTIQLFTLLLWPINKQLFRKINCRLSYCISSQ 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 LVMLLEWWSCTECTLFTDQATVERFGKEHAVIILNHNFEIDFLCGWTMCERFGVLGSSKV ::::::::: ::::.::: . ..:::.:...:::.:::::::::.. ::::.::.::: CCDS52 LVMLLEWWSGTECTIFTDPRAYLKYGKENAIVVLNHKFEIDFLCGWSLSERFGLLGGSKV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 LAKKELLYVPLIGWTWYFLEIVFCKRKWEEDRDTVVEGLRRLSDYPEYMWFLLYCEGTRF :::::: :::.::: ::: :.:::.::::.:: ::. .:..: :::: ..::..:::::: CCDS52 LAKKELAYVPIIGWMWYFTEMVFCSRKWEQDRKTVATSLQHLRDYPEKYFFLIHCEGTRF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 TETKHRVSMEVAAAKGLPVLKYHLLPRTKGFTTAVKCLRGTVAAVYDVTLNFRGNKNPSL :: ::..::.:: ::::: ::.:::::::::. .:. ::..:.:::: :::::.:.::.: CCDS52 TEKKHEISMQVARAKGLPRLKHHLLPRTKGFAITVRSLRNVVSAVYDCTLNFRNNENPTL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB9 LGILYGKKYEADMCVRRFPLEDIPLDEKEAAQWLHKLYQEKDALQEIYNQKGMFPGEQFK ::.: ::::.::. :::.:::::: :. : . :::::::::::.:: : . : :: . CCDS52 LGVLNGKKYHADLYVRRIPLEDIPEDDDECSAWLHKLYQEKDAFQEEYYRTGTFPETPMV 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB9 PARRPWTLLNFLSWATILLSPLFSFVLGVFASGSPLLILTFLGFVGAASFGVRRLIGVTE : ::::::.:.: ::...: :.:.:..... ::: : . .:. .:: ::: .::::: CCDS52 PPRRPWTLVNWLFWASLVLYPFFQFLVSMIRSGSSLTLASFILVFFVASVGVRWMIGVTE 310 320 330 340 350 360 370 pF1KB9 IEKGSSYGNQEFKKKE :.:::.:::.. :.: CCDS52 IDKGSAYGNSDSKQKLND 370 >>CCDS42670.1 LCLAT1 gene_id:253558|Hs108|chr2 (376 aa) initn: 316 init1: 256 opt: 386 Z-score: 488.9 bits: 99.2 E(32554): 6.5e-21 Smith-Waterman score: 408; 30.5% identity (57.2% similar) in 348 aa overlap (11-345:9-344) 10 20 30 40 50 60 pF1KB9 MGLLAFLKTQFVLHLLVGFVFVVSGLVINFVQLCTLALWPVSKQLYRRLNCRLAYSLWSQ :.: :. : : : . :. : : :. . :: .: ::. . : CCDS42 MVSWKGIYFILTLFWGSFF---GSI--FMLSPFLPLMFVNPSWYRWINNRLVAT-WLT 10 20 30 40 50 70 80 90 100 110 pF1KB9 L-VMLLEWWSCTECTLFTDQATVERFGKEHAVIILNHNFEIDFLCGWTMCERFGVLGSSK : : ::: .. ...: .: : : :..:::.:: ..:.. :. :.. : : CCDS42 LPVALLETMFGVK-VIITGDAFVP--G-ERSVIIMNHRTRMDWMFLWNCLMRYSYLRLEK 60 70 80 90 100 120 130 140 150 160 170 pF1KB9 VLAKKELLYVPLIGWTWYFLEIVFCKRKWEEDRDTVVEGLRRLSDYPEYMWFLLYCEGTR . : : :: .::. .: .:::..:.. . . . : : . .:.. ::: CCDS42 ICLKASLKGVPGFGWAMQAAAYIFIHRKWKDDKSHFEDMIDYFCDIHEPLQLLIFPEGTD 110 120 130 140 150 160 180 190 200 210 220 230 pF1KB9 FTETKHRVSMEVAAAKGLPVLKYHLLPRTKGFTTAVKCLR-G-TVAAVYDVTLNFRGNKN .::... : : .:: .: : ::: ::: .: :: : .. ::.:.:. . : CCDS42 LTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFVVDRLREGKNLDAVHDITVAYPHNIP 170 180 190 200 210 220 240 250 260 270 280 290 pF1KB9 PSLLGILYGK-KYEADMCVRRFPLEDIPLDEKEAAQWLHKLYQEKDA-LQEIYN-QKGM- : .: : : . :.:.:.. .: .... : :: ..::. :. .:. .:.. CCDS42 QSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNFY 230 240 250 260 270 280 300 310 320 330 340 pF1KB9 FPGEQFKPARRPWT------LLNFLSWATILLSPLFSFVLGVFASGSPLLILTFLGFVGA : :.. : . ::..: :. :.:: . ... ... . .:.:.. :: CCDS42 FTGQSVIPPCKSELRVLVVKLLSILYWT--LFSPAMCLLIYLYSLVKWYFIITIVIFVLQ 290 300 310 320 330 340 350 360 370 pF1KB9 ASFGVRRLIGVTEIEKGSSYGNQEFKKKE CCDS42 ERIFGGLEIIELACYRLLHKQPHLNSKKNE 350 360 370 >>CCDS1772.1 LCLAT1 gene_id:253558|Hs108|chr2 (414 aa) initn: 316 init1: 256 opt: 386 Z-score: 488.3 bits: 99.2 E(32554): 7.1e-21 Smith-Waterman score: 408; 30.5% identity (57.2% similar) in 348 aa overlap (11-345:47-382) 10 20 30 40 pF1KB9 MGLLAFLKTQFVLHLLVGFVFVVSGLVINFVQLCTLALWP :.: :. : : : . :. : : CCDS17 INEAVSSYCTYFIKQDSKSFGIMVSWKGIYFILTLFWGSFF---GSI--FMLSPFLPLMF 20 30 40 50 60 70 50 60 70 80 90 pF1KB9 VSKQLYRRLNCRLAYSLWSQL-VMLLEWWSCTECTLFTDQATVERFGKEHAVIILNHNFE :. . :: .: ::. . : : : ::: .. ...: .: : : :..:::.:: . CCDS17 VNPSWYRWINNRLVAT-WLTLPVALLETMFGVK-VIITGDAFVP--G-ERSVIIMNHRTR 80 90 100 110 120 100 110 120 130 140 150 pF1KB9 IDFLCGWTMCERFGVLGSSKVLAKKELLYVPLIGWTWYFLEIVFCKRKWEEDRDTVVEGL .:.. :. :.. : :. : : :: .::. .: .:::..:.. . . CCDS17 MDWMFLWNCLMRYSYLRLEKICLKASLKGVPGFGWAMQAAAYIFIHRKWKDDKSHFEDMI 130 140 150 160 170 180 160 170 180 190 200 210 pF1KB9 RRLSDYPEYMWFLLYCEGTRFTETKHRVSMEVAAAKGLPVLKYHLLPRTKGFTTAVKCLR . : : . .:.. ::: .::... : : .:: .: : ::: ::: .: :: CCDS17 DYFCDIHEPLQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFVVDRLR 190 200 210 220 230 240 220 230 240 250 260 270 pF1KB9 -G-TVAAVYDVTLNFRGNKNPSLLGILYGK-KYEADMCVRRFPLEDIPLDEKEAAQWLHK : .. ::.:.:. . : : .: : : . :.:.:.. .: .... : :: CCDS17 EGKNLDAVHDITVAYPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHK 250 260 270 280 290 300 280 290 300 310 320 pF1KB9 LYQEKDA-LQEIYN-QKGM-FPGEQFKPARRPWT------LLNFLSWATILLSPLFSFVL ..::. :. .:. .:.. : :.. : . ::..: :. :.:: . ... CCDS17 RWEEKEERLRSFYQGEKNFYFTGQSVIPPCKSELRVLVVKLLSILYWT--LFSPAMCLLI 310 320 330 340 350 360 330 340 350 360 370 pF1KB9 GVFASGSPLLILTFLGFVGAASFGVRRLIGVTEIEKGSSYGNQEFKKKE ... . .:.:.. :: CCDS17 YLYSLVKWYFIITIVIFVLQERIFGGLEIIELACYRLLHKQPHLNSKKNE 370 380 390 400 410 >>CCDS34796.1 AGPAT5 gene_id:55326|Hs108|chr8 (364 aa) initn: 265 init1: 128 opt: 307 Z-score: 389.2 bits: 80.7 E(32554): 2.3e-15 Smith-Waterman score: 314; 25.4% identity (58.5% similar) in 272 aa overlap (44-297:45-310) 20 30 40 50 60 70 pF1KB9 HLLVGFVFVVSGLVINFVQLCTLALWPVSKQLYRRLNCRLAYSLWSQLVMLLEWWSCTEC ..:. :. :: :......: .. .. CCDS34 LLPSVVLLGTAPTYVLAWGVWRLLSAFLPARFYQALDDRLYCVYQSMVLFFFENYTGVQI 20 30 40 50 60 70 80 90 100 110 120 130 pF1KB9 TLFTDQATVERFGKEHAVIILNHNFEIDFLCGWTMCERFGVLGSSKVLAKKELLYVPLIG :. : .::. . . ::. .:.. . . : ..:: . . :. : ..:: : CCDS34 LLYGDLPK----NKENIIYLANHQSTVDWIVADILAIRQNALGHVRYVLKEGLKWLPLYG 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 WTWYFLEI--VFCKRKWEEDRDTVVEGLRRLSDYPEYMWFLLYCEGTRFTETKHRV---S :: . .. ::. . .. . . :. : :..... ::::.. . .: : CCDS34 C--YFAQHGGIYVKRSAKFNEKEMRNKLQSYVDAGTPMYLVIFPEGTRYNPEQTKVLSAS 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 MEVAAAKGLPVLKYHLLPRTKGFTTAVKCLRGTVAAVYDVTLNFRGN-------KNPSLL . :: .:: :::. : :: :. .: :... . :.::::. ..:. ..:.. CCDS34 QAFAAQRGLAVLKHVLTPRIKATHVAFDCMKNYLDAIYDVTVVYEGKDDGGQRRESPTMT 190 200 210 220 230 240 250 260 270 280 290 pF1KB9 GILYGKKYEADMCVRRFPLEDIPLDEKEAAQWLHKLYQEKDA-LQEIYN-----QKGMFP .: . . . . :. .:.: .... .:::. .. :: : :.:. .. :: CCDS34 EFLCKECPKIHIHIDRIDKKDVPEEQEHMRRWLHERFEIKDKMLIEFYESPDPERRKRFP 250 260 270 280 290 300 300 310 320 330 340 350 pF1KB9 GEQFKPARRPWTLLNFLSWATILLSPLFSFVLGVFASGSPLLILTFLGFVGAASFGVRRL :. CCDS34 GKSVNSKLSIKKTLPSMLILSGLTAGMLMTDAGRKLYVNTWIYGTLLGCLWVTIKA 310 320 330 340 350 360 376 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 19:40:49 2016 done: Sat Nov 5 19:40:50 2016 Total Scan time: 2.610 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]