FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB5673, 550 aa 1>>>pF1KB5673 550 - 550 aa - 550 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4459+/- 0.001; mu= 17.8842+/- 0.060 mean_var=66.7565+/-13.945, 0's: 0 Z-trim(103.7): 35 B-trim: 0 in 0/49 Lambda= 0.156974 statistics sampled from 7520 (7537) to 7520 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.589), E-opt: 0.2 (0.232), width: 16 Scan time: 3.170 The best scores are: opt bits E(32554) CCDS1330.1 SOAT1 gene_id:6646|Hs108|chr1 ( 550) 3753 859.2 0 CCDS58047.1 SOAT1 gene_id:6646|Hs108|chr1 ( 492) 3385 775.9 0 CCDS58048.1 SOAT1 gene_id:6646|Hs108|chr1 ( 485) 3342 766.1 0 CCDS8847.1 SOAT2 gene_id:8435|Hs108|chr12 ( 522) 1711 396.8 3.4e-110 CCDS6420.1 DGAT1 gene_id:8694|Hs108|chr8 ( 488) 405 101.0 3.4e-21 >>CCDS1330.1 SOAT1 gene_id:6646|Hs108|chr1 (550 aa) initn: 3753 init1: 3753 opt: 3753 Z-score: 4590.8 bits: 859.2 E(32554): 0 Smith-Waterman score: 3753; 100.0% identity (100.0% similar) in 550 aa overlap (1-550:1-550) 10 20 30 40 50 60 pF1KB5 MVGEEKMSLRNRLSKSRENPEEDEDQRNPAKESLETPSNGRIDIKQLIAKKIKLTAEAEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MVGEEKMSLRNRLSKSRENPEEDEDQRNPAKESLETPSNGRIDIKQLIAKKIKLTAEAEE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB5 LKPFFMKEVGSHFDDFVTNLIEKSASLDNGGCALTTFSVLEGEKNNHRAKDLRAPPEQGK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 LKPFFMKEVGSHFDDFVTNLIEKSASLDNGGCALTTFSVLEGEKNNHRAKDLRAPPEQGK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB5 IFIARRSLLDELLEVDHIRTIYHMFIALLILFILSTLVVDYIDEGRLVLEFSLLSYAFGK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 IFIARRSLLDELLEVDHIRTIYHMFIALLILFILSTLVVDYIDEGRLVLEFSLLSYAFGK 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB5 FPTVVWTWWIMFLSTFSVPYFLFQHWATGYSKSSHPLIRSLFHGFLFMIFQIGVLGFGPT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 FPTVVWTWWIMFLSTFSVPYFLFQHWATGYSKSSHPLIRSLFHGFLFMIFQIGVLGFGPT 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB5 YVVLAYTLPPASRFIIIFEQIRFVMKAHSFVRENVPRVLNSAKEKSSTVPIPTVNQYLYF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 YVVLAYTLPPASRFIIIFEQIRFVMKAHSFVRENVPRVLNSAKEKSSTVPIPTVNQYLYF 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB5 LFAPTLIYRDSYPRNPTVRWGYVAMKFAQVFGCFFYVYYIFERLCAPLFRNIKQEPFSAR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 LFAPTLIYRDSYPRNPTVRWGYVAMKFAQVFGCFFYVYYIFERLCAPLFRNIKQEPFSAR 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB5 VLVLCVFNSILPGVLILFLTFFAFLHCWLNAFAEMLRFGDRMFYKDWWNSTSYSNYYRTW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 VLVLCVFNSILPGVLILFLTFFAFLHCWLNAFAEMLRFGDRMFYKDWWNSTSYSNYYRTW 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB5 NVVVHDWLYYYAYKDFLWFFSKRFKSAAMLAVFAVSAVVHEYALAVCLSFFYPVLFVLFM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 NVVVHDWLYYYAYKDFLWFFSKRFKSAAMLAVFAVSAVVHEYALAVCLSFFYPVLFVLFM 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB5 FFGMAFNFIVNDSRKKPIWNVLMWTSLFLGNGVLLCFYSQEWYARQHCPLKNPTFLDYVR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 FFGMAFNFIVNDSRKKPIWNVLMWTSLFLGNGVLLCFYSQEWYARQHCPLKNPTFLDYVR 490 500 510 520 530 540 550 pF1KB5 PRSWTCRYVF :::::::::: CCDS13 PRSWTCRYVF 550 >>CCDS58047.1 SOAT1 gene_id:6646|Hs108|chr1 (492 aa) initn: 3385 init1: 3385 opt: 3385 Z-score: 4141.2 bits: 775.9 E(32554): 0 Smith-Waterman score: 3385; 100.0% identity (100.0% similar) in 491 aa overlap (60-550:2-492) 30 40 50 60 70 80 pF1KB5 AKESLETPSNGRIDIKQLIAKKIKLTAEAEELKPFFMKEVGSHFDDFVTNLIEKSASLDN :::::::::::::::::::::::::::::: CCDS58 MELKPFFMKEVGSHFDDFVTNLIEKSASLDN 10 20 30 90 100 110 120 130 140 pF1KB5 GGCALTTFSVLEGEKNNHRAKDLRAPPEQGKIFIARRSLLDELLEVDHIRTIYHMFIALL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 GGCALTTFSVLEGEKNNHRAKDLRAPPEQGKIFIARRSLLDELLEVDHIRTIYHMFIALL 40 50 60 70 80 90 150 160 170 180 190 200 pF1KB5 ILFILSTLVVDYIDEGRLVLEFSLLSYAFGKFPTVVWTWWIMFLSTFSVPYFLFQHWATG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 ILFILSTLVVDYIDEGRLVLEFSLLSYAFGKFPTVVWTWWIMFLSTFSVPYFLFQHWATG 100 110 120 130 140 150 210 220 230 240 250 260 pF1KB5 YSKSSHPLIRSLFHGFLFMIFQIGVLGFGPTYVVLAYTLPPASRFIIIFEQIRFVMKAHS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 YSKSSHPLIRSLFHGFLFMIFQIGVLGFGPTYVVLAYTLPPASRFIIIFEQIRFVMKAHS 160 170 180 190 200 210 270 280 290 300 310 320 pF1KB5 FVRENVPRVLNSAKEKSSTVPIPTVNQYLYFLFAPTLIYRDSYPRNPTVRWGYVAMKFAQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 FVRENVPRVLNSAKEKSSTVPIPTVNQYLYFLFAPTLIYRDSYPRNPTVRWGYVAMKFAQ 220 230 240 250 260 270 330 340 350 360 370 380 pF1KB5 VFGCFFYVYYIFERLCAPLFRNIKQEPFSARVLVLCVFNSILPGVLILFLTFFAFLHCWL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 VFGCFFYVYYIFERLCAPLFRNIKQEPFSARVLVLCVFNSILPGVLILFLTFFAFLHCWL 280 290 300 310 320 330 390 400 410 420 430 440 pF1KB5 NAFAEMLRFGDRMFYKDWWNSTSYSNYYRTWNVVVHDWLYYYAYKDFLWFFSKRFKSAAM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 NAFAEMLRFGDRMFYKDWWNSTSYSNYYRTWNVVVHDWLYYYAYKDFLWFFSKRFKSAAM 340 350 360 370 380 390 450 460 470 480 490 500 pF1KB5 LAVFAVSAVVHEYALAVCLSFFYPVLFVLFMFFGMAFNFIVNDSRKKPIWNVLMWTSLFL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 LAVFAVSAVVHEYALAVCLSFFYPVLFVLFMFFGMAFNFIVNDSRKKPIWNVLMWTSLFL 400 410 420 430 440 450 510 520 530 540 550 pF1KB5 GNGVLLCFYSQEWYARQHCPLKNPTFLDYVRPRSWTCRYVF ::::::::::::::::::::::::::::::::::::::::: CCDS58 GNGVLLCFYSQEWYARQHCPLKNPTFLDYVRPRSWTCRYVF 460 470 480 490 >>CCDS58048.1 SOAT1 gene_id:6646|Hs108|chr1 (485 aa) initn: 3342 init1: 3342 opt: 3342 Z-score: 4088.6 bits: 766.1 E(32554): 0 Smith-Waterman score: 3342; 100.0% identity (100.0% similar) in 485 aa overlap (66-550:1-485) 40 50 60 70 80 90 pF1KB5 TPSNGRIDIKQLIAKKIKLTAEAEELKPFFMKEVGSHFDDFVTNLIEKSASLDNGGCALT :::::::::::::::::::::::::::::: CCDS58 MKEVGSHFDDFVTNLIEKSASLDNGGCALT 10 20 30 100 110 120 130 140 150 pF1KB5 TFSVLEGEKNNHRAKDLRAPPEQGKIFIARRSLLDELLEVDHIRTIYHMFIALLILFILS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 TFSVLEGEKNNHRAKDLRAPPEQGKIFIARRSLLDELLEVDHIRTIYHMFIALLILFILS 40 50 60 70 80 90 160 170 180 190 200 210 pF1KB5 TLVVDYIDEGRLVLEFSLLSYAFGKFPTVVWTWWIMFLSTFSVPYFLFQHWATGYSKSSH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 TLVVDYIDEGRLVLEFSLLSYAFGKFPTVVWTWWIMFLSTFSVPYFLFQHWATGYSKSSH 100 110 120 130 140 150 220 230 240 250 260 270 pF1KB5 PLIRSLFHGFLFMIFQIGVLGFGPTYVVLAYTLPPASRFIIIFEQIRFVMKAHSFVRENV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 PLIRSLFHGFLFMIFQIGVLGFGPTYVVLAYTLPPASRFIIIFEQIRFVMKAHSFVRENV 160 170 180 190 200 210 280 290 300 310 320 330 pF1KB5 PRVLNSAKEKSSTVPIPTVNQYLYFLFAPTLIYRDSYPRNPTVRWGYVAMKFAQVFGCFF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 PRVLNSAKEKSSTVPIPTVNQYLYFLFAPTLIYRDSYPRNPTVRWGYVAMKFAQVFGCFF 220 230 240 250 260 270 340 350 360 370 380 390 pF1KB5 YVYYIFERLCAPLFRNIKQEPFSARVLVLCVFNSILPGVLILFLTFFAFLHCWLNAFAEM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 YVYYIFERLCAPLFRNIKQEPFSARVLVLCVFNSILPGVLILFLTFFAFLHCWLNAFAEM 280 290 300 310 320 330 400 410 420 430 440 450 pF1KB5 LRFGDRMFYKDWWNSTSYSNYYRTWNVVVHDWLYYYAYKDFLWFFSKRFKSAAMLAVFAV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 LRFGDRMFYKDWWNSTSYSNYYRTWNVVVHDWLYYYAYKDFLWFFSKRFKSAAMLAVFAV 340 350 360 370 380 390 460 470 480 490 500 510 pF1KB5 SAVVHEYALAVCLSFFYPVLFVLFMFFGMAFNFIVNDSRKKPIWNVLMWTSLFLGNGVLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 SAVVHEYALAVCLSFFYPVLFVLFMFFGMAFNFIVNDSRKKPIWNVLMWTSLFLGNGVLL 400 410 420 430 440 450 520 530 540 550 pF1KB5 CFYSQEWYARQHCPLKNPTFLDYVRPRSWTCRYVF ::::::::::::::::::::::::::::::::::: CCDS58 CFYSQEWYARQHCPLKNPTFLDYVRPRSWTCRYVF 460 470 480 >>CCDS8847.1 SOAT2 gene_id:8435|Hs108|chr12 (522 aa) initn: 1748 init1: 1196 opt: 1711 Z-score: 2091.9 bits: 396.8 E(32554): 3.4e-110 Smith-Waterman score: 1711; 55.7% identity (80.1% similar) in 433 aa overlap (116-546:96-520) 90 100 110 120 130 140 pF1KB5 SLDNGGCALTTFSVLEGEKNNHRAKDLRAPPEQGK--IFIARRSLLDELLEVDHIRTIYH : :: .:: :.::::::.::.:.::::: CCDS88 RAMREAIQSYPSQDKPLPPPPPGSLSRTQEPSLGKQKVFIIRKSLLDELMEVQHFRTIYH 70 80 90 100 110 120 150 160 170 180 190 200 pF1KB5 MFIALLILFILSTLVVDYIDEGRLVLEFSLLSYAFGKFPTVVWTWWIMFLSTFSVPYFLF :::: : .::.:::..:.::::::.:::.:: ..::..: .. :: :::::. .:: . CCDS88 MFIAGLCVFIISTLAIDFIDEGRLLLEFDLLIFSFGQLPLALVTWVPMFLSTLLAPYQAL 130 140 150 160 170 180 210 220 230 240 250 260 pF1KB5 QHWATGYSKSSHPLIRSLFHGFLFMIFQIGVLGFGPTYVVLAYTLPPASRFIIIFEQIRF . :: : .. : : .. . :: :..:.. . :::::: ...:::.:: CCDS88 RLWARGTWTQATGL------GCALLAAHAVVLCALPVHVAVEHQLPPASRCVLVFEQVRF 190 200 210 220 230 270 280 290 300 310 320 pF1KB5 VMKAHSFVRENVPRVLNSAKEKSSTVPIPTVNQYLYFLFAPTLIYRDSYPRNPTVRWGYV .::..::.:: :: .: . ... . :. ..:::::: ::::::..:::.: :::.:: CCDS88 LMKSYSFLREAVPGTLRA--RRGEGIQAPSFSSYLYFLFCPTLIYRETYPRTPYVRWNYV 240 250 260 270 280 290 330 340 350 360 370 380 pF1KB5 AMKFAQVFGCFFYVYYIFERLCAPLFRNIKQEPFSARVLVLCVFNSILPGVLILFLTFFA : .:::..:: .:. .:. :::.:.: :...::::.:.::: .... :::...:.: ::: CCDS88 AKNFAQALGCVLYACFILGRLCVPVFANMSREPFSTRALVLSILHATLPGIFMLLLIFFA 300 310 320 330 340 350 390 400 410 420 430 440 pF1KB5 FLHCWLNAFAEMLRFGDRMFYKDWWNSTSYSNYYRTWNVVVHDWLYYYAYKDFLWFFSKR :::::::::::::::::::::.:::::::.:::::::::::::::: :.:.: : ... : CCDS88 FLHCWLNAFAEMLRFGDRMFYRDWWNSTSFSNYYRTWNVVVHDWLYSYVYQDGLRLLGAR 360 370 380 390 400 410 450 460 470 480 490 500 pF1KB5 FKSAAMLAVFAVSAVVHEYALAVCLSFFYPVLFVLFMFFGMAFNFIVNDSRKKPIWNVLM ...:::.:: ::::.::: . :.:::::...::. .: .::...:.: : ::::: CCDS88 ARGVAMLGVFLVSAVAHEYIFCFVLGFFYPVMLILFLVIGGMLNFMMHDQRTGPAWNVLM 420 430 440 450 460 470 510 520 530 540 550 pF1KB5 WTSLFLGNGVLLCFYSQEWYARQHCPLKNPTFLDYVRPRSWTCRYVF :: ::::.:. . .: ::::::.:::: . :: : ::::.: CCDS88 WTMLFLGQGIQVSLYCQEWYARRHCPLPQATFWGLVTPRSWSCHT 480 490 500 510 520 >>CCDS6420.1 DGAT1 gene_id:8694|Hs108|chr8 (488 aa) initn: 358 init1: 231 opt: 405 Z-score: 493.9 bits: 101.0 E(32554): 3.4e-21 Smith-Waterman score: 446; 26.6% identity (56.0% similar) in 448 aa overlap (110-523:53-477) 80 90 100 110 120 130 pF1KB5 LIEKSASLDNGGCALTTFSVLEGEKNNHRAKDLRAPPEQGKIFIARRSLLDELLEVD--- :: : .:. . . : : :. : CCDS64 GPAAAEEEVRDAAAGPDVGAAGDAPAPAPNKDGDAGVGSGHWELRCHRLQDSLFSSDSGF 30 40 50 60 70 80 140 150 160 170 180 190 pF1KB5 -HIRTIYHMFIALLILFILSTLVVDYIDEGRLVLEFSLLSYAFGKFPTVVWTW----WIM . : : . ...::: .. . : : :: ....: : : : ..: .. CCDS64 SNYRGILNWCVVMLILSNARLFLENLIKYGILVDPIQVVSL-FLKDP---YSWPAPCLVI 90 100 110 120 130 200 210 220 230 240 pF1KB5 FLSTFSVPYF-LFQHWATGYSKSSHPLIRSLFHGFLFMIFQIG-VLGFGPTYVVLAYTLP ..:.: : . .. :.: . :.:. . ... .: : . :.:. .. CCDS64 AANVFAVAAFQVEKRLAVGALTEQA--------GLLLHVANLATILCFPAAVVLLVESIT 140 150 160 170 180 190 250 260 270 280 290 pF1KB5 PASRFI------IIFEQIRFVMKAHSFVRENVPRVLNSAKEKSS-----TVPIP---TVN :.. .. :.: .. ..:. :. .. ...:. :: :: : : CCDS64 PVGSLLALMAHTILFLKLFSYRDVNSWCRRARAKAASAGKKASSAAAPHTVSYPDNLTYR 200 210 220 230 240 250 300 310 320 330 340 350 pF1KB5 QYLYFLFAPTLIYRDSYPRNPTVRWGYVAMKFAQVFGCFFYVYYI--FERLCAPLFRNIK . :::::::: :. ..::.: .: .. .. ... :: . ... .: ..: . CCDS64 DLYYFLFAPTLCYELNFPRSPRIRKRFLLRRILEML--FFTQLQVGLIQQWMVPTIQN-S 260 270 280 290 300 360 370 380 390 400 pF1KB5 QEPFS----ARVLVLCVFNSILPGVLILFLTFFAFLHCWLNAFAEMLRFGDRMFYKDWWN ..::. .:. . ... .:. :: .. :. ..: ::: ::...:::: ::.:::: CCDS64 MKPFKDMDYSRI-IERLLKLAVPNHLIWLIFFYWLFHSCLNAVAELMQFGDREFYRDWWN 310 320 330 340 350 360 410 420 430 440 450 460 pF1KB5 STSYSNYYRTWNVVVHDWLYYYAYKDFLWFFSKRFKSAAMLAVFAVSAVVHEYALAVCLS : : . ....::. :: : . :: .: :... : .:: .:: ::: ..: : CCDS64 SESVTYFWQNWNIPVHKWCIRHFYKPMLRRGSSKW--MARTGVFLASAFFHEYLVSVPLR 370 380 390 400 410 420 470 480 490 500 510 520 pF1KB5 FFYPVLFVLFMFFGMA----FNFIVNDSRKKPIWNVLMWTSLFLGNGVLLCFYSQEWYAR .: :. : :: . ..:. . :. .: ::..:. . . .: ...: CCDS64 -----MFRLWAFTGMMAQIPLAWFVGRFFQGNYGNAAVWLSLIIGQPIAVLMYVHDYYVL 430 440 450 460 470 530 540 550 pF1KB5 QHCPLKNPTFLDYVRPRSWTCRYVF CCDS64 NYEAPAAEA 480 550 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 13:12:15 2016 done: Sat Nov 5 13:12:16 2016 Total Scan time: 3.170 Total Display time: 0.060 Function used was FASTA [36.3.4 Apr, 2011]