FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB5440, 327 aa 1>>>pF1KB5440 327 - 327 aa - 327 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6729+/-0.000636; mu= 15.8750+/- 0.039 mean_var=87.6984+/-17.248, 0's: 0 Z-trim(113.2): 11 B-trim: 0 in 0/53 Lambda= 0.136955 statistics sampled from 13848 (13859) to 13848 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.774), E-opt: 0.2 (0.426), width: 16 Scan time: 2.710 The best scores are: opt bits E(32554) CCDS4429.1 B4GALT7 gene_id:11285|Hs108|chr5 ( 327) 2315 466.6 1.2e-131 CCDS82245.1 B4GALT6 gene_id:9331|Hs108|chr18 ( 343) 433 94.7 1.1e-19 CCDS11900.1 B4GALT6 gene_id:9331|Hs108|chr18 ( 382) 433 94.7 1.2e-19 CCDS6535.1 B4GALT1 gene_id:2683|Hs108|chr9 ( 398) 430 94.2 1.9e-19 CCDS13420.1 B4GALT5 gene_id:9334|Hs108|chr20 ( 388) 426 93.4 3.2e-19 CCDS1222.1 B4GALT3 gene_id:8703|Hs108|chr1 ( 393) 405 89.2 5.8e-18 CCDS2986.1 B4GALT4 gene_id:8702|Hs108|chr3 ( 344) 396 87.4 1.8e-17 CCDS506.1 B4GALT2 gene_id:8704|Hs108|chr1 ( 372) 387 85.7 6.5e-17 CCDS55596.1 B4GALT2 gene_id:8704|Hs108|chr1 ( 401) 387 85.7 6.9e-17 >>CCDS4429.1 B4GALT7 gene_id:11285|Hs108|chr5 (327 aa) initn: 2315 init1: 2315 opt: 2315 Z-score: 2476.7 bits: 466.6 E(32554): 1.2e-131 Smith-Waterman score: 2315; 100.0% identity (100.0% similar) in 327 aa overlap (1-327:1-327) 10 20 30 40 50 60 pF1KB5 MFPSRRKAAQLPWEDGRSGLLSGGLPRKCSVFHLFVACLSLGFFSLLWLQLSCSGDVARA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 MFPSRRKAAQLPWEDGRSGLLSGGLPRKCSVFHLFVACLSLGFFSLLWLQLSCSGDVARA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB5 VRGQGQETSGPPRACPPEPPPEHWEEDASWGPHRLAVLVPFRERFEELLVFVPHMRRFLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 VRGQGQETSGPPRACPPEPPPEHWEEDASWGPHRLAVLVPFRERFEELLVFVPHMRRFLS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB5 RKKIRHHIYVLNQVDHFRFNRAALINVGFLESSNSTDYIAMHDVDLLPLNEELDYGFPEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 RKKIRHHIYVLNQVDHFRFNRAALINVGFLESSNSTDYIAMHDVDLLPLNEELDYGFPEA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB5 GPFHVASPELHPLYHYKTYVGGILLLSKQHYRLCNGMSNRFWGWGREDDEFYRRIKGAGL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 GPFHVASPELHPLYHYKTYVGGILLLSKQHYRLCNGMSNRFWGWGREDDEFYRRIKGAGL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB5 QLFRPSGITTGYKTFRHLHDPAWRKRDQKRIAAQKQEQFKVDREGGLNTVKYHVASRTAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 QLFRPSGITTGYKTFRHLHDPAWRKRDQKRIAAQKQEQFKVDREGGLNTVKYHVASRTAL 250 260 270 280 290 300 310 320 pF1KB5 SVGGAPCTVLNIMLDCDKTATPWCTFS ::::::::::::::::::::::::::: CCDS44 SVGGAPCTVLNIMLDCDKTATPWCTFS 310 320 >>CCDS82245.1 B4GALT6 gene_id:9331|Hs108|chr18 (343 aa) initn: 404 init1: 202 opt: 433 Z-score: 466.7 bits: 94.7 E(32554): 1.1e-19 Smith-Waterman score: 433; 35.0% identity (61.3% similar) in 217 aa overlap (80-292:104-316) 50 60 70 80 90 100 pF1KB5 QLSCSGDVARAVRGQGQETSGPPRACPPEPPPEHWEEDASWGPHRLAVLVPFRERFEELL : ::. ..:::.:::.: :.: CCDS82 PYMRGFLNVNVSEVSFDEIHQLFSKDLDIEPGGHWRPKDCKPRWKVAVLIPFRNRHEHLP 80 90 100 110 120 130 110 120 130 140 150 160 pF1KB5 VFVPHMRRFLSRKKIRHHIYVLNQVDHFRFNRAALINVGFLESSNST--DYIAMHDVDLL .: :. .:...... .::..:. :::: :.:::: :. ... : . .:::: : CCDS82 IFFLHLIPMLQKQRLEFAFYVIEQTGTQPFNRAMLFNVGFKEAMKDSVWDCVIFHDVDHL 140 150 160 170 180 190 170 180 190 200 210 220 pF1KB5 PLNEELDYGFPEAGPFHVASPELHPLY--HYKTYVGGILLLSKQHYRLCNGMSNRFWGWG : :.. :: : : : :. . .: :: . ::. :. ...: ::. : ::::: CCDS82 PENDRNYYGCGEM-PRHFAAKLDKYMYILPYKEFFGGVSGLTVEQFRKINGFPNAFWGWG 200 210 220 230 240 250 230 240 250 260 270 280 pF1KB5 REDDEFYRRIKGAGLQLFRPSGITTGYKTFRHLHDPAWRKRDQKRIAAQKQEQFKVDREG :::... :.. :: .. :: : ::.. : : . . .. ..:. .: CCDS82 GEDDDLWNRVHYAGYNVTRPEGDLGKYKSIPHHHRGEVQFLGRYKLLRYSKERQYID--- 260 270 280 290 300 290 300 310 320 pF1KB5 GLNTVKYHVASRTALSVGGAPCTVLNIMLDCDKTATPWCTFS :::.. : CCDS82 GLNNLIYRPKILVDRLYTNISVNLMPELAPIEDY 310 320 330 340 >>CCDS11900.1 B4GALT6 gene_id:9331|Hs108|chr18 (382 aa) initn: 404 init1: 202 opt: 433 Z-score: 466.1 bits: 94.7 E(32554): 1.2e-19 Smith-Waterman score: 433; 35.0% identity (61.3% similar) in 217 aa overlap (80-292:143-355) 50 60 70 80 90 100 pF1KB5 QLSCSGDVARAVRGQGQETSGPPRACPPEPPPEHWEEDASWGPHRLAVLVPFRERFEELL : ::. ..:::.:::.: :.: CCDS11 PYMRGFLNVNVSEVSFDEIHQLFSKDLDIEPGGHWRPKDCKPRWKVAVLIPFRNRHEHLP 120 130 140 150 160 170 110 120 130 140 150 160 pF1KB5 VFVPHMRRFLSRKKIRHHIYVLNQVDHFRFNRAALINVGFLESSNST--DYIAMHDVDLL .: :. .:...... .::..:. :::: :.:::: :. ... : . .:::: : CCDS11 IFFLHLIPMLQKQRLEFAFYVIEQTGTQPFNRAMLFNVGFKEAMKDSVWDCVIFHDVDHL 180 190 200 210 220 230 170 180 190 200 210 220 pF1KB5 PLNEELDYGFPEAGPFHVASPELHPLY--HYKTYVGGILLLSKQHYRLCNGMSNRFWGWG : :.. :: : : : :. . .: :: . ::. :. ...: ::. : ::::: CCDS11 PENDRNYYGCGEM-PRHFAAKLDKYMYILPYKEFFGGVSGLTVEQFRKINGFPNAFWGWG 240 250 260 270 280 290 230 240 250 260 270 280 pF1KB5 REDDEFYRRIKGAGLQLFRPSGITTGYKTFRHLHDPAWRKRDQKRIAAQKQEQFKVDREG :::... :.. :: .. :: : ::.. : : . . .. ..:. .: CCDS11 GEDDDLWNRVHYAGYNVTRPEGDLGKYKSIPHHHRGEVQFLGRYKLLRYSKERQYID--- 300 310 320 330 340 290 300 310 320 pF1KB5 GLNTVKYHVASRTALSVGGAPCTVLNIMLDCDKTATPWCTFS :::.. : CCDS11 GLNNLIYRPKILVDRLYTNISVNLMPELAPIEDY 350 360 370 380 >>CCDS6535.1 B4GALT1 gene_id:2683|Hs108|chr9 (398 aa) initn: 398 init1: 193 opt: 430 Z-score: 462.6 bits: 94.2 E(32554): 1.9e-19 Smith-Waterman score: 430; 35.1% identity (64.9% similar) in 208 aa overlap (92-294:175-378) 70 80 90 100 110 120 pF1KB5 RGQGQETSGPPRACPPEPPPEHWEEDASWGPHRLAVLVPFRERFEELLVFVPHMRRFLSR ::..:...:::.: :.: .. ... :.: CCDS65 FNMPVDLELVAKQNPNVKMGGRYAPRDCVSPHKVAIIIPFRNRQEHLKYWLYYLHPVLQR 150 160 170 180 190 200 130 140 150 160 170 pF1KB5 KKIRHHIYVLNQVDHFRFNRAALINVGFLESSNSTDY--IAMHDVDLLPLNEELDYG-FP ... . :::.::. :::: :.:::: :. .. :: ... ::::.:.:.. : : CCDS65 QQLDYGIYVINQAGDTIFNRAKLLNVGFQEALKDYDYTCFVFSDVDLIPMNDHNAYRCFS 210 220 230 240 250 260 180 190 200 210 220 230 pF1KB5 EAGPFHVASPELHPLYHYKTYVGGILLLSKQHYRLCNGMSNRFWGWGREDDEFYRRIKGA . . :: .. : : ::. ::::.. ::. : .:::: :::... :. CCDS65 QPRHISVAMDKFGFSLPYVQYFGGVSALSKQQFLTINGFPNNYWGWGGEDDDIFNRLVFR 270 280 290 300 310 320 240 250 260 270 280 290 pF1KB5 GLQLFRPSGITTGYKTFRHLHDPAWRKRDQK--RIAAQKQEQFKVDREGGLNTVKYHVAS :... ::.... . .:: .: . :. ::: :. ... :::.. :.: CCDS65 GMSISRPNAVVGRCRMIRHSRDKKNEPNPQRFDRIAHTKETMLS----DGLNSLTYQVLD 330 340 350 360 370 380 300 310 320 pF1KB5 RTALSVGGAPCTVLNIMLDCDKTATPWCTFS CCDS65 VQRYPLYTQITVDIGTPS 390 >>CCDS13420.1 B4GALT5 gene_id:9334|Hs108|chr20 (388 aa) initn: 402 init1: 207 opt: 426 Z-score: 458.5 bits: 93.4 E(32554): 3.2e-19 Smith-Waterman score: 426; 34.1% identity (63.1% similar) in 214 aa overlap (83-292:152-361) 60 70 80 90 100 110 pF1KB5 CSGDVARAVRGQGQETSGPPRACPPEPPPEHWEEDASWGPHRLAVLVPFRERFEELLVFV ::. . ..:.:.:::.: :.: :. CCDS13 KGPIDINMSEIGMDYIHELFSKDPTIKLGGHWKPSDCMPRWKVAILIPFRNRHEHLPVLF 130 140 150 160 170 180 120 130 140 150 160 170 pF1KB5 PHMRRFLSRKKIRHHIYVLNQVDHFRFNRAALINVGFLESSNSTDY--IAMHDVDLLPLN :. .:.:.... .::..:: :::: :.:::: :. .. :. . .:::: .: . CCDS13 RHLLPMLQRQRLQFAFYVVEQVGTQPFNRAMLFNVGFQEAMKDLDWDCLIFHDVDHIPES 190 200 210 220 230 240 180 190 200 210 220 pF1KB5 EELDYGFPEAGPFHVASPELHPLY--HYKTYVGGILLLSKQHYRLCNGMSNRFWGWGRED .. :: . : : :. . .: : . ::. :. ...: ::. : ::::: :: CCDS13 DRNYYGCGQM-PRHFATKLDKYMYLLPYTEFFGGVSGLTVEQFRKINGFPNAFWGWGGED 250 260 270 280 290 300 230 240 250 260 270 280 pF1KB5 DEFYRRIKGAGLQLFRPSGITTGYKTFRHLHDPAWRKRDQKRIAAQKQEQFKVDREGGLN :... :...:: .. :: : : ::.. : : . . . ...:. .: ::: CCDS13 DDLWNRVQNAGYSVSRPEGDTGKYKSIPHHHRGEVQFLGRYALLRKSKERQGLD---GLN 310 320 330 340 350 290 300 310 320 pF1KB5 TVKYHVASRTALSVGGAPCTVLNIMLDCDKTATPWCTFS ...: CCDS13 NLNYFANITYDALYKNITVNLTPELAQVNEY 360 370 380 >>CCDS1222.1 B4GALT3 gene_id:8703|Hs108|chr1 (393 aa) initn: 405 init1: 183 opt: 405 Z-score: 436.0 bits: 89.2 E(32554): 5.8e-18 Smith-Waterman score: 405; 34.0% identity (62.7% similar) in 209 aa overlap (94-297:124-329) 70 80 90 100 110 120 pF1KB5 QGQETSGPPRACPPEPPPEHWEEDASWGPHRLAVLVPFRERFEELLVFVPHMRRFLSRKK : :..:: : : ..: ... :.. ::.:.. CCDS12 PVPSLAEIVERNPRVEPGGRYRPAGCEPRSRTAIIVPHRAREHHLRLLLYHLHPFLQRQQ 100 110 120 130 140 150 130 140 150 160 170 180 pF1KB5 IRHHIYVLNQVDHFRFNRAALINVGFLES--SNSTDYIAMHDVDLLPLNEELDYGFPEAG . . :::..:. . :::: :.::: :. .. : . .::::::: :.. : : CCDS12 LAYGIYVIHQAGNGTFNRAKLLNVGVREALRDEEWDCLFLHDVDLLPENDHNLYVCDPRG 160 170 180 190 200 210 190 200 210 220 230 pF1KB5 PFHVASPELHPLYH--YKTYVGGILLLSKQHYRLCNGMSNRFWGWGREDDEFYRRIKGAG : ::: . : : : ::. :. ..: ::. :..:::: :::.. :.. :: CCDS12 PRHVAVAMNKFGYSLPYPQYFGGVSALTPDQYLKMNGFPNEYWGWGGEDDDIATRVRLAG 220 230 240 250 260 270 240 250 260 270 280 290 pF1KB5 LQLFRPSGITTGYKTFRHLHDPAWRKRDQK-RIAAQKQEQFKVDREGGLNTVKYHVASRT ... :: . :: .: : . .. .. . .. :... : :.:.. :.. .: CCDS12 MKISRPPTSVGHYKMVKHRGDKGNEENPHRFDLLVRTQNSWTQD---GMNSLTYQLLARE 280 290 300 310 320 330 300 310 320 pF1KB5 ALSVGGAPCTVLNIMLDCDKTATPWCTFS CCDS12 LGPLYTNITADIGTDPRGPRAPSGPRYPPGSSQAFRQEMLQRRPPARPGPLSTANHTALR 340 350 360 370 380 390 >>CCDS2986.1 B4GALT4 gene_id:8702|Hs108|chr3 (344 aa) initn: 374 init1: 187 opt: 396 Z-score: 427.2 bits: 87.4 E(32554): 1.8e-17 Smith-Waterman score: 396; 30.2% identity (61.2% similar) in 258 aa overlap (49-296:72-326) 20 30 40 50 60 70 pF1KB5 GLLSGGLPRKCSVFHLFVACLSLGFFSLLWLQLSCSGDVARAVRGQGQETSGPP------ ..:. .:. .:::.. : CCDS29 PKAKEFMANFHKTLILGKGKTLTNEASTKKVELDNCPSVSPYLRGQSKLIFKPDLTLEEV 50 60 70 80 90 100 80 90 100 110 120 130 pF1KB5 RACPPEPPPEHWEEDASWGPHRLAVLVPFRERFEELLVFVPHMRRFLSRKKIRHHIYVLN .: :. ... . . .:.:.::: :.: ..:. .. :.. ::.:... . :::.. CCDS29 QAENPKVSRGRYRPQECKALQRVAILVPHRNREKHLMYLLEHLHPFLQRQQLDYGIYVIH 110 120 130 140 150 160 140 150 160 170 180 190 pF1KB5 QVDHFRFNRAALINVGFLES--SNSTDYIAMHDVDLLPLNEELDYGFPEAGPFHVASPEL :.. .:::: :.:::.::. .. : . .:::::.: :. : : : :.. . CCDS29 QAEGKKFNRAKLLNVGYLEALKEENWDCFIFHDVDLVPENDFNLYKCEEH-PKHLVVGRN 170 180 190 200 210 220 200 210 220 230 240 pF1KB5 HPLYH--YKTYVGGILLLSKQHYRLCNGMSNRFWGWGREDDEFYRRIKGAGLQLFRPSGI :. :. : ::. ::.... ::.:: .:::: :::.. :.. ... :: CCDS29 STGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGEDDDLRLRVELQRMKISRPLPE 230 240 250 260 270 280 250 260 270 280 290 300 pF1KB5 TTGYKTFRHLHDPAWRKRDQKRIAAQKQEQFKVDREGGLNTVKYHVASRTALSVGGAPCT . : : .: . . . .:. .: . .: : ::.. .:...: CCDS29 VGKYTMVFHTRDKG-NEVNAERMKLLHQVS-RVWRTDGLSSCSYKLVSVEHNPLYINITV 290 300 310 320 330 310 320 pF1KB5 VLNIMLDCDKTATPWCTFS CCDS29 DFWFGA 340 >>CCDS506.1 B4GALT2 gene_id:8704|Hs108|chr1 (372 aa) initn: 364 init1: 183 opt: 387 Z-score: 417.1 bits: 85.7 E(32554): 6.5e-17 Smith-Waterman score: 389; 29.4% identity (58.8% similar) in 289 aa overlap (42-323:103-368) 20 30 40 50 60 70 pF1KB5 PWEDGRSGLLSGGLPRKCSVFHLFVACLSLGFFSLLWLQLSCSGDVARAVR-GQGQETSG :. . : .... . :. : . : .: CCDS50 TASSSGLPEVPSALPGPTAPTLPPCPDSPPGLVGRLLIEFTSPMPLERVQRENPGVLMGG 80 90 100 110 120 130 80 90 100 110 120 130 pF1KB5 PPRACPPEPPPEHWEEDASWGPHRLAVLVPFRERFEELLVFVPHMRRFLSRKKIRHHIYV : ::. : . .::..:::.: ..: .. ... .: :...:. .:: CCDS50 --RYTPPDCTPAQ----------TVAVIIPFRHREHHLRYWLHYLHPILRRQRLRYGVYV 140 150 160 170 180 140 150 160 170 180 pF1KB5 LNQVDHFRFNRAALINVGFLES---SNSTDYIAMHDVDLLPLNEELDYGFPEAGPFH--V .:: . :::: :.::::::. . . : . . ::::.:.... : . : : . CCDS50 INQHGEDTFNRAKLLNVGFLEALKEDAAYDCFIFSDVDLVPMDDRNLYRCGDQ-PRHFAI 190 200 210 220 230 190 200 210 220 230 240 pF1KB5 ASPELHPLYHYKTYVGGILLLSKQHYRLCNGMSNRFWGWGREDDEFYRRIKGAGLQLFRP : .. : : ::. ::: .. ::. :..:::: :::... ::. .:... :: CCDS50 AMDKFGFRLPYAGYFGGVSGLSKAQFLRINGFPNEYWGWGGEDDDIFNRISLTGMKISRP 240 250 260 270 280 290 250 260 270 280 290 300 pF1KB5 SGITTG-YKTFRHLHDPAWRKRDQKRIAAQKQEQFKVDREGGLNTVKYHVASRTALSVGG . : : :. ..: .: . . .:.. .. .. . :.: ...:.:.: : :. CCDS50 D-IRIGRYRMIKHDRDKH-NEPNPQRFTKIQNTKLTMKRDG-IGSVRYQV-----LEVSR 300 310 320 330 340 350 310 320 pF1KB5 APCTVLNIMLDCDKTATPWCTFS : :: .: . . : CCDS50 QP-LFTNITVDIGRPPS-WPPRG 360 370 >>CCDS55596.1 B4GALT2 gene_id:8704|Hs108|chr1 (401 aa) initn: 364 init1: 183 opt: 387 Z-score: 416.6 bits: 85.7 E(32554): 6.9e-17 Smith-Waterman score: 389; 29.4% identity (58.8% similar) in 289 aa overlap (42-323:132-397) 20 30 40 50 60 70 pF1KB5 PWEDGRSGLLSGGLPRKCSVFHLFVACLSLGFFSLLWLQLSCSGDVARAVR-GQGQETSG :. . : .... . :. : . : .: CCDS55 TASSSGLPEVPSALPGPTAPTLPPCPDSPPGLVGRLLIEFTSPMPLERVQRENPGVLMGG 110 120 130 140 150 160 80 90 100 110 120 130 pF1KB5 PPRACPPEPPPEHWEEDASWGPHRLAVLVPFRERFEELLVFVPHMRRFLSRKKIRHHIYV : ::. : . .::..:::.: ..: .. ... .: :...:. .:: CCDS55 --RYTPPDCTPAQ----------TVAVIIPFRHREHHLRYWLHYLHPILRRQRLRYGVYV 170 180 190 200 140 150 160 170 180 pF1KB5 LNQVDHFRFNRAALINVGFLES---SNSTDYIAMHDVDLLPLNEELDYGFPEAGPFH--V .:: . :::: :.::::::. . . : . . ::::.:.... : . : : . CCDS55 INQHGEDTFNRAKLLNVGFLEALKEDAAYDCFIFSDVDLVPMDDRNLYRCGDQ-PRHFAI 210 220 230 240 250 260 190 200 210 220 230 240 pF1KB5 ASPELHPLYHYKTYVGGILLLSKQHYRLCNGMSNRFWGWGREDDEFYRRIKGAGLQLFRP : .. : : ::. ::: .. ::. :..:::: :::... ::. .:... :: CCDS55 AMDKFGFRLPYAGYFGGVSGLSKAQFLRINGFPNEYWGWGGEDDDIFNRISLTGMKISRP 270 280 290 300 310 320 250 260 270 280 290 300 pF1KB5 SGITTG-YKTFRHLHDPAWRKRDQKRIAAQKQEQFKVDREGGLNTVKYHVASRTALSVGG . : : :. ..: .: . . .:.. .. .. . :.: ...:.:.: : :. CCDS55 D-IRIGRYRMIKHDRDKH-NEPNPQRFTKIQNTKLTMKRDG-IGSVRYQV-----LEVSR 330 340 350 360 370 380 310 320 pF1KB5 APCTVLNIMLDCDKTATPWCTFS : :: .: . . : CCDS55 QP-LFTNITVDIGRPPS-WPPRG 390 400 327 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 12:48:02 2016 done: Sat Nov 5 12:48:02 2016 Total Scan time: 2.710 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]