FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8638, 374 aa 1>>>pF1KB8638 374 - 374 aa - 374 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4159+/-0.000669; mu= 16.0665+/- 0.041 mean_var=66.1039+/-13.789, 0's: 0 Z-trim(110.1): 143 B-trim: 418 in 1/50 Lambda= 0.157747 statistics sampled from 11234 (11396) to 11234 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.709), E-opt: 0.2 (0.35), width: 16 Scan time: 2.720 The best scores are: opt bits E(32554) CCDS11551.1 SPOP gene_id:8405|Hs108|chr17 ( 374) 2515 580.8 6.4e-166 CCDS33298.1 SPOPL gene_id:339745|Hs108|chr2 ( 392) 1932 448.2 5.8e-126 CCDS34609.1 KLHL7 gene_id:55975|Hs108|chr7 ( 586) 279 72.1 1.4e-12 CCDS3245.2 KLHL6 gene_id:89857|Hs108|chr3 ( 621) 272 70.5 4.6e-12 CCDS5378.2 KLHL7 gene_id:55975|Hs108|chr7 ( 538) 265 68.9 1.2e-11 CCDS47418.1 BTBD9 gene_id:114781|Hs108|chr6 ( 612) 258 67.3 4.1e-11 CCDS30575.1 KLHL21 gene_id:9903|Hs108|chr1 ( 597) 254 66.4 7.5e-11 >>CCDS11551.1 SPOP gene_id:8405|Hs108|chr17 (374 aa) initn: 2515 init1: 2515 opt: 2515 Z-score: 3092.3 bits: 580.8 E(32554): 6.4e-166 Smith-Waterman score: 2515; 100.0% identity (100.0% similar) in 374 aa overlap (1-374:1-374) 10 20 30 40 50 60 pF1KB8 MSRVPSPPPPAEMSSGPVAESWCYTQIKVVKFSYMWTINNFSFCREEMGEVIKSSTFSSG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MSRVPSPPPPAEMSSGPVAESWCYTQIKVVKFSYMWTINNFSFCREEMGEVIKSSTFSSG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 ANDKLKWCLRVNPKGLDEESKDYLSLYLLLVSCPKSEVRAKFKFSILNAKGEETKAMESQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 ANDKLKWCLRVNPKGLDEESKDYLSLYLLLVSCPKSEVRAKFKFSILNAKGEETKAMESQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 RAYRFVQGKDWGFKKFIRRDFLLDEANGLLPDDKLTLFCEVSVVQDSVNISGQNTMNMVK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 RAYRFVQGKDWGFKKFIRRDFLLDEANGLLPDDKLTLFCEVSVVQDSVNISGQNTMNMVK 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 VPECRLADELGGLWENSRFTDCCLCVAGQEFQAHKAILAARSPVFSAMFEHEMEESKKNR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 VPECRLADELGGLWENSRFTDCCLCVAGQEFQAHKAILAARSPVFSAMFEHEMEESKKNR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 VEINDVEPEVFKEMMCFIYTGKAPNLDKMADDLLAAADKYALERLKVMCEDALCSNLSVE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 VEINDVEPEVFKEMMCFIYTGKAPNLDKMADDLLAAADKYALERLKVMCEDALCSNLSVE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB8 NAAEILILADLHSADQLKTQAVDFINYHASDVLETSGWKSMVVSHPHLVAEAYRSLASAQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 NAAEILILADLHSADQLKTQAVDFINYHASDVLETSGWKSMVVSHPHLVAEAYRSLASAQ 310 320 330 340 350 360 370 pF1KB8 CPFLGPPRKRLKQS :::::::::::::: CCDS11 CPFLGPPRKRLKQS 370 >>CCDS33298.1 SPOPL gene_id:339745|Hs108|chr2 (392 aa) initn: 2177 init1: 1932 opt: 1932 Z-score: 2374.9 bits: 448.2 E(32554): 5.8e-126 Smith-Waterman score: 2144; 80.6% identity (91.6% similar) in 392 aa overlap (1-374:1-392) 10 20 30 40 50 60 pF1KB8 MSRVPSPPPPAEMSSGPVAESWCYTQIKVVKFSYMWTINNFSFCREEMGEVIKSSTFSSG ::: :.:: :..::.::.::::::::.::::::::::::::::::::::::.:::::::: CCDS33 MSREPTPPLPGDMSTGPIAESWCYTQVKVVKFSYMWTINNFSFCREEMGEVLKSSTFSSG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 ANDKLKWCLRVNPKGLDEESKDYLSLYLLLVSCPKSEVRAKFKFSILNAKGEETKAMESQ .::.::::::::::::.:::::::::::::::::::::::::::.:::: ::::::::: CCDS33 PSDKMKWCLRVNPKGLDDESKDYLSLYLLLVSCPKSEVRAKFKFSLLNAKREETKAMESQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 RAYRFVQGKDWGFKKFIRRDFLLDEANGLLPDDKLTLFCEVSVVQDSVNISGQNTMNMVK ::::::::::::::::::::::::::::::::::::::::::::::::::::... : .: CCDS33 RAYRFVQGKDWGFKKFIRRDFLLDEANGLLPDDKLTLFCEVSVVQDSVNISGHTNTNTLK 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 VPECRLADELGGLWENSRFTDCCLCVAGQEFQAHKAILAARSPVFSAMFEHEMEESKKNR :::::::..::.::::.::::: . : ::::.:::..::::::::.:::::::::::::: CCDS33 VPECRLAEDLGNLWENTRFTDCSFFVRGQEFKAHKSVLAARSPVFNAMFEHEMEESKKNR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 VEINDVEPEVFKEMMCFIYTGKAPNLDKMADDLLAAADKYALERLKVMCEDALCSNLSVE :::::..:::::::: :::::.:::::::::.::::::::::::::::::.::::::::: CCDS33 VEINDLDPEVFKEMMRFIYTGRAPNLDKMADNLLAAADKYALERLKVMCEEALCSNLSVE 250 260 270 280 290 300 310 320 330 340 pF1KB8 NAAEILILADLHSADQLKTQAVDFINY------------------HASDVLETSGWKSMV :.:. :.:::::::.:::.::.:::: .:.:..::::::::. CCDS33 NVADTLVLADLHSAEQLKAQAIDFINRCSVLRQLGCKDGKNWNSNQATDIMETSGWKSMI 310 320 330 340 350 360 350 360 370 pF1KB8 VSHPHLVAEAYRSLASAQCPFLGPPRKRLKQS :::::::::.:.::::::: .: :::::::: CCDS33 QSHPHLVAEAFRALASAQCPQFGIPRKRLKQS 370 380 390 >>CCDS34609.1 KLHL7 gene_id:55975|Hs108|chr7 (586 aa) initn: 297 init1: 273 opt: 279 Z-score: 339.1 bits: 72.1 E(32554): 1.4e-12 Smith-Waterman score: 279; 32.2% identity (64.9% similar) in 174 aa overlap (183-351:23-196) 160 170 180 190 200 pF1KB8 DKLTLFCEVSVVQDSVNISGQNTMNMVKVPECRLADELGGLWENSR----FTDCCLCVAG : .: . :. .: : . : : : CCDS34 MAASGVEKSSKKKTEKKLAAREEAKLLAGFMGVMNNMRKQKTLCDVILMVQE 10 20 30 40 50 210 220 230 240 250 260 pF1KB8 QEFQAHKAILAARSPVFSAMFEHEMEESKKNRVEINDVEPEVFKEMMCFIYTGKAPNLDK ... ::...::: : :. :: .: :::. .::..:.::....... : ::.. .. CCDS34 RKIPAHRVVLAAASHFFNLMFTTNMLESKSFEVELKDAEPDIIEQLVEFAYTARISVNSN 60 70 80 90 100 110 270 280 290 300 310 320 pF1KB8 MADDLLAAADKYALERLKVMCEDALCSNLSVENAAEILILADLHSADQLKTQAVDFINYH ...:: ::..: .: .: :: : : .... : : .::. . .::. : :::. : CCDS34 NVQSLLDAANQYQIEPVKKMCVDFLKEQVDASNCLGISVLAECLDCPELKATADDFIHQH 120 130 140 150 160 170 330 340 350 360 370 pF1KB8 ASDVLETSGWKSMVVSH-PHLVAEAYRSLASAQCPFLGPPRKRLKQS ..: .:. . .. :.. ::. . CCDS34 FTEVYKTDEFLQLDVKRVTHLLNQDTLTVRAEDQVYDAAVRWLKYDEPNRQPFMVDILAK 180 190 200 210 220 230 >>CCDS3245.2 KLHL6 gene_id:89857|Hs108|chr3 (621 aa) initn: 262 init1: 262 opt: 272 Z-score: 330.1 bits: 70.5 E(32554): 4.6e-12 Smith-Waterman score: 272; 33.5% identity (64.5% similar) in 155 aa overlap (195-349:68-221) 170 180 190 200 210 220 pF1KB8 QDSVNISGQNTMNMVKVPECRLADELGGLWENSRFTDCCLCVAGQEFQAHKAILAARSPV ::. .:: ::: :::. :...::: : CCDS32 LVEILNGEKVKFDDAGLSLILQNGLETLRMENA-LTDVILCVDIQEFSCHRVVLAAASNY 40 50 60 70 80 90 230 240 250 260 270 280 pF1KB8 FSAMFEHEMEESKKNRVEINDVEPEVFKEMMCFIYTGKAPNLDKMADDLLAAADKYALER : ::: ....:. ..:. :. :. :... .. . ::.:: . .. .: ::. . . : CCDS32 FRAMFCNDLKEKYEKRIIIKGVDAETMHTLLDYTYTSKALITKQNVQRVLEAANLFQFLR 100 110 120 130 140 150 290 300 310 320 330 340 pF1KB8 LKVMCEDALCSNLSVENAAEILILADLHSADQLKTQAVDFINYHASDVLETSGWKSMVVS . : . : :. :: . :: ::: :: :.:: :. ..: . ..:.. . .. :. CCDS32 MVDACASFLTEALNPENCVGILRLADTHSLDSLKKQVQSYIIQNFVQILNSEEFLDLPVD 160 170 180 190 200 210 350 360 370 pF1KB8 HPHLVAEAYRSLASAQCPFLGPPRKRLKQS : . CCDS32 TLHHILKSDDLYVTEEAQVFETVMSWVRHKPSERLCLLPYVLENVRLPLLDPWYFVETVE 220 230 240 250 260 270 >>CCDS5378.2 KLHL7 gene_id:55975|Hs108|chr7 (538 aa) initn: 286 init1: 262 opt: 265 Z-score: 322.5 bits: 68.9 E(32554): 1.2e-11 Smith-Waterman score: 265; 33.3% identity (68.7% similar) in 147 aa overlap (206-351:2-148) 180 190 200 210 220 230 pF1KB8 MNMVKVPECRLADELGGLWENSRFTDCCLCVAGQEFQAHKAILAARSPVFSAMFEHEMEE : ... ::...::: : :. :: .: : CCDS53 MVQERKIPAHRVVLAAASHFFNLMFTTNMLE 10 20 30 240 250 260 270 280 290 pF1KB8 SKKNRVEINDVEPEVFKEMMCFIYTGKAPNLDKMADDLLAAADKYALERLKVMCEDALCS ::. .::..:.::....... : ::.. .. ...:: ::..: .: .: :: : : CCDS53 SKSFEVELKDAEPDIIEQLVEFAYTARISVNSNNVQSLLDAANQYQIEPVKKMCVDFLKE 40 50 60 70 80 90 300 310 320 330 340 350 pF1KB8 NLSVENAAEILILADLHSADQLKTQAVDFINYHASDVLETSGWKSMVVSH-PHLVAEAYR .... : : .::. . .::. : :::. : ..: .:. . .. :.. ::. . CCDS53 QVDASNCLGISVLAECLDCPELKATADDFIHQHFTEVYKTDEFLQLDVKRVTHLLNQDTL 100 110 120 130 140 150 360 370 pF1KB8 SLASAQCPFLGPPRKRLKQS CCDS53 TVRAEDQVYDAAVRWLKYDEPNRQPFMVDILAKVRFPLISKNFLSKTVQAEPLIQDNPEC 160 170 180 190 200 210 >>CCDS47418.1 BTBD9 gene_id:114781|Hs108|chr6 (612 aa) initn: 174 init1: 98 opt: 258 Z-score: 313.0 bits: 67.3 E(32554): 4.1e-11 Smith-Waterman score: 258; 29.4% identity (64.4% similar) in 160 aa overlap (186-341:22-181) 160 170 180 190 200 210 pF1KB8 TLFCEVSVVQDSVNISGQNTMNMVKVPECRLADELGGLWENSRFTDCCLCVAGQEFQAHK :....:.: . .. : . : ..: ::. CCDS47 MSNSHPLRPFTAVGEIDHVHILSEHIGALLIGEEYGDVTFVVEKKRFPAHR 10 20 30 40 50 220 230 240 250 260 270 pF1KB8 AILAARSPVFSAMFEHEMEESK-KNRVEINDVEPEVFKEMMCFIYTGKAPNLDKMAD--- .::::: : :.. :.::. . .. ..:. :.: .. .::::.: :. . CCDS47 VILAARCQYFRALLYGGMRESQPEAEIPLQDTTAEAFTMLLKYIYTGRATLTDEKEEVLL 60 70 80 90 100 110 280 290 300 310 320 330 pF1KB8 DLLAAADKYALERLKVMCEDALCSNLSVENAAEILILADLHSADQLKTQAVDFINYHASD :.:. : ::.. .:. . ::. :...:. . .:.:.: .: . :.. .:.. CCDS47 DFLSLAHKYGFPELEDSTSEYLCTILNIQNVCMTFDVASLYSLPKLTCMCCMFMDRNAQE 120 130 140 150 160 170 340 350 360 370 pF1KB8 VLETSGWKSMVVSHPHLVAEAYRSLASAQCPFLGPPRKRLKQS :: . :. :. CCDS47 VLSSEGFLSLSKTALLNIVLRDSFAAPEKDIFLALLNWCKHNSKENHAEIMQAVRLPLMS 180 190 200 210 220 230 >>CCDS30575.1 KLHL21 gene_id:9903|Hs108|chr1 (597 aa) initn: 232 init1: 232 opt: 254 Z-score: 308.3 bits: 66.4 E(32554): 7.5e-11 Smith-Waterman score: 254; 33.3% identity (61.8% similar) in 144 aa overlap (190-332:25-168) 160 170 180 190 200 210 pF1KB8 EVSVVQDSVNISGQNTMNMVKVPECRLADELGGLWENSRFTDCCLCVAG-QEFQAHKAIL :. : . .: : : .:: ..: ::.:.: CCDS30 MERPAPLAVLPFSDPAHALSLLRGLSQLRAERKFLDVTLEAAGGRDFPAHRAVL 10 20 30 40 50 220 230 240 250 260 270 pF1KB8 AARSPVFSAMFEHEMEESKKNRVEINDVEPEVFKEMMCFIYTGKAPNLDKMADDLLAAAD :: :: : ::: ...::. .::... : :.... .. : :::.. :. :: ::: CCDS30 AAASPYFRAMFAGQLRESRAERVRLHGVPPDMLQLLLDFSYTGRVAVSGDNAEPLLRAAD 60 70 80 90 100 110 280 290 300 310 320 330 pF1KB8 KYALERLKVMCEDALCSNLSVENAAEILILADLHSADQLKTQAVDFINYHASDVLETSGW . .: : : ..:.. : .. .:. : . : . : :: :.... CCDS30 LLQFPAVKEACGAFLQQQLDLANCLDMQDFAEAFSCSGLASAAQRFILRHVGELGAEQLE 120 130 140 150 160 170 340 350 360 370 pF1KB8 KSMVVSHPHLVAEAYRSLASAQCPFLGPPRKRLKQS CCDS30 RLPLARLLRYLRDDGLCVPKEEAAYQLALRWVRADPPRRAAHWPQLLEAVRLPFVRRFYL 180 190 200 210 220 230 374 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 14:05:33 2016 done: Fri Nov 4 14:05:34 2016 Total Scan time: 2.720 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]