FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8303, 600 aa 1>>>pF1KB8303 600 - 600 aa - 600 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.7238+/-0.00106; mu= 16.7059+/- 0.064 mean_var=68.0336+/-13.595, 0's: 0 Z-trim(102.9): 25 B-trim: 0 in 0/51 Lambda= 0.155494 statistics sampled from 7150 (7161) to 7150 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.588), E-opt: 0.2 (0.22), width: 16 Scan time: 3.170 The best scores are: opt bits E(32554) CCDS21.1 CPSF3L gene_id:54973|Hs108|chr1 ( 600) 4015 910.2 0 CCDS57960.1 CPSF3L gene_id:54973|Hs108|chr1 ( 606) 3955 896.7 0 CCDS57959.1 CPSF3L gene_id:54973|Hs108|chr1 ( 571) 3825 867.6 0 CCDS57961.1 CPSF3L gene_id:54973|Hs108|chr1 ( 499) 3032 689.7 2.4e-198 CCDS72678.1 CPSF3L gene_id:54973|Hs108|chr1 ( 502) 3020 687.0 1.6e-197 CCDS1664.1 CPSF3 gene_id:51692|Hs108|chr2 ( 684) 1147 266.8 6.3e-71 CCDS82417.1 CPSF3 gene_id:51692|Hs108|chr2 ( 647) 1039 242.6 1.2e-63 CCDS9902.1 CPSF2 gene_id:53981|Hs108|chr14 ( 782) 438 107.8 5.4e-23 >>CCDS21.1 CPSF3L gene_id:54973|Hs108|chr1 (600 aa) initn: 4015 init1: 4015 opt: 4015 Z-score: 4864.9 bits: 910.2 E(32554): 0 Smith-Waterman score: 4015; 99.8% identity (99.8% similar) in 600 aa overlap (1-600:1-600) 10 20 30 40 50 60 pF1KB8 MPEIRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS21 MPEIRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 LDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHSTQAICPILLEDYRKIAVDKKGEANF :::::::::::::::::::::::::::::::::: ::::::::::::::::::::::::: CCDS21 LDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANF 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 FTSQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS21 FTSQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 MTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS21 MTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVF 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 ALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS21 ALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMF 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB8 EFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS21 EFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVG 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB8 HKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS21 HKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKM 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB8 EFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQGLLPEAKKPRLLH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS21 EFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQGLLPEAKKPRLLH 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB8 GTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDTRKEQETALRVYSHLKSVLKD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS21 GTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDTRKEQETALRVYSHLKSVLKD 490 500 510 520 530 540 550 560 570 580 590 600 pF1KB8 HCVQHLPDGSVTVESVLLQAAAPSEDPGTKVLLVSWTYQDEELGSFLTSLLKKGLPQAPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS21 HCVQHLPDGSVTVESVLLQAAAPSEDPGTKVLLVSWTYQDEELGSFLTSLLKKGLPQAPS 550 560 570 580 590 600 >>CCDS57960.1 CPSF3L gene_id:54973|Hs108|chr1 (606 aa) initn: 3955 init1: 3955 opt: 3955 Z-score: 4792.0 bits: 896.7 E(32554): 0 Smith-Waterman score: 3955; 99.8% identity (99.8% similar) in 591 aa overlap (10-600:16-606) 10 20 30 40 50 pF1KB8 MPEIRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQN ::::::::::::::::::::::::::::::::::::::::::::: CCDS57 MCGAGFGHFEWLAGGGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQN 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB8 GRLTDFLDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHSTQAICPILLEDYRKIAVDK :::::::::::::::::::::::::::::::::::::::: ::::::::::::::::::: CCDS57 GRLTDFLDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDK 70 80 90 100 110 120 120 130 140 150 160 170 pF1KB8 KGEANFFTSQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS57 KGEANFFTSQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVV 130 140 150 160 170 180 180 190 200 210 220 230 pF1KB8 YTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS57 YTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGK 190 200 210 220 230 240 240 250 260 270 280 290 pF1KB8 VLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS57 VLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF 250 260 270 280 290 300 300 310 320 330 340 350 pF1KB8 VQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS57 VQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYC 310 320 330 340 350 360 360 370 380 390 400 410 pF1KB8 VQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS57 VQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVH 370 380 390 400 410 420 420 430 440 450 460 470 pF1KB8 GEAKKMEFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQGLLPEAK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS57 GEAKKMEFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQGLLPEAK 430 440 450 460 470 480 480 490 500 510 520 530 pF1KB8 KPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDTRKEQETALRVYSHL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS57 KPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDTRKEQETALRVYSHL 490 500 510 520 530 540 540 550 560 570 580 590 pF1KB8 KSVLKDHCVQHLPDGSVTVESVLLQAAAPSEDPGTKVLLVSWTYQDEELGSFLTSLLKKG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS57 KSVLKDHCVQHLPDGSVTVESVLLQAAAPSEDPGTKVLLVSWTYQDEELGSFLTSLLKKG 550 560 570 580 590 600 600 pF1KB8 LPQAPS :::::: CCDS57 LPQAPS >>CCDS57959.1 CPSF3L gene_id:54973|Hs108|chr1 (571 aa) initn: 3825 init1: 3825 opt: 3825 Z-score: 4634.9 bits: 867.6 E(32554): 0 Smith-Waterman score: 3825; 99.8% identity (99.8% similar) in 571 aa overlap (30-600:1-571) 10 20 30 40 50 60 pF1KB8 MPEIRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDF ::::::::::::::::::::::::::::::: CCDS57 MLDCGMHMGFNDDRRFPDFSYITQNGRLTDF 10 20 30 70 80 90 100 110 120 pF1KB8 LDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHSTQAICPILLEDYRKIAVDKKGEANF :::::::::::::::::::::::::::::::::: ::::::::::::::::::::::::: CCDS57 LDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANF 40 50 60 70 80 90 130 140 150 160 170 180 pF1KB8 FTSQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS57 FTSQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 100 110 120 130 140 150 190 200 210 220 230 240 pF1KB8 MTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS57 MTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVF 160 170 180 190 200 210 250 260 270 280 290 300 pF1KB8 ALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS57 ALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMF 220 230 240 250 260 270 310 320 330 340 350 360 pF1KB8 EFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS57 EFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVG 280 290 300 310 320 330 370 380 390 400 410 420 pF1KB8 HKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS57 HKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKM 340 350 360 370 380 390 430 440 450 460 470 480 pF1KB8 EFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQGLLPEAKKPRLLH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS57 EFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQGLLPEAKKPRLLH 400 410 420 430 440 450 490 500 510 520 530 540 pF1KB8 GTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDTRKEQETALRVYSHLKSVLKD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS57 GTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDTRKEQETALRVYSHLKSVLKD 460 470 480 490 500 510 550 560 570 580 590 600 pF1KB8 HCVQHLPDGSVTVESVLLQAAAPSEDPGTKVLLVSWTYQDEELGSFLTSLLKKGLPQAPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS57 HCVQHLPDGSVTVESVLLQAAAPSEDPGTKVLLVSWTYQDEELGSFLTSLLKKGLPQAPS 520 530 540 550 560 570 >>CCDS57961.1 CPSF3L gene_id:54973|Hs108|chr1 (499 aa) initn: 3306 init1: 3026 opt: 3032 Z-score: 3674.4 bits: 689.7 E(32554): 2.4e-198 Smith-Waterman score: 3108; 83.2% identity (83.2% similar) in 600 aa overlap (1-600:1-499) 10 20 30 40 50 60 pF1KB8 MPEIRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDF :::::::::::::::::::::::::::::::::::::::::: CCDS57 MPEIRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDD------------------ 10 20 30 40 70 80 90 100 110 120 pF1KB8 LDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHSTQAICPILLEDYRKIAVDKKGEANF CCDS57 ------------------------------------------------------------ 130 140 150 160 170 180 pF1KB8 FTSQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYN ::::::::::::::::::::::::::::::::::::: CCDS57 -----------------------VDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 50 60 70 190 200 210 220 230 240 pF1KB8 MTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS57 MTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVF 80 90 100 110 120 130 250 260 270 280 290 300 pF1KB8 ALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS57 ALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMF 140 150 160 170 180 190 310 320 330 340 350 360 pF1KB8 EFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS57 EFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVG 200 210 220 230 240 250 370 380 390 400 410 420 pF1KB8 HKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS57 HKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKM 260 270 280 290 300 310 430 440 450 460 470 480 pF1KB8 EFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQGLLPEAKKPRLLH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS57 EFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQGLLPEAKKPRLLH 320 330 340 350 360 370 490 500 510 520 530 540 pF1KB8 GTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDTRKEQETALRVYSHLKSVLKD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS57 GTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDTRKEQETALRVYSHLKSVLKD 380 390 400 410 420 430 550 560 570 580 590 600 pF1KB8 HCVQHLPDGSVTVESVLLQAAAPSEDPGTKVLLVSWTYQDEELGSFLTSLLKKGLPQAPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS57 HCVQHLPDGSVTVESVLLQAAAPSEDPGTKVLLVSWTYQDEELGSFLTSLLKKGLPQAPS 440 450 460 470 480 490 >>CCDS72678.1 CPSF3L gene_id:54973|Hs108|chr1 (502 aa) initn: 3081 init1: 3020 opt: 3020 Z-score: 3659.8 bits: 687.0 E(32554): 1.6e-197 Smith-Waterman score: 3020; 99.8% identity (100.0% similar) in 457 aa overlap (144-600:46-502) 120 130 140 150 160 170 pF1KB8 KKGEANFFTSQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESV :.:::::::::::::::::::::::::::: CCDS72 SWNLVLPPGRGSRAPSSTDTRTGFHVLPHGVNDELEIKAYYAGHVLGAAMFQIKVGSESV 20 30 40 50 60 70 180 190 200 210 220 230 pF1KB8 VYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 VYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGG 80 90 100 110 120 130 240 250 260 270 280 290 pF1KB8 KVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 KVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKT 140 150 160 170 180 190 300 310 320 330 340 350 pF1KB8 FVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 FVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGY 200 210 220 230 240 250 360 370 380 390 400 410 pF1KB8 CVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 CVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLV 260 270 280 290 300 310 420 430 440 450 460 470 pF1KB8 HGEAKKMEFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQGLLPEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 HGEAKKMEFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQGLLPEA 320 330 340 350 360 370 480 490 500 510 520 530 pF1KB8 KKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDTRKEQETALRVYSH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 KKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDTRKEQETALRVYSH 380 390 400 410 420 430 540 550 560 570 580 590 pF1KB8 LKSVLKDHCVQHLPDGSVTVESVLLQAAAPSEDPGTKVLLVSWTYQDEELGSFLTSLLKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS72 LKSVLKDHCVQHLPDGSVTVESVLLQAAAPSEDPGTKVLLVSWTYQDEELGSFLTSLLKK 440 450 460 470 480 490 600 pF1KB8 GLPQAPS ::::::: CCDS72 GLPQAPS 500 >>CCDS1664.1 CPSF3 gene_id:51692|Hs108|chr2 (684 aa) initn: 1017 init1: 378 opt: 1147 Z-score: 1386.9 bits: 266.8 E(32554): 6.3e-71 Smith-Waterman score: 1147; 38.2% identity (69.7% similar) in 502 aa overlap (3-494:11-496) 10 20 30 40 50 pF1KB8 MPEIRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYIT .. . ::::::.::::::.. . :...:::::.: :.. .: .. : CCDS16 MSAIPAEESDQLLIRPLGAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLID 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB8 QNGRLTDFLDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHSTQAICPILLEDYRKIAV .: ..:::::::::::::.: . ... : .:::.:.:: :: :: :.. CCDS16 PAE-----IDLLLISHFHLDHCGALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVS- 70 80 90 100 110 120 130 140 150 160 170 pF1KB8 DKKGEANFFTSQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSES . ... ..: ... : :. ....:.. .: ... :.::::::::::.:.... . CCDS16 NISADDMLYTETDLEESMDKIETINFHEVKEVAG-IKFWCYHAGHVLGAAMFMIEIAGVK 120 130 140 150 160 170 180 190 200 210 220 230 pF1KB8 VVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERG ..::::.. :::: :: : . .:..:: ::::.: :..... :: : . ::. :.:: CCDS16 LLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNRG 180 190 200 210 220 230 240 250 260 270 280 290 pF1KB8 GKVLIPVFALGRAQELCILLETFWERM-NLK-VPIYFSTGLTEKANHYYKLFIPWTNQKI :. ::::::::::::: ..:. .:. .:. .:::....:..: :. .. :.:: CCDS16 GRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKI 240 250 260 270 280 290 300 310 320 330 340 pF1KB8 RKTFVQRNMFEFKHI---KAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNM :: . : : :::: :..:. : :. :: ::.:.:::...: : ..:..: ...: CCDS16 RKQININNPFVFKHISNLKSMDH-F-DDIGPSVVMASPGMMQSGLSRELFESWCTDKRNG 300 310 320 330 340 350 350 360 370 380 390 400 pF1KB8 VIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEP ::. ::::.::....:.: ... . : : .::.:.:.:::::.: . ... .: CCDS16 VIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKP 360 370 380 390 400 410 410 420 430 440 450 460 pF1KB8 ESVLLVHGEAKKMEFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQ :.::::: ..: :: . .: . : :. .: ...:.. ...:. CCDS16 PHVILVHGEQNEMARLKAALIREYE------DNDEVHIEVHNPRNTEAVTLNFRGEKLAK 420 430 440 450 460 470 480 490 500 510 520 pF1KB8 --GLLPEAKKPRL---LHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDTRK :.: . :::. . : :. .. :....: CCDS16 VMGFLAD-KKPEQGQRVSGILVKRNFNYHILSPCDLSNYTDLAMSTVKQTQAIPYTGPFN 470 480 490 500 510 520 530 540 550 560 570 580 pF1KB8 EQETALRVYSHLKSVLKDHCVQHLPDGSVTVESVLLQAAAPSEDPGTKVLLVSWTYQDEE CCDS16 LLCYQLQKLTGDVEELEIQEKPALKVFKNITVIQEPGMVVLEWLANPSNDMYADTVTTVI 530 540 550 560 570 580 >>CCDS82417.1 CPSF3 gene_id:51692|Hs108|chr2 (647 aa) initn: 913 init1: 378 opt: 1039 Z-score: 1256.3 bits: 242.6 E(32554): 1.2e-63 Smith-Waterman score: 1039; 38.0% identity (68.7% similar) in 479 aa overlap (30-494:1-459) 10 20 30 40 50 60 pF1KB8 MPEIRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDF :::::.: :.. .: .. : CCDS82 MLDCGIHPGLEGMDALPYIDLIDPAE----- 10 20 70 80 90 100 110 120 pF1KB8 LDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHSTQAICPILLEDYRKIAVDKKGEANF .: ..:::::::::::::.: . ... : .:::.:.:: :: :: :.. . ... . CCDS82 IDLLLISHFHLDHCGALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVS-NISADDML 30 40 50 60 70 80 130 140 150 160 170 180 pF1KB8 FTSQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYN .: ... : :. ....:.. .: ... :.::::::::::.:.... ...::::.. CCDS82 YTETDLEESMDKIETINFHEVKEVAG-IKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFS 90 100 110 120 130 140 190 200 210 220 230 240 pF1KB8 MTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVF :::: :: : . .:..:: ::::.: :..... :: : . ::. :.:::. ::::: CCDS82 RQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVF 150 160 170 180 190 200 250 260 270 280 290 pF1KB8 ALGRAQELCILLETFWERM-NLK-VPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRN :::::::: ..:. .:. .:. .:::....:..: :. .. :.:::: . : CCDS82 ALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININN 210 220 230 240 250 260 300 310 320 330 340 350 pF1KB8 MFEFKHI---KAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCV : :::: :..:. : :. :: ::.:.:::...: : ..:..: ...: ::. :::: CCDS82 PFVFKHISNLKSMDH-F-DDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCV 270 280 290 300 310 320 360 370 380 390 400 410 pF1KB8 QGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHG .::....:.: ... . : : .::.:.:.:::::.: . ... .: :.:::: CCDS82 EGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHG 330 340 350 360 370 380 420 430 440 450 460 pF1KB8 EAKKMEFLKQKI------EQELRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQGL : ..: :: . ..:.... . : : :.::: : .:. ..:. CCDS82 EQNEMARLKAALIREYEDNDEVHIEVHNPRNTEAVTLNFR-----GEKLA-----KVMGF 390 400 410 420 430 470 480 490 500 510 520 pF1KB8 LPEAKKPRL---LHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDTRKEQET : . :::. . : :. .. :....: CCDS82 LAD-KKPEQGQRVSGILVKRNFNYHILSPCDLSNYTDLAMSTVKQTQAIPYTGPFNLLCY 440 450 460 470 480 490 >>CCDS9902.1 CPSF2 gene_id:53981|Hs108|chr14 (782 aa) initn: 219 init1: 145 opt: 438 Z-score: 526.3 bits: 107.8 E(32554): 5.4e-23 Smith-Waterman score: 451; 27.2% identity (56.9% similar) in 401 aa overlap (4-387:5-396) 10 20 30 40 50 pF1KB8 MPEIRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTD :..: :.. :. . : :... .:::: :. : : . . . CCDS99 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMD-------IIDSLRKHVH 10 20 30 40 50 60 70 80 90 100 110 pF1KB8 FLDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHSTQAICPILLEDYRKIAVDKKGEAN .: :..:: : ::::: .: . :: : . . ... : . . . . . CCDS99 QIDAVLLSHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQ-SRHNTEDFT 60 70 80 90 100 110 120 130 140 150 160 170 pF1KB8 FFTSQMIKDCMKKVVAVHLHQTVQVDDE---LEIKAYYAGHVLGAAMFQI-KVGSESVVY .:: . . . :. ... : :.. . : : :::..:.....: : : : .:: CCDS99 LFTLDDVDAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVY 120 130 140 150 160 170 180 190 200 210 220 230 pF1KB8 TGDYNMTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGK . :.: . ::.. .. ::.::::.: :: .. .. :....: .: ::.. :. CCDS99 AVDFNHKREIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGN 180 190 200 210 220 230 240 250 260 270 280 290 pF1KB8 VLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANH----YYKLFIPWTNQKI ::: : . ::. :: ::. .:. . . .: : .: ..... . : . : ..:. CCDS99 VLIAVDTAGRVLELAQLLDQIWRTKDAGLGVY-SLALLNNVSYNVVEFSKSQVEWMSDKL 240 250 260 270 280 290 300 310 320 330 340 pF1KB8 RKTFVQR--NMFEFKHIKAFD--RAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKN . : .. : :.:.:.. .: :.: ::.:. :. : : ..: .: . :: CCDS99 MRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKN 300 310 320 330 340 350 350 360 370 380 390 400 pF1KB8 MVIMPGYCVQGTVGHKILSGQRK----LEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV .:. . ::... .... . .:.. : :: : ::. CCDS99 SIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQ 360 370 380 390 400 410 410 420 430 440 450 460 pF1KB8 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLK CCDS99 SKEADIDSSDESDIEEDIDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYPMFPAPEERI 420 430 440 450 460 470 600 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 11:33:08 2016 done: Fri Nov 4 11:33:09 2016 Total Scan time: 3.170 Total Display time: 0.090 Function used was FASTA [36.3.4 Apr, 2011]