FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB5995, 577 aa 1>>>pF1KB5995 577 - 577 aa - 577 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 11.4998+/-0.00129; mu= -5.5941+/- 0.078 mean_var=483.5272+/-99.616, 0's: 0 Z-trim(114.3): 70 B-trim: 0 in 0/51 Lambda= 0.058326 statistics sampled from 14846 (14900) to 14846 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.756), E-opt: 0.2 (0.458), width: 16 Scan time: 4.200 The best scores are: opt bits E(32554) CCDS14473.1 CSTF2 gene_id:1478|Hs108|chrX ( 577) 3935 345.7 9e-95 CCDS7245.1 CSTF2T gene_id:23283|Hs108|chr10 ( 616) 2535 228.0 2.7e-59 CCDS78498.1 CSTF2 gene_id:1478|Hs108|chrX ( 597) 2299 208.1 2.6e-53 >>CCDS14473.1 CSTF2 gene_id:1478|Hs108|chrX (577 aa) initn: 3935 init1: 3935 opt: 3935 Z-score: 1814.9 bits: 345.7 E(32554): 9e-95 Smith-Waterman score: 3935; 100.0% identity (100.0% similar) in 577 aa overlap (1-577:1-577) 10 20 30 40 50 60 pF1KB5 MAGLTVRDPAVDRSLRSVFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MAGLTVRDPAVDRSLRSVFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB5 FCEYQDQETALSAMRNLNGREFSGRALRVDNAASEKNKEELKSLGTGAPVIESPYGETIS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 FCEYQDQETALSAMRNLNGREFSGRALRVDNAASEKNKEELKSLGTGAPVIESPYGETIS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB5 PEDAPESISKAVASLPPEQMFELMKQMKLCVQNSPQEARNMLLQNPQLAYALLQAQVVMR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 PEDAPESISKAVASLPPEQMFELMKQMKLCVQNSPQEARNMLLQNPQLAYALLQAQVVMR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB5 IVDPEIALKILHRQTNIPTLIAGNPQPVHGAGPGSGSNVSMNQQNPQAPQAQSLGGMHVN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 IVDPEIALKILHRQTNIPTLIAGNPQPVHGAGPGSGSNVSMNQQNPQAPQAQSLGGMHVN 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB5 GAPPLMQASMQGGVPAPGQMPAAVTGPGPGSLAPGGGMQAQVGMPGSGPVSMERGQVPMQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 GAPPLMQASMQGGVPAPGQMPAAVTGPGPGSLAPGGGMQAQVGMPGSGPVSMERGQVPMQ 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB5 DPRAAMQRGSLPANVPTPRGLLGDAPNDPRGGTLLSVTGEVEPRGYLGPPHQGPPMHHVP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 DPRAAMQRGSLPANVPTPRGLLGDAPNDPRGGTLLSVTGEVEPRGYLGPPHQGPPMHHVP 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB5 GHESRGPPPHELRGGPLPEPRPLMAEPRGPMLDQRGPPLDGRGGRDPRGIDARGMEARAM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 GHESRGPPPHELRGGPLPEPRPLMAEPRGPMLDQRGPPLDGRGGRDPRGIDARGMEARAM 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB5 EARGLDARGLEARAMEARAMEARAMEARAMEARAMEVRGMEARGMDTRGPVPGPRGPIPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 EARGLDARGLEARAMEARAMEARAMEARAMEARAMEVRGMEARGMDTRGPVPGPRGPIPS 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB5 GMQGPSPINMGAVVPQGSRQVPVMQGTGMQGASIQGGSQPGGFSPGQNQVTPQDHEKAAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 GMQGPSPINMGAVVPQGSRQVPVMQGTGMQGASIQGGSQPGGFSPGQNQVTPQDHEKAAL 490 500 510 520 530 540 550 560 570 pF1KB5 IMQVLQLTADQIAMLPPEQRQSILILKEQIQKSTGAP ::::::::::::::::::::::::::::::::::::: CCDS14 IMQVLQLTADQIAMLPPEQRQSILILKEQIQKSTGAP 550 560 570 >>CCDS7245.1 CSTF2T gene_id:23283|Hs108|chr10 (616 aa) initn: 1812 init1: 1221 opt: 2535 Z-score: 1177.9 bits: 228.0 E(32554): 2.7e-59 Smith-Waterman score: 2777; 69.4% identity (80.8% similar) in 625 aa overlap (1-570:1-609) 10 20 30 40 50 60 pF1KB5 MAGLTVRDPAVDRSLRSVFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYG :..:.:::::.::::::::::::::::::::::::::::: ::::::::::::::::::: CCDS72 MSSLAVRDPAMDRSLRSVFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB5 FCEYQDQETALSAMRNLNGREFSGRALRVDNAASEKNKEELKSLGTGAPVIESPYGETIS ::::::::::::::::::::::::::::::::::::::::::::: .::.:.::::. :. CCDS72 FCEYQDQETALSAMRNLNGREFSGRALRVDNAASEKNKEELKSLGPAAPIIDSPYGDPID 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB5 PEDAPESISKAVASLPPEQMFELMKQMKLCVQNSPQEARNMLLQNPQLAYALLQAQVVMR ::::::::..:::::::::::::::::::::::: ::::::::::::::::::::::::: CCDS72 PEDAPESITRAVASLPPEQMFELMKQMKLCVQNSHQEARNMLLQNPQLAYALLQAQVVMR 130 140 150 160 170 180 190 200 210 220 230 pF1KB5 IVDPEIALKILHRQTNIPTLIAGNPQ------PVHGAGPG--SGSNVSMNQQNPQAPQAQ :.:::::::::::. .. :: :. : : : ::: : :: .::::: ::: : CCDS72 IMDPEIALKILHRKIHVTPLIPGKSQSVSVSGPGPGPGPGLCPGPNVLLNQQNPPAPQPQ 190 200 210 220 230 240 240 250 260 270 280 290 pF1KB5 SLGGMHVNGAPPLMQASMQGGVPAPGQMPAAVTGPGPGSLAPGGGMQAQVGMPGSGPVSM :. :. :::::. .:::.:::: .:::: : :::::.:::.:: :.:::: ::: . CCDS72 HLARRPVKDIPPLMQTPIQGGIPAPGPIPAAVPGAGPGSLTPGGAMQPQLGMPGVGPVPL 250 260 270 280 290 300 300 310 320 330 340 350 pF1KB5 ERGQVPMQDPRAAMQRGSL-PANVPTPRGLLGDAPNDPRGGTLLSVTGEVEPRGYLGPPH ::::: :.:::: . :: . :...: :::::::::::::::::::::::::::::::::: CCDS72 ERGQVQMSDPRAPIPRGPVTPGGLP-PRGLLGDAPNDPRGGTLLSVTGEVEPRGYLGPPH 310 320 330 340 350 360 370 380 390 400 410 pF1KB5 QGPPMHHVPGHESRGPPPHELRGGPLPEPRPLMAEPRGPMLDQRGPPLDGRGGRDPRGID :::::::. ::..::: ::.::::: .:: :..::::::.:::: :.::::::: CCDS72 QGPPMHHASGHDTRGPSSHEMRGGPLGDPRLLIGEPRGPMIDQRGLPMDGRGGRD----- 360 370 380 390 400 410 420 430 440 450 460 470 pF1KB5 ARGMEARAMEARGLDARGLEARAMEARAMEARAMEARAMEARAMEVRGMEARGMDTRGPV .:.::.::::.. ..:.:.:: :.::. :::.:.::.:::.:::.. :::: CCDS72 SRAMETRAMETE----------VLETRVMERRGMETCAMETRGMEARGMDARGLEMRGPV 420 430 440 450 460 480 490 500 510 pF1KB5 PGPRGPIPSGMQGPSPINMGAV-VPQGSRQVPVM---------------QGTGMQGASIQ :. :::. .:.:::.:::.:: ::: :::: . ::::::::.:: CCDS72 PSSRGPMTGGIQGPGPINIGAGGPPQGPRQVPGISGVGNPGAGMQGTGIQGTGMQGAGIQ 470 480 490 500 510 520 520 530 540 pF1KB5 GG------------------------------SQPGGFSPGQNQVTPQDHEKAALIMQVL :: :::..:::::.::::::.:::::::::: CCDS72 GGGMQGAGIQGVSIQGGGIQGGGIQGASKQGGSQPSSFSPGQSQVTPQDQEKAALIMQVL 530 540 550 560 570 580 550 560 570 pF1KB5 QLTADQIAMLPPEQRQSILILKEQIQKSTGAP ::::::::::::::::::::::::: CCDS72 QLTADQIAMLPPEQRQSILILKEQIQKSTGAS 590 600 610 >>CCDS78498.1 CSTF2 gene_id:1478|Hs108|chrX (597 aa) initn: 2043 init1: 2043 opt: 2299 Z-score: 1070.7 bits: 208.1 E(32554): 2.6e-53 Smith-Waterman score: 3885; 96.6% identity (96.6% similar) in 597 aa overlap (1-577:1-597) 10 20 30 40 50 60 pF1KB5 MAGLTVRDPAVDRSLRSVFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 MAGLTVRDPAVDRSLRSVFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB5 FCEYQDQETALSAMRNLNGREFSGRALRVDNAASEKNKEELKSLGTGAPVIESPYGETIS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 FCEYQDQETALSAMRNLNGREFSGRALRVDNAASEKNKEELKSLGTGAPVIESPYGETIS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB5 PEDAPESISKAVASLPPEQMFELMKQMKLCVQNSPQEARNMLLQNPQLAYALLQAQVVMR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 PEDAPESISKAVASLPPEQMFELMKQMKLCVQNSPQEARNMLLQNPQLAYALLQAQVVMR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB5 IVDPEIALKILHRQTNIPTLIAGNPQPVHGAGPGSGSNVSMNQQNPQAPQAQSLGGMHVN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 IVDPEIALKILHRQTNIPTLIAGNPQPVHGAGPGSGSNVSMNQQNPQAPQAQSLGGMHVN 190 200 210 220 230 240 250 260 270 280 290 pF1KB5 GAPPLMQASMQGGVPAPGQMPAAVTGPGPGSLAPGGGMQAQVGMPGSGPVSMERGQ---- :::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 GAPPLMQASMQGGVPAPGQMPAAVTGPGPGSLAPGGGMQAQVGMPGSGPVSMERGQGTLQ 250 260 270 280 290 300 300 310 320 330 340 pF1KB5 ----------------VPMQDPRAAMQRGSLPANVPTPRGLLGDAPNDPRGGTLLSVTGE :::::::::::::::::::::::::::::::::::::::::::: CCDS78 HSPVGPAGPASIERVQVPMQDPRAAMQRGSLPANVPTPRGLLGDAPNDPRGGTLLSVTGE 310 320 330 340 350 360 350 360 370 380 390 400 pF1KB5 VEPRGYLGPPHQGPPMHHVPGHESRGPPPHELRGGPLPEPRPLMAEPRGPMLDQRGPPLD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 VEPRGYLGPPHQGPPMHHVPGHESRGPPPHELRGGPLPEPRPLMAEPRGPMLDQRGPPLD 370 380 390 400 410 420 410 420 430 440 450 460 pF1KB5 GRGGRDPRGIDARGMEARAMEARGLDARGLEARAMEARAMEARAMEARAMEARAMEVRGM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 GRGGRDPRGIDARGMEARAMEARGLDARGLEARAMEARAMEARAMEARAMEARAMEVRGM 430 440 450 460 470 480 470 480 490 500 510 520 pF1KB5 EARGMDTRGPVPGPRGPIPSGMQGPSPINMGAVVPQGSRQVPVMQGTGMQGASIQGGSQP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 EARGMDTRGPVPGPRGPIPSGMQGPSPINMGAVVPQGSRQVPVMQGTGMQGASIQGGSQP 490 500 510 520 530 540 530 540 550 560 570 pF1KB5 GGFSPGQNQVTPQDHEKAALIMQVLQLTADQIAMLPPEQRQSILILKEQIQKSTGAP ::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS78 GGFSPGQNQVTPQDHEKAALIMQVLQLTADQIAMLPPEQRQSILILKEQIQKSTGAP 550 560 570 580 590 577 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 10:52:52 2016 done: Sat Nov 5 10:52:52 2016 Total Scan time: 4.200 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]