FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KSDA1234, 719 aa
1>>>pF1KSDA1234 719 - 719 aa - 719 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.3980+/-0.000951; mu= 6.9132+/- 0.057
mean_var=214.6436+/-42.949, 0's: 0 Z-trim(113.4): 24 B-trim: 0 in 0/53
Lambda= 0.087542
statistics sampled from 14048 (14069) to 14048 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.76), E-opt: 0.2 (0.432), width: 16
Scan time: 4.380
The best scores are: opt bits E(32554)
CCDS43297.1 AHRR gene_id:57491|Hs108|chr5 ( 719) 5037 649.3 5.7e-186
CCDS56355.1 AHRR gene_id:57491|Hs108|chr5 ( 701) 3273 426.5 6.6e-119
CCDS5366.1 AHR gene_id:196|Hs108|chr7 ( 848) 840 119.3 2.4e-26
>>CCDS43297.1 AHRR gene_id:57491|Hs108|chr5 (719 aa)
initn: 5037 init1: 5037 opt: 5037 Z-score: 3452.2 bits: 649.3 E(32554): 5.7e-186
Smith-Waterman score: 5037; 99.9% identity (99.9% similar) in 719 aa overlap (1-719:1-719)
10 20 30 40 50 60
pF1KSD MPRTMIPPGECTYAGRKRRRPLQKQRPAVGAEKSNPSKRHRDRLNAELDHLASLLPFPPD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 MPRTMIPPGECTYAGRKRRRPLQKQRPAVGAEKSNPSKRHRDRLNAELDHLASLLPFPPD
10 20 30 40 50 60
70 80 90 100 110 120
pF1KSD IISKLDKLSVLRLSVSYLRVKSFFQVVQEQSSRQPAAGAPSPGDSCPLAGSAVLEGRLLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 IISKLDKLSVLRLSVSYLRVKSFFQVVQEQSSRQPAAGAPSPGDSCPLAGSAVLEGRLLL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KSD ESLNGFALVVSAEGTIFYASATIVDYLGFHQTDVMHQNIYDYIHVDDRQDFCRQLHWAMD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 ESLNGFALVVSAEGTIFYASATIVDYLGFHQTDVMHQNIYDYIHVDDRQDFCRQLHWAMD
130 140 150 160 170 180
190 200 210 220 230 240
pF1KSD PPQVVFGQAPPLETGDDAILGRLLRAQEWGTGTPTEYSAFLTRCFICRVRCLLDSTSGFL
:::::::: :::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 PPQVVFGQPPPLETGDDAILGRLLRAQEWGTGTPTEYSAFLTRCFICRVRCLLDSTSGFL
190 200 210 220 230 240
250 260 270 280 290 300
pF1KSD ARGSQAWQLRLCCPEPLMTMQFQGKLKFLFGQKKKAPSGAMLPPRLSLFCIAAPVLLPSA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 ARGSQAWQLRLCCPEPLMTMQFQGKLKFLFGQKKKAPSGAMLPPRLSLFCIAAPVLLPSA
250 260 270 280 290 300
310 320 330 340 350 360
pF1KSD AEMKMRSALLRAKPRADTAATADAKVKATTSLCESELHGKPNYSAGRSSRESGVLVLREQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 AEMKMRSALLRAKPRADTAATADAKVKATTSLCESELHGKPNYSAGRSSRESGVLVLREQ
310 320 330 340 350 360
370 380 390 400 410 420
pF1KSD TDAGRWAQVPARAPCLCLRGGPDLVLDPKGGSGDREEEQHRMLSRASGVTGRRETPGPTK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 TDAGRWAQVPARAPCLCLRGGPDLVLDPKGGSGDREEEQHRMLSRASGVTGRRETPGPTK
370 380 390 400 410 420
430 440 450 460 470 480
pF1KSD PLPWTAGKHSEDGARPRLQPSKNDPPSLRPMPRGSCLPCPCVQGTFRNSPISHPPSPSPS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 PLPWTAGKHSEDGARPRLQPSKNDPPSLRPMPRGSCLPCPCVQGTFRNSPISHPPSPSPS
430 440 450 460 470 480
490 500 510 520 530 540
pF1KSD AYSSRTSRPMRDVGEDQVHPPLCHFPQRSLQHQLPQPGAQRFATRGYPMEDMKLQGVPMP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 AYSSRTSRPMRDVGEDQVHPPLCHFPQRSLQHQLPQPGAQRFATRGYPMEDMKLQGVPMP
490 500 510 520 530 540
550 560 570 580 590 600
pF1KSD PGDLCGPTLLLDVSIKMEKDSGCEGAADGCVPSQVWLGASDRSHPATFPTRMHLKTEPDS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 PGDLCGPTLLLDVSIKMEKDSGCEGAADGCVPSQVWLGASDRSHPATFPTRMHLKTEPDS
550 560 570 580 590 600
610 620 630 640 650 660
pF1KSD RQQVYISHLGHGVRGAQPHGRATAGRSRELTPFHPAHCACLEPTDGLPQSEPPHQLCARG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 RQQVYISHLGHGVRGAQPHGRATAGRSRELTPFHPAHCACLEPTDGLPQSEPPHQLCARG
610 620 630 640 650 660
670 680 690 700 710
pF1KSD RGEQSCTCRAAEAAPVVKREPLDSPQWATHSQGMVPGMLPKSALATLVPPQASGCTFLP
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 RGEQSCTCRAAEAAPVVKREPLDSPQWATHSQGMVPGMLPKSALATLVPPQASGCTFLP
670 680 690 700 710
>>CCDS56355.1 AHRR gene_id:57491|Hs108|chr5 (701 aa)
initn: 3273 init1: 3273 opt: 3273 Z-score: 2248.3 bits: 426.5 E(32554): 6.6e-119
Smith-Waterman score: 4851; 97.4% identity (97.4% similar) in 719 aa overlap (1-719:1-701)
10 20 30 40 50 60
pF1KSD MPRTMIPPGECTYAGRKRRRPLQKQRPAVGAEKSNPSKRHRDRLNAELDHLASLLPFPPD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 MPRTMIPPGECTYAGRKRRRPLQKQRPAVGAEKSNPSKRHRDRLNAELDHLASLLPFPPD
10 20 30 40 50 60
70 80 90 100 110 120
pF1KSD IISKLDKLSVLRLSVSYLRVKSFFQVVQEQSSRQPAAGAPSPGDSCPLAGSAVLEGRLLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 IISKLDKLSVLRLSVSYLRVKSFFQVVQEQSSRQPAAGAPSPGDSCPLAGSAVLEGRLLL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KSD ESLNGFALVVSAEGTIFYASATIVDYLGFHQTDVMHQNIYDYIHVDDRQDFCRQLHWAMD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 ESLNGFALVVSAEGTIFYASATIVDYLGFHQTDVMHQNIYDYIHVDDRQDFCRQLHWAMD
130 140 150 160 170 180
190 200 210 220 230 240
pF1KSD PPQVVFGQAPPLETGDDAILGRLLRAQEWGTGTPTEYSAFLTRCFICRVRCLLDSTSGFL
:::::::: :::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 PPQVVFGQPPPLETGDDAILGRLLRAQEWGTGTPTEYSAFLTRCFICRVRCLLDSTSGFL
190 200 210 220 230 240
250 260 270 280 290 300
pF1KSD ARGSQAWQLRLCCPEPLMTMQFQGKLKFLFGQKKKAPSGAMLPPRLSLFCIAAPVLLPSA
::::::::::::::::::::::::::::::::::::::::::
CCDS56 ------------------TMQFQGKLKFLFGQKKKAPSGAMLPPRLSLFCIAAPVLLPSA
250 260 270 280
310 320 330 340 350 360
pF1KSD AEMKMRSALLRAKPRADTAATADAKVKATTSLCESELHGKPNYSAGRSSRESGVLVLREQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 AEMKMRSALLRAKPRADTAATADAKVKATTSLCESELHGKPNYSAGRSSRESGVLVLREQ
290 300 310 320 330 340
370 380 390 400 410 420
pF1KSD TDAGRWAQVPARAPCLCLRGGPDLVLDPKGGSGDREEEQHRMLSRASGVTGRRETPGPTK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 TDAGRWAQVPARAPCLCLRGGPDLVLDPKGGSGDREEEQHRMLSRASGVTGRRETPGPTK
350 360 370 380 390 400
430 440 450 460 470 480
pF1KSD PLPWTAGKHSEDGARPRLQPSKNDPPSLRPMPRGSCLPCPCVQGTFRNSPISHPPSPSPS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 PLPWTAGKHSEDGARPRLQPSKNDPPSLRPMPRGSCLPCPCVQGTFRNSPISHPPSPSPS
410 420 430 440 450 460
490 500 510 520 530 540
pF1KSD AYSSRTSRPMRDVGEDQVHPPLCHFPQRSLQHQLPQPGAQRFATRGYPMEDMKLQGVPMP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 AYSSRTSRPMRDVGEDQVHPPLCHFPQRSLQHQLPQPGAQRFATRGYPMEDMKLQGVPMP
470 480 490 500 510 520
550 560 570 580 590 600
pF1KSD PGDLCGPTLLLDVSIKMEKDSGCEGAADGCVPSQVWLGASDRSHPATFPTRMHLKTEPDS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 PGDLCGPTLLLDVSIKMEKDSGCEGAADGCVPSQVWLGASDRSHPATFPTRMHLKTEPDS
530 540 550 560 570 580
610 620 630 640 650 660
pF1KSD RQQVYISHLGHGVRGAQPHGRATAGRSRELTPFHPAHCACLEPTDGLPQSEPPHQLCARG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 RQQVYISHLGHGVRGAQPHGRATAGRSRELTPFHPAHCACLEPTDGLPQSEPPHQLCARG
590 600 610 620 630 640
670 680 690 700 710
pF1KSD RGEQSCTCRAAEAAPVVKREPLDSPQWATHSQGMVPGMLPKSALATLVPPQASGCTFLP
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 RGEQSCTCRAAEAAPVVKREPLDSPQWATHSQGMVPGMLPKSALATLVPPQASGCTFLP
650 660 670 680 690 700
>>CCDS5366.1 AHR gene_id:196|Hs108|chr7 (848 aa)
initn: 1089 init1: 608 opt: 840 Z-score: 586.5 bits: 119.3 E(32554): 2.4e-26
Smith-Waterman score: 1026; 42.1% identity (62.9% similar) in 475 aa overlap (12-448:9-456)
10 20 30 40 50
pF1KSD MPRTMIPPGECTYAGRKRRRPLQKQRPAVGAE--KSNPSKRHRDRLNAELDHLASLLPFP
:::.::::.:.:: . :: :::::::::::::.:::.::::::::
CCDS53 MNSSSANITYASRKRRKPVQKTVKPIPAEGIKSNPSKRHRDRLNTELDRLASLLPFP
10 20 30 40 50
60 70 80 90 100 110
pF1KSD PDIISKLDKLSVLRLSVSYLRVKSFFQVVQEQSSRQPAAGAPSPGDSCPLA----GSAVL
:.:.::::::::::::::::.::::.:. ..: . .: :.: : : .
CCDS53 QDVINKLDKLSVLRLSVSYLRAKSFFDVALKSSPTERNGGQ----DNCRAANFREGLNLQ
60 70 80 90 100 110
120 130 140 150 160 170
pF1KSD EGRLLLESLNGFALVVSAEGTIFYASATIVDYLGFHQTDVMHQNIYDYIHVDDRQDFCRQ
::..::..::::.:::.... .::::.:: :::::.:.::.::..:. ::..:: .: ::
CCDS53 EGEFLLQALNGFVLVVTTDALVFYASSTIQDYLGFQQSDVIHQSVYELIHTEDRAEFQRQ
120 130 140 150 160 170
180 190 200 210 220 230
pF1KSD LHWAMDPPQVVF-GQAPPLETGDDAILGRLLRAQEWGTGTPTEYSAFLTRCFICRVRCLL
::::..: : . ::. :: : . . . : : : .. ::::::.::::
CCDS53 LHWALNPSQCTESGQGIEEATG----LPQTVVCYN-PDQIPPENSPLMERCFICRLRCLL
180 190 200 210 220
240 250 260 270 280 290
pF1KSD DSTSGFLARGSQAWQLRLCCPEPLMTMQFQGKLKFLFGQKKKAPSGAMLPPRLSLFCIAA
:..::::: :.::::::.: :::::. .:..:::.:.:: ::.
CCDS53 DNSSGFLA------------------MNFQGKLKYLHGQKKKGKDGSILPPQLALFAIAT
230 240 250 260 270
300 310 320 330 340
pF1KSD PVLLPSAAEMKMRSALLRAKPRAD-TAATADAKVKATTSLCESELHGKPN----------
:. :: :.. .. ..:.: . : : ::: . . . :.:: . .
CCDS53 PLQPPSILEIRTKNFIFRTKHKLDFTPIGCDAKGRIVLGYTEAELCTRGSGYQFIHAADM
280 290 300 310 320 330
350 360 370 380 390
pF1KSD -YSAGRSSR-----ESGVLVLREQTDAGRWAQVPARAPCLCLRGGPDLVLDPKGGSGDRE
: : : :::..:.: : .::. : . : : : :: .. . :.:
CCDS53 LYCAESHIRMIKTGESGMIVFRLLTKNNRWTWVQSNARLLYKNGRPDYIIVTQRPLTDEE
340 350 360 370 380 390
400 410 420 430 440
pF1KSD EEQHR---------MLSRASGVTGRRETPGPT--KPLPWTA--GKHSEDGARPR-LQPSK
.: :.. . .: . .: :. ::: . : ..:.: :. ..
CCDS53 GTEHLRKRNTKLPFMFTTGEAVLYEATNPFPAIMDPLPLRTKNGTSGKDSATTSTLSKDS
400 410 420 430 440 450
450 460 470 480 490 500
pF1KSD NDPPSLRPMPRGSCLPCPCVQGTFRNSPISHPPSPSPSAYSSRTSRPMRDVGEDQVHPPL
.: ::
CCDS53 LNPSSLLAAMMQQDESIYLYPASSTSSTAPFENNFFNESMNECRNWQDNTAPMGNDTILK
460 470 480 490 500 510
719 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 05:29:59 2016 done: Thu Nov 3 05:30:00 2016
Total Scan time: 4.380 Total Display time: 0.030
Function used was FASTA [36.3.4 Apr, 2011]