FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KSDA1234, 719 aa 1>>>pF1KSDA1234 719 - 719 aa - 719 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.3980+/-0.000951; mu= 6.9132+/- 0.057 mean_var=214.6436+/-42.949, 0's: 0 Z-trim(113.4): 24 B-trim: 0 in 0/53 Lambda= 0.087542 statistics sampled from 14048 (14069) to 14048 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.76), E-opt: 0.2 (0.432), width: 16 Scan time: 4.380 The best scores are: opt bits E(32554) CCDS43297.1 AHRR gene_id:57491|Hs108|chr5 ( 719) 5037 649.3 5.7e-186 CCDS56355.1 AHRR gene_id:57491|Hs108|chr5 ( 701) 3273 426.5 6.6e-119 CCDS5366.1 AHR gene_id:196|Hs108|chr7 ( 848) 840 119.3 2.4e-26 >>CCDS43297.1 AHRR gene_id:57491|Hs108|chr5 (719 aa) initn: 5037 init1: 5037 opt: 5037 Z-score: 3452.2 bits: 649.3 E(32554): 5.7e-186 Smith-Waterman score: 5037; 99.9% identity (99.9% similar) in 719 aa overlap (1-719:1-719) 10 20 30 40 50 60 pF1KSD MPRTMIPPGECTYAGRKRRRPLQKQRPAVGAEKSNPSKRHRDRLNAELDHLASLLPFPPD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 MPRTMIPPGECTYAGRKRRRPLQKQRPAVGAEKSNPSKRHRDRLNAELDHLASLLPFPPD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KSD IISKLDKLSVLRLSVSYLRVKSFFQVVQEQSSRQPAAGAPSPGDSCPLAGSAVLEGRLLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 IISKLDKLSVLRLSVSYLRVKSFFQVVQEQSSRQPAAGAPSPGDSCPLAGSAVLEGRLLL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KSD ESLNGFALVVSAEGTIFYASATIVDYLGFHQTDVMHQNIYDYIHVDDRQDFCRQLHWAMD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 ESLNGFALVVSAEGTIFYASATIVDYLGFHQTDVMHQNIYDYIHVDDRQDFCRQLHWAMD 130 140 150 160 170 180 190 200 210 220 230 240 pF1KSD PPQVVFGQAPPLETGDDAILGRLLRAQEWGTGTPTEYSAFLTRCFICRVRCLLDSTSGFL :::::::: ::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 PPQVVFGQPPPLETGDDAILGRLLRAQEWGTGTPTEYSAFLTRCFICRVRCLLDSTSGFL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KSD ARGSQAWQLRLCCPEPLMTMQFQGKLKFLFGQKKKAPSGAMLPPRLSLFCIAAPVLLPSA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 ARGSQAWQLRLCCPEPLMTMQFQGKLKFLFGQKKKAPSGAMLPPRLSLFCIAAPVLLPSA 250 260 270 280 290 300 310 320 330 340 350 360 pF1KSD AEMKMRSALLRAKPRADTAATADAKVKATTSLCESELHGKPNYSAGRSSRESGVLVLREQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 AEMKMRSALLRAKPRADTAATADAKVKATTSLCESELHGKPNYSAGRSSRESGVLVLREQ 310 320 330 340 350 360 370 380 390 400 410 420 pF1KSD TDAGRWAQVPARAPCLCLRGGPDLVLDPKGGSGDREEEQHRMLSRASGVTGRRETPGPTK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 TDAGRWAQVPARAPCLCLRGGPDLVLDPKGGSGDREEEQHRMLSRASGVTGRRETPGPTK 370 380 390 400 410 420 430 440 450 460 470 480 pF1KSD PLPWTAGKHSEDGARPRLQPSKNDPPSLRPMPRGSCLPCPCVQGTFRNSPISHPPSPSPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 PLPWTAGKHSEDGARPRLQPSKNDPPSLRPMPRGSCLPCPCVQGTFRNSPISHPPSPSPS 430 440 450 460 470 480 490 500 510 520 530 540 pF1KSD AYSSRTSRPMRDVGEDQVHPPLCHFPQRSLQHQLPQPGAQRFATRGYPMEDMKLQGVPMP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 AYSSRTSRPMRDVGEDQVHPPLCHFPQRSLQHQLPQPGAQRFATRGYPMEDMKLQGVPMP 490 500 510 520 530 540 550 560 570 580 590 600 pF1KSD PGDLCGPTLLLDVSIKMEKDSGCEGAADGCVPSQVWLGASDRSHPATFPTRMHLKTEPDS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 PGDLCGPTLLLDVSIKMEKDSGCEGAADGCVPSQVWLGASDRSHPATFPTRMHLKTEPDS 550 560 570 580 590 600 610 620 630 640 650 660 pF1KSD RQQVYISHLGHGVRGAQPHGRATAGRSRELTPFHPAHCACLEPTDGLPQSEPPHQLCARG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 RQQVYISHLGHGVRGAQPHGRATAGRSRELTPFHPAHCACLEPTDGLPQSEPPHQLCARG 610 620 630 640 650 660 670 680 690 700 710 pF1KSD RGEQSCTCRAAEAAPVVKREPLDSPQWATHSQGMVPGMLPKSALATLVPPQASGCTFLP ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 RGEQSCTCRAAEAAPVVKREPLDSPQWATHSQGMVPGMLPKSALATLVPPQASGCTFLP 670 680 690 700 710 >>CCDS56355.1 AHRR gene_id:57491|Hs108|chr5 (701 aa) initn: 3273 init1: 3273 opt: 3273 Z-score: 2248.3 bits: 426.5 E(32554): 6.6e-119 Smith-Waterman score: 4851; 97.4% identity (97.4% similar) in 719 aa overlap (1-719:1-701) 10 20 30 40 50 60 pF1KSD MPRTMIPPGECTYAGRKRRRPLQKQRPAVGAEKSNPSKRHRDRLNAELDHLASLLPFPPD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 MPRTMIPPGECTYAGRKRRRPLQKQRPAVGAEKSNPSKRHRDRLNAELDHLASLLPFPPD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KSD IISKLDKLSVLRLSVSYLRVKSFFQVVQEQSSRQPAAGAPSPGDSCPLAGSAVLEGRLLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 IISKLDKLSVLRLSVSYLRVKSFFQVVQEQSSRQPAAGAPSPGDSCPLAGSAVLEGRLLL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KSD ESLNGFALVVSAEGTIFYASATIVDYLGFHQTDVMHQNIYDYIHVDDRQDFCRQLHWAMD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 ESLNGFALVVSAEGTIFYASATIVDYLGFHQTDVMHQNIYDYIHVDDRQDFCRQLHWAMD 130 140 150 160 170 180 190 200 210 220 230 240 pF1KSD PPQVVFGQAPPLETGDDAILGRLLRAQEWGTGTPTEYSAFLTRCFICRVRCLLDSTSGFL :::::::: ::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 PPQVVFGQPPPLETGDDAILGRLLRAQEWGTGTPTEYSAFLTRCFICRVRCLLDSTSGFL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KSD ARGSQAWQLRLCCPEPLMTMQFQGKLKFLFGQKKKAPSGAMLPPRLSLFCIAAPVLLPSA :::::::::::::::::::::::::::::::::::::::::: CCDS56 ------------------TMQFQGKLKFLFGQKKKAPSGAMLPPRLSLFCIAAPVLLPSA 250 260 270 280 310 320 330 340 350 360 pF1KSD AEMKMRSALLRAKPRADTAATADAKVKATTSLCESELHGKPNYSAGRSSRESGVLVLREQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 AEMKMRSALLRAKPRADTAATADAKVKATTSLCESELHGKPNYSAGRSSRESGVLVLREQ 290 300 310 320 330 340 370 380 390 400 410 420 pF1KSD TDAGRWAQVPARAPCLCLRGGPDLVLDPKGGSGDREEEQHRMLSRASGVTGRRETPGPTK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 TDAGRWAQVPARAPCLCLRGGPDLVLDPKGGSGDREEEQHRMLSRASGVTGRRETPGPTK 350 360 370 380 390 400 430 440 450 460 470 480 pF1KSD PLPWTAGKHSEDGARPRLQPSKNDPPSLRPMPRGSCLPCPCVQGTFRNSPISHPPSPSPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 PLPWTAGKHSEDGARPRLQPSKNDPPSLRPMPRGSCLPCPCVQGTFRNSPISHPPSPSPS 410 420 430 440 450 460 490 500 510 520 530 540 pF1KSD AYSSRTSRPMRDVGEDQVHPPLCHFPQRSLQHQLPQPGAQRFATRGYPMEDMKLQGVPMP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 AYSSRTSRPMRDVGEDQVHPPLCHFPQRSLQHQLPQPGAQRFATRGYPMEDMKLQGVPMP 470 480 490 500 510 520 550 560 570 580 590 600 pF1KSD PGDLCGPTLLLDVSIKMEKDSGCEGAADGCVPSQVWLGASDRSHPATFPTRMHLKTEPDS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 PGDLCGPTLLLDVSIKMEKDSGCEGAADGCVPSQVWLGASDRSHPATFPTRMHLKTEPDS 530 540 550 560 570 580 610 620 630 640 650 660 pF1KSD RQQVYISHLGHGVRGAQPHGRATAGRSRELTPFHPAHCACLEPTDGLPQSEPPHQLCARG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 RQQVYISHLGHGVRGAQPHGRATAGRSRELTPFHPAHCACLEPTDGLPQSEPPHQLCARG 590 600 610 620 630 640 670 680 690 700 710 pF1KSD RGEQSCTCRAAEAAPVVKREPLDSPQWATHSQGMVPGMLPKSALATLVPPQASGCTFLP ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 RGEQSCTCRAAEAAPVVKREPLDSPQWATHSQGMVPGMLPKSALATLVPPQASGCTFLP 650 660 670 680 690 700 >>CCDS5366.1 AHR gene_id:196|Hs108|chr7 (848 aa) initn: 1089 init1: 608 opt: 840 Z-score: 586.5 bits: 119.3 E(32554): 2.4e-26 Smith-Waterman score: 1026; 42.1% identity (62.9% similar) in 475 aa overlap (12-448:9-456) 10 20 30 40 50 pF1KSD MPRTMIPPGECTYAGRKRRRPLQKQRPAVGAE--KSNPSKRHRDRLNAELDHLASLLPFP :::.::::.:.:: . :: :::::::::::::.:::.:::::::: CCDS53 MNSSSANITYASRKRRKPVQKTVKPIPAEGIKSNPSKRHRDRLNTELDRLASLLPFP 10 20 30 40 50 60 70 80 90 100 110 pF1KSD PDIISKLDKLSVLRLSVSYLRVKSFFQVVQEQSSRQPAAGAPSPGDSCPLA----GSAVL :.:.::::::::::::::::.::::.:. ..: . .: :.: : : . CCDS53 QDVINKLDKLSVLRLSVSYLRAKSFFDVALKSSPTERNGGQ----DNCRAANFREGLNLQ 60 70 80 90 100 110 120 130 140 150 160 170 pF1KSD EGRLLLESLNGFALVVSAEGTIFYASATIVDYLGFHQTDVMHQNIYDYIHVDDRQDFCRQ ::..::..::::.:::.... .::::.:: :::::.:.::.::..:. ::..:: .: :: CCDS53 EGEFLLQALNGFVLVVTTDALVFYASSTIQDYLGFQQSDVIHQSVYELIHTEDRAEFQRQ 120 130 140 150 160 170 180 190 200 210 220 230 pF1KSD LHWAMDPPQVVF-GQAPPLETGDDAILGRLLRAQEWGTGTPTEYSAFLTRCFICRVRCLL ::::..: : . ::. :: : . . . : : : .. ::::::.:::: CCDS53 LHWALNPSQCTESGQGIEEATG----LPQTVVCYN-PDQIPPENSPLMERCFICRLRCLL 180 190 200 210 220 240 250 260 270 280 290 pF1KSD DSTSGFLARGSQAWQLRLCCPEPLMTMQFQGKLKFLFGQKKKAPSGAMLPPRLSLFCIAA :..::::: :.::::::.: :::::. .:..:::.:.:: ::. CCDS53 DNSSGFLA------------------MNFQGKLKYLHGQKKKGKDGSILPPQLALFAIAT 230 240 250 260 270 300 310 320 330 340 pF1KSD PVLLPSAAEMKMRSALLRAKPRAD-TAATADAKVKATTSLCESELHGKPN---------- :. :: :.. .. ..:.: . : : ::: . . . :.:: . . CCDS53 PLQPPSILEIRTKNFIFRTKHKLDFTPIGCDAKGRIVLGYTEAELCTRGSGYQFIHAADM 280 290 300 310 320 330 350 360 370 380 390 pF1KSD -YSAGRSSR-----ESGVLVLREQTDAGRWAQVPARAPCLCLRGGPDLVLDPKGGSGDRE : : : :::..:.: : .::. : . : : : :: .. . :.: CCDS53 LYCAESHIRMIKTGESGMIVFRLLTKNNRWTWVQSNARLLYKNGRPDYIIVTQRPLTDEE 340 350 360 370 380 390 400 410 420 430 440 pF1KSD EEQHR---------MLSRASGVTGRRETPGPT--KPLPWTA--GKHSEDGARPR-LQPSK .: :.. . .: . .: :. ::: . : ..:.: :. .. CCDS53 GTEHLRKRNTKLPFMFTTGEAVLYEATNPFPAIMDPLPLRTKNGTSGKDSATTSTLSKDS 400 410 420 430 440 450 450 460 470 480 490 500 pF1KSD NDPPSLRPMPRGSCLPCPCVQGTFRNSPISHPPSPSPSAYSSRTSRPMRDVGEDQVHPPL .: :: CCDS53 LNPSSLLAAMMQQDESIYLYPASSTSSTAPFENNFFNESMNECRNWQDNTAPMGNDTILK 460 470 480 490 500 510 719 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 05:29:59 2016 done: Thu Nov 3 05:30:00 2016 Total Scan time: 4.380 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]