FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7702, 461 aa 1>>>pF1KB7702 461 - 461 aa - 461 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 10.1715+/-0.00107; mu= -0.3616+/- 0.065 mean_var=405.4588+/-82.565, 0's: 0 Z-trim(115.7): 118 B-trim: 433 in 1/54 Lambda= 0.063694 statistics sampled from 16120 (16238) to 16120 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.798), E-opt: 0.2 (0.499), width: 16 Scan time: 3.130 The best scores are: opt bits E(32554) CCDS6856.1 NR5A1 gene_id:2516|Hs108|chr9 ( 461) 3209 308.7 8.1e-84 CCDS60383.1 NR5A2 gene_id:2494|Hs108|chr1 ( 469) 1900 188.4 1.3e-47 CCDS1400.1 NR5A2 gene_id:2494|Hs108|chr1 ( 495) 1900 188.5 1.4e-47 CCDS1401.1 NR5A2 gene_id:2494|Hs108|chr1 ( 541) 1900 188.5 1.5e-47 >>CCDS6856.1 NR5A1 gene_id:2516|Hs108|chr9 (461 aa) initn: 3209 init1: 3209 opt: 3209 Z-score: 1618.3 bits: 308.7 E(32554): 8.1e-84 Smith-Waterman score: 3209; 100.0% identity (100.0% similar) in 461 aa overlap (1-461:1-461) 10 20 30 40 50 60 pF1KB7 MDYSYDEDLDELCPVCGDKVSGYHYGLLTCESCKGFFKRTVQNNKHYTCTESQSCKIDKT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS68 MDYSYDEDLDELCPVCGDKVSGYHYGLLTCESCKGFFKRTVQNNKHYTCTESQSCKIDKT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 QRKRCPFCRFQKCLTVGMRLEAVRADRMRGGRNKFGPMYKRDRALKQQKKAQIRANGFKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS68 QRKRCPFCRFQKCLTVGMRLEAVRADRMRGGRNKFGPMYKRDRALKQQKKAQIRANGFKL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 ETGPPMGVPPPPPPAPDYVLPPSLHGPEPKGLAAGPPAGPLGDFGAPALPMAVPGAHGPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS68 ETGPPMGVPPPPPPAPDYVLPPSLHGPEPKGLAAGPPAGPLGDFGAPALPMAVPGAHGPL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 AGYLYPAFPGRAIKSEYPEPYASPPQPGLPYGYPEPFSGGPNVPELILQLLQLEPDEDQV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS68 AGYLYPAFPGRAIKSEYPEPYASPPQPGLPYGYPEPFSGGPNVPELILQLLQLEPDEDQV 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 RARILGCLQEPTKSRPDQPAAFGLLCRMADQTFISIVDWARRCMVFKELEVADQMTLLQN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS68 RARILGCLQEPTKSRPDQPAAFGLLCRMADQTFISIVDWARRCMVFKELEVADQMTLLQN 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB7 CWSELLVFDHIYRQVQHGKEGSILLVTGQEVELTTVATQAGSLLHSLVLRAQELVLQLLA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS68 CWSELLVFDHIYRQVQHGKEGSILLVTGQEVELTTVATQAGSLLHSLVLRAQELVLQLLA 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB7 LQLDRQEFVCLKFIILFSLDLKFLNNHILVKDAQEKANAALLDYTLCHYPHCGDKFQQLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS68 LQLDRQEFVCLKFIILFSLDLKFLNNHILVKDAQEKANAALLDYTLCHYPHCGDKFQQLL 370 380 390 400 410 420 430 440 450 460 pF1KB7 LCLVEVRALSMQAKEYLYHKHLGNEMPRNNLLIEMLQAKQT ::::::::::::::::::::::::::::::::::::::::: CCDS68 LCLVEVRALSMQAKEYLYHKHLGNEMPRNNLLIEMLQAKQT 430 440 450 460 >>CCDS60383.1 NR5A2 gene_id:2494|Hs108|chr1 (469 aa) initn: 1857 init1: 879 opt: 1900 Z-score: 968.1 bits: 188.4 E(32554): 1.3e-47 Smith-Waterman score: 1901; 61.5% identity (82.6% similar) in 470 aa overlap (1-460:2-468) 10 20 30 40 50 pF1KB7 MDYSYDEDLDELCPVCGDKVSGYHYGLLTCESCKGFFKRTVQNNKHYTCTESQSCKIDK ..:::::::.:::::::::::::::::::::::::::::::::::.::: :.:.:.::: CCDS60 MVNYSYDEDLEELCPVCGDKVSGYHYGLLTCESCKGFFKRTVQNNKRYTCIENQNCQIDK 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB7 TQRKRCPFCRFQKCLTVGMRLEAVRADRMRGGRNKFGPMYKRDRALKQQKKAQIRANGFK :::::::.:::::::.:::.:::::::::::::::::::::::::::::::: :::::.: CCDS60 TQRKRCPYCRFQKCLSVGMKLEAVRADRMRGGRNKFGPMYKRDRALKQQKKALIRANGLK 70 80 90 100 110 120 120 130 140 150 160 170 pF1KB7 LETGPPMGVPPPPPPAPDYVLPPSLHGPEPKGL----AAGPPAG-PLGDFGAPALPMAVP ::. . . : ..:. ::: :: ::. . : . . :..: CCDS60 LEAMSQV-IQAMPSDLTISSAIQNIHSAS-KGLPLNHAALPPTDYDRSPFVTSPISMTMP 130 140 150 160 170 180 190 200 210 220 230 pF1KB7 GAHGPLAGY-LYPAFPGRAIKSEYPEPYASPPQPGLPYGYPEPF-SGGP-NVPELILQLL :: : :: : ::.::::::::.::.: :. . :.: . . ...: ..:.:::.:: CCDS60 -PHGSLQGYQTYGHFPSRAIKSEYPDPYTSSPESIMGYSYMDSYQTSSPASIPHLILELL 180 190 200 210 220 230 240 250 260 270 280 pF1KB7 QLEPDEDQVRARILGCLQEP--TKSRPDQPAAFGLLCRMADQTFISIVDWARRCMVFKEL . :::: ::.:.:.. ::. ..:. .. ..:::.:.:::::..:::.::: . :.:: CCDS60 KCEPDEPQVQAKIMAYLQQEQANRSKHEKLSTFGLMCKMADQTLFSIVEWARSSIFFREL 240 250 260 270 280 290 290 300 310 320 330 340 pF1KB7 EVADQMTLLQNCWSELLVFDHIYRQVQHGKEGSILLVTGQEVELTTVATQAGSLLHSLVL .: ::: ::::::::::..::::::: :::::::.:::::.:. . .:.:::. :..:. CCDS60 KVDDQMKLLQNCWSELLILDHIYRQVVHGKEGSIFLVTGQQVDYSIIASQAGATLNNLMS 300 310 320 330 340 350 350 360 370 380 390 400 pF1KB7 RAQELVLQLLALQLDRQEFVCLKFIILFSLDLKFLNNHILVKDAQEKANAALLDYTLCHY .::::: .: .::.:..:::::::..:::::.: :.: ::. .::..::::::::.:.: CCDS60 HAQELVAKLRSLQFDQREFVCLKFLVLFSLDVKNLENFQLVEGVQEQVNAALLDYTMCNY 360 370 380 390 400 410 410 420 430 440 450 460 pF1KB7 PHCGDKFQQLLLCLVEVRALSMQAKEYLYHKHLGNEMPRNNLLIEMLQAKQT :. .:: :::: : :.::.::::.::::.:::....: ::::::::.::. CCDS60 PQQTEKFGQLLLRLPEIRAISMQAEEYLYYKHLNGDVPYNNLLIEMLHAKRA 420 430 440 450 460 >>CCDS1400.1 NR5A2 gene_id:2494|Hs108|chr1 (495 aa) initn: 1857 init1: 879 opt: 1900 Z-score: 967.8 bits: 188.5 E(32554): 1.4e-47 Smith-Waterman score: 1901; 61.5% identity (82.6% similar) in 470 aa overlap (1-460:28-494) 10 20 30 pF1KB7 MDYSYDEDLDELCPVCGDKVSGYHYGLLTCESC ..:::::::.::::::::::::::::::::::: CCDS14 MSSNSDTGDLQESLKHGLTPIVSQFKMVNYSYDEDLEELCPVCGDKVSGYHYGLLTCESC 10 20 30 40 50 60 40 50 60 70 80 90 pF1KB7 KGFFKRTVQNNKHYTCTESQSCKIDKTQRKRCPFCRFQKCLTVGMRLEAVRADRMRGGRN ::::::::::::.::: :.:.:.::::::::::.:::::::.:::.:::::::::::::: CCDS14 KGFFKRTVQNNKRYTCIENQNCQIDKTQRKRCPYCRFQKCLSVGMKLEAVRADRMRGGRN 70 80 90 100 110 120 100 110 120 130 140 150 pF1KB7 KFGPMYKRDRALKQQKKAQIRANGFKLETGPPMGVPPPPPPAPDYVLPPSLHGPEPKGL- :::::::::::::::::: :::::.:::. . . : ..:. ::: CCDS14 KFGPMYKRDRALKQQKKALIRANGLKLEAMSQV-IQAMPSDLTISSAIQNIHSAS-KGLP 130 140 150 160 170 160 170 180 190 200 pF1KB7 ---AAGPPAG-PLGDFGAPALPMAVPGAHGPLAGY-LYPAFPGRAIKSEYPEPYASPPQP :: ::. . : . . :..: :: : :: : ::.::::::::.::.: :. CCDS14 LNHAALPPTDYDRSPFVTSPISMTMP-PHGSLQGYQTYGHFPSRAIKSEYPDPYTSSPES 180 190 200 210 220 230 210 220 230 240 250 260 pF1KB7 GLPYGYPEPF-SGGP-NVPELILQLLQLEPDEDQVRARILGCLQEP--TKSRPDQPAAFG . :.: . . ...: ..:.:::.::. :::: ::.:.:.. ::. ..:. .. ..:: CCDS14 IMGYSYMDSYQTSSPASIPHLILELLKCEPDEPQVQAKIMAYLQQEQANRSKHEKLSTFG 240 250 260 270 280 290 270 280 290 300 310 320 pF1KB7 LLCRMADQTFISIVDWARRCMVFKELEVADQMTLLQNCWSELLVFDHIYRQVQHGKEGSI :.:.:::::..:::.::: . :.::.: ::: ::::::::::..::::::: ::::::: CCDS14 LMCKMADQTLFSIVEWARSSIFFRELKVDDQMKLLQNCWSELLILDHIYRQVVHGKEGSI 300 310 320 330 340 350 330 340 350 360 370 380 pF1KB7 LLVTGQEVELTTVATQAGSLLHSLVLRAQELVLQLLALQLDRQEFVCLKFIILFSLDLKF .:::::.:. . .:.:::. :..:. .::::: .: .::.:..:::::::..:::::.: CCDS14 FLVTGQQVDYSIIASQAGATLNNLMSHAQELVAKLRSLQFDQREFVCLKFLVLFSLDVKN 360 370 380 390 400 410 390 400 410 420 430 440 pF1KB7 LNNHILVKDAQEKANAALLDYTLCHYPHCGDKFQQLLLCLVEVRALSMQAKEYLYHKHLG :.: ::. .::..::::::::.:.::. .:: :::: : :.::.::::.::::.:::. CCDS14 LENFQLVEGVQEQVNAALLDYTMCNYPQQTEKFGQLLLRLPEIRAISMQAEEYLYYKHLN 420 430 440 450 460 470 450 460 pF1KB7 NEMPRNNLLIEMLQAKQT ...: ::::::::.::. CCDS14 GDVPYNNLLIEMLHAKRA 480 490 >>CCDS1401.1 NR5A2 gene_id:2494|Hs108|chr1 (541 aa) initn: 1857 init1: 879 opt: 1900 Z-score: 967.4 bits: 188.5 E(32554): 1.5e-47 Smith-Waterman score: 1901; 61.5% identity (82.6% similar) in 470 aa overlap (1-460:74-540) 10 20 30 pF1KB7 MDYSYDEDLDELCPVCGDKVSGYHYGLLTC ..:::::::.:::::::::::::::::::: CCDS14 KVETEALGLARSHGEQGQMPENMQVSQFKMVNYSYDEDLEELCPVCGDKVSGYHYGLLTC 50 60 70 80 90 100 40 50 60 70 80 90 pF1KB7 ESCKGFFKRTVQNNKHYTCTESQSCKIDKTQRKRCPFCRFQKCLTVGMRLEAVRADRMRG :::::::::::::::.::: :.:.:.::::::::::.:::::::.:::.::::::::::: CCDS14 ESCKGFFKRTVQNNKRYTCIENQNCQIDKTQRKRCPYCRFQKCLSVGMKLEAVRADRMRG 110 120 130 140 150 160 100 110 120 130 140 150 pF1KB7 GRNKFGPMYKRDRALKQQKKAQIRANGFKLETGPPMGVPPPPPPAPDYVLPPSLHGPEPK ::::::::::::::::::::: :::::.:::. . . : ..:. : CCDS14 GRNKFGPMYKRDRALKQQKKALIRANGLKLEAMSQV-IQAMPSDLTISSAIQNIHSAS-K 170 180 190 200 210 220 160 170 180 190 200 pF1KB7 GL----AAGPPAG-PLGDFGAPALPMAVPGAHGPLAGY-LYPAFPGRAIKSEYPEPYASP :: :: ::. . : . . :..: :: : :: : ::.::::::::.::.: CCDS14 GLPLNHAALPPTDYDRSPFVTSPISMTMP-PHGSLQGYQTYGHFPSRAIKSEYPDPYTSS 230 240 250 260 270 280 210 220 230 240 250 260 pF1KB7 PQPGLPYGYPEPF-SGGP-NVPELILQLLQLEPDEDQVRARILGCLQEP--TKSRPDQPA :. . :.: . . ...: ..:.:::.::. :::: ::.:.:.. ::. ..:. .. . CCDS14 PESIMGYSYMDSYQTSSPASIPHLILELLKCEPDEPQVQAKIMAYLQQEQANRSKHEKLS 290 300 310 320 330 340 270 280 290 300 310 320 pF1KB7 AFGLLCRMADQTFISIVDWARRCMVFKELEVADQMTLLQNCWSELLVFDHIYRQVQHGKE .:::.:.:::::..:::.::: . :.::.: ::: ::::::::::..::::::: :::: CCDS14 TFGLMCKMADQTLFSIVEWARSSIFFRELKVDDQMKLLQNCWSELLILDHIYRQVVHGKE 350 360 370 380 390 400 330 340 350 360 370 380 pF1KB7 GSILLVTGQEVELTTVATQAGSLLHSLVLRAQELVLQLLALQLDRQEFVCLKFIILFSLD :::.:::::.:. . .:.:::. :..:. .::::: .: .::.:..:::::::..::::: CCDS14 GSIFLVTGQQVDYSIIASQAGATLNNLMSHAQELVAKLRSLQFDQREFVCLKFLVLFSLD 410 420 430 440 450 460 390 400 410 420 430 440 pF1KB7 LKFLNNHILVKDAQEKANAALLDYTLCHYPHCGDKFQQLLLCLVEVRALSMQAKEYLYHK .: :.: ::. .::..::::::::.:.::. .:: :::: : :.::.::::.::::.: CCDS14 VKNLENFQLVEGVQEQVNAALLDYTMCNYPQQTEKFGQLLLRLPEIRAISMQAEEYLYYK 470 480 490 500 510 520 450 460 pF1KB7 HLGNEMPRNNLLIEMLQAKQT ::....: ::::::::.::. CCDS14 HLNGDVPYNNLLIEMLHAKRA 530 540 461 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 09:10:31 2016 done: Fri Nov 4 09:10:31 2016 Total Scan time: 3.130 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]