FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9904, 589 aa 1>>>pF1KB9904 589 - 589 aa - 589 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.4642+/-0.00105; mu= 3.8294+/- 0.064 mean_var=342.3760+/-69.843, 0's: 0 Z-trim(115.0): 22 B-trim: 953 in 1/50 Lambda= 0.069314 statistics sampled from 15573 (15595) to 15573 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.767), E-opt: 0.2 (0.479), width: 16 Scan time: 3.700 The best scores are: opt bits E(32554) CCDS33015.1 HNRNPL gene_id:3191|Hs108|chr19 ( 589) 4115 425.4 1e-118 CCDS33016.1 HNRNPL gene_id:3191|Hs108|chr19 ( 456) 3200 333.8 3e-91 CCDS46261.1 HNRNPLL gene_id:92906|Hs108|chr2 ( 537) 1025 116.3 9.8e-26 CCDS1796.2 HNRNPLL gene_id:92906|Hs108|chr2 ( 542) 1025 116.3 9.9e-26 >>CCDS33015.1 HNRNPL gene_id:3191|Hs108|chr19 (589 aa) initn: 4115 init1: 4115 opt: 4115 Z-score: 2245.0 bits: 425.4 E(32554): 1e-118 Smith-Waterman score: 4115; 100.0% identity (100.0% similar) in 589 aa overlap (1-589:1-589) 10 20 30 40 50 60 pF1KB9 MSRRLLPRAEKRRRRLEQRQQPDEQRRRSGAMVKMAAAGGGGGGGRYYGGGSEGGRAPKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 MSRRLLPRAEKRRRRLEQRQQPDEQRRRSGAMVKMAAAGGGGGGGRYYGGGSEGGRAPKR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 LKTDNAGDQHGGGGGGGGGAGAAGGGGGGENYDDPHKTPASPVVHIRGLIDGVVEADLVE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 LKTDNAGDQHGGGGGGGGGAGAAGGGGGGENYDDPHKTPASPVVHIRGLIDGVVEADLVE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 ALQEFGPISYVVVMPKKRQALVEFEDVLGACNAVNYAADNQIYIAGHPAFVNYSTSQKIS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 ALQEFGPISYVVVMPKKRQALVEFEDVLGACNAVNYAADNQIYIAGHPAFVNYSTSQKIS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 RPGDSDDSRSVNSVLLFTILNPIYSITTDVLYTICNPCGPVQRIVIFRKNGVQAMVEFDS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 RPGDSDDSRSVNSVLLFTILNPIYSITTDVLYTICNPCGPVQRIVIFRKNGVQAMVEFDS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB9 VQSAQRAKASLNGADIYSGCCTLKIEYAKPTRLNVFKNDQDTWDYTNPNLSGQGDPGSNP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 VQSAQRAKASLNGADIYSGCCTLKIEYAKPTRLNVFKNDQDTWDYTNPNLSGQGDPGSNP 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB9 NKRQRQPPLLGDHPAEYGGPHGGYHSHYHDEGYGPPPPHYEGRRMGPPVGGHRRGPSRYG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 NKRQRQPPLLGDHPAEYGGPHGGYHSHYHDEGYGPPPPHYEGRRMGPPVGGHRRGPSRYG 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB9 PQYGHPPPPPPPPEYGPHADSPVLMVYGLDQSKMNCDRVFNVFCLYGNVEKVKFMKSKPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 PQYGHPPPPPPPPEYGPHADSPVLMVYGLDQSKMNCDRVFNVFCLYGNVEKVKFMKSKPG 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB9 AAMVEMADGYAVDRAITHLNNNFMFGQKLNVCVSKQPAIMPGQSYGLEDGSCSYKDFSES :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 AAMVEMADGYAVDRAITHLNNNFMFGQKLNVCVSKQPAIMPGQSYGLEDGSCSYKDFSES 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB9 RNNRFSTPEQAAKNRIQHPSNVLHFFNAPLEVTEENFFEICDELGVKRPSSVKVFSGKSE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 RNNRFSTPEQAAKNRIQHPSNVLHFFNAPLEVTEENFFEICDELGVKRPSSVKVFSGKSE 490 500 510 520 530 540 550 560 570 580 pF1KB9 RSSSGLLEWESKSDALETLGFLNHYQMKNPNGPYPYTLKLCFSTAQHAS ::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 RSSSGLLEWESKSDALETLGFLNHYQMKNPNGPYPYTLKLCFSTAQHAS 550 560 570 580 >>CCDS33016.1 HNRNPL gene_id:3191|Hs108|chr19 (456 aa) initn: 3200 init1: 3200 opt: 3200 Z-score: 1751.8 bits: 333.8 E(32554): 3e-91 Smith-Waterman score: 3200; 100.0% identity (100.0% similar) in 456 aa overlap (134-589:1-456) 110 120 130 140 150 160 pF1KB9 VHIRGLIDGVVEADLVEALQEFGPISYVVVMPKKRQALVEFEDVLGACNAVNYAADNQIY :::::::::::::::::::::::::::::: CCDS33 MPKKRQALVEFEDVLGACNAVNYAADNQIY 10 20 30 170 180 190 200 210 220 pF1KB9 IAGHPAFVNYSTSQKISRPGDSDDSRSVNSVLLFTILNPIYSITTDVLYTICNPCGPVQR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 IAGHPAFVNYSTSQKISRPGDSDDSRSVNSVLLFTILNPIYSITTDVLYTICNPCGPVQR 40 50 60 70 80 90 230 240 250 260 270 280 pF1KB9 IVIFRKNGVQAMVEFDSVQSAQRAKASLNGADIYSGCCTLKIEYAKPTRLNVFKNDQDTW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 IVIFRKNGVQAMVEFDSVQSAQRAKASLNGADIYSGCCTLKIEYAKPTRLNVFKNDQDTW 100 110 120 130 140 150 290 300 310 320 330 340 pF1KB9 DYTNPNLSGQGDPGSNPNKRQRQPPLLGDHPAEYGGPHGGYHSHYHDEGYGPPPPHYEGR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 DYTNPNLSGQGDPGSNPNKRQRQPPLLGDHPAEYGGPHGGYHSHYHDEGYGPPPPHYEGR 160 170 180 190 200 210 350 360 370 380 390 400 pF1KB9 RMGPPVGGHRRGPSRYGPQYGHPPPPPPPPEYGPHADSPVLMVYGLDQSKMNCDRVFNVF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 RMGPPVGGHRRGPSRYGPQYGHPPPPPPPPEYGPHADSPVLMVYGLDQSKMNCDRVFNVF 220 230 240 250 260 270 410 420 430 440 450 460 pF1KB9 CLYGNVEKVKFMKSKPGAAMVEMADGYAVDRAITHLNNNFMFGQKLNVCVSKQPAIMPGQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 CLYGNVEKVKFMKSKPGAAMVEMADGYAVDRAITHLNNNFMFGQKLNVCVSKQPAIMPGQ 280 290 300 310 320 330 470 480 490 500 510 520 pF1KB9 SYGLEDGSCSYKDFSESRNNRFSTPEQAAKNRIQHPSNVLHFFNAPLEVTEENFFEICDE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 SYGLEDGSCSYKDFSESRNNRFSTPEQAAKNRIQHPSNVLHFFNAPLEVTEENFFEICDE 340 350 360 370 380 390 530 540 550 560 570 580 pF1KB9 LGVKRPSSVKVFSGKSERSSSGLLEWESKSDALETLGFLNHYQMKNPNGPYPYTLKLCFS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 LGVKRPSSVKVFSGKSERSSSGLLEWESKSDALETLGFLNHYQMKNPNGPYPYTLKLCFS 400 410 420 430 440 450 pF1KB9 TAQHAS :::::: CCDS33 TAQHAS >>CCDS46261.1 HNRNPLL gene_id:92906|Hs108|chr2 (537 aa) initn: 1767 init1: 934 opt: 1025 Z-score: 575.5 bits: 116.3 E(32554): 9.8e-26 Smith-Waterman score: 1884; 57.1% identity (76.7% similar) in 520 aa overlap (71-587:44-536) 50 60 70 80 90 100 pF1KB9 GGGGGRYYGGGSEGGRAPKRLKTDNAGDQHGGGGGGGGGAGAAGGGGGGENYDDPHKTPA ::: ::::: . . .:: . ::. . CCDS46 ESQAKRLKTEEGEIDYSAEEGENRREATPRGGGDGGGGGRSFSQPEAGGSH----HKVSV 20 30 40 50 60 110 120 130 140 150 160 pF1KB9 SPVVHIRGLIDGVVEADLVEALQEFGPISYVVVMPKKRQALVEFEDVLGACNAVNYAADN :::::.::: ..::::::::::..:: : ::..:: :::::::::.. .: . :..:::. CCDS46 SPVVHVRGLCESVVEADLVEALEKFGTICYVMMMPFKRQALVEFENIDSAKECVTFAADE 70 80 90 100 110 120 170 180 190 200 210 220 pF1KB9 QIYIAGHPAFVNYSTSQKISRPGDSDDSRSVNSVLLFTILNPIYSITTDVLYTICNPCGP .::::. :: :::::..:.:::..:: . :.:::..: ::.: ::.:::::.::: : CCDS46 PVYIAGQQAFFNYSTSKRITRPGNTDDPSGGNKVLLLSIQNPLYPITVDVLYTVCNPVGK 130 140 150 160 170 180 230 240 250 260 270 280 pF1KB9 VQRIVIFRKNGVQAMVEFDSVQSAQRAKASLNGADIYSGCCTLKIEYAKPTRLNVFKNDQ :::::::..::.::::::.:: ::.:::.:::::::.::::::::::.::::::..::. CCDS46 VQRIVIFKRNGIQAMVEFESVLCAQKAKAALNGADIYAGCCTLKIEYARPTRLNVIRNDN 190 200 210 220 230 240 290 300 310 320 330 340 pF1KB9 DTWDYTNPNLSGQGDPGSNPNKRQRQPPLLGDHPAEYGGPHGGYHSHYHDEGYGPPPPHY :.::::.: : :. : :.. :::: .::.::. . : :: :: :: : CCDS46 DSWDYTKPYL-GRRDRGKG---RQRQA-ILGEHPSSFR--HDGYGSH------GPLLPLP 250 260 270 280 290 350 360 370 380 390 pF1KB9 EGRRMGPPVGGHRRGPSRYGPQYGHPPPPPPPPEY--GPHADSPVLMVYGLDQSKMNCDR ::: :: :. : : : : . .. :.:: :: : ::::.: CCDS46 SRYRMG----------SRDTPELVAYPLPQASSSYMHGGNPSGSVVMVSGLHQLKMNCSR 300 310 320 330 340 400 410 420 430 440 450 pF1KB9 VFNVFCLYGNVEKVKFMKSKPGAAMVEMADGYAVDRAITHLNNNFMFGQKLNVCVSKQPA :::.::::::.:::::::. ::.:.:::.: :::.::.::::: .::..:::::::: . CCDS46 VFNLFCLYGNIEKVKFMKTIPGTALVEMGDEYAVERAVTHLNNVKLFGKRLNVCVSKQHS 350 360 370 380 390 400 460 470 480 490 500 510 pF1KB9 IMPGQSYGLEDGSCSYKDFSESRNNRFSTPEQAAKNRIQHPSNVLHFFNAPLEVTEENFF ..:.: . ::::. :::::. :.::::.. ::.:: :: :: :::..:.:: ::::.: CCDS46 VVPSQIFELEDGTSSYKDFAMSKNNRFTSAGQASKNIIQPPSCVLHYYNVPLCVTEETFT 410 420 430 440 450 460 520 530 540 550 560 570 pF1KB9 EICDELGVKRPSSVKVFSGK-SERSSSGLLEWESKSDALETLGFLNHYQMKNPNGPYPYT ..:.. : . :::..: : .. ::::::: :.::.:.: :::::.. ::: ::: CCDS46 KLCNDHEVLTFIKYKVFDAKPSAKTLSGLLEWECKTDAVEALTALNHYQIRVPNGSNPYT 470 480 490 500 510 520 580 pF1KB9 LKLCFSTAQHAS :::::::..: CCDS46 LKLCFSTSSHL 530 >>CCDS1796.2 HNRNPLL gene_id:92906|Hs108|chr2 (542 aa) initn: 1767 init1: 934 opt: 1025 Z-score: 575.4 bits: 116.3 E(32554): 9.9e-26 Smith-Waterman score: 1884; 57.1% identity (76.7% similar) in 520 aa overlap (71-587:49-541) 50 60 70 80 90 100 pF1KB9 GGGGGRYYGGGSEGGRAPKRLKTDNAGDQHGGGGGGGGGAGAAGGGGGGENYDDPHKTPA ::: ::::: . . .:: . ::. . CCDS17 ESQAKRLKTEEGEIDYSAEEGENRREATPRGGGDGGGGGRSFSQPEAGGSH----HKVSV 20 30 40 50 60 70 110 120 130 140 150 160 pF1KB9 SPVVHIRGLIDGVVEADLVEALQEFGPISYVVVMPKKRQALVEFEDVLGACNAVNYAADN :::::.::: ..::::::::::..:: : ::..:: :::::::::.. .: . :..:::. CCDS17 SPVVHVRGLCESVVEADLVEALEKFGTICYVMMMPFKRQALVEFENIDSAKECVTFAADE 80 90 100 110 120 130 170 180 190 200 210 220 pF1KB9 QIYIAGHPAFVNYSTSQKISRPGDSDDSRSVNSVLLFTILNPIYSITTDVLYTICNPCGP .::::. :: :::::..:.:::..:: . :.:::..: ::.: ::.:::::.::: : CCDS17 PVYIAGQQAFFNYSTSKRITRPGNTDDPSGGNKVLLLSIQNPLYPITVDVLYTVCNPVGK 140 150 160 170 180 190 230 240 250 260 270 280 pF1KB9 VQRIVIFRKNGVQAMVEFDSVQSAQRAKASLNGADIYSGCCTLKIEYAKPTRLNVFKNDQ :::::::..::.::::::.:: ::.:::.:::::::.::::::::::.::::::..::. CCDS17 VQRIVIFKRNGIQAMVEFESVLCAQKAKAALNGADIYAGCCTLKIEYARPTRLNVIRNDN 200 210 220 230 240 250 290 300 310 320 330 340 pF1KB9 DTWDYTNPNLSGQGDPGSNPNKRQRQPPLLGDHPAEYGGPHGGYHSHYHDEGYGPPPPHY :.::::.: : :. : :.. :::: .::.::. . : :: :: :: : CCDS17 DSWDYTKPYL-GRRDRGKG---RQRQA-ILGEHPSSFR--HDGYGSH------GPLLPLP 260 270 280 290 300 350 360 370 380 390 pF1KB9 EGRRMGPPVGGHRRGPSRYGPQYGHPPPPPPPPEY--GPHADSPVLMVYGLDQSKMNCDR ::: :: :. : : : : . .. :.:: :: : ::::.: CCDS17 SRYRMG----------SRDTPELVAYPLPQASSSYMHGGNPSGSVVMVSGLHQLKMNCSR 310 320 330 340 350 400 410 420 430 440 450 pF1KB9 VFNVFCLYGNVEKVKFMKSKPGAAMVEMADGYAVDRAITHLNNNFMFGQKLNVCVSKQPA :::.::::::.:::::::. ::.:.:::.: :::.::.::::: .::..:::::::: . CCDS17 VFNLFCLYGNIEKVKFMKTIPGTALVEMGDEYAVERAVTHLNNVKLFGKRLNVCVSKQHS 360 370 380 390 400 410 460 470 480 490 500 510 pF1KB9 IMPGQSYGLEDGSCSYKDFSESRNNRFSTPEQAAKNRIQHPSNVLHFFNAPLEVTEENFF ..:.: . ::::. :::::. :.::::.. ::.:: :: :: :::..:.:: ::::.: CCDS17 VVPSQIFELEDGTSSYKDFAMSKNNRFTSAGQASKNIIQPPSCVLHYYNVPLCVTEETFT 420 430 440 450 460 470 520 530 540 550 560 570 pF1KB9 EICDELGVKRPSSVKVFSGK-SERSSSGLLEWESKSDALETLGFLNHYQMKNPNGPYPYT ..:.. : . :::..: : .. ::::::: :.::.:.: :::::.. ::: ::: CCDS17 KLCNDHEVLTFIKYKVFDAKPSAKTLSGLLEWECKTDAVEALTALNHYQIRVPNGSNPYT 480 490 500 510 520 530 580 pF1KB9 LKLCFSTAQHAS :::::::..: CCDS17 LKLCFSTSSHL 540 589 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 21:27:03 2016 done: Sat Nov 5 21:27:04 2016 Total Scan time: 3.700 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]