FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0372, 565 aa 1>>>pF1KE0372 565 - 565 aa - 565 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.1639+/-0.00114; mu= 13.2880+/- 0.068 mean_var=61.1922+/-12.410, 0's: 0 Z-trim(102.3): 37 B-trim: 243 in 1/48 Lambda= 0.163956 statistics sampled from 6856 (6874) to 6856 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.579), E-opt: 0.2 (0.211), width: 16 Scan time: 3.150 The best scores are: opt bits E(32554) CCDS46911.1 COL6A6 gene_id:131873|Hs108|chr3 (2263) 1006 246.9 2e-64 CCDS33410.2 COL6A3 gene_id:1293|Hs108|chr2 (2570) 341 89.6 5e-17 CCDS33409.1 COL6A3 gene_id:1293|Hs108|chr2 (2971) 341 89.6 5.8e-17 CCDS33412.1 COL6A3 gene_id:1293|Hs108|chr2 (3177) 341 89.6 6.2e-17 >>CCDS46911.1 COL6A6 gene_id:131873|Hs108|chr3 (2263 aa) initn: 1137 init1: 644 opt: 1006 Z-score: 1270.3 bits: 246.9 E(32554): 2e-64 Smith-Waterman score: 1006; 59.2% identity (83.7% similar) in 245 aa overlap (312-555:1947-2191) 290 300 310 320 330 340 pF1KE0 QKLMINYEKDQKSAEIASLTSGHENYGRKEEPDHTYEPGDVSLQEYYMDVAFLIDASQRV .:: . . . . :::.:::.:::. . CCDS46 TFQVIVVPSGADYIPALERLQRCTFCYDVCKPDASCDQARPPPVQSYMDAAFLLDASRNM 1920 1930 1940 1950 1960 1970 350 360 370 380 390 400 pF1KE0 GSDEFKEVKAFITSVLDYFHIAPTPLTSTLGDRVAVLSYSPPGYMPNTEECPVYLEFDLV :: ::....::. ..::.:.:.: : ::. :::::.::..:: ..:::.. :: ::.:. CCDS46 GSAEFEDIRAFLGALLDHFEITPEPETSVTGDRVALLSHAPPDFLPNTQKSPVRAEFNLT 1980 1990 2000 2010 2020 2030 410 420 430 440 450 460 pF1KE0 TYNSIHQMKHHLQDS-QQLNGDVFIGHALQWTIDNVFVGTPNLRKNKVIFVISAGETNSL :: : . ::.:...: .:::::.:::::::::.::::..:::::.::::::::::::. : CCDS46 TYRSKRLMKRHVHESVKQLNGDAFIGHALQWTLDNVFLSTPNLRRNKVIFVISAGETSHL 2040 2050 2060 2070 2080 2090 470 480 490 500 510 520 pF1KE0 DKDVLRNVSLRAKCQGYSIFVFSFGPKHNDKELEELASHPLDHHLVQLGRTHKPDWNYII : ..:.. :::::::::..::::.:: .:::::.::::::::::::::: :::: .: . CCDS46 DGEILKKESLRAKCQGYALFVFSLGPIWDDKELEDLASHPLDHHLVQLGRIHKPDHSYGV 2100 2110 2120 2130 2140 2150 530 540 550 560 pF1KE0 KFVKPFVHLIRRAINKYPTEDMKATCVNMTSPNPENGGTENTVLW :::: :.. ::::::::: ..: : ..: .:. CCDS46 KFVKSFINSIRRAINKYPPINLKIKCNRLNSIDPKQPPRPFRSFVPGPLKATLKEDVLQK 2160 2170 2180 2190 2200 2210 CCDS46 AKFFQDKKYLSRVARSGRDDAIQNFMRSTSHTFKNGRMIESAPKQHD 2220 2230 2240 2250 2260 >>CCDS33410.2 COL6A3 gene_id:1293|Hs108|chr2 (2570 aa) initn: 405 init1: 190 opt: 341 Z-score: 419.2 bits: 89.6 E(32554): 5e-17 Smith-Waterman score: 341; 28.3% identity (63.9% similar) in 219 aa overlap (329-546:2011-2229) 300 310 320 330 340 350 pF1KE0 SLTSGHENYGRKEEPDHTYEPGDVSLQEYYMDVAFLIDASQRVGSDEFKEVKAFITSVLD .:.::..:... . .:.:.: .:. .. CCDS33 LDICNIDPSCGFGSWRPSFRDRRAAGSDVDIDMAFILDSAETTTLFQFNEMKKYIAYLVR 1990 2000 2010 2020 2030 2040 360 370 380 390 400 410 pF1KE0 YFHIAPTPLTSTLGDRVAVLSYSPPGYMPNTEECPVYLEFDLVTYNSIHQMKHHLQDSQ- . ..: : .: ::::....: . :. :: .::.:. :.: ... :. .. CCDS33 QLDMSPDPKASQHFARVAVVQHAPSESVDNASMPPVKVEFSLTDYGSKEKLVDFLSRGMT 2050 2060 2070 2080 2090 2100 420 430 440 450 460 470 pF1KE0 QLNGDVFIGHALQWTIDNVFVGTPNLRKNKVIFVISAGETNSLDKDVLRNVSLRAKCQGY ::.: .: :...::.::: ..:: : :.. .. .::. . . . : :.:::.:: CCDS33 QLQGTRALGSAIEYTIENVFESAPNPRDLKIVVLMLTGEVPEQQLEEAQRVILQAKCKGY 2110 2120 2130 2140 2150 2160 480 490 500 510 520 530 pF1KE0 SIFVFSFGPKHNDKELEELASHPLDHHLVQLGRTHKPDWNYIIKFVKPFVHLIRRAINKY . :...: : : ::. .::.: : . . .. . . . ...: . . .. : CCDS33 FFVVLGIGRKVNIKEVYTFASEPNDVFFKLVDKSTELNEEPLMRFGRLLPSFVSSENAFY 2170 2180 2190 2200 2210 2220 540 550 560 pF1KE0 PTEDMKATCVNMTSPNPENGGTENTVLW . :.. : CCDS33 LSPDIRKQCDWFQGDQPTKNLVKFGHKQVNVPNNVTSSPTSNPVTTTKPVTTTKPVTTTT 2230 2240 2250 2260 2270 2280 >>CCDS33409.1 COL6A3 gene_id:1293|Hs108|chr2 (2971 aa) initn: 374 init1: 190 opt: 341 Z-score: 418.0 bits: 89.6 E(32554): 5.8e-17 Smith-Waterman score: 341; 28.3% identity (63.9% similar) in 219 aa overlap (329-546:2412-2630) 300 310 320 330 340 350 pF1KE0 SLTSGHENYGRKEEPDHTYEPGDVSLQEYYMDVAFLIDASQRVGSDEFKEVKAFITSVLD .:.::..:... . .:.:.: .:. .. CCDS33 LDICNIDPSCGFGSWRPSFRDRRAAGSDVDIDMAFILDSAETTTLFQFNEMKKYIAYLVR 2390 2400 2410 2420 2430 2440 360 370 380 390 400 410 pF1KE0 YFHIAPTPLTSTLGDRVAVLSYSPPGYMPNTEECPVYLEFDLVTYNSIHQMKHHLQDSQ- . ..: : .: ::::....: . :. :: .::.:. :.: ... :. .. CCDS33 QLDMSPDPKASQHFARVAVVQHAPSESVDNASMPPVKVEFSLTDYGSKEKLVDFLSRGMT 2450 2460 2470 2480 2490 2500 420 430 440 450 460 470 pF1KE0 QLNGDVFIGHALQWTIDNVFVGTPNLRKNKVIFVISAGETNSLDKDVLRNVSLRAKCQGY ::.: .: :...::.::: ..:: : :.. .. .::. . . . : :.:::.:: CCDS33 QLQGTRALGSAIEYTIENVFESAPNPRDLKIVVLMLTGEVPEQQLEEAQRVILQAKCKGY 2510 2520 2530 2540 2550 2560 480 490 500 510 520 530 pF1KE0 SIFVFSFGPKHNDKELEELASHPLDHHLVQLGRTHKPDWNYIIKFVKPFVHLIRRAINKY . :...: : : ::. .::.: : . . .. . . . ...: . . .. : CCDS33 FFVVLGIGRKVNIKEVYTFASEPNDVFFKLVDKSTELNEEPLMRFGRLLPSFVSSENAFY 2570 2580 2590 2600 2610 2620 540 550 560 pF1KE0 PTEDMKATCVNMTSPNPENGGTENTVLW . :.. : CCDS33 LSPDIRKQCDWFQGDQPTKNLVKFGHKQVNVPNNVTSSPTSNPVTTTKPVTTTKPVTTTT 2630 2640 2650 2660 2670 2680 >>CCDS33412.1 COL6A3 gene_id:1293|Hs108|chr2 (3177 aa) initn: 374 init1: 190 opt: 341 Z-score: 417.5 bits: 89.6 E(32554): 6.2e-17 Smith-Waterman score: 341; 28.3% identity (63.9% similar) in 219 aa overlap (329-546:2618-2836) 300 310 320 330 340 350 pF1KE0 SLTSGHENYGRKEEPDHTYEPGDVSLQEYYMDVAFLIDASQRVGSDEFKEVKAFITSVLD .:.::..:... . .:.:.: .:. .. CCDS33 LDICNIDPSCGFGSWRPSFRDRRAAGSDVDIDMAFILDSAETTTLFQFNEMKKYIAYLVR 2590 2600 2610 2620 2630 2640 360 370 380 390 400 410 pF1KE0 YFHIAPTPLTSTLGDRVAVLSYSPPGYMPNTEECPVYLEFDLVTYNSIHQMKHHLQDSQ- . ..: : .: ::::....: . :. :: .::.:. :.: ... :. .. CCDS33 QLDMSPDPKASQHFARVAVVQHAPSESVDNASMPPVKVEFSLTDYGSKEKLVDFLSRGMT 2650 2660 2670 2680 2690 2700 420 430 440 450 460 470 pF1KE0 QLNGDVFIGHALQWTIDNVFVGTPNLRKNKVIFVISAGETNSLDKDVLRNVSLRAKCQGY ::.: .: :...::.::: ..:: : :.. .. .::. . . . : :.:::.:: CCDS33 QLQGTRALGSAIEYTIENVFESAPNPRDLKIVVLMLTGEVPEQQLEEAQRVILQAKCKGY 2710 2720 2730 2740 2750 2760 480 490 500 510 520 530 pF1KE0 SIFVFSFGPKHNDKELEELASHPLDHHLVQLGRTHKPDWNYIIKFVKPFVHLIRRAINKY . :...: : : ::. .::.: : . . .. . . . ...: . . .. : CCDS33 FFVVLGIGRKVNIKEVYTFASEPNDVFFKLVDKSTELNEEPLMRFGRLLPSFVSSENAFY 2770 2780 2790 2800 2810 2820 540 550 560 pF1KE0 PTEDMKATCVNMTSPNPENGGTENTVLW . :.. : CCDS33 LSPDIRKQCDWFQGDQPTKNLVKFGHKQVNVPNNVTSSPTSNPVTTTKPVTTTKPVTTTT 2830 2840 2850 2860 2870 2880 565 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 13:37:54 2016 done: Thu Nov 3 13:37:54 2016 Total Scan time: 3.150 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]