FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KA1983, 406 aa 1>>>pF1KA1983 406 - 406 aa - 406 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 10.5293+/-0.00044; mu= -2.9356+/- 0.028 mean_var=533.0750+/-108.245, 0's: 0 Z-trim(124.4): 406 B-trim: 0 in 0/60 Lambda= 0.055549 statistics sampled from 45459 (45920) to 45459 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.804), E-opt: 0.2 (0.538), width: 16 Scan time: 8.300 The best scores are: opt bits E(85289) NP_597716 (OMIM: 235510,612753) collagen and calci ( 406) 2959 251.4 3e-66 XP_016881047 (OMIM: 235510,612753) PREDICTED: coll ( 366) 2419 208.0 3e-53 XP_016881045 (OMIM: 235510,612753) PREDICTED: coll ( 435) 1877 164.7 4e-40 XP_016881046 (OMIM: 235510,612753) PREDICTED: coll ( 435) 1877 164.7 4e-40 NP_000486 (OMIM: 301050,303630) collagen alpha-5(I (1685) 344 42.6 0.0085 XP_016884749 (OMIM: 301050,303630) PREDICTED: coll (1690) 344 42.6 0.0085 >>NP_597716 (OMIM: 235510,612753) collagen and calcium-b (406 aa) initn: 2959 init1: 2959 opt: 2959 Z-score: 1310.3 bits: 251.4 E(85289): 3e-66 Smith-Waterman score: 2959; 100.0% identity (100.0% similar) in 406 aa overlap (1-406:1-406) 10 20 30 40 50 60 pF1KA1 MVPPPPSRGGAARGQLGRSLGPLLLLLALGHTWTYREEPEDGDREICSESKIATTKYPCL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_597 MVPPPPSRGGAARGQLGRSLGPLLLLLALGHTWTYREEPEDGDREICSESKIATTKYPCL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KA1 KSSGELTTCYRKKCCKGYKFVLGQCIPEDYDVCAEAPCEQQCTDNFGRVLCTCYPGYRYD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_597 KSSGELTTCYRKKCCKGYKFVLGQCIPEDYDVCAEAPCEQQCTDNFGRVLCTCYPGYRYD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KA1 RERHRKREKPYCLDIDECASSNGTLCAHICINTLGSYRCECREGYIREDDGKTCTRGDKY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_597 RERHRKREKPYCLDIDECASSNGTLCAHICINTLGSYRCECREGYIREDDGKTCTRGDKY 130 140 150 160 170 180 190 200 210 220 230 240 pF1KA1 PNDTGHEKSENMVKAGTCCATCKEFYQMKQTVLQLKQKIALLPNNAADLGKYITGDKVLA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_597 PNDTGHEKSENMVKAGTCCATCKEFYQMKQTVLQLKQKIALLPNNAADLGKYITGDKVLA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KA1 SNTYLPGPPGLPGGQGPPGSPGPKGSPGFPGMPGPPGQPGPRGSMGPMGPSPDLSHIKQG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_597 SNTYLPGPPGLPGGQGPPGSPGPKGSPGFPGMPGPPGQPGPRGSMGPMGPSPDLSHIKQG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KA1 RRGPVGPPGAPGRDGSKGERGAPGPRGSPGPPGSFDFLLLMLADIRNDITELQEKVFGHR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_597 RRGPVGPPGAPGRDGSKGERGAPGPRGSPGPPGSFDFLLLMLADIRNDITELQEKVFGHR 310 320 330 340 350 360 370 380 390 400 pF1KA1 THSSAEEFPLPQEFPSYPEAMDLGSGDDHPRRTETRDLRAPRDFYP :::::::::::::::::::::::::::::::::::::::::::::: NP_597 THSSAEEFPLPQEFPSYPEAMDLGSGDDHPRRTETRDLRAPRDFYP 370 380 390 400 >>XP_016881047 (OMIM: 235510,612753) PREDICTED: collagen (366 aa) initn: 2650 init1: 2419 opt: 2419 Z-score: 1076.9 bits: 208.0 E(85289): 3e-53 Smith-Waterman score: 2419; 100.0% identity (100.0% similar) in 329 aa overlap (1-329:1-329) 10 20 30 40 50 60 pF1KA1 MVPPPPSRGGAARGQLGRSLGPLLLLLALGHTWTYREEPEDGDREICSESKIATTKYPCL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 MVPPPPSRGGAARGQLGRSLGPLLLLLALGHTWTYREEPEDGDREICSESKIATTKYPCL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KA1 KSSGELTTCYRKKCCKGYKFVLGQCIPEDYDVCAEAPCEQQCTDNFGRVLCTCYPGYRYD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 KSSGELTTCYRKKCCKGYKFVLGQCIPEDYDVCAEAPCEQQCTDNFGRVLCTCYPGYRYD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KA1 RERHRKREKPYCLDIDECASSNGTLCAHICINTLGSYRCECREGYIREDDGKTCTRGDKY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 RERHRKREKPYCLDIDECASSNGTLCAHICINTLGSYRCECREGYIREDDGKTCTRGDKY 130 140 150 160 170 180 190 200 210 220 230 240 pF1KA1 PNDTGHEKSENMVKAGTCCATCKEFYQMKQTVLQLKQKIALLPNNAADLGKYITGDKVLA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 PNDTGHEKSENMVKAGTCCATCKEFYQMKQTVLQLKQKIALLPNNAADLGKYITGDKVLA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KA1 SNTYLPGPPGLPGGQGPPGSPGPKGSPGFPGMPGPPGQPGPRGSMGPMGPSPDLSHIKQG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 SNTYLPGPPGLPGGQGPPGSPGPKGSPGFPGMPGPPGQPGPRGSMGPMGPSPDLSHIKQG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KA1 RRGPVGPPGAPGRDGSKGERGAPGPRGSPGPPGSFDFLLLMLADIRNDITELQEKVFGHR ::::::::::::::::::::::::::::: XP_016 RRGPVGPPGAPGRDGSKGERGAPGPRGSPVSSTLCPASPGERSQGCSSDEPIGTPWFFRL 310 320 330 340 350 360 >>XP_016881045 (OMIM: 235510,612753) PREDICTED: collagen (435 aa) initn: 1877 init1: 1877 opt: 1877 Z-score: 841.4 bits: 164.7 E(85289): 4e-40 Smith-Waterman score: 1877; 99.2% identity (99.2% similar) in 262 aa overlap (1-262:1-262) 10 20 30 40 50 60 pF1KA1 MVPPPPSRGGAARGQLGRSLGPLLLLLALGHTWTYREEPEDGDREICSESKIATTKYPCL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 MVPPPPSRGGAARGQLGRSLGPLLLLLALGHTWTYREEPEDGDREICSESKIATTKYPCL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KA1 KSSGELTTCYRKKCCKGYKFVLGQCIPEDYDVCAEAPCEQQCTDNFGRVLCTCYPGYRYD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 KSSGELTTCYRKKCCKGYKFVLGQCIPEDYDVCAEAPCEQQCTDNFGRVLCTCYPGYRYD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KA1 RERHRKREKPYCLDIDECASSNGTLCAHICINTLGSYRCECREGYIREDDGKTCTRGDKY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 RERHRKREKPYCLDIDECASSNGTLCAHICINTLGSYRCECREGYIREDDGKTCTRGDKY 130 140 150 160 170 180 190 200 210 220 230 240 pF1KA1 PNDTGHEKSENMVKAGTCCATCKEFYQMKQTVLQLKQKIALLPNNAADLGKYITGDKVLA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 PNDTGHEKSENMVKAGTCCATCKEFYQMKQTVLQLKQKIALLPNNAADLGKYITGDKVLA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KA1 SNTYLPGPPGLPGGQGPPGSPGPKGSPGFPGMPGPPGQPGPRGSMGPMGPSPDLSHIKQG ::::::::::::::::::: : XP_016 SNTYLPGPPGLPGGQGPPGECGRGHRRVITNRKSLPKAHICWGLDNVLQLPSRRNKAHQD 250 260 270 280 290 300 >>XP_016881046 (OMIM: 235510,612753) PREDICTED: collagen (435 aa) initn: 1877 init1: 1877 opt: 1877 Z-score: 841.4 bits: 164.7 E(85289): 4e-40 Smith-Waterman score: 1877; 99.2% identity (99.2% similar) in 262 aa overlap (1-262:1-262) 10 20 30 40 50 60 pF1KA1 MVPPPPSRGGAARGQLGRSLGPLLLLLALGHTWTYREEPEDGDREICSESKIATTKYPCL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 MVPPPPSRGGAARGQLGRSLGPLLLLLALGHTWTYREEPEDGDREICSESKIATTKYPCL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KA1 KSSGELTTCYRKKCCKGYKFVLGQCIPEDYDVCAEAPCEQQCTDNFGRVLCTCYPGYRYD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 KSSGELTTCYRKKCCKGYKFVLGQCIPEDYDVCAEAPCEQQCTDNFGRVLCTCYPGYRYD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KA1 RERHRKREKPYCLDIDECASSNGTLCAHICINTLGSYRCECREGYIREDDGKTCTRGDKY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 RERHRKREKPYCLDIDECASSNGTLCAHICINTLGSYRCECREGYIREDDGKTCTRGDKY 130 140 150 160 170 180 190 200 210 220 230 240 pF1KA1 PNDTGHEKSENMVKAGTCCATCKEFYQMKQTVLQLKQKIALLPNNAADLGKYITGDKVLA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 PNDTGHEKSENMVKAGTCCATCKEFYQMKQTVLQLKQKIALLPNNAADLGKYITGDKVLA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KA1 SNTYLPGPPGLPGGQGPPGSPGPKGSPGFPGMPGPPGQPGPRGSMGPMGPSPDLSHIKQG ::::::::::::::::::: : XP_016 SNTYLPGPPGLPGGQGPPGECGRGHRRVITNRKSLPKAHICWGLDNVLQLPSRRNKAHQD 250 260 270 280 290 300 >>NP_000486 (OMIM: 301050,303630) collagen alpha-5(IV) c (1685 aa) initn: 806 init1: 333 opt: 344 Z-score: 171.2 bits: 42.6 E(85289): 0.0085 Smith-Waterman score: 349; 51.9% identity (57.5% similar) in 106 aa overlap (246-333:1192-1297) 220 230 240 250 260 pF1KA1 KQKIALLPNNAADLGKYITGDKVLASNTYLPGPPGLPG--GQ----------GPPGSPGP :::::::: :: : :: ::: NP_000 PPGEKGKPGQDGIPGPAGQKGEPGQPGFGNPGPPGLPGLSGQKGDGGLPGIPGNPGLPGP 1170 1180 1190 1200 1210 1220 270 280 290 300 310 pF1KA1 KGSPGFPGMPGPPGQPGPRGSMGPM------GPSPDLSHIKQGRRGPVGPPGAPGRDGSK :: ::: :.:: : ::: :: :: .:.:. . : :: :::: :: : : NP_000 KGEPGFHGFPGVQGPPGPPGSPGPALEGPKGNPGPQGPPGRPGLPGPEGPPGLPGNGGIK 1230 1240 1250 1260 1270 1280 320 330 340 350 360 370 pF1KA1 GERGAPGPRGSPGPPGSFDFLLLMLADIRNDITELQEKVFGHRTHSSAEEFPLPQEFPSY ::.: :: : :: :: NP_000 GEKGNPGQPGLPGLPGLKGDQGPPGLQGNPGRPGLNGMKGDPGLPGVPGFPGMKGPSGVP 1290 1300 1310 1320 1330 1340 >>XP_016884749 (OMIM: 301050,303630) PREDICTED: collagen (1690 aa) initn: 806 init1: 333 opt: 344 Z-score: 171.2 bits: 42.6 E(85289): 0.0085 Smith-Waterman score: 349; 51.9% identity (57.5% similar) in 106 aa overlap (246-333:1197-1302) 220 230 240 250 260 pF1KA1 KQKIALLPNNAADLGKYITGDKVLASNTYLPGPPGLPG--GQ----------GPPGSPGP :::::::: :: : :: ::: XP_016 PPGEKGKPGQDGIPGPAGQKGEPGQPGFGNPGPPGLPGLSGQKGDGGLPGIPGNPGLPGP 1170 1180 1190 1200 1210 1220 270 280 290 300 310 pF1KA1 KGSPGFPGMPGPPGQPGPRGSMGPM------GPSPDLSHIKQGRRGPVGPPGAPGRDGSK :: ::: :.:: : ::: :: :: .:.:. . : :: :::: :: : : XP_016 KGEPGFHGFPGVQGPPGPPGSPGPALEGPKGNPGPQGPPGRPGLPGPEGPPGLPGNGGIK 1230 1240 1250 1260 1270 1280 320 330 340 350 360 370 pF1KA1 GERGAPGPRGSPGPPGSFDFLLLMLADIRNDITELQEKVFGHRTHSSAEEFPLPQEFPSY ::.: :: : :: :: XP_016 GEKGNPGQPGLPGLPGLKGDQGPPGLQGNPGRPGLNGMKGDPGLPGVPGFPGMKGPSGVP 1290 1300 1310 1320 1330 1340 406 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Wed Nov 2 22:21:20 2016 done: Wed Nov 2 22:21:21 2016 Total Scan time: 8.300 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]