FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7777, 453 aa 1>>>pF1KB7777 453 - 453 aa - 453 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.7934+/-0.000903; mu= 12.7226+/- 0.055 mean_var=131.1914+/-25.471, 0's: 0 Z-trim(110.5): 15 B-trim: 47 in 1/50 Lambda= 0.111975 statistics sampled from 11620 (11627) to 11620 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.705), E-opt: 0.2 (0.357), width: 16 Scan time: 3.140 The best scores are: opt bits E(32554) CCDS5374.1 CDCA7L gene_id:55536|Hs108|chr7 ( 454) 3015 498.3 6.8e-141 CCDS47559.1 CDCA7L gene_id:55536|Hs108|chr7 ( 420) 2785 461.1 9.9e-130 CCDS47558.1 CDCA7L gene_id:55536|Hs108|chr7 ( 408) 2367 393.5 2e-109 CCDS2252.1 CDCA7 gene_id:83879|Hs108|chr2 ( 450) 832 145.6 9.8e-35 CCDS2253.1 CDCA7 gene_id:83879|Hs108|chr2 ( 371) 739 130.5 2.8e-30 >>CCDS5374.1 CDCA7L gene_id:55536|Hs108|chr7 (454 aa) initn: 2672 init1: 2672 opt: 3015 Z-score: 2642.9 bits: 498.3 E(32554): 6.8e-141 Smith-Waterman score: 3015; 99.8% identity (99.8% similar) in 454 aa overlap (1-453:1-454) 10 20 30 40 50 pF1KB7 MELATRYQIPKEVADIFNAPSDDEEFVGFRDDVPMETLSSEESCDSFDSLESGKQ-DVRF ::::::::::::::::::::::::::::::::::::::::::::::::::::::: :::: CCDS53 MELATRYQIPKEVADIFNAPSDDEEFVGFRDDVPMETLSSEESCDSFDSLESGKQQDVRF 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB7 HSKYFTEELRRIFIEDTDSETEDFAGFTQSDLNGKTNPEVMVVESDLSDDGKASLVSEEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 HSKYFTEELRRIFIEDTDSETEDFAGFTQSDLNGKTNPEVMVVESDLSDDGKASLVSEEE 70 80 90 100 110 120 120 130 140 150 160 170 pF1KB7 EDEEEDKATPRRSRSRRSSIGLRVAFQFPTKKLANKPDKNSSSEQLFSSARLQNEKKTIL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 EDEEEDKATPRRSRSRRSSIGLRVAFQFPTKKLANKPDKNSSSEQLFSSARLQNEKKTIL 130 140 150 160 170 180 180 190 200 210 220 230 pF1KB7 ERKKDCRQVIQREDSTSESEDDSRDESQESSDALLKRTMNIKENKAMLAQLLAELNSMPD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 ERKKDCRQVIQREDSTSESEDDSRDESQESSDALLKRTMNIKENKAMLAQLLAELNSMPD 190 200 210 220 230 240 240 250 260 270 280 290 pF1KB7 FFPVRTPTSASRKKTVRRAFSEGQITRRMNPTRSARPPEKFALENFTVSAAKFAEEFYSF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 FFPVRTPTSASRKKTVRRAFSEGQITRRMNPTRSARPPEKFALENFTVSAAKFAEEFYSF 250 260 270 280 290 300 300 310 320 330 340 350 pF1KB7 RRRKTIGGKCREYRRRHRISSFRPVEDITEEDLENVAITVRDKIYDKVLGNTCHQCRQKT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 RRRKTIGGKCREYRRRHRISSFRPVEDITEEDLENVAITVRDKIYDKVLGNTCHQCRQKT 310 320 330 340 350 360 360 370 380 390 400 410 pF1KB7 IDTKTVCRNQGCCGVRGQFCGPCLRNRYGEDVRSALLDPDWVCPPCRGICNCSYCRKRDG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 IDTKTVCRNQGCCGVRGQFCGPCLRNRYGEDVRSALLDPDWVCPPCRGICNCSYCRKRDG 370 380 390 400 410 420 420 430 440 450 pF1KB7 RCATGILIHLAKFYGYDNVKEYLESLQKELVEDN :::::::::::::::::::::::::::::::::: CCDS53 RCATGILIHLAKFYGYDNVKEYLESLQKELVEDN 430 440 450 >>CCDS47559.1 CDCA7L gene_id:55536|Hs108|chr7 (420 aa) initn: 2672 init1: 2672 opt: 2785 Z-score: 2442.6 bits: 461.1 E(32554): 9.9e-130 Smith-Waterman score: 2785; 99.8% identity (99.8% similar) in 420 aa overlap (35-453:1-420) 10 20 30 40 50 60 pF1KB7 TRYQIPKEVADIFNAPSDDEEFVGFRDDVPMETLSSEESCDSFDSLESGKQ-DVRFHSKY ::::::::::::::::::::: :::::::: CCDS47 METLSSEESCDSFDSLESGKQQDVRFHSKY 10 20 30 70 80 90 100 110 120 pF1KB7 FTEELRRIFIEDTDSETEDFAGFTQSDLNGKTNPEVMVVESDLSDDGKASLVSEEEEDEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 FTEELRRIFIEDTDSETEDFAGFTQSDLNGKTNPEVMVVESDLSDDGKASLVSEEEEDEE 40 50 60 70 80 90 130 140 150 160 170 180 pF1KB7 EDKATPRRSRSRRSSIGLRVAFQFPTKKLANKPDKNSSSEQLFSSARLQNEKKTILERKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 EDKATPRRSRSRRSSIGLRVAFQFPTKKLANKPDKNSSSEQLFSSARLQNEKKTILERKK 100 110 120 130 140 150 190 200 210 220 230 240 pF1KB7 DCRQVIQREDSTSESEDDSRDESQESSDALLKRTMNIKENKAMLAQLLAELNSMPDFFPV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 DCRQVIQREDSTSESEDDSRDESQESSDALLKRTMNIKENKAMLAQLLAELNSMPDFFPV 160 170 180 190 200 210 250 260 270 280 290 300 pF1KB7 RTPTSASRKKTVRRAFSEGQITRRMNPTRSARPPEKFALENFTVSAAKFAEEFYSFRRRK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 RTPTSASRKKTVRRAFSEGQITRRMNPTRSARPPEKFALENFTVSAAKFAEEFYSFRRRK 220 230 240 250 260 270 310 320 330 340 350 360 pF1KB7 TIGGKCREYRRRHRISSFRPVEDITEEDLENVAITVRDKIYDKVLGNTCHQCRQKTIDTK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 TIGGKCREYRRRHRISSFRPVEDITEEDLENVAITVRDKIYDKVLGNTCHQCRQKTIDTK 280 290 300 310 320 330 370 380 390 400 410 420 pF1KB7 TVCRNQGCCGVRGQFCGPCLRNRYGEDVRSALLDPDWVCPPCRGICNCSYCRKRDGRCAT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 TVCRNQGCCGVRGQFCGPCLRNRYGEDVRSALLDPDWVCPPCRGICNCSYCRKRDGRCAT 340 350 360 370 380 390 430 440 450 pF1KB7 GILIHLAKFYGYDNVKEYLESLQKELVEDN :::::::::::::::::::::::::::::: CCDS47 GILIHLAKFYGYDNVKEYLESLQKELVEDN 400 410 420 >>CCDS47558.1 CDCA7L gene_id:55536|Hs108|chr7 (408 aa) initn: 2367 init1: 2367 opt: 2367 Z-score: 2077.8 bits: 393.5 E(32554): 2e-109 Smith-Waterman score: 2631; 90.1% identity (90.1% similar) in 453 aa overlap (1-453:1-408) 10 20 30 40 50 60 pF1KB7 MELATRYQIPKEVADIFNAPSDDEEFVGFRDDVPMETLSSEESCDSFDSLESGKQDVRFH ::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 MELATRYQIPKEVADIFNAPSDDEEFVGFRDDVPMETLSSEESCDSFDSLESGKQ----- 10 20 30 40 50 70 80 90 100 110 120 pF1KB7 SKYFTEELRRIFIEDTDSETEDFAGFTQSDLNGKTNPEVMVVESDLSDDGKASLVSEEEE :::::::::::::::::::: CCDS47 ----------------------------------------VVESDLSDDGKASLVSEEEE 60 70 130 140 150 160 170 180 pF1KB7 DEEEDKATPRRSRSRRSSIGLRVAFQFPTKKLANKPDKNSSSEQLFSSARLQNEKKTILE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 DEEEDKATPRRSRSRRSSIGLRVAFQFPTKKLANKPDKNSSSEQLFSSARLQNEKKTILE 80 90 100 110 120 130 190 200 210 220 230 240 pF1KB7 RKKDCRQVIQREDSTSESEDDSRDESQESSDALLKRTMNIKENKAMLAQLLAELNSMPDF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 RKKDCRQVIQREDSTSESEDDSRDESQESSDALLKRTMNIKENKAMLAQLLAELNSMPDF 140 150 160 170 180 190 250 260 270 280 290 300 pF1KB7 FPVRTPTSASRKKTVRRAFSEGQITRRMNPTRSARPPEKFALENFTVSAAKFAEEFYSFR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 FPVRTPTSASRKKTVRRAFSEGQITRRMNPTRSARPPEKFALENFTVSAAKFAEEFYSFR 200 210 220 230 240 250 310 320 330 340 350 360 pF1KB7 RRKTIGGKCREYRRRHRISSFRPVEDITEEDLENVAITVRDKIYDKVLGNTCHQCRQKTI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 RRKTIGGKCREYRRRHRISSFRPVEDITEEDLENVAITVRDKIYDKVLGNTCHQCRQKTI 260 270 280 290 300 310 370 380 390 400 410 420 pF1KB7 DTKTVCRNQGCCGVRGQFCGPCLRNRYGEDVRSALLDPDWVCPPCRGICNCSYCRKRDGR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 DTKTVCRNQGCCGVRGQFCGPCLRNRYGEDVRSALLDPDWVCPPCRGICNCSYCRKRDGR 320 330 340 350 360 370 430 440 450 pF1KB7 CATGILIHLAKFYGYDNVKEYLESLQKELVEDN ::::::::::::::::::::::::::::::::: CCDS47 CATGILIHLAKFYGYDNVKEYLESLQKELVEDN 380 390 400 >>CCDS2252.1 CDCA7 gene_id:83879|Hs108|chr2 (450 aa) initn: 983 init1: 728 opt: 832 Z-score: 737.1 bits: 145.6 E(32554): 9.8e-35 Smith-Waterman score: 1056; 43.3% identity (64.5% similar) in 453 aa overlap (33-449:26-446) 10 20 30 40 50 60 pF1KB7 LATRYQIPKEVADIFNAPSDDEEFVGFRDDVPMETLSS-EESCDSFDSLESGKQDVRFHS . ::: :: ..::::: : . .. .:.: CCDS22 MDARRVPQKDLRVKKNLKKFRYVKLISMETSSSSDDSCDSFASDNFANTKPKFRS 10 20 30 40 50 70 80 90 100 pF1KB7 KYFTEELRRIFIEDTDSETEDFAGFTQSDLNG---------KTNPEVM-----VVESDLS ..::: .: ::.:.:. : ::..:... : :.: . ..: : CCDS22 D-ISEELANVFYEDSDNES--FCGFSESEVQDVLDHCGFLQKPRPDVTNELAGIFHAD-S 60 70 80 90 100 110 110 120 130 140 150 160 pF1KB7 DDGKASLVSEEEEDEEEDKATPR---RSRSR-RSSIGLRVAFQFPTKKLANKPDKNSSSE :: . :: : .. . : :.::. : : ::::..::... . .:.. :. CCDS22 DDESFCGFSESEIQDGMRLQSVREGCRTRSQCRHSGPLRVAMKFPARSTRGATNKKAESR 120 130 140 150 160 170 170 180 190 200 210 220 pF1KB7 QLFSSARLQNEKKTILERKKDCRQVIQREDSTSESEDDSRDESQESSDALLKRTMNIKEN : :.:...:..::.::: . . : ::..:::.: CCDS22 Q-------------------------PSENSVTDSNSDSEDES--GMNFLEKRALNIKQN 180 190 200 230 240 250 260 270 280 pF1KB7 KAMLAQLLAELNSMPDFFPVRTPT--SASRKKTVRRAFSEGQITRRMNPTRSARPPEKF- :::::.:..::.:.: : : : : :... :: : .:: :: : ::: . CCDS22 KAMLAKLMSELESFPGSFRGRHPLPGSDSQSRRPRRRTFPGVASRR-NPERRARPLTRSR 210 220 230 240 250 260 290 300 310 320 pF1KB7 -----ALENFTVSAAKFAEEFYSFRRRKTIGGKCRE----YRRRHRISS-----FRPVED .:. . . . .... :.:::. : : :: : : .::::. CCDS22 SRILGSLDALPMEEEEEEDKYMLVRKRKTVDGYMNEDDLPRSRRSRSSVTLPHIIRPVEE 270 280 290 300 310 320 330 340 350 360 370 380 pF1KB7 ITEEDLENVAITVRDKIYDKVLGNTCHQCRQKTIDTKTVCRNQGCCGVRGQFCGPCLRNR ::::.:::: . :.:::.. ::.:::::::::::::: ::: : :::::::::::::: CCDS22 ITEEELENVCSNSREKIYNRSLGSTCHQCRQKTIDTKTNCRNPDCWGVRGQFCGPCLRNR 330 340 350 360 370 380 390 400 410 420 430 440 pF1KB7 YGEDVRSALLDPDWVCPPCRGICNCSYCRKRDGRCATGILIHLAKFYGYDNVKEYLESLQ :::.::.:::::.: :::::::::::.::.::::::::.:..:::..:. ::. ::.::. CCDS22 YGEEVRDALLDPNWHCPPCRGICNCSFCRQRDGRCATGVLVYLAKYHGFGNVHAYLKSLK 390 400 410 420 430 440 450 pF1KB7 KELVEDN .:. CCDS22 QEFEMQA 450 >>CCDS2253.1 CDCA7 gene_id:83879|Hs108|chr2 (371 aa) initn: 937 init1: 728 opt: 739 Z-score: 657.0 bits: 130.5 E(32554): 2.8e-30 Smith-Waterman score: 957; 45.5% identity (67.4% similar) in 365 aa overlap (104-449:31-367) 80 90 100 110 120 130 pF1KB7 EDTDSETEDFAGFTQSDLNGKTNPEVMVVESDLSDDGKASLVSEEEEDEEEDKATPR-RS :. :::. :..:.. . . ... :. CCDS22 MDARRVPQKDLRVKKNLKKFRYVKLISMETSSSSDDSCDSFASDNFANTRLQSVREGCRT 10 20 30 40 50 60 140 150 160 170 180 190 pF1KB7 RSR-RSSIGLRVAFQFPTKKLANKPDKNSSSEQLFSSARLQNEKKTILERKKDCRQVIQR ::. : : ::::..::... . .:.. :.: CCDS22 RSQCRHSGPLRVAMKFPARSTRGATNKKAESRQ-------------------------PS 70 80 90 200 210 220 230 240 pF1KB7 EDSTSESEDDSRDESQESSDALLKRTMNIKENKAMLAQLLAELNSMPDFFPVRTPT--SA :.:...:..::.::: . . : ::..:::.::::::.:..::.:.: : : : : CCDS22 ENSVTDSNSDSEDES--GMNFLEKRALNIKQNKAMLAKLMSELESFPGSFRGRHPLPGSD 100 110 120 130 140 150 250 260 270 280 290 300 pF1KB7 SRKKTVRRAFSEGQITRRMNPTRSARPPEKF------ALENFTVSAAKFAEEFYSFRRRK :... :: : .:: :: : ::: . .:. . . . .... :.:: CCDS22 SQSRRPRRRTFPGVASRR-NPERRARPLTRSRSRILGSLDALPMEEEEEEDKYMLVRKRK 160 170 180 190 200 210 310 320 330 340 350 pF1KB7 TIGGKCRE----YRRRHRISS-----FRPVEDITEEDLENVAITVRDKIYDKVLGNTCHQ :. : : :: : : .::::.::::.:::: . :.:::.. ::.:::: CCDS22 TVDGYMNEDDLPRSRRSRSSVTLPHIIRPVEEITEEELENVCSNSREKIYNRSLGSTCHQ 220 230 240 250 260 270 360 370 380 390 400 410 pF1KB7 CRQKTIDTKTVCRNQGCCGVRGQFCGPCLRNRYGEDVRSALLDPDWVCPPCRGICNCSYC :::::::::: ::: : :::::::::::::::::.::.:::::.: :::::::::::.: CCDS22 CRQKTIDTKTNCRNPDCWGVRGQFCGPCLRNRYGEEVRDALLDPNWHCPPCRGICNCSFC 280 290 300 310 320 330 420 430 440 450 pF1KB7 RKRDGRCATGILIHLAKFYGYDNVKEYLESLQKELVEDN :.::::::::.:..:::..:. ::. ::.::..:. CCDS22 RQRDGRCATGVLVYLAKYHGFGNVHAYLKSLKQEFEMQA 340 350 360 370 453 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 22:15:55 2016 done: Fri Nov 4 22:15:56 2016 Total Scan time: 3.140 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]