Casbene Synthase Gene Cloned from Euphorbia fischeriana Steud

Casbene synthase may play an important role in the biosynthesis of tigliane type terpenoid. Casbene synthase gene, named as EfCS (GenBank accession number JN862821), was cloned from the root of Euphorbia fischeriana Steud. using RACE-PCR method. The fragment was 1 969 bp in length and contained an open reading frame coding a polypeptide of 602 amino acids, with a typical structure of terpenoid synthase gene. With biochemical analysis of the amino acid sequence coding by CS gene, the results indicated that the theoretical isoelectric point of CS was 5.36 and the molecular weight was 69.364 8 kD. By hydrophobic analysis, we found the overall performance of CS was hydrophilic. The lead peptide analysis revealed that the peptide had a chloroplast guide sequence. Secondary structure prediction showed that this CS protein was composed with α-helix, extended chain, β-turn and random coil, which accounted for 64.95%, 6.64%, 3.16% and 25.25%, respectively. Protein tertiary structure prediction showed that EfCS, the casbene synthase obtained in this study, belonged to class I terpene synthase.


Background
In Euphorbiaceae plants, there are a wide variety of terpenoids, ranging from monoterpene, sesquiterpene, diterpene to triterpene. Many of these compounds have been investigated for their toxicity or their potential therapeutic activity (Cox, 1993;Barlow et al., 2005;Cragg and Newman, 2005;Li et al., 2009). Prostratin (12-deoxyphorbal-13-acetate), which belongs to tigliane type diterpene, is a kind of phorbol esters. It was isolated by American National Cancer Institute (NCI) in 1992, and was confirmed with good resistance to AIDS virus (Gustafson et al., 1992). Therefore, tigliane biosynthesis pathway is a hot research topic in this field in recent year.
Casbene synthase (CS; EC 4.6.1.7), the key enzyme in the pathway of tigliane type diterpene biosynthesis, can produce intermediate product cembrane by catalyzed geranylgeranyl diphosphate (GGPP). Cembrane is then converted into casbene. Casbene is converted into tigliane through a series of electron transfer (Schmidt, 1987). Some CS genes were cloned from plants, such as Ricinus communis, Triadica sebifera, Euphorbia esula, Euphorbia resinifera and Homalanthus nutans (Kirby et al., 2010). Genome sequencing of Jatropha curcas was completed by S. Sato in 2011 and 9 CS genes were found in the J. curcas genome, one of the 9 CS genes was regarded as a pseudogene and the amino acid sequences of 6 CS genes were published from species (Sato et al., 2011). As far as we known, Shanghai Institute of Materia Medica had isolated prostratin from the root of E. fischeriana Steud (Liu et al., 1996;Ma et al., 1997;Wang et al., 2006), but the CS gene has not been cloned from this species.
In this study, we cloned a casbene synthase gene from E. fischeriana with RACE-PCR technique and analyzed this enzyme using bioinformatics methods.
1 Results and Analysis 1.1 CS gene fragment from E. fischeriana

Cloning and analysis of the full-length CS gene of E. fischeriana
The total RNA were isolated from the tuberous root of E. fischeriana and synthesized to cDNA. With the cDNA as template and a pair of specific primers RP1 and UPM, we carried out the 3'RACE PCR amplification. A fragment of CS about 1 201 bp in length was obtained ( Figure 1). We also used specific primers LP1 and UPM for the first run 5'RACE PCR and then used its PCR product as the template with primers LP2 and NUP for the next run for the nested PCR, a 803 bp in length of PCR product was obtained ( Figure 1). According to the sequence of these two fragments, we designed full-length primers (EfCS1-1 and EfCS2-1) for ORF amplification and obtained a 1 809 bp fragment after sequencing. With the sequences of fragments that we got by RT-PCR and RACE-PCR, after combining, the fulllength sequence of CS gene 1 969 bp was obtained, named as EfCS (GenBank accession No. JN862821). ORF Finder analysis (NCBI ORF finder website) indicated that the EfCS full-length sequence contained a 1 809 bp open reading frame, a 38 bp 5' untranslated region and a 122 base pair 3' untranslated region, encoding a protein with 602 amino acids.

Bioinformatics analysis
Using ProtParam online tools (Gasteiger et al., 2003), we analyzed the amino acid sequence coding by CS gene. We found the theoretical isoelectric point of CS was 5.36 and the molecular weight was 69.364 8 kD. ProtScal (Kyce and Doolittle, 1982) was common used to analyze protein hydrophobicity. With setting parameters as default, the ProtScal analysis results showed that the maximum value of hydrophobicity was 2.978 (V 347 ), and the minimum value was -3.067 (N 273 ).The overall performance of EfCS is hydrophobicity. Choosing online tool TargetP 1.1 Server (Emanuelsson et al., 2000) in plant version and with default setting parameters, we analyzed the EfCS amino acid sequence, and the result showed that the protein had chloroplasts guide peptide and the reliable level was Ⅱ. Predicting the second structure of CS by SOPMA (Geourjon and Deleage, 1995) suggested that the protein was mainly composed by 25 α-helices with 391 amino acids, 13 extended chains with 40 amino acids, 10 β-turns with 40 amino acids and 28 random coils with 192 amino acids, which accounted for 64.95%, 6.64%, 3.16% and 25.25% respectively. Using SMART (Schultz et al., 1998) to analyze the protein, we found that this gene had two function domains Terpene_synth and Terpene_synth_C which were specific to terpene synthase, locating at the 74 to 249 amino acids and the 279 to 547 amino acids respectively. Using an automated homology modeling program of ESyPred3D Web Server 1.0 (Lambert et al., 2002) and setting parameters as neural net and new screening, tertiary structure prediction was performed. As shown in figure 2, we got tertiary structure of CS protein. The result showed that our EfCS belonged to class Ⅰ terpene synthase (α domain, blue) (Cao et al., 2010), with two metal ion binding regions DDXXD and (N,D)DXX(S,T)XXXE (red and orange, respectively) (Köksal et al., 2011). At the same time, there was a small retrogressive region (ß domain, green), that always played a role in class Ⅱ terpene synthase (Wendt et al., 1997). The purple part was the first α-helix, which worked in class Ⅰ terpene synthase to cap the active center of enzyme (Köksal et al., 2011).

Discussion
Casbene synthase is a key enzyme in the tigliane biosynthesis pathway in all probability. In this report, CS gene was isolated from E. fischeriana using RACE-PCR. By bioinformatics analysis, we found CS was an acidic protein, with theoretical isoelectric point of 5.36, and the overall performance of CS was hydrophilic. These results would provide useful information for further study of CS. Because of the substrate for CS mainly locate in chloroplastid, the lead peptide for transporting to chloroplastid is a key domain for its appropriate function (Kirby et al., 2010). The prediction showed that CS had chloroplasts lead peptide, supported by the result of ChloroP 1.1 analysis. Secondary structure and tertiary structure prediction showed that EfCS belonged to class Ⅰ terpene synthase, providing evidence for testing its activity and analyzing its function in vitro.
There are many compounds have been isolated from E. fischeriana, a lot of them have insecticide, antiseptic and anticancer activity (Liu et al., 1988;Liu et al., 2000;Zhao et al.,2000;Ma et al., 2005, China Science and Technology Information, 12: 65,89). Prostratin had been isolated from the root of E. fischeriana by Shanghai Institute of Materia Medica, however, the prostratin yield of the natural plant is not enough for its commercial production. Therefore, as a candidate of key enzyme in the pathway of prostratin biosynthesis, isolation of casbene synthase gene is an important step towards improving the yield of prostratin in E. fischeriana.

Plant material
The tuberous root of E. fischeriana was collected from Jiagedaqi in Heilongjiang Province. DH5α cells (Escherichia coli) were bought from TAKARA Inc..

Extraction of total RNA and synthesis of cDNA
Following the manual instructions, total RNA was extracted by RNeasy Mini Kit (QIAGEN, Germany). Using SMARTer™ RACE cDNA Amplification Kit (Clontech, Dalian, China), we synthesized 5' and 3' RACE Ready cDNA according to the protocol provided by Clontech.

Sequence analysis of target gene
The target gene sequence was analyzed by online bioinformatics soft wares (Table 1). The analysis of CS open reading frame was worked out by NCBI-ORF Finder. The composition and physicochemical property of amino acids sequence was analyzed by ProtParam online tools. The prediction of lead peptide of protein was completed by TargetP 1.1 and ChloroP 1.1. Second structure and function domain were predicted by SMART and SOPMA, respectively. Protein tertiary structure prediction was completed by ESyPred3D online tool.