1 Introduction
Soybeans contain two major kinds of seed-storage proteins, β-conglycinin (7S proteins) and glycinins (11S proteins), of which the β-conglycinin accumulates up to 30% of the total seed proteins (Beachy et al., 1981). The β-conglycinin consists of three subunits, a-,α’- and β-subunits. Analysis of the promoter activities of these subunits indicated that the promoter of a’-subunit possesses more seed-specific activities than the others and it contains all of the motifs required for seed-specific expression in the upstream region of this α’subunit gene promoter (Beachy et al., 1985). Cahoon, et al. (1999, 2000, 2001) successfully expressed the fatty acid desaturase gene in soybean somatic embryos by means of this a’-subunit gene promoter. Therefore, it is a feasible way to control the interested genes in the transgenic plants temporally and specially with seed-specific promoter, and it is an important breakthrough and improvement in plant genetic engineering technology.
Except for the report by Yoshino, et al. (2001), in which the 489 bp fragment of the a-subunit gene promoter was merely cloned, there is no report about the function of α-subunit gene promoter nowadays. In order to further explore the function of the α-subunit gene promoter of the soybean β-conglycinin and develop a novel seed-specific promoter, the 489 bp fragment of the α-subunit gene promoter was firstly cloned by PCR method and successfully extended by Thermal Asymmetric Interlaced PCR (TAIL PCR) to gain 666 bp fragment, and then the sequence, structure and function of this fragment were analyzed in this study.
2 Materials and methods
2.1 Materials
2.1.1 Experimental materials
The seeds of the Soybean Jilin 43S were kindly provided by the Academy of Agricultural Sciences in Jilin province. The seeds of Arabidopsis thaliana were kindly provided by Prof. Wang Ningning from Nankai University. Escherichia coli DH5a, Agrobacterium LBA4404, and plant expression vector pBI121 were available in our labs. The sequencing vector pGEM-T was purchased from Promega Corporation.
2.1.2 Experimental reagents
DNA Restriction enzymes, T4 NDA ligase, Taq DNA polymerase, and DNA gel extraction kit were purchased from Takara Bio Inc, where the DNA sequencing was also accomplished. Other chemicals were commercially available at home and abroad. The PCR primers were synthesized in Shanghai Sangon Biotechnology Co. Ltd. PCR reactions were performed at Tgradient Thermoblock (Biometra). Fluorometrical analysis was accomplished with a Spectrofluorophotometer RF-540 (Shimadzu Corporation).
2.2 Methods
2.2.1 Extraction of soybean total DNA
Total DNA of the seeds of the Soybean Jilin 43 was extracted by SDS-CTAB method with some improvement (Liu et al., 1997). Its purity and concentration of 5 µL of the DNA samples were analyzed by gel electrophoresis.
2.2.2 PCR amplification
(1) Amplification of BCSP489 fragment The primer P1 and P2 were designed and synthesized according to the sequence of the α-subunit gene promoter (Yoshino et al., 2001), and the restriction enzyme site HindIII and XbaI were added to 5’-end of Primer P1 and P2 for further cloning,
Primer P1: 5’-GCG AAG CTT (HindIII ) AAG CAA CCA TAT CAG CAT ATC-3’, Primer P2: 5’-GAA TCT AGA(XbaI) AAC CGC GCT CTC ATC ATA GTA TAT-3’. And then the total DNA of Soybean Jilin 43 was used as template, PCR reaction was conducted in the following conditions, 97°C - 5 min for pre-denature, followed by 35 cycles of repeating 94°C - 30 sec, 52°C - 1 min, 72°C - 2 min, and then 72°C - 10 min for post-extension. The PCR products were extracted by a DNA gel extraction kit and ligated to pGEM-T, to obtain pT-BCSP489 and sequence it.
(2) TAIL PCR extension Three nested primers SP1, SP2, SP3 ( SP1: 5’-GTT GCG CAT GCA TGA TCC AAG AGA-3’; SP2: 5’-CTT GGA CAT TGC TTT CGA AAG GAT A-3’; SP3: 5’-TGA AGT GGG GTG AGG TTG CAT T-3’) and three arbitrary degenerate primers AD1, AD2, and AD3 (AD1: 5’-NTCGA(G/C)T(A/T)T(G/C)G(A/T) GTT-3’; AD2: 5’-NGTCGA(G/C)(A/T)GANA(A/T)GAA-3’; AD3: 5’-(A/T)GTGNAG(A/T)ANCA NAGA-3’) were synthesized respectively according to the sequence of BCSP489 obtained by Liu et al. (1995). TAIL PCR was conducted with the above primers and PCR conditions were referred to the report of Liu et al. (1995) with some alterations. The composition of TAIL PCR reaction is shown in Table 1 and the procedure of TAIL PCR is shown in Table 2. TAIL PCR products (named TAIL1, approximately 200 bp) were obtained and sequenced.
(3) Amplification of BCSP666 fragment The upstream primer P3, 5’-GCG AAG CTT ( HindIII) CAA AAA CGC AAT CAC ACA CA-3’ was synthesized according to the sequence of TAIL1, and then PCR products BCSP666 were obtained with the primer P3, P2 and the total DNA of Soybean Jilin 43 was used as template. The PCR conditions were as follows, 97°C - 5 min for pre-denature, followed by 35 cycles of repeating 94°C -30 secs, 52°C - 1 min, 72°C - 2 min, then 72°C - 10 min for post-extension. The PCR products BCSP666 were extracted by DNA gel extraction kit, ligated to sequencing vector pGEM-T to generate pT-BCSP666, and then sequence it.
2.2.3 Construction of the seed-specific expression vector with BCSP666 and transformation of Arabidopsis thaliana plant
The plasmid pT-BCSP666 and plant expression vector pBI121 were double-digested with
HindIII and
XbaI, respectively and ligated with each other so as to construct vector pBI121-666 (
Fig. 6). Then the vector pBI121-666 was transformed into
Agrobaterium LBA4404 by means of the freeze-and-thaw method (Hofen and Willmitzer, 1988). The positive recombined
Agrobaterium LBA4404 was selected to transform
Arabidopssi thaliana by Agrobacterium-mediated floral-dip method (Clough and Bent, 1998) and the positive plant strains were selected on the 1/2 MS medium containing 50 mg/L kanamycin. The positive
Arabidopsis thaliana plant strains were further confirmed by Southern Blot Analysis with the help of PCR products of the GUS gene as a probe. The amplification of GUS gene primers were P
gus1, 5’-GAA GAG GAT CCC CGG GTG GT-3’, and P
gus2, 5’-ACA GAG CTC GAT GGT GCG CCA GGA GAG TTG-3’.
2.2.4 Fluorescence analysis of the GUS protein in transgenic Arabidopsis thaliana plants
Fluorescence and histochemistry analyses of the GUS protein were referred to the work of Jefferson (1987). For the fluorescence analysis, GUS proteins were extracted from 100 mg leaves and seeds of the transgenic Arabidopsis thaliana plants, and same amount samples of the wild-type Arabidopsis thaliana strains as the negative controls respectively. Fluorescence activities of these samples were analyzed at Spectrofluorophotometer RF-540. For the histochemical analysis, the seeds of the transgenic Arabidopsis thaliana plants and wild-type plants were harvested respectively, and dipped in X-Gluc solution [ 0.1 mol/L phosphate buffer containing 1 mmol/L X-Gluc (5-bromo-4-chloro-3-indolyl glucuronide) ] at 37°C for a whole night. Then the samples were destained with 75% ethanol at 37°C for 4-6 h, and then observed under optical microscopes.
3 Results
3.1 Cloning of seed-specific promoter fragment BCSP666
The promoter fragment BCSP489 was obtained by PCR amplification via primers P1 and P2, and the result is shown in
Fig. 1. This fragment was ligated to pGEM-T to obtain pT-BCSP489 and sequence it.
According to the sequence of the BCSP489, the primers, required for TAIL PCR, were synthesized and the BCSP489 was extended by TAIL PCR, so that the fragment TAIL1 was obtained and then sequenced, as shown in
Fig. 2.
The upstream primer P3 was synthesized according to the sequence of TAIL1 (217 bp), and the PCR fragment BCSP666 was obtained by primer P2, P3 via the total DNA of Soybean Jilin 43 as the PCR template. The result is shown in
Fig. 3. Then the PCR fragment BCSP666 was ligated to sequencing vector pGEM-T, to get pT-BCSP666 obtained and sequenced.
3.2 Sequencing analysis of the seed-specific promoter fragment BCSP666
Sequencing analysis of the seed-specific promoter fragment BCSP666 indicated that this fragment contains 666 pairs of bases and several kinds of seed-specific promoter motifs as shown in
Fig. 4 and
Fig. 5. The motifs are as follows: (1) A/T rich motif, which is an important region for binding of transcriptional factors and required for seed-specific expression (Stalberg et al., 1996); (2) RY repeat element (or Legeumin box), which is also a sequence motif required for seed-specific expression (Chamberland et al., 1992); (3) AGCCCA motif, which is a sequence motif required for transcriptional regulation and seed-specific expression (Chen et al., 1986);
(4) TACACAT motif, which allows one base to mismatch and is responsible for activating seed-storage protein expression (Josefsson et al., 1987); (5) ACGT motif, which is also a sequence motif required for seed-specific expression (Vincentz et al., 1997); (6) E-box, which is co operated with other seed-specific promoter motifs to activate the seed-specific promoter for heterologous expression (Kawagoe and Murai, 1992). Although the sequence alignment between BCSP666 and β-conglycinin a’-subunit gene promoter was very low (40%), both of them exhibited similar sequence structure and similar amount of seed-specific promoter motifs, such as multi-copy of A/T rich motif, RY repeat element, AGCCCA motif and TACACAT motif as shown in
Fig. 4 and
Fig. 5. Based on the above analysis, it indicates that both of the promoter fragment BCSP666 and β-conglycinin a’-subunit gene promoter possess similar promoter activity. Therefore, the promoter fragment BCSP666 may promote the seed-specific expression of heterologous gene.
3.3 Construction of the seed-specific expression vector and transformation of Arabidopsis thaliana plants
The procedure to construct the seed-specific expression vector is shown in
Fig. 6. It proved that the seed-specific expression vector pBI121-666 was confirmed by double digestion via
HindIII and
XbaI, and that the promoter fragment BCSP666 inserted in front of the GUS gene of the plant expression vector pBI121 and substituted the original CaMV 35S promoter, as shown in
Fig. 7. The vector pBI121-666 was transformed into
Agrobacterium LBA4404 by the freeze-and-thaw method, further transformed into
Arabidopsis thaliana plants via
Agrobacterium-mediated fluoral-dip method and approximately 2 000 seeds of
Arabidopsis thaliana were harvested. The seeds were planted on the MS medium containing kanamycin (50 mg/L) and cultivated for about 15 days, and 20 transgenic plant strains were harvested. Then they were transplanted into a flowerpot to harvest their seeds. The Southern Blot Analysis of the total DNA from the transgenic
Arabidopsis thaliana plants indicated (
Fig. 8) that the GUS gene was inserted as a single copy into the genomic DNA of the transgenic
Arabidopsis thaliana plants.
The fluorescence analysis of leaves and seeds from the transgenic
Arabidopsis thaliana plants and wild-type strains indicated that fluorescence activities of the samples from the transgenic
Arabidopsis thaliana plants were higher than those of the wild-type strains (
Fig. 9), while the histochemistry analysis of the seeds from the transgenic
Arabidopsis thaliana plants and wild-type strains further confirmed the above mentioned result, as shown in
Fig. 10. Therefore, it is concluded that the GUS gene in the transgenic
Arabidopsis thaliana plants was expressed in a seed-specific manner under the control of the seed-specific promoter fragment BCSP666.
4 Discussion
The β-conglycinin α-subunit gene promoter BCSP666 was cloned from the total DNA of Soybean Jilin 43 in this study. Sequencing analysis indicated that the fragment BCSP666 contains several kinds of multi-copy of seed-specific promoter motifs, such as 5 copies of RY repeat motifs, 4 copies of AGCCCA motifs and TACACAT motifs, two copies of ACGT motifs and E-box, and one copy of A/T rich motif. These kinds of multi-copy of seed-specific promoter motifs in fragment BCSP666 may be responsible for its seed-specific promoter activity. It is interesting to find that there are two copies of TATA box and CAAT box in the promoter fragment BCSP666, as shown in
Fig. 4. And they seldom happen in other promoter sequences and their functions are still under study.
TAIL PCR protocol was successfully applied to extend the promoter fragment BCSP489 in this study. Although the extended fragment was not long enough, it provided us with a simple and convenient way to clone the unknown sequence adjacent to a known sequence fragment. And it is especially useful for cloning the unknown promoter fragment because it is laborious and consumptive to clone promoters via genomic library construction, or linker PCR (Siebert et al., 1995), or inverse PCR (Ochman et al., 1988). For TAIL PCR, it just requires to synthesize three gene-specific primers and 3 to 10 arbitrary degenerated primers, and the final results can be achieved in 6-8 h. Therefore, TAIL PCR possesses the nice cloning characters of convenience, quickness, efficiency, and specificity. The extended fragment of this study was only 217 bp. The possible reason was the quantity of the arbitrary degenerate primers (only three primers), and it was improved by increasing the quantity of the arbitrary degenerate primers. According to some related reports (Liu et al., 1998), the products of the TAIL PCR can be reached to 0.2-2 kb. Therefore, it comes to the conclusion that TAIL PCR is an effective and convenient protocol to clone unknown promoter fragments.