AJP - Regu Fuel your research with LabChart
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Am J Physiol Regul Integr Comp Physiol 290: R634-R641, 2006. First published October 13, 2005; doi:10.1152/ajpregu.00596.2005
0363-6119/06 $8.00
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
290/3/R634    most recent
00596.2005v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Citing Articles
Right arrow Citing Articles via ISI Web of Science (3)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Zhou, L.
Right arrow Articles by Irwin, D. M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Zhou, L.
Right arrow Articles by Irwin, D. M.

APPETITE, OBESITY, DIGESTION, AND METABOLISM

Intron 1 sequences are required for pancreatic expression of the human proglucagon gene

Li Zhou, Min Nian, Jun Gu, and David M. Irwin

Departments of Laboratory Medicine and Pathobiology and the Banting and Best Diabetes Centre, University of Toronto, Toronto, Ontario, Canada

Submitted 16 August 2005 ; accepted in final form 11 October 2005


    ABSTRACT
 TOP
 ABSTRACT
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
The mammalian proglucagon gene is expressed in pancreatic islet A-cells, intestinal L-cells, and select neurons of the brain, where posttranslational processing results in the liberation of a unique profile of peptides. Despite the importance of proglucagon-derived peptides in human biology, little is known about the regulation of the human gene, as the rat gene has been the preferred model for understanding the regulation of proglucagon gene expression. Previously, we have shown that although the immediate promoter region of the rat proglucagon gene is sufficient for expression in pancreatic islet cells, the homologous human proglucagon promoter sequences are not sufficient. We have now used a comparative genomic approach to identify noncoding sequences near the human proglucagon gene that are conserved among mammals, and thus potentially are regulatory sequences. Our alignments identified three evolutionarily conserved noncoding regions (ECR), one is the immediate promoter region (ECR1), the second is about 5 kb 5' to the mRNA start site (ECR2), and the third is near the 3' end of the first intron (ECR3). Our in vitro transient transfection assays with reporter gene constructs that include the human ECR3 support expression in rodent islet cell lines. Complementary studies with transgenic mice possessing a reporter gene regulated by a human proglucagon gene promoter-intron 1 (including ECR3) sequences express the reporter gene in the pancreas, as well as the intestine and selected neurons. These studies suggest that conserved sequences within intron 1 of the human proglucagon gene are important for expression in the pancreas.

glucagon; gene expression; transgenic mice; diabetes


THE HUMAN PROGLUCAGON GENE is expressed in A-cells of the pancreatic islets, L-cells of the small and large intestine, and selected neurons of the brain (4, 15, 16). Tissue-specific posttranslational processing of proglucagon results in the production of glucagon in the pancreas and two glucagon-like peptides, GLP-1 and GLP-2, in the intestine (15, 27). The proglucagon-derived peptide hormones have essential and diverse roles in human physiology. Glucagon regulates carbohydrate, lipid, and amino acid metabolism and acts as a counter-regulatory hormone to insulin in regulating blood glucose levels (14). GLP-1 is an incretin hormone that potentiates insulin secretion from the pancreas and has additional roles in gastric emptying and feeding behavior (1, 2, 3). GLP-1 is currently under development as a potential therapy for the treatment of diabetes (7, 20). GLP-2 is a key regulator of intestinal epithelial growth and function (1, 2, 3). The biological functions of additional peptides produced from proglucagon, e.g., oxyntomodulin and glicentin, are less well established.

Extensive knowledge has been gained on the sequences and DNA-binding proteins that are required for expression of the rat proglucagon gene (4, 15, 16). Approximately 300 bases of rat proglucagon 5' flanking sequence have been found to be sufficient to support expression of reporter genes in glucagon gene-expressing rodent islet cell lines (5). Within the immediate promoter of the rat proglucagon gene, four enhancer elements (designated G2 through G5) and an A-cell-specific promoter element (G1), as well as a large number of DNA-binding proteins have been identified and characterized (4, 15, 16). In contrast, much less is known about the regulation of the human proglucagon gene. Despite extensive sequence similarity of the immediate promoter region, short human proglucagon promoter-reporter gene constructs fail to generate significant reporter gene activity in rodent islet cell lines (21). It was concluded that divergent mechanisms are used for tissue-specific regulation of the human and rat proglucagon genes, as more than 3,300 bases of human proglucagon 5'-flanking sequences were required for expression in rodent islet cell lines (21). Similar conclusions were also drawn from observations with transgenic mice. Although 1,300 bases of rat proglucagon gene 5' flanking sequences allowed expression of a transgene in islets of mice (6), 1,600 bases of human proglucagon 5' flanking sequence did not generate expression in islets of mice, yet this human proglucagon promoter construct did support expression in two other proglucagon gene expressing tissues, the intestine, and selected neurons of the brain (21).

To help identify which lineage, either the primate lineage leading to human, or rodent lineage leading to rat, produces the pancreatic islet cell transcriptional activity change, proglucagon gene promoters from the chicken and diverse mammals have been characterized (Tsai B, Yue S, and Irwin DM, unpublished observations, Ref. 30). The chicken proglucagon gene promoter was found to have extremely limited sequence similarity to mammalian promoter, yet it could support reporter gene expression in rodent islet cell lines (30). This work suggested that significant changes in the sequence of the proglucagon promoter could occur yet retain promoter activity. A survey of proglucagon promoters from diverse mammals that included two primates (human and rhesus monkey), three rodents (rat, mouse, and hamster), a carnivore (dog), and an artiodactyl (cow) found that only the primate proglucagon promoters failed to support reporter gene expression in transient-transfection studies with rodent islet cell lines (Tsai B, Yue S, and Irwin DM, unpublished observations). These two studies strongly suggest that promoter sequence changes have occurred on the lineage leading to primates that have yielded an immediate promoter that fails to support expression in rodent islet cell lines. It was concluded that additional sequences, either further 5' or 3' to the mRNA start site, are necessary for the regulation of expression of the human proglucagon gene (Tsai B, Yue S, and Irwin DM, unpublished observations) (21, 30).

In this study, we have compared proglucagon gene sequences from diverse mammals and the chicken to identify evolutionarily conserved noncoding regions (ECRs) that potentially regulate expression of the human proglucagon gene. ECRs are sequences that do not code for protein yet have changed little through evolution, presumably because they have been maintained because they have a function (8, 23), potentially a regulatory function. Our genomic comparisons have identified three ECRs: the immediate promoter region, a second ~5-kb upstream of the proglucagon promoter, and a third within intron 1. We have tested the transcription activity of these ECR sequences to determine whether they support gene expression in islet cells. We found that the ECR sequence present in intron 1 supports expression of the proglucagon gene in rodent islet cells, both in vitro and in vivo.


    MATERIALS AND METHODS
 TOP
 ABSTRACT
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
Reagents. Reagents and chemicals were purchased from BioShop Canada (Burlington, ON), Caledon (Georgetown, ON), Difco (Detroit, MI), AP Biotech (Baie d'Urfe, PQ), Sigma (St. Louis, MO), and Invitrogen (Burlington, ON). Restriction endonucleases and DNA-modifying enzymes were purchased from New England Biolabs (Mississauga, ON). Taq DNA polymerase, and deoxynucleotide triphosphates were obtained from Roche (Laval, PQ). Turbo Pfu was purchased from Stratagene (La Jolla, CA). Oligonucleotide primers and DNA sequencing were from ACGT (Toronto, ON). pGL3 expression vectors were purchased from Promega (Madison, WI).

Bioinformatic analysis. Genomic sequences that encoded proglucagon genes were downloaded from the ensemble Web site (http://www.ensemble.org). Genome databases for human (NCBI35), mouse (NCBIm33), rat (RGSC3.4), dog (BROADD1), cow (Btau1.0), opossum (BROAD0.5), and chicken (WASUC1), were searched for proglucagon genes. The cow and opossum gene databases had not been annotated; thus proglucagon gene sequences were identified through BLAST searches. The cow proglucagon genomic sequence was identified using the BLASTN algorithm to find genomic sequences identical to the cow proglucagon cDNA sequence. The opossum proglucagon gene sequence was identified using the TBLASTN algorithm to find genomic sequences that had similarity to mammalian proglucagon amino acid sequences. Proglucagon genomic sequences were downloaded and further characterized using MacDNASIS version 3.7 DNA and protein sequence analysis software (Hitachi, San Bruno, CA) using methods we have previously described (12, 13, 32). Multiple sequence alignments of the proglucagon genomic sequences were generated using the program MultiPipMaker (25, 26).

Reporter gene plasmids. Genomic fragments containing putative promoter and intron 1 regions of the human proglucagon gene were amplified by PCR using primers –602f and 3071r (see Table 1) from a P1 genomic clone that has previous been characterized (21) and cloned into pCR2.1 TA cloning vector (Invitrogen). Smaller promoter and intron fragments were amplified from this promoter-intron clone using the primers listed in Table 1, were released from the TA vector by digestion with appropriate restriction endonucleases (see Table 1), and cloned into pGL3 basic, pGL3 promoter, or modified pGL3 luciferase reporter gene vectors (Promega). Large genomic fragments (>1 kb) were amplified using Turbo Pfu (Stratagene), while smaller fragments were amplified with Taq (Roche, Laval PQ). The 602-base and minimal 82-base human promoters were amplified using the primers –602f and 58r and –82f and 58r, respectively. Intron 1 sequences were amplified using the primers 98f and 3063r. A series of 500-base intron fragments were amplified using the following pairs of primers: 98f and 552r, 553f and 1018r, 1018f and 1454r, 1455f and 1920r, 1921f and 2378r, 2379f and 2862r, and 2862f and 3063r. The DNA sequence of all promoter-reporter constructs was verified.


View this table:
[in this window]
[in a new window]
 
Table 1. Primers for construction and characterization of reporter genes

 
Cell culture and transfections. The hamster islet cell line InR1-G9, mouse islet cell line {alpha}TC-1, and baby hamster fibroblasts (BHK cells) were grown in DMEM (4.5 g glucose/l) supplemented with 5% fetal calf serum and 1% penicillin and streptomycin. Luciferase reporter gene constructs were transfected into the cell lines using lipofectamine (Invitrogen). Controls for transfections included the promoterless pGL3 basic, the positive transfection pGL3 control vector (containing the SV40 promoter and enhancer; Promega), and a construct containing 312 bases of promoter and 58 bases of exon 1 of the rat proglucagon gene cloned into pGL3 Basic. Reporter gene construct activity was measured by detecting luciferase activity using a Lumat LB 9501 luminometer (EG&G Berthold, Wellesley, MA).

Data analysis. Results are presented as means ± SE from at least three independent experiments. All luciferase assays were performed in triplicate. Data were analyzed by ANOVA followed by Tukey's test or paired Student's t-tests using the computer software InStat 2.0 (GraphPad Software, San Diego, CA). Differences were considered statistically significant at P < 0.05.

Generation of transgenic mice. The 5.7 glucagon-growth hormone (GLU-GH) transgene contained a 5.8-kb genomic fragment that included 5775 bp of 5' flanking sequence and 96 bp of exon 1 and was ligated upstream of the human growth hormone (hGH) reporter gene. The 3.7 GLUin1-betaGal transgene was a 3.7-kb genomic fragment containing 602 bases of 5' flanking sequence, all of exon 1 and intron 1, and 8 bases of untranslated region of exon 2 of the human proglucagon gene were ligated upstream of the beta-galactosidase (beta-gal) reporter gene. The University of Toronto Faculty of Medicine Transgene and Knockout Facility used the transgene constructs to generate transgenic founders in the FVB strain. Transgenic mice were characterized by Southern blot analysis or PCR analysis of tail DNA, as previously described (21). Primers for detection of transgene are listed in Table 1. Germ line transmission was identified from two founders for each transgene, with transgene expression characterized in mice from the second generation of each line. The University Health Network Animal Care Committee approved the experimental protocol for the generation and characterization of transgenic mice.

RNA isolation and analysis. Total cellular RNA was isolated from various tissues using Trizol reagent (Invitrogen). cDNA was synthesized by reverse transcription using a cDNA synthesis kit (AP Biotech, Baie d'Urfe, PQ). Primers for detection of transgene expression are listed in Table 1. For beta-gal, the primers were beta-galf and beta-galr. Human proglucagon intron 1 primers were 2379f and 2862r. Mouse proglucagon primers were mouse PG5' and mouse PG3'. GAPDH primers were GAPDH5' and GAPDH3'.


    RESULTS
 TOP
 ABSTRACT
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
Identification of evolutionarily conserved regions near the proglucagon gene promoter. Human (31), mouse (24), rat (10), dog (11), cow (18), and chicken (30) proglucagon cDNAs and/or genes have previously been characterized and were used to identify genomic sequences that included proglucagon genes and flanking sequences. To identify the opossum proglucagon gene, we searched the draft opossum genomic sequence using TBLASTN with mammalian proglucagon amino acid sequences. Our searches of the opossum genome identified a proglucagon gene that is similar in structure to previously characterized mammalian proglucagon genes, as it consisted of six exons and encoded a predicted proglucagon polypeptide that potentially could be proteolytically processed to release hormones similar to glucagon, GLP-1, and GLP-2. In all of the genomes that we examined, the proglucagon gene was about 10 kb in length, with genes for fibroblast activation protein (FAP) located 5', and dipeptidyl peptidase IV (DPIV) located 3', thus maintaining the order seen in the human genome (12).

Conserved noncoding regions potentially have a role in the regulation of nearby genes (8, 23). To identify ECRs near the human proglucagon gene, we used the program MultiPipMaker (25, 26) to align genomic sequences that encode the human, mouse, rat, dog, cow, opossum, and chicken proglucagon genes. An output of the MultiPipMaker program is a PIP plot where for each species, the genomic sequence was compared with the human sequence, and portions that show gap-free alignments with more than 50% identity to the human sequence are indicated by dots, with the height indicating the level of identity. If longer sequences have similar identity to the human sequence, then they are shown as lines. Figure 1 shows the PIP plot of the mammalian and chicken proglucagon genes; similar PIP plots were generated using other mammalian sequences as the reference sequence (results not shown).


Figure 1
View larger version (34K):
[in this window]
[in a new window]
 
Fig. 1. Conservation of genomic sequences near the human proglucagon gene. Genomic sequences (40 kb) that encode the human proglucagon gene along with ~18 kb of 5' flanking and 12 kb of 3' flanking sequences were aligned to genomic sequences from mouse, rat, dog, cow, opossum, and chicken, using the program MultiPipMaker. Exons of the human proglucagon gene are represented by boxes (the gene is located between bases 18k and 28k). Open and solid boxes represent the untranslated and coding regions of the exons, respectively. The arrow above the exons indicates the direction of transcription. Below the human sequence, percent identity plots are presented for each of the species, in which dots or lines indicate the percentage sequence identity of each gap-free alignment to the human sequence (if above 50%). Vertical arrows identify the locations of ECR1, ECR2, and ECR3, which are three evolutionarily conserved noncoding sequences that are at least 100 bp in length and show at least 75% sequence identity among mammals.

 
Identification of ECRs requires comparisons of genomic sequences that have diverged enough to allow nonfunctional sequences to accumulate differences, whereas sequences under selection have accumulated few if any changes. Since sequences under selection can have differing levels of selective constraints, genome comparisons representing different amounts of evolutionary divergence can differentially detect conserved sequences with differing levels of constraints. The most conserved sequences will be detected by comparisons covering the greatest evolutionary divergence, whereas lower levels of conservation can be detected using more closely related genome sequences (28). When the chicken proglucagon genomic sequences is compared with mammalian proglucagon genomic sequences, the only sequences near the proglucagon gene that showed greater than 50% sequence identity are exons 2 through 6 of the human proglucagon gene, exons that encode the proglucagon polypeptide (Fig. 1). We were unable to identify any sequence with greater than 50% identity to exon 1. The vast majority of the intron and flanking sequences show less than 50% sequence identity; thus no ECRs could be identified by comparing the chicken and mammalian proglucagon genomic sequences.

Comparisons within placental mammals (human, mouse, rat, dog, and cow) showed that most of the genomic sequences near and within the proglucagon gene possessed greater than 50% sequence identity, with a few areas showing higher levels of sequence identity (Fig. 1). The four regions that showed the highest identities are (1) about 5 kb upstream of the mRNA start site, (2) the immediate promoter region, (3) near the 3' end of intron 1, and (4) the middle of intron 3. The highly conserved sequences about 16 kb upstream of the proglucagon mRNA start site in Fig. 1 are the coding and 3' untranslated sequences encoded by the upstream FAP gene.

Comparison of the proglucagon genomic sequence from the more distantly related opossum (a marsupial) to placental mammalian proglucagon genomic sequences identified fewer portions of the sequence that show greater than 50% sequence identity (Fig. 1). All six exons of the proglucagon gene are well conserved, although the untranslated sequences within exons 1 and 6 show less conservation than the protein coding exons (Fig. 1). In addition to the exon sequences, three noncoding genomic areas, identified as ECR1, ECR2, and ECR3 in Fig. 1, show very high sequence conservation. ECR1 is the 300-base immediate promoter region and is known to be important for expression of the rat proglucagon gene (5). ECR2 is ~5 kb upstream from the mRNA start site and contains a region and of more than 200 bases with greater than 90% identity between human and the opossum (and more than 300 bases showing greater than 90% identity within placental mammals). ECR3, which is found near the 3' end of intron 1 is even better conserved than the other two ECRs, composed of ~300 bases, showing greater than 92% identity between opossum and human (and ~300 bases with greater than 97% identity within placental mammals).

Expression of human proglucagon transgene with 5.7 kb of 5' flanking sequence. We have previously reported that 5.7 kb of human proglucagon gene 5' flanking sequence could drive expression of luciferase reporter gene in rodent islet cell lines (21). ECR2 is included within the 5.7 kb of 5' flanking sequence, thus potentially contributing to islet-specific expression of the proglucagon gene. To determine whether the 5.7 kb of 5' flanking sequence was sufficient to drive expression in glucagon-gene expressing cells in vivo, we generated the 5.7 GLU-GH transgene. Tissue-specific expression of the reporter gene was characterized in two founder lines of transgenic mice by RT-PCR (Fig. 2). hGH reporter transgene mRNA was detected in the stomach, duodenum, jejunum, ileum, colon, and brain, but not pancreas or liver of mice descending from founder no. 10 (Fig. 2A). In contrast, low levels of growth hormone transgene mRNA were detected by RT-PCR in all tested tissues of mice descendent from founder no. 6 (Fig. 2B). To confirm expression of the transgene product, immunocytochemistry was used to assay for expression of the transgene protein product hGH. hGH-immunoreactive cells were detected in the stomach, duodenum, jejunum, ileum, colon, and brain, but not in the pancreas or liver of mice from both line no. 10 and 6 (data not shown). Despite widespread expression of the transgene as detected by RT-PCR (Fig. 2B), immunocytochemistry could only detect the hGH reporter in the stomach, duodenum, jejunum, ileum, colon, and brain in mice from line no. 6, suggesting that only some tissues properly express the transgene to generate mRNA that can be translated to yield hGH.


Figure 2
View larger version (38K):
[in this window]
[in a new window]
 
Fig. 2. Tissue-specific expression of the human growth hormone (hGH) transgene in 5.7 human glucagon-growth hormone (hGLU-GH) transgenic mice. A and B: RT-PCR of hGH and GAPDH expression in various tissues from 5.7 hGLU-GH transgenic mice from line 10 (A) and line 6 (B). + and – indicates presence or absence, respectively, of RT in the preparation of the cDNA template for PCR. The source of RNA for the RT-PCR reaction is indicated above the lanes, whereas negative refers to a mock reaction without RNA, and marker identifies the DNA size ladder. For hGH, the upper panel is the ethidium bromide-stained agarose gel of the separated RT-PCR DNA fragments, whereas the lower panel is a Southern blot of this gel that was hybridized with a [{gamma}-32P]dATP end-labeled internal hGH oligonucleotide probe. For GAPDH, only the ethidium bromide stained gel is shown.

 
Enhancer activity of human proglucagon intron 1. In addition to the ECRs identified in the 5' flanking sequence of the proglucagon gene sequence, an ECR was identified within intron 1 (Fig. 1). To determine whether intron 1 sequences could promote expression on the proglucagon gene in glucagon gene-expressing pancreatic islet cells, we constructed a reporter gene plasmid that contained 602 bases of 5' flanking sequence, all 96 bp of exon 1, all 2,970 bp of intron 1, and the first 12 (untranslated) bases of exon 2 of the human proglucagon gene ligated upstream of a luciferase reporter gene (3.7 kb Glu-Luc). The 3.7-kb Glu-Luc, containing the intron 1 construct, was found to drive significantly higher levels of luciferase reporter gene activity than the promoterless construct or one that contained only 602 bases of human proglucagon promoter (Fig. 3A). The levels of reporter gene activity for the 3.7-kb Glu-Luc construct were 50–60% of the levels generated by a 312-base rat proglucagon promoter construct (Fig. 3A).


Figure 3
View larger version (26K):
[in this window]
[in a new window]
 
Fig. 3. Expression of human proglucagon promoter constructs that contain intron 1. A: transcription activity. Schematic structure (not to scale) of luciferase reporter genes are shown to the left with luciferase activity in glucagon-gene expressing hamster islet cell line InR1-G9 and mouse islet cell line {alpha}TC-1 on the right. Reporter gene activity is expressed relative to the activity of the promoterless pGL3 plasmid. *Activity is significantly greater than that of the promoterless plasmid. C: schematic structure (not to scale) of luciferase reporter genes is shown to the left with luciferase activity in glucagon-gene expressing hamster islet cell line InR1-G9 and mouse islet cell line {alpha}TC-1 on the right. pGL3B is a promoterless vector, whereas pGL3P is a pGL3 vector containing an SV40 promoter. Reporter gene activity is expressed relative to the activity of the promoterless pGL3B plasmid. *Activity is significantly greater than that of the promoterless plasmid.

 
To confirm that transcription initiated at the human proglucagon mRNA initiation site in our reporter gene construct, human proglucagon reporter gene cDNA was amplified by RT-PCR using a sense primers (7f) in exon 1 for the human proglucagon gene and an antisense primers (luc-r) for the luciferase sequence (see Table 1 for primers). The cDNA product generated by RT-PCR was the predicted size, and its sequence showed that the mRNA generated by this reporter gene construct initiated with the proglucagon exon 1 sequence and had properly spliced intron 1 to join human proglucagon exon 1 and exon 2 sequences (results not shown). As an alternative approach, to exclude the possibility that intron 1 was acting as a promoter element, we cloned intron 1 into luciferase reporter plasmids with (pGL3P) and without (pGL3B) promoters. Significant reporter gene activity was only detected when intron 1 was cloned into a promoter-containing vector and not with a vector that lacks a promoter (Fig. 3B).

Enhancer activity of the evolutionarily conserved region in intron 1. To determine which portions of intron 1 (bases 97 to 3,063 of the human proglucagon gene) contribute to the enhancer activity, a series of smaller fragments of intron 1 were tested for enhancer activity (Fig. 4A). A total of seven fragments, each of ~500 base pairs, spanning the intron were cloned upstream of the SV40 promoter in pGL3P. Only one fragment, spanning bases 2,379 to 2,862, significantly enhanced luciferase activity in rodent islet cell lines (Fig. 4A). This fragment from bases 2,379 to 2,862, which enhances reporter gene activity, corresponds to the ECR3 that was found within intron 1 of the proglucagon gene (Fig. 1). The short ECR3 portion of intron 1 was also found to enhance reporter gene activity when placed upstream of 332-base or 82-base human proglucagon gene promoters or a minimal 82-base rat proglucagon promoter, although to a lower extent than that generated from the 312-base rat promoter (Fig. 4B). No other fragment of intron 1 significantly enhanced reporter gene activity, but surprisingly, one portion of the intron, from bases 1,018 to 1,454, essentially silenced reporter activity (Fig. 4A). The sequence between bases 1,018 and 1,454 of the proglucagon gene does not appear to correspond to any highly conserved portion of intron 1, although most of intron 1 is better conserved than other noncoding sequences near the proglucagon gene (Fig. 1).


Figure 4
View larger version (24K):
[in this window]
[in a new window]
 
Fig. 4. ECR3 in intron 1 is an enhancer. A: enhancer activities of fragments of human proglucagon gene intron 1. Schematic structure (not to scale) of luciferase reporter genes are shown to the left with luciferase activity in glucagon gene-expressing hamster islet cell line InR1-G9 and mouse islet cell line {alpha}TC-1 on the right. Reporter gene activity is expressed relative to the activity of the pGL3P plasmid. *Activity is significantly greater than that of the promoterless plasmid. B: intron 1 enhancer work on minimal proglucagon promoters. Schematic structure (not to scale) of luciferase reporter genes are shown to the left with luciferase activity in glucagon gene-expressing hamster islet cell line InR1-G9 and mouse islet cell line {alpha}TC-1 on the right. Reporter gene activity is expressed relative to the activity of the human or rat proglucagon promoter plasmids without the intron fragment. *Activity is significantly greater than that of the human or rat proglucagon promoter plasmids without the intron fragment. The rat 312-base promoter generated about 15–20 times more reporter gene activity compared with the 82-base human or rat promoters or the 332-base human promoter.

 
Expression of human proglucagon transgene containing intron 1 sequences. To determine whether intron 1 sequences could direct tissue-specific expression of a reporter gene in vivo, a transgenic mouse was generated that contained 602 bases of human proglucagon 5' flanking sequence, all of exon 1 and intron 1, and the first 12 untranslated bases of exon 2 ligated upstream of a beta-gal reporter gene. This construct was identical to the construct in Fig. 3A, except for the reporter gene. A total of three founders were generated, two of which produced germ line transmission. To assess reporter gene expression, RT-PCR was conducted using primers for the beta-gal sequence. As shown in Fig. 5A, reporter gene expression was detected in both lines showing germ line transmission in brain and intestinal tissues, as previously detected in 1.6 kb (21) and 5.7 kb (Fig. 2), 5' flanking transgenic mice. In addition, both of our intron 1-containing transgenic mice also expressed the transgene in the pancreas and did not express the transgene in tissues, such as liver and muscle (Fig. 5A), suggesting that this intron 1-containing transgene construct expresses the reporter gene in all major sites of proglucagon expression. An attempt to amplify a portion of human proglucagon intron 1 (Fig. 5B) in these same cDNA samples failed to generate the expected band, indicating that the mRNA was not contaminated with genomic DNA or partially processed mRNA, yet GAPDH could be amplified from all samples (Fig. 5D), suggesting that the mRNA was intact and converted to cDNA. The pattern of expression of the transgene (Fig. 5A) was identical to that of the endogenous mouse proglucagon gene (Fig. 5C).


Figure 5
View larger version (52K):
[in this window]
[in a new window]
 
Fig. 5. Tissue-specific expression of beta-galactosidase (beta-gal) in an intron 1-containing human proglucagon promoter transgene. AD: RT-PCR for beta-gal, human proglucagon intron 1, mouse, proglucagon, and GAPDH, respectively, in various mouse tissues from a transgenic mouse that has beta-gal downstream of 602 bases of 5' flanking, all of exon 1 and intron 1, and 12 untranslated bases of exon 2 of the human proglucagon gene sequence. Tissues used for isolation of RNA for RT-PCR are indicated above the lanes, with negative referring to a mock reaction without RNA, and positive being an RT-PCR with a cloned DNA product. The positive control for A and B is the transgene used to make the mouse, for C, a cloned mouse proglucagon cDNA, and for D, a cloned mouse GAPDH cDNA.

 

    DISCUSSION
 TOP
 ABSTRACT
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
The availability of a number of mammalian genomes (e.g., human, mouse, rat, dog, cow, opossum, chicken, see www.ensemble.org) has allowed the use of comparative genomics to identify sequences that are evolving at a slower rate than expected for DNA sequences that are not functional (8, 19, 23, 28). Comparative genomics readily identifies protein coding regions of genes, but, in addition, it has found an equally large amount of the noncoding sequence that has been constrained by selective pressures, presumably because they have a biological function, such as regulatory sequences (19). We have applied these techniques to identify sequences near or within the human proglucagon gene that have been conserved within mammals (Fig. 1).

We first compared the genomic sequences encoding the proglucagon gene of the chicken to those sequences encoding proglucagon genes from mammals, thus representing sequence divergence of ~310 million years (9, 17), and found that the only conserved sequences are the exons of the gene (Fig. 1). The protein coding exons 2 through 5 that encode most of the proglucagon precursor show about 75% sequence conservation, while exon 6, which encodes the C-terminus of proglucagon and the 3' untranslated region, had more varied conservation (Fig. 1). Although our chicken proglucagon genomic sequence did contain the first exon of the gene (30), this exon was not conserved with the mammalian sequences. The mammalian proglucagon gene is flanked by the FAP gene, about 16 kb upstream, and the DPIV gene, about 65 kb downstream (12), with a similar structure observed in the chicken genome (30). The last exon of the upstream FAP gene is conserved between chicken and mammalian proglucagon genomic sequences (Fig. 1), whereas downstream, nothing similar was identified between the proglucagon and DPIV genes (data not shown). Similar results were observed with comparisons of proglucagon genomic sequences from fish or frogs to mammals (data not shown). Therefore, comparisons of chicken (or fish or frogs) and mammals fail to detect any ECRs, which potentially could be regulatory sequences.

Our next comparisons involved genomic sequences from the placental mammals (human, mouse, rat, dog, and cow), which diverged from each other over the past ~100 million years (9, 17). In comparisons within placental mammals, most of the nonrepetitive sequence was found to align between species (Fig. 1), with most of the "gaps" in the alignment due to deletion or insertion of sequence rather than low conservation. A large number of noncoding sequences are highly conserved between proglucagon genes of placental mammals (see Fig. 1). Comparisons that included the proglucagon gene sequence from the marsupial opossum, which diverged from placental mammals about 170 million years ago (9, 17), result in the identification of fewer ECRs (Fig. 2). In comparisons between the opossum and placental mammals, the noncoding sequences immediately upstream of exon 1 (ECR1), a sequence about 5 kb 5' to exon 1 (ECR2), and intron 1, especially near the 3' end of intron 1 (ECR3) (Fig. 2). The sequences of the three ECRs are better conserved than other flanking or intronic sequences, suggesting that they are being maintained by selection because they have a function, possibly as regulatory elements. In the opossum-placental mammal comparisons, the best conserved genomic sequence is ECR3 located in intron 1, which shows higher sequence identity than any of the protein coding exons, implying that it is undergoing a very slow rate of sequence evolution. Yet despite a slow rate of sequence evolution in mammals for ECR3, no sequence similar to ECR3 was found within the chicken proglucagon gene (see Fig. 1). This suggests that the selective constraints, and hence function of the sequence, have changed between chicken and mammals. A similar change appears to have also occurred for part of ECR1 in the immediate promoter region between these species (Tsai B, Yue S, and Irwin DM, unpublished observations).

ECR1 has a clear role in the regulation of rat proglucagon gene expression. ECR1 corresponds to the well-characterized immediate promoter region of the rat proglucagon gene (4, 15, 16), which in many mammals is sufficient to support expression in rodent islet cell lines (Tsai B, Yue S, and Irwin DM, unpublished observations). Although this sequence is well conserved in the human proglucagon gene, it is not sufficient to support reporter gene expression in vitro or in vivo, suggesting that a few changes may prevent transcriptional activity (21). Additional sequences have been hypothesized to be necessary for proper expression of the human proglucagon gene (21). In vitro transfection assays suggested that human proglucagon gene sequences 3.3 to 5.7 kb upstream of the mRNA start site could support expression of a reporter gene in rodent islet cell lines (21). ECR2 at about 5 kb upstream of the human proglucagon mRNA start site falls within the sequence defined by transient transfection assays as having enhancer activity. Transgenic mice, which include the 5.7 kb of 5' flanking sequence, including ECR2, failed to express the reporter gene in the pancreas, yet did express the reporter in the other major sites of endogenous proglucagon gene expression, the intestine and brain (Fig. 3). Thus the presence of both ECR1 and ECR2 was not sufficient to drive expression of a reporter gene in all sites of expression of the endogenous proglucagon gene.

Genomic comparisons indicated that the most conserved portion of mammalian proglucagon genes is not a protein coding exon but rather is a 450-base segment (ECR3) near the 3' end of intron 1 (Fig. 1) and potentially confers a regulatory function. Testing the enhancer activity of the proglucagon gene intron 1 is greatly facilitated by the fact that the initiator methionine for human (and other mammalian) proglucagon gene is encoded by exon 2 (31). Thus all of the exon 1 and intron 1, together with the 5' end of exon 2, could be included in our constructs, as they would only contribute 5' untranslated sequence and would not interfere with the expression, or function, of downstream reporter genes. Both in vitro and in vivo, reporter gene constructs that contained 602 bases of 5' flanking sequence (containing ECR1), all of exon 1 and intron 1 (containing ECR3), and the first 12 bases of exon 2, allowed expression of reporter genes in rodent islet cells (Figs. 3 and 5). Because the 602 bases of 5' flanking sequence, as well as exon 1 sequences, did not support reporter gene expression (Fig. 3), these results suggest that sequences within intron 1 were acting as an enhancer. Analysis of fragments of intron 1 suggests that most of the enhancer activity is located in a fragment between bases 2,379 and 2,860 of intron 1 (Fig. 5), the sequence that corresponds to ECR3 (Fig. 2). Searches of the ECR3 sequence identified potential binding sites for transcription factors such as HNF3beta, Cdx-2/3, GATA factors, homeobox-containing factors (e.g., Isl-1, Brn4), and Pax factors. The potential transcription factor binding sites included many factors that previously had been identified as having a role in the expression of the rat proglucagon gene (4, 15, 16).

Most of intron 1 is better conserved than most of the noncoding sequences flanking or within mammalian proglucagon genes (Fig. 1), suggesting that selective pressure is conserving the sequence, possibly for additional functions. The observation that a sequence between bases 1,018 and 1,454 may repress expression supports this suggestion. Sequences distributed through intron 1 may act to modulate expression of proglucagon rather than directly act as an enhancer. The disassociation of sequences necessary for expression and those necessary for regulation has previously been demonstrated for intestinal expression of the human proglucagon gene. While 1.6 kb of 5' flanking sequence is sufficient for expression in intestinal cell (21), these sequences were not sufficient for regulated expression via nutrient sensing (22).


    ACKNOWLEDGMENTS
 
This work was supported in part by an operating grant from the Canadian Institutes of Health Research (to D. M. Irwin) and Studentships from the Banting and Best Diabetes Centre (to L. Zhou and M. Nian). We thank Drs. D. Drucker, H. Elsholtz, T. Jin, and anonymous reviewers for suggestions.


    FOOTNOTES
 

Address for reprint requests and other correspondence: David M. Irwin, Dept. of Laboratory Medicine and Pathobiology, Univ. of Toronto, 100 College St., Toronto, Ontario, Canada, M5G 1L5 (e-mail: david.irwin{at}utoronto.ca)

The costs of publication of this article were defrayed in part by the payment of page charges. The article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.


    REFERENCES
 TOP
 ABSTRACT
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 

  1. Baggio LL and Drucker DJ. Glucagon-like peptide-1 and glucagon-like peptide-2. Best Prac Res Clin Endocrinol Metab 18: 531–554, 2004.
  2. Drucker DJ. The glucagon-like peptides. Endocrinology 142: 521–527, 2001.[Abstract/Free Full Text]
  3. Drucker DJ. Biological actions and therapeutic potential of the glucagon-like peptides. Gastroenterology 122: 531–544, 2002.[CrossRef][ISI][Medline]
  4. Drucker DJ. Glucagon gene expression. In: Encyclopedia of Hormones, edited by Henry HL and Norman AW. San Diego, CA: Academic, vol. 2, 2003, p. 47–55.
  5. Drucker DJ, Philippe J, Jepal L, and Habener JF. Glucagon gene 5'-flanking sequences promote islet cell-specific glucagon gene transcription. J Biol Chem 262: 15659–15665, 1987.[Abstract/Free Full Text]
  6. Efrat S, Teitelman G, Anwar M, Ruggiero D, and Hanahan D. Glucagon gene regulatory region directs oncoprotein expression to neurons and pancreatic alpha cells. Neuron 1: 605–613, 1988.[CrossRef][ISI][Medline]
  7. Hansen PA and Corbert JA. Incretin hormones and insulin sensitivity. Trends Endocrinol Metab 16: 135–136, 2005.[Medline]
  8. Hardison RC. Conserved noncoding sequences are reliable guides to regulatory elements. Trends Genet 16: 369–372, 2000.[CrossRef][ISI][Medline]
  9. Hedges SB and Kumar S. Genomic clocks and evolutionary timescales. Trends Genet 19: 200–206, 2003.[CrossRef][ISI][Medline]
  10. Heinrich G, Gros P, Lund PK, Bentley RC, and Habener JF. Pre-proglucagon messenger ribonucleic acid: nucleotide and encoded amino acid sequences of the rat pancreatic complementary deoxyribonucleic acid. Endocrinology 115: 2176–2181, 1984.[Abstract]
  11. Irwin DM. cDNA cloning of proglucagon from the stomach and pancreas of the dog. DNA Seq 12: 253–260, 2001.[Medline]
  12. Irwin DM. Ancient duplication of the human proglucagon gene. Genomics 79: 741–746, 2002.[CrossRef][ISI][Medline]
  13. Irwin DM and Gong Z. Molecular evolution of vertebrate goose-type lysozyme genes. J Mol Evol 56: 234–242, 2003.[CrossRef][ISI][Medline]
  14. Jiang G and Zhang BB. Glucagon and regulation of glucose metabolism. Am J Physiol Endocrinol Metab 284: E671–E678, 2003.[Abstract/Free Full Text]
  15. Kieffer TJ and Habener JF. The glucagon-like peptides. Endocr Rev 20: 876–913, 1999.[Abstract/Free Full Text]
  16. Knepel W. The {alpha}-cell and regulation of glucagon gene transcriptions. In: Molecular Basis of Pancreas Development and Function, edited by Habener JF and Hussain M. Norwell, MA: Kluwer Academic, 2001, p. 67–89.
  17. Kumar S and Hedges SB. A molecular timescale for vertebrate evolution. Nature 392: 917–920, 1998.[CrossRef]
  18. Lopez LC, Frazier ML, Su CJ, Kumar A, and Saunders GF. Mammalian pancreatic preproglucagon contains three glucagon-related peptides. Proc Natl Acad Sci USA 80: 5485–5489, 1983.[Abstract/Free Full Text]
  19. Miller W, Makova KD, Nekrutenko A, and Hardison RC. Comparative genomics. Ann Rev Genomics Hum Genet 5: 15–56, 2004.[CrossRef][ISI][Medline]
  20. Nauck MA. Glucagon-like peptide 1 (GLP-1) in the treatment of diabetes. Horm Metab Res 36: 852–858, 2004.[CrossRef][Medline]
  21. Nian M, Drucker DJ, and Irwin D. Divergent regulation of human and rat proglucagon promoters in vivo. Am J Physiol Gastrointest Liver Physiol 277: G829–G837, 1999.[Abstract/Free Full Text]
  22. Nian M, Gu J, Irwin DM, and Drucker DJ. Human glucagon gene promoter sequences regulating tissue-specific versus nutrient-regulated gene expression. Am J Physiol Regul Integr Comp Physiol 282: R173–R183, 2002.[Abstract/Free Full Text]
  23. Pennacchio LA and Rubin EM. Genomic strategies to identify mammalian regulatory sequences. Nature Rev Genet 2: 100–109, 2001.[CrossRef][ISI][Medline]
  24. Rothenberg ME, Eilertson CD, Klein K, Zhou Y, Lindberg I, McDonald JK, Mackin RB, and Noe BD. Processing of mouse proglucagon by recombinant prohormone convertase 1 and immunopurified prohormone convertase 2 in vitro. J Biol Chem 270: 10136–10146, 1995.[Abstract/Free Full Text]
  25. Schwartz S, Zhang Z, Frazer KA, Smit A, Riemer C, Bouck J, Gibbs R, Hardison R, and Miller W. PipMaker–a web server for aligning two genomic DNA sequences. Genome Res 10: 577–586, 2000.[Abstract/Free Full Text]
  26. Schwartz S, Elnitski L, Li M, Weirauch M, Riemer C, Smit A, Green ED, Hardison RC, and Miller W. NISC comparative sequencing program. MultiPipMaker and supporting tools: Alignments and analysis of multiple genomic DNA sequences. Nucleic Acids Res 31: 3518–3524, 2003.[Abstract/Free Full Text]
  27. Steiner DF. Glucagon processing. In: Encyclopedia of Hormones, edited by Henry HL and Norman, AW. San Diego, CA: Academic, 2003, vol. 2, p. 65–74.
  28. Stone EA, Cooper GM, and Sidow A. Trade-offs in detecting evolutionarily constrained sequence by comparative genomics. Annu Rev Genomics Hum Genet 6: 143–164, 2005.[CrossRef][ISI][Medline]
  29. Yue S and Irwin DM. Structure and expression of the chicken proglucagon gene. Mol Cell Endocrinol 230: 69–76, 2005.[Medline]
  30. White JW and Saunders GF. Structure of the human glucagon gene. Nucleic Acids Res 14: 4719–4730, 1986.[Abstract/Free Full Text]
  31. Zhou L and Irwin DM. Fish proglucagon genes have differing coding potential. Comp Biochem Physiol 137B: 255–264, 2004.[Medline]




This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
290/3/R634    most recent
00596.2005v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Citing Articles
Right arrow Citing Articles via ISI Web of Science (3)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Zhou, L.
Right arrow Articles by Irwin, D. M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Zhou, L.
Right arrow Articles by Irwin, D. M.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Visit Other APS Journals Online
Copyright © 2006 by the American Physiological Society.