However the technique is not without limitations. One of the biggest drawbacks of using pronuclear injection for generating transgenic mice is that the transgene cannot be directed to a specific chromosomal location of the mouse genome. Integration of the transgene is a random event.
Without knowing where integration takes place, it is impossible to completely predict the consequences of a given genetic modification.
Interpreting Phenotypic Data from Transgenic ModelsThere are several factors worth considering when interpreting the phenotypes of transgenic lines:
- The regulatory or coding region of a critical endogenous gene may be disrupted by insertion of the transgene3,4,5, potentially complicating the interpretation of phenotypes. It has been estimated that 5-10% of transgenic mice carry phenotypes unrelated to the function of the transgene6.
- The precise location where the integration event occurs may lead to mosaic patterns of transgene expression, a phenomenon known as position effect7.
- Multiple copies of the transgene are often inserted as head-to-tail concatemers resulting in variability in copy number between or within founder lines. High transgene copy numbers may lead to epigenetic modification and transgene silencing8.
Existing Methods for Mapping Transgene InsertionThe characterization of transgenic animals has historically been a challenging process. Mapping transgene insertion has typically been achieved using Fluorescence In-Situ Hybridization (FISH) or PCR-based methods — each of which bring their own limitations to the project.
FISH is a low resolution visualization technique that is labor intensive and limited by its inability to verify sequence integrity or detect tandem insertions at the integration site9. PCR-based approaches, such as inverse PCR or ligation-mediated PCR, offer better resolution than FISH, but require knowledge of the restriction sites within the transgene and specific sequence information10.
As the cost of sequencing has come down significantly, whole genome sequencing11 and sequencing with capture probes12 have recently been used to decipher transgene insertion sites. Although next generation sequencing (NGS)-based methods offer better clarity and quicker turnaround than FISH and PCR-based methods, the standard pair-end read (typically <400 bp) cannot reliably detect all structural variations in and around the transgene insertion site, particularly when the region is rich in repetitive sequences.
Efficiently Mapping Transgene Insertion Sites with TLAIn January 2017, an alternative NGS-based method — targeted locus amplification (TLA) — was successfully used to identify the transgene integration sites in seven commonly used Cre and CreERT2 transgenic lines13. TLA is a novel targeted enrichment strategy combining the principle of proximity ligation with NGS to selectively amplify and sequence the transgene and surrounding genomic region of sizes ranging from tens to hundreds of kilobases.
Using only one primer pair, complementary to a short sequence unique to the transgene, crosslinked and ligated DNA fragments surrounding the transgene insertion site are selectively amplified and sequenced. By analyzing the coverage profile of the sequencing reads and the breakpoint sequences, TLA allows the precise identification of the transgene integration site14.
The main advantage of TLA over conventional approaches is that it generates complete sequence information of a region of interest. TLA thus enables the detection of all Single Nucleotide Variants, structural changes (both in the transgene and integration site) and only requires very little prior knowledge of the transgene sequence.
In all seven transgenic lines, the TLA analyses detected structural changes — either deletions or genomic duplications — at the transgene integration sites. This illustrates the importance of testing for rearrangements around the site of transgene insertion. An example of TLA analyses is provided below.
Advantages of Transgene Mapping AnalysisBy fully characterizing the transgene insertion site, researchers gain better understanding of how the insertion site location of a transgene contributes to phenotypic outcomes and uncovers potential for instability in transgene expression.
Knowing the location of transgene insertion is also useful for planning intercrosses in the event that a transgene and the desired allele to be selected are located on the same chromosome. This can reduce the cost of maintaining a transgenic colony.
When integration site is unknown, a quantitative based copy number variation (CNV) analysis is utilized to distinguish wild-type from hemi- and homozygous animals. This can vary in resolution and reliability, depending on transgene size and copy number. Once the integration site is known, researchers can replace the quantitative analysis with targeted genotyping assays using standard PCR, which dramatically cuts colony management costs while also being more specific and reliable.
The power and simplicity of TLA technology has opened the door for researchers to efficiently identify transgene integration sites while also providing important data regarding transgene integrity and the integrity of the surrounding genome. This information is invaluable for interpreting all data generated from a transgenic model and for planning future studies and intercrosses.