RECODE: a programmable guide-free C-to-U RNA editing tool
收藏NIAID Data Ecosystem2026-05-10 收录
下载链接:
http://datadryad.org/dataset/doi%253A10.5061%252Fdryad.cjsxksngv
下载链接
链接失效反馈官方服务:
资源简介:
Programmable RNA cytidine deaminase tools have been developed to convert cytidine-to-uridine (C-to-U) using CRISPR systems with guide RNAs. These tools, however, have limitations such as low editing efficiency, limited targetable sequence flexibility, and off-target RNA editing. Here, we present a novel guide-free C-to-U editing tool, named RECODE (RNA Editor for C-to-U with an Optimized DYW Enzyme), based on the RNA-binding pentatricopeptide repeat proteins, naturally fused to a C-terminal DYW cytidine deaminase domain. The RECODE specificity domain was engineered to enable retargeting, while its length and sequence were optimized to reduce off-target effects. Further optimization of the C-terminal catalytic region increased both the editing activity and the translation of the edited RNA. We showed that RECODE efficiently edits a wide range of targets in human cells, without affecting adjacent cytidines. It achieved over 50% editing efficiency for most sites, except those with an upstream guanine. Furthermore, we showed that RECODE is functional in mice, with high editing efficiency observed in specific tissues such as skeletal muscles using an AAV delivery system, suggesting its therapeutic potential for various diseases.
Methods
Alignment of C-terminal domains and construction of a phylogenetic tree
To design the C-terminal domain, the non-DYW:KP proteins identified in hornwort, lycophyte, and fern transcriptomes (Ichinose et al., 2022) were aligned using MAFFT in L-INS-i mode (v7.407) (Katoh and Standley, 2013). Alignments were then trimmed using TrimAL (v1.4.rev15) (Capella-Gutiérrez et al., 2009) with a minimum conservation threshold and a gap threshold of 20%. A tree was built for the trimmed alignment using IQ-TREE (v.2.0.3) (Minh et al., 2020) with the JTT+F+I+G4 substitution model, selected using the in-built automated test (Kalyaanamoorthy et al., 2017). The branch support was computed with 1000 bootstrap replicates. The tree was visualized using iTOL (Letunic and Bork, 2016).
Transfection
For the design of the PG and WW C-terminal domains and the analysis of several PPR code combinations (at the exogenous rpoA editing site), HEK293T cells were plated in 24-well plates (Thermo Fisher Scientific) at a density of approximately 8.0 × 104 cells/well 24 hrs prior to transfection. A mixture containing 500 ng of transfection plasmid combined with Opti-MEM® I Reduced Serum Medium (Thermo Fisher Scientific), and 1.5 µL of FuGENE® HD Transfection Reagent (Promega) was prepared in a total volume of 25 µL per well. This mixture was incubated at room temperature for 10 minutes before being added to the cells. After transfection, the cells were incubated at 37ºC for 24 hrs.
For the development of RECODE proteins (at the endogenous CTNNB1-T41I site), HEK293T cells were plated in 24-well plates (Thermo Fisher Scientific) at a density of approximately 1.0 × 105 cells/well 24 hrs prior to transfection. A mixture containing 700 ng of RECODE plasmid combined with Opti-MEM® I Reduced Serum Medium (Thermo Fisher Scientific), and 2.5 µL of FuGENE® HD Transfection Reagent (Promega) was added to the cells. After transfection, the cells were incubated at 37ºC for 48 hrs.
For the comparison between RECODE and RESCUE-S, HEK293T cells were plated in 96-well plates coated with poly-L-lysine (IWAKI) at approximately 104 cells/well 24 hrs prior to transfection. 140 ng RECODE plasmid or 90 ng Cas-ADARRESCUE-S / 180 ng gRNA plasmids were transfected with 0.5 µL of FuGENE® HD Transfection Reagent (Promega). The amounts of plasmid DNA for RECODE and RESCUE-S are based on Supplementary Figure S2. After transfection, the cells were incubated at 37ºC for 48 hrs.
For western blotting and RNA-seq analysis, 700 ng RECODE plasmid or 450 ng Cas-ADARRESCUE-S / 900 ng gRNA plasmids were transfected with 2.5 µL of FuGENE® HD Transfection Reagent (Promega) into cells plated in 24-well plates and incubated at 37ºC for 48 hrs.
Hepa1-6 cells were seeded into a well of a 24-well plate at approximately 1.0 × 105 cells/well the day before transfection. 500 ng RECODE plasmid was transfected with Lipofectamine 3000 (Thermo Fisher Scientific) and incubated for 24 hrs at 37ºC.
Analysis of in vitro RNA editing by Sanger sequencing
Total RNA was extracted from transfected cells according to the manufacturer’s instructions using the Maxwell® RSC simplyRNA Tissue Kit (Promega). cDNA was synthesized with 300 ng of RNA, ReverTra Ace® (TYOBO), and random primers. The editing site was PCR-amplified using PrimeSTAR Max DNA® polymerase (TaKaRa) with 1 µL of cDNA and primers. PCR products were purified with ExoSAP-ITTM Express PCR Cleanup Reagent (Thermo Fisher Scientific) and sequenced (Azenta) using forward or reverse primers specific to the gene. RNA editing was quantified from raw sequencing chromatograms using EditR (EditR software: http://baseeditr.com) (Kluesner et al., 2018). Editing efficiency was determined as the ratio of thymidine (T) to T and cytidine (C), based on the percentage of their respective peak areas after trimming low-quality data (P-value cutoff: 0.01). Areas not significantly different from background noise were assigned a value of 0.
In vivo study
The study using wild-type mice was outsourced to SHIN NIPPON BIOMEDICAL LABORATORIES, LTD., which approved the study protocol. The breeding and use of animals was carried out in accordance with the animal experiment regulations of the outsourcing company. The mice were kept under the non-SPF condition and were maintained on a 12-hour light/dark cycle at the controlled room temperature of 21.2–23.0°C with humidity of 40–57% during experiments. The FVB/NJcl strain mice used in this study were purchased from CLEA Japan Inc.
AAV9 packaged RECODE (RECODE-WW[F75F] and RECODE-PG[I83I]) were prepared by VectorBuilder. The study included three groups of three 8-week-old male WT mice (FVB/NJcl): a control substance group (PBS group) and two test substance groups (AAV9-RECODE-WW[F75F] and AAV9-RECODE-PG[I83I]). Mice in each group were administered a single intravenous dose of the test substance at a dose of 3×1014 vg/kg or the same volume of PBS as the AAV preparation as the control substance. Eight weeks after administration, the mice were euthanized by exsanguination under isoflurane inhalation anaesthesia (2.0–4.0%; Isoflurane Inhalation Anesthetic Solution “VTRS,” Mylan EPD LLC), and various tissues were collected and flash-frozen in liquid nitrogen.
RNA extraction from in vivo samples
10 mg of pieces of organs were homogenized in 250 µL of the 1-Thioglycerol/Homogenization (supplied by Maxwell RSC simplyRNA Tissue Kit; Promega) using a BioMasher tube (Funakoshi). The homogenate was centrifuged at 14,000 rpm for 15 min at 4ºC, and 200 µL of supernatant was collected. 200 µL Lysis buffer was added before extracting total RNA using Maxwell RSC simplyRNA Tissue Kit (Promega).
Analysis of transcriptome-wide expression changes
The NGS libraries were prepared from total RNA (as described in the ‘RNA extraction from in vivo samples section’) using poly(A) enrichment, and Illumina Novaseq 6000 sequencing was performed by Azenta. Reads were mapped to a customized reference sequence including GRCm39 (Ensembl) and each AAV9-RECODE sequence for in vivo study and GRCh38.105 (Ensembl) and the RECODE sequences for the in vitro study, using STAR (v2.7.10) (Dobin et al., 2013). Alignment parameters included: --quantMode TranscriptomeSAM –outFilterType BySJout –outFilterMultimapNmax 1 –outSAMstrandField intronMotif –outSAMattributes All. Estimated counts for all the transcripts were generated using RSEM (v1.2.28) (Li and Dewey, 2011) following STAR alignment (as described in the ‘Read mapping’ section). Genes were then pre-filtered to include only those with a CPM (counts per million) > 0.5 in at least half of the samples. Differential gene expression analysis was subsequently performed using DESeq2 (1.46.0) (Love et al., 2014) with default parameters. MA plots were generated by plotting the log2 fold change against the mean expression (baseMean).
Off-target analysis by RNA-seq
PCR duplicates were removed from the alignment generated by STAR (as described in ‘Read mapping’ section) using Picard MarkDuplicates tool (http://broadinstitute.github.io/picard/), RNA editing candidate sites were identified using REDItools (v1.3) (Picardi and Pesole, 2013) using the following parameters -t 17 -e -d -l -U [AG, TC, CT, GA] -G path/to/gtf -p -u -m 30 -T 6-0 -W -v 10 -n 0 -g 2 -s 1 removing the sites with a read count less than 10 and minimum mapping quality score of less than 30. Base frequencies were counted at all positions in the transcript sequences. Positions with significant differences in base frequency compared to the reference were identified using Fisher’s exact test with Benjamini-Hochberg correction (p-value < 0.01).
Protein analysis for in vitro samples
Total protein was extracted with RIPA buffer (ATTO), then adjusted to 1 mg/mL. Western blotting was performed with the Abby Simple Western system (ProteinSimple) according to the manufacturer’s instructions using the 12-230 kDa Separation Module, RePlex Module, and the Anti-Mouse or Anti-Rabbit Detection Module (ProteinSimple). Primary antibodies against β-catenin (sc-59737, Santa Cruz Biotechnology; 1:250), KRAS (12063-1-AP, Proteintech; 1:200) and BRG1/SMARCA4 (21634-1-AP, Proteintech; 1:100) were used. Antibodies were diluted with Can Get Signal Solution 1 (TOYOBO). Total protein was detected using the Total Protein labeling reagent (ProteinSimple). The peak area values of target proteins were normalized to the total protein values using Compass software (ProteinSimple).
Protein analysis for in vivo samples
Total proteins were extracted from 10 mg of a piece of organ with 120 µL RIPA buffer (ATTO) supplemented with 1% Proteinase Inhibitor Cocktail. After homogenization in a BioMasher tube (Funakoshi) and centrifugation at 14,000 rpm for 15 min at 4ºC, 80 µL supernatant and 20 µL 5× SDS buffer were mixed and heated at 95ºC for 5 min. Total protein and standard protein were quantified using the Protein Quantification Assay Kit (Thermo Fisher Scientific) with 0.05 g/ml Ionic Detergent Compatibility Reagent. The protein concentration of the sample was adjusted to 3 mg/mL with 1× SDS buffer, and protein samples were analysed using the Abby Simple Western system (ProteinSimple). Anti-HA monoclonal antibody (clone C29F4, Cell Signaling Technology; 1:500) and Anti-Rabbit Detection Module (ProteinSimple) were used to detect RECODE protein. Anti-GAPDH monoclonal antibody (clone 4F8-2, Thermo Fisher Scientific; 1:2000) and Anti-Mouse Detection Module (ProteinSimple) for Gapdh protein detection. Antibodies were diluted with Can Get Signal Solution 1 (TOYOBO). Total protein was detected with Total Protein labeling reagent (ProteinSimple) after detection of Gapdh protein on the same blot. These total protein values were then used to normalize both Gapdh protein and RECODE proteins, the latter of which were run on a separate blot using the same samples. The signal intensity (area) of the bands corresponding to RECODE or Gapdh protein was quantified and normalized to total protein using Compass software (ProteinSimple).
Protein expression and purification
The DNA sequences encoding His6-SUMO tag (synthesized by Eurofins Genomics) and RECODE’s C-terminal domain were amplified by PCR using primers with Esp3I restriction site. The resulting DNA fragments were subcloned into a modified pET21b+PA vector by Golden Gate assembly with Esp3I. The PPR domain was subsequently cloned using BpiI (as described in ‘Design of RECODE’). The His6-SUMO-RECODE fusion protein was expressed in E. coli Rosetta 2 (DE3) at 15ºC overnight following induction with 0.1 mM IPTG and 0.4 mM ZnSO4. The cells were harvested by centrifugation, and the pellets were resuspended in lysis buffer (50 mM Tris-HCl, pH 8.0, 500 mM NaCl, 20 mM imidazole, and 1 mM dithiothreitol) supplemented with 5 mg/ml lysozyme, then stored at -80ºC until use. The cells were disrupted by sonication followed by centrifugation to remove cell debris. The soluble fraction was applied to Ni-NTA beads (Qiagen) and thoroughly washed with lysis buffer. The SUMO fusion protein was eluted with lysis buffer containing 500 mM imidazole and then cleaved overnight with 5 µg of Ulp1 protease and dialyzed against buffer (50 mM Tris-HCl, pH 8.0, 200 mM NaCl, and 1 mM dithiothreitol). The cleaved protein was subsequently loaded onto a HiTrap Q column (Cytiva), and peak fractions containing the target protein were pooled. Further purification was performed using a Superdex 200 10/300 GL column (Cytiva), equilibrated with 50 mM Tris-HCl, pH 8.0, 200 mM NaCl, and 0.5 mM TCEP. The purified protein was concentrated using an Amicon Ultra centrifugal filter unit (Merck).
RNA electrophoretic mobility shift assay (REMSA)
Cy3- or Cy5-labeled synthetic 31-nt RNA oligonucleotides, CTNNB1-T41I (5’-Cy3-UCUGGAAUCCAUUCUGGUGCCACUACCACAG, the editing site is underlined) and SMARCA4-P88L (5’-Cy5-CAUGAGAAGGGCAUGUCGGACGACCCGCGCU, negative control), were used for REMSA. RNA probes (10 nmol) and purified protein at the indicated concentrations (0, 0.5, 1, 2, 5, 10, 20, 50, 100, and 200 nM) were mixed in 10 µL reaction buffer (20 mM Tris-HCl, pH 8.0, 150 mM NaCl, 0.5% NP40, 1 mM EDTA, pH 8.0 and 1 mM dithiothreitol) to a final reaction volume of 20 µL. The reactions were incubated at 25ºC for 20 min, after which 2 µL of 80% glycerol was added. A 10-µL aliquot of each reaction was loaded onto a 5-20% e-PAGEL (ATTO) and electrophoresed in 1 × TBE buffer at 4ºC. Gels were imaged using an iBright Imaging System (Invitrogen), and the fraction of oligonucleotide bound was quantified with Image Lab software (Bio-Rad).
Prediction of the E1-E2-DYW domain in complex with RNA
The structure of the complex between the E1-E2-DYW domain of RECODE-PG and RECODE-WW targeting the CTNNB1-T41I site and its target (5’-CUACC-3’, with the editing site at the second cytosine) was predicted using Protenix (v0.6.0) on the server (https://protenix-server.com/login) with default parameters (ByteDance AML AI4Science Team et al., 2025). The resulting models were then visualized using PyMOL (https://github.com/schrodinger/pymol-open-source).
创建时间:
2025-11-07



