The biological landscape from a DRS perspective

Transcriptome diversity is a fundamental feature of biological systems and a major source of phenotypic complexity in animal and plant species. Beyond gene-level expression, transcriptome diversity arises from alternative transcription initiation, AS, APA, allele-specific expression (ASE), RNA editing, chemical modification, and structural heterogeneity. Conventional transcriptomic approaches, dominated by short-read sequencing and cDNA-based protocols, fragment RNA molecules and decouple these regulatory layers, limiting their ability to reconstruct native transcript structures. Nanopore DRS overcomes these limitations by directly sequencing full-length native RNA molecules, thereby providing a single-molecule view of transcriptome diversity (Figure 4) [9].

Transcriptomic architecture from the DRS perspective

From the perspective of nanopore DRS, the transcriptome is not simply a collection of expressed genes but a structured population of individual RNA molecules, each defined by a specific combination of TSS, exon-intron architecture, 3′-end formation, poly(A) tail features, and chemical modifications [120], [199], [200], [201]. This molecule-centric view contrasts sharply with conventional srRNA-seq, which reconstructs transcript models indirectly through computational assembly and often obscures exon connectivity, transcript boundaries, and coordinated regulatory features. By directly observing intact transcripts, DRS reduces ambiguity inherent to short-read-based transcript reconstruction and enables systematic discovery of transcriptome features that are difficult to resolve from fragmented data [9], [202]. These include novel isoforms generated by alternative transcription initiation, AS, APA, and gene fusion events, which have been reported across plants, animals, and viral systems [33], [200], [201], [203], [204], [205], [206].

Application of DRS has revealed extensive isoform heterogeneity across species and biological contexts. In organisms such as A. thaliana, Caenorhabditis elegans, and in human cancer models, DRS studies have identified tens of thousands of previously unannotated full-length isoforms per species, tissue, or developmental condition [203], [204], [208]. This diversity reflects regulated variation in transcription initiation, exon usage, and transcript termination rather than solely stochastic transcriptional noise. DRS further uncovers fine-grained diversity at transcript 3′ ends, including widespread use of alternative and intronic polyadenylation sites. Such events can generate truncated transcripts encoding proteins with altered or novel functions, as exemplified by intronic polyadenylation of TLE1 in estrogen-responsive breast cancer, which produces functionally distinct protein variants [199], [203]. In parallel, DRS enables mapping of epitranscriptomic features, such as m6A and m5C modifications, within specific transcript contexts in animals, bacteria, and plants, where these modifications often correlate with tissue- or condition-specific expression patterns and characteristic poly(A) tail profiles [200], [209], [210], [211]. Across tissues, developmental stages, and environmental or pathological conditions, combined evidence from DRS and deep RNA sequencing demonstrates that most genes express multiple isoforms with regulated differences in TSS usage, untranslated regions, and terminal exons [65], [201], [202], [208], [212], [213]. Many of these isoforms alter protein-coding potential or introduce distinct post-transcriptional regulatory elements, thereby influencing protein truncation, localization, stability, and translational efficiency [203], [208], [214], [215].

Collectively, DRS reveals transcriptomes as ensembles of structurally and chemically distinct RNA molecules, whose isoform- and modification-level diversity encodes essential functional and context-specific information. This architectural view of the transcriptome provides a foundation for mechanistic studies of RNA regulation and highlights the limitations of gene-centric representations in capturing the full complexity of RNA-mediated control in biology and disease.

Alternative splicing and quantitative isoform regulation

AS is a dominant contributor to transcriptome diversity and a central mechanism in development and disease. More than 90% of multi-exon human genes and approximately 80% of plant genes undergo AS, including exon skipping, intron retention, alternative 5′ or 3′ splice-site usage, and mutually exclusive exons, which collectively reshape protein-coding potential and post-transcriptional regulatory elements [82], [216], [217], [218]. Accurately resolving such complexity is challenging for srRNA-seq, which infers isoforms indirectly from fragmented reads and often collapses distinct transcript variants at the exon or gene level. DRS enables direct, assembly-free identification of full-length splice isoforms, providing improved resolution of complex and coordinated splicing patterns that are difficult to reconstruct from short reads [82], [207], [208], [219]. This capability is especially useful for detecting novel or low-abundance isoforms that would otherwise be missed or misassembled [208], [220].

Full-length isoform definition also enables improved quantification of isoform usage [82], [207], [219]. Long-read-based tools such as LIQA and NanoCount leverage the properties of long reads to estimate isoform abundance and detect differential splicing or isoform switching with greater specificity than short-read-based approaches [86], [216], [221]. For example, during human neuronal differentiation, DRS-based analyses identified thousands of previously unannotated isoforms and revealed hundreds of differentially expressed isoforms and isoform switches, highlighting substantial transcript-level regulation beyond gene-level expression changes [86].

Because long-read sequencing captures complete transcript structures, including untranslated regions (UTRs), it provides a foundation for integrating splicing with downstream functional regulation. Isoform-resolved transcript models enable investigation of isoform-specific translational regulation, such as how alternative UTRs, retained introns, or nonsense-mediated decay-prone isoforms modulate translational efficiency without altering total RNA abundance [82], [221], [222]. Although most current translational efficiency metrics are computed at the gene level, the structural information provided by DRS and related long-read approaches makes transcript-level modeling of differential translational efficiency becoming increasingly feasible when integrated with ribosome profiling or proteomics data.

An additional strength of long-read sequencing is its ability to support ASE and haplotype-resolved isoform analysis [223], [224], [225]. Long reads spanning multiple heterozygous variants allow direct phasing of transcript isoforms to parental haplotypes, enabling detection of allele-specific splicing and expression. Methods such as IsoPhase, HapIso, and IDP-ASE use long-read transcriptomes to reconstruct haplotype-specific isoforms, identify imprinted genes and parent-of-origin effects, and dissect cis- and trans-regulatory influences in systems ranging from maize to human cells [223], [224], [225].

Together, nanopore DRS and related long-read approaches enable integrated analysis of AS, quantitative isoform regulation, translational control, and ASE at transcript resolution. This isoform-centric view reveals regulatory mechanisms that are largely invisible to gene-centric, short-read analyses and provides critical insights into RNA-mediated disease processes.

Poly(A) tail dynamics and 3’-end regulation

APA is a pervasive mechanism shaping transcriptome diversity and a major determinant of RNA stability, subcellular localization, and translational output [35]. By selecting distinct cleavage and PAS, APA generates mRNA isoforms with different 3′ UTR or, in some cases, altered coding sequences, thereby rewiring post-transcriptional regulatory programs [200], [226], [227]. Accurate characterization of APA is therefore essential for understanding RNA fate in both physiological and disease contexts. DRS is uniquely suited for studying APA because sequencing initiates at the native 3′ end of RNA molecules and proceeds through the entire poly(A) tail [60], [211], [228]. This provides direct, direct evidence of transcript termination sites, enabling improved mapping of poly(A) site usage across transcript isoforms, even in complex or heterogeneous samples. In contrast, short-read APA methods, including DaPars, IsoSCM, APAtrap, and IntMAP, typically infer poly(A) sites from read coverage patterns and often rely on prior annotations, which can obscure complex isoform architectures and coordinated RNA processing events [200].

By reading through native cleavage sites and poly(A) tails, long-read and DRS-based approaches directly resolve isoform-specific 3′ ends and avoid ambiguities associated with transcript reconstruction. When combined with dedicated tail analysis tools such as tailfindR, Nanopolish, Poly(A)tailor, and Ninetails, ONT DRS enables single-molecule assignment of poly(A) sites usage, tail length, and tail composition to specific transcript isoforms [60], [228], [229], [230]. This isoform-resolved perspective is particularly valuable given that more than half of human genes use multiple poly(A) sites. A large-scale plant full-length RNA atlas reported more than 120 million polyadenylated mRNA molecules across multiple tissues and species, uncovering conserved, tissue-specific poly(A) tail-length distributions, including reproducible peaks around ~20 nt and ~45 nt in many tissues [231]. Poly(A) tail length itself is dynamic and developmentally regulated, with important consequences for RNA stability and translational control. Genome-wide studies using approaches such as TAIL-seq [232], Poly(A)-seq [233], and DRS-based tail profiling have shown that highly expressed and efficiently translated mRNAs often carry relatively short steady-state poly(A) tails in many non-embryonic systems [226], [234], [235]. Rather than uniformly predicting translational efficiency, tail length is more closely associated with RNA half-life and pathway-specific regulatory programs [60], [234], [236]. In addition, widespread incorporation of non-adenosine residues, such as uridylation, guanylation, or mixed tails, adds another layer of regulation by modulating RNA decay and stability [232], [233]. Recent DRS-based methods, including Ninetails, now enable detection of internal non-A residues within poly(A) tails in endogenous and therapeutic mRNAs [229].

Importantly, APA and poly(A) tail regulation are closely interconnected with other RNA processing steps. AS and 3′-end processing exhibit extensive mechanistic crosstalk, and long-read technologies are essential for resolving coordinated AS-APA programs on individual transcripts [226], [227], [237]. DRS and PacBio-based Iso-seq analyses demonstrate that poly(A) tail length and composition can vary systematically between isoforms derived from the same gene, supporting a model in which splicing decisions, cleavage-site choice, and tail regulation are co-regulated to shape RNA stability and fate [211], [230], [232], [238].

These studies support the view that APA and poly(A) tail length and composition form a tightly coordinated 3′-end regulatory layer. By uniquely linking cleavage site selection, isoform structure, and tail features on the same RNA molecule, nanopore DRS enables an isoform-resolved understanding of 3′-end regulation and its role in transcriptome diversity, gene regulation, and disease.

RNA chemical modifications and epitranscriptomic diversity

RNA chemical modifications add a critical dimension to transcriptome diversity by modulating RNA structure, stability, splicing, localization, and translation [239]. Unlike antibody-based enrichment or chemical conversion approaches, nanopore DRS detects RNA modifications directly through characteristic perturbations in ionic current signals as native RNA molecules translocate through the nanopore [118], [120], [207], [240]. Consequently, DRS can be used to infer several classes of RNA modifications on single molecules and at single-nucleotide resolution from nanopore current signals, especially when combined with dedicated modification-calling models [116], [118], [207], [240], [241]. Recent advances in SQK-RNA004 pore chemistry and basecalling models such as Dorado now enable implements modification-aware basecalling models that infer m6A, Ψ, m5C, Nm, and inosine, improving accuracy and F1 scores in benchmark datasets [53]. Nevertheless, orthogonal validation remains essential, particularly for low-stoichiometry or context-dependent modifications.

A major advantage of DRS lies in its preservation of full-length transcript context. Long reads allow modification calls to be assigned to specific splice and APA isoforms rather than being aggregated at the gene or exon level [60], [118], [207], [240], [242]. This isoform-resolved view is crucial, as RNA modifications often exhibit variant-specific distributions that are obscured by short-read or enrichment-based methods. Community resources such as DirectRMDB now aggregate hundreds of thousands of DRS-derived modification sites spanning multiple modification types and explicitly support isoform-level exploration of epitranscriptomic patterns [240]. Analytical tools such as R2Dtool further integrate isoform-mapped modification sites with open reading frames, splice junctions, and untranslated regions, enabling systematic analysis of how modification landscapes shift with AS or 3′-end choice [242]. DRS-based studies have begun to illuminate coordinated regulation between RNA modifications and other layers of RNA processing. For example, in human leukemia cells, long-read DRS enabled joint profiling of m6A deposition, poly(A) tail length, transcript abundance, and splicing, revealing coordinated transcript-specific changes following perturbation of the m6A writer [60]. Such analyses demonstrate how chemical modifications interact with RNA processing and stability in a highly context-dependent manner. Widely studied modifications, including m6A, m5C, Ψ, and A-to-I editing, have been implicated in regulation of splicing, nuclear export, RNA decay, translation, and subcellular localization, and are linked to cancer, cardiovascular disorders, and inherited diseases [81], [116], [243], [244], [245]. In plants, DRS-based m6A profiling has progressed from proof-of-concept to quantitatively benchmarked applications. For example, in Populus trichocarpa stem-differentiating xylem, Nanopore-based m6A inference identified 3253 m6A-modified genes, of which 2626 overlapped with MeRIP-seq results, demonstrating substantial concordance at the gene level while preserving transcript-specific resolution [172]. Importantly, accumulating evidence indicates that dysregulated RNA modification often acts at the level of specific transcript variants rather than entire genes, leading to isoform-selective effects on RNA fate [81], [243], [244]. DRS-based protocols now enable differential modification analysis across conditions, perturbations, and cell states, facilitating dynamic studies of epitranscriptomic regulation in disease models [60], [118], [246].

Together, these advances establish nanopore DRS as an important technology for charting RNA chemical modifications in their native isoform context. By revealing variant-specific and combinatorial epitranscriptomic programs on individual RNA molecules, DRS provides insights into the functional diversity of the transcriptome and its roles in development, disease, and therapeutic response [177].

Discovery of noncanonical and noncoding RNA species

Beyond protein-coding transcripts, DRS has substantially expanded our understanding of transcriptome diversity by enabling systematic discovery and characterization of noncanonical RNA species. Long noncoding RNAs (lncRNAs), circular RNAs (circRNAs), transfer RNAs (tRNAs), and other small or highly structured RNAs are often poorly captured by conventional RNA-seq protocols owing to their size, secondary structure, low abundance, or lack of polyadenylation. DRS is particularly powerful for profiling lncRNAs, as it captures long, often low-abundance transcripts with their native 5′ and 3′ boundaries, splice patterns, and poly(A) tail features intact [247]. This overcomes biases associated with poly(A)-selected or non-strand-specific srRNA-seq, which frequently miss non-polyadenylated lncRNAs or mis-assign transcript orientation. In Arabidopsis, integration of DRS with low-abundance-aware isoform discovery identified more than 1100 previously unannotated lncRNAs and revealed that approximately one-third of lncRNAs lack poly(A) tails [211]. Moreover, poly(A) tail length and m6A modification status was shown to jointly regulate lncRNA abundance and stability, underscoring the importance of isoform-resolved analysis, particularly in disease contexts where only specific lncRNA variants may be functional [248], [249], [250].

circRNA represents another class of noncanonical transcripts that benefit from long-read sequencing. Because circRNAs are non-polyadenylated, their detection typically requires ribosomal RNA (rRNA) depletion or specialized enrichment strategies. Long-read nanopore approaches have become essential for reconstructing full-length circRNA sequences and defining their exon composition [251], [252], [253]. For example, the CIRI-long workflow combines circular RT with nanopore sequencing to achieve substantial enrichment of circRNAs and to resolve complex variants, including mitochondrial and read-through circRNAs in mouse brain [252]. These studies highlight the importance of long, processive reads for distinguishing lowly expressed circRNAs from linear isoforms and for precisely mapping back-splice junctions and internal structures, which are critical for functional characterization and biomarker discovery [251], [253], [254], [255]. tRNA poses distinct technical challenges owing to their short length, extensive secondary structure, and dense chemical modification. Customized nanopore DRS protocols and analytical pipelines, including direct tRNA adapters, Nano-tRNA-seq strategies, and updated pore chemistries, now enable end-to-end sequencing of full-length tRNAs at single-molecule resolution [36], [192], [256]. These approaches allow simultaneous quantification of tRNA isoacceptor abundances and detection of numerous modification types on individual molecules, revealing coordinated “modification circuits” within tRNA structural elements such as the T loop, as well as stress- or condition-specific remodeling of tRNA pools [192], [256]. Integration of DRS with LC-MS/MS and complementary epitranscriptomic assays further enables transcriptome-scale mapping of structured noncoding RNAs and their dynamic modification states [192], [256], [257], [258].

Collectively, evidence across lncRNAs, circRNAs, tRNAs, and other structured or noncanonical RNAs demonstrates that nanopore DRS substantially expands the detectable transcriptome. By delivering full-length transcript structures, native processing states, and modification landscapes that are largely inaccessible to conventional srRNA-seq, DRS provides a comprehensive framework for exploring the functional diversity of noncoding and noncanonical RNA species in biology and disease.

RNA structure and conformational heterogeneity

RNA structure is an intrinsic component of transcriptome diversity, shaping RNA stability, subcellular localization, and interactions with proteins and other RNAs. Classical approaches to RNA structure probing rely on chemical modification or enzymatic digestion and typically require dedicated experimental workflows that are separate from transcriptome profiling. DRS provides a complementary, signal-based strategy by capturing ionic current and dwell-time variations that reflect RNA secondary and tertiary structure during translocation through the nanopore [259]. In DRS, a motor enzyme feeds native RNA molecules through a biological nanopore, and both the nucleotide sequence and its structural context influence the resulting ionic current and translocation kinetics.

Base-paired regions, stable secondary structures, and chemical adducts used for structure probing can all perturb current intensity and dwell time in characteristic ways. Several proof-of-concept studies have demonstrated the feasibility of this approach. For example, the nanoSHAPE framework showed that SHAPE adducts and endogenous modifications in rRNA generate reproducible current and dwell-time shifts at single-molecule, long-read resolution, enabling simultaneous sequencing and structure probing [118]. In a complementary single-molecule system, an engineered reverse transcriptase coupled to an MspA nanopore revealed that enzyme stepping kinetics are sensitive to downstream RNA secondary structure, allowing direct detection of structured regions without prior cDNA conversion [260]. Related strategies exploiting dwell-time perturbations at helicase “brake” points have further been used to distinguish Ψ from uridine and to infer modification-dependent structural stabilization in viral and bacterial RNAs [261], [262]. Collectively, these studies indicate that structured or base-paired RNA regions produce distinct signal signatures compared with flexible, unstructured segments, although deconvolving structural effects from those of chemical modifications remains an active area of methodological development [118], [260]. Because DRS yields full-length reads, structure-sensitive signals can, in principle, be mapped continuously along entire transcripts and analyzed in conjunction with AS patterns, untranslated regions, and epitranscriptomic modifications [263], [264], [265], [266]. In this way, DRS-based approaches complement transcriptome-wide chemical and enzymatic structure-mapping methods by embedding structural information within native transcript architectures.

RNA structure plays a central role in regulating AS, RNA-protein interactions, and repeat-associated toxic gain-of-function mechanisms, and its disruption contributes to cancer, neurodegeneration, and repeat-expansion disorders [267], [268], [269]. Disease-associated single-nucleotide variants can act as riboSNitches, altering local or long-range RNA structure and thereby reshaping splicing decisions, RNA stability, or RNA-binding protein affinity [265], [266]. Integrating DRS-derived structural signals with isoform-resolved modification maps and transcript profiles offers a promising strategy to pinpoint variant- or isoform-specific structural changes that rewire post-transcriptional regulation and translational control in disease [264], [270], [271]. sm‑PORE‑cupine integrates chemical probing with direct RNA sequencing to map RNA structure ensembles at single‑molecule resolution. By uncovering isoform‑specific structural heterogeneity and linking these ensembles to translation efficiency and RNA stability, this approach provides valuable insights into RNA structure-function relationships across complex transcriptomes [272].

Taken together, current evidence supports nanopore DRS as a promising, though still maturing, signal-based complement to chemical and enzymatic RNA structure probing. Its ability to relate RNA structural features to full-length isoforms, chemical modifications, and disease-linked sequence variation on single molecules positions DRS as a promising emerging tool for studying RNA conformational heterogeneity in health and disease. Although systematic applications in plant systems remain limited, this capability holds significant promise for broader implementation.