Challenges and technical improvements of DRS

Despite its ability to deliver a variety of high-value analytical applications, the performance of DRS remains subject to sample type, library chemistry, nanopore properties, motor enzymes and basecalling models. Most related applications continue to rely heavily on model-driven inference and can be significantly affected by sequencing coverage, signal quality and available training data. DRS should therefore be considered not as an all-around or universally better substitute for traditional transcriptomic methods, but as a rapidly developing platform that shows clear strengths when native long-read data are specifically required.

In this section, we summarize the major technical constraints that currently limit DRS, with emphasis on sample input, transcript coverage, signal interpretation, reproducibility, and computational generalizability. We then discuss recent improvement directions in chemistry, hardware, algorithms, and standardization. To avoid overinterpretation, we also propose a task-oriented view of DRS outputs, distinguishing between readouts that are already relatively robust, features that remain strongly model-dependent, and biological conclusions that still require extensive orthogonal validation.

Current technical challenges in DRS

Sample input, library preparation, and transcriptome representation

DRS library preparation still requires relatively large amounts of input RNA. For example, the ONT SQK-RNA004 workflow typically requires approximately 300 ng of poly(A)-selected RNA or about 1 μg of total RNA, which remains challenging for many low-yield samples such as plasma RNA, extracellular vesicle RNA, needle biopsies, rare clinical specimens, or highly specific micro dissected tissues [60]. Although multiplexing and targeted enrichment may partially alleviate this constraint, these solutions are not yet standardized across laboratories or application settings.

This limitation is even more pronounced in single-cell and spatially resolved applications. The RNA content of a single cell is orders of magnitude below the input requirements of current commercial DRS kits, and low-input processing introduces substantial risks of adsorption loss, degradation, contamination, and stochastic transcript dropout [534], [673], [674]. As a result, the application of DRS in single cell has not been developed to date.

Limited performance for short RNAs

DRS remains poorly suited for the short RNAs. Molecules shorter than roughly 100−150 nt are often inefficiently captured or confidently basecalled in standard workflows [60] , even though many biologically important and clinically informative RNAs fall within or below this range. While recent updates to the MinKNOW software have improved detection sensitivity for RNA molecules longer than 50 nt, biologically relevant short RNAs, such as 22−45 nt cell-free RNAs in liquid biopsies that serve as potential disease biomarkers, remain largely undetectable [63]. Currently, laboratory-developed small-RNA DRS protocols are promising, but they are not yet broadly standardized and continue to face challenges in reproducibility, adapter discrimination, and general applicability. While laboratory-based non-poly(A) enrichment methods exist, their lack of commercialization restricts transcriptome comprehensiveness and highlights the need for inclusive capture strategies [36], [192].

Limitations of DRS in throughput, accuracy, and completeness

Although DRS has certain advantages, benchmarking this approach against mainstream transcriptomic technologies, such as Illumina short-read platforms, nanopore cDNA sequencing, and PacBio Iso-Seq exposes inherent technical bottlenecks [81]. The most notable limitations revolve around base-calling fidelity, 5’ transcript integrity, and sequencing throughput [162], [189].

An elevated base error rate remains a fundamental constraint of the DRS platform. Although the incorporation of the SQK-RNA004 sequencing kit has improved the overall read accuracy, the systematic errors remain largely identical compared to the previous kits. Some sequencing errors are attributable to signal insufficiency rather than algorithmic (basecalling) artefacts [54]. This performance contrasts starkly with NGS methodologies, nanopore cDNA sequencing, and modern PacBio Iso-Seq that utilizes Circular Consensus Sequencing (CCS/HiFi) to suppress error rates [39]. The high frequency of stochastic errors in DRS severely complicates single nucleotide variant (SNV) calling [54], [675]. Insertion and deletion artifacts in homopolymeric regions are particularly problematic and generate a prohibitive background of false-positive variants [54], [534], [676]. Therefore, for studies relying on high-precision detection, such as SNV identification, it is still recommended to use high-accuracy methods including NGS, nanopore cDNA sequencing, and PacBio Iso-seq [191], [468], [675], [677]. DRS should be used in conjunction with these high-precision approaches.

Transcript completeness presents another major hurdle. DRS elegantly circumvents artifacts introduced by reverse transcription and PCR amplification, yet it paradoxically struggles to capture true full-length transcripts when compared to nanopore cDNA sequencing and PacBio Iso-Seq [81], [84]. Researchers have well documented the systematic loss of coverage at the 5’ terminus during DRS runs [646], [678]. This structural truncation occurs because the motor protein driving the RNA strand dissociates or accelerates prematurely as the tail end of the molecule translocates through the pore. The sensor consequently misses approximately 10 to 15 nucleotides at the 5’ end [35], [60], [109], [191]. Pervasive in vitro RNA degradation during library preparation further exacerbates this physical limitation [481]. The resulting 5’ incompleteness critically impairs downstream transcriptomic interpretations [84]. Assembly algorithms frequently misclassify degraded or systematically truncated 5’ ends as novel alternative TSS [533]. This misinterpretation artificially inflates the perceived complexity of the transcriptome and hinders accurate isoform quantification. Moreover, although poly(A) tail profiling represents one of the more mature and accurate applications of DRS, particularly when using the Dorado basecaller (v0.9.0 or later), its reliability can still be affected by factors such as RNA degradation and transcript end integrity [468], [537]. Consequently, current poly(A) tail-length estimates should not be regarded as a fully comprehensive readout for interpreting post-transcriptional regulation across all transcript contexts.

In addition, the overall sequencing depth of DRS is also relatively lower compared with NGS [81]. For tasks requiring high coverage, such as transcriptome assembly and annotation in novel species, cDNA-based long-read sequencing supplemented with NGS for error correction may be more appropriate [84], [612].

Taken together, these comparisons demonstrate that while DRS holds unique advantages in retaining native RNA information its higher base error rate and relatively lower transcript completeness compared with NGS, nanopore cDNA, and PacBio Iso-seq restrict its utility in high-precision applications such as SNV identification and comprehensive transcriptome profiling. In these scenarios, NGS-based error correction would be helpful to enhance the reliably of DRS.

Ground truth limitations and unresolved challenges in RNA modification detection

Currently, there is a significant gap between expectations and routine practice in the field of epitranscriptome interpretation. Most DRS-based modification callers infer modifications indirectly from deviations in ionic current intensity, dwell time, or basecalling behavior relative to learned expectations [655]. This creates several sources of uncertainty.

A fundamental challenge is the lack of robust, broadly accepted ground truth datasets for many RNA modifications. Unlike canonical sequence benchmarking, where reference genomes and validated variant sets are often available, modification calling typically depends on synthetic constructs, enzyme perturbation experiments, antibody-based enrichment, or site-specific orthogonal assays, each of which captures only part of the issue and introduces its own biases [116], [119]. As a result, these reported methods’ performance may not transfer well across species, transcript contexts, chemistries, or laboratories. The second challenge is signal confounding. Modification-associated signatures are influenced by k-mer context, neighboring modifications, RNA secondary structure, translocation kinetics, and local noise [655]. Multiple nearby modifications may compress or distort the current profile, making deconvolution difficult [182]. This is particularly problematic for de novo discovery, where the search space is large and false positives can accumulate rapidly if candidate signals are not filtered against appropriate negative controls and matched reference backgrounds [34], [372]. The third challenge is quantification. Even when a site can be detected, accurate estimation of modification stoichiometry remains difficult, especially for low-abundance transcripts or heterogeneous transcript isoforms [62], [372]. Current methods vary in whether they classify reads, positions, or transcript-level events, and these analytical choices affect the biological conclusions that can be drawn [180]. This problem is even more pronounced for low-abundance or low-stoichiometry modifications, where limited read support and weak signal perturbation can jointly reduce sensitivity and precision, making confident detection particularly difficult in complex transcriptomes [62], [145]. Overall, the challenges that need to be addressed for the identification and quantification of RNA modifications using DRS include the construction of authentic and reliable training sets, the screening of effective prediction results, and the enhancement of comparability among different tools and models.

The rapid development of quantitative methods for RNA m6A modification based on chemical or enzymatic conversion combined with NGS, such as GLORI-seq, has generated extensive and reliable quantitative datasets of m6A sites [155], [679]. These resources enable model training for DRS-based m6A detection to avoid manual labeling of training sets. Models can be trained directly using m6A modification levels at corresponding sites as seen in methods like SingleMod [155], [179]. This progress has greatly improved the predictive performance of related models and serves as an important reason for the relatively high accuracy of DRS in m6A prediction [680]. Future development of identification and quantification models for RNA modifications using DRS should therefore prioritize the establishment of precise experimental approaches. Such methods can supply accurate and dependable training sets for model training and also act as an important means to validate model outputs. Effective filtering is required for prediction results generated by DRS models [179]. Preliminary screening can be performed following threshold combinations recommended by corresponding tools [62], [162]. Common filtering indicators include modification levels and sequencing coverage depth [162]. Stricter filtering criteria may also be applied according to specific research demands to achieve more reliable outcomes [119], [326]. Additional validation with the support of relevant databases and experimental methods is also helpful [521], [681]. GLORI-seq for instance can be adopted to verify predicted m6A modification signals [21], [155].

To improve the comparability of prediction outcomes across different tools or models researchers suggest integrating results from multiple tools for site-level identification:ref:[162] <ref162>. Consistent sites identified through intersection analysis can be regarded as confident modification positions [162]. In addition, modification ratio from different models and tools vary greatly. Such discrepancies are closely associated with model training datasets [162], [580], [655]. Currently, this issue is tough to resolve. To avoid this issue, groups should utilize the same tool in the same experiment to get comparable results.

Technological improvement directions of DRS

Hardware optimization

Hardware optimization containing sequencing chemistry, pore design, and motor enzymes, remains essential for DRS because many current analytical limitations originate upstream at the level of signal generation rather than downstream interpretation alone [54], [120].

In DRS, irregular motor stepping, incomplete control of RNA translocation, pore blockage by structured molecules, and chemistry-dependent signal drift can all distort the ionic current profile before basecalling begins [39], [194]. These effects reduce signal-to-noise ratio, compress differences between similar k-mers, and increase local instability at transcript ends or in structurally complex regions, thereby contributing to read truncation, base miscalls, and reduced confidence in modification-associated signal shifts.

Improvements in pore proteins, motor enzymes, and sequencing chemistry are therefore needed not simply to raise nominal accuracy, but to produce cleaner and more uniform signal traces. More stable pore-motor coupling and better-controlled translocation can reduce dwell-time variability and improve resolution of adjacent nucleotides, whereas motors with greater processivity and tolerance for structured RNA may lower blockage rates and improve read continuity across difficult regions [39]. In practical terms, these changes can enhance splice junction mapping, transcript boundary assignment, detection of subtle sequence variants such as RNA editing events, and discrimination among highly similar alleles, paralogs, or copy-number-related transcript copies [81], [682]. They may also improve the separability of modification signals from background noise, although this remains highly context dependent. Notably, these improvements should be interpreted as reducing specific technical bottlenecks rather than conferring uniformly high nucleotide-level accuracy across all applications, and conclusions involving subtle variants or epitranscriptomic features still need orthogonal and cross-platform validation [34]. For nucleotide-level applications in which even small residual errors can alter biological interpretation, short-read sequencing and other long-read approaches remain essential reference frameworks.

Algorithm optimization: From base calling to scalable, uncertainty-aware, and integrative modeling

The next stage of DRS development will depend not merely on adding new base or modification callers, but on establishing analysis frameworks that can (i) efficiently prioritize true signals from a large candidate space; (ii) distinguish chemically similar modifications under heterogeneous sequence contexts; (iii) quantify uncertainty explicitly; and (iv) connect read-level molecular features to RNA structure, function, and regulatory networks [518]. These challenges are particularly acute for native RNA, where the observed current at any given position reflects not only the focal nucleotide but also its surrounding k-mer context, neighboring modifications, translocation dynamics, and run-specific noise.

The first priority is the development of scalable strategies for candidate screening and false-positive control [62]. In practice, many current modification analyses still rely on broad signal scanning followed by threshold-based calling, an approach that becomes increasingly unstable when the number of candidate sites is large, coverage is uneven, or multiple modification types may coexist within the same transcript [175], [683]. A more robust computational workflow will likely require multi-stage filtering. One practical strategy is to combine a high-sensitivity first-pass detector with downstream evidence integration, in which candidate sites are retained only if they satisfy predefined criteria across several dimensions, such as read depth, replicate concordance, local signal consistency, transcript-context plausibility, and contrast against matched negative controls or perturbation datasets. Hierarchical statistical models, empirical Bayes shrinkage, and replicate-aware false discovery control may be especially useful here, because they can borrow information across sites or transcripts while preventing low-coverage outliers from being overinterpreted [34], [185]. Such frameworks would be particularly valuable for large-scale studies, where the main computational question is no longer whether some anomalous sites can be detected, but how to rank, filter, and validate thousands of weak candidate events without inflating false positives.

The second challenge is the discrimination of modifications with subtle or partially overlapping signal signatures. For example, closely related marks such as m6A and m6Am, or context-dependent signatures that resemble pseudouridylation- or m5C-associated perturbations, may not be separable by a single signal feature alone [390], [684]. Recent multi-modification models, such as ORCA [503], DirectRM [180], and NanoSpeech [180], have advanced DRS analysis beyond one-modification-at-a-time calling by enabling unified prediction of multiple RNA modification classes from shared nanopore signal features. Their main strength is that they capture common and modification-specific signal perturbations within a single framework, thereby improving the feasibility of multi-class inference on native RNA. However, their performance remains limited by the restricted diversity and scale of available training data, which often fail to represent endogenous transcript contexts, variable stoichiometries, neighboring modifications, and chemistry-dependent signal shifts. As a result, distinguishing closely related or partially overlapping modification states, such as m6A and m6Am, remains challenging, particularly when local RNA structure or adjacent modifications distort the signal. In addition, most current models still behave largely as closed-set classifiers and may over assign ambiguous or out-of-distribution signals to known classes [518]. Future progress will therefore require context-aware and uncertainty-aware frameworks that integrate local signal features with transcript position, motif environment, cap proximity, and RNA structural information, while also incorporating calibrated probability outputs or abstention categories for unresolved cases. Equally important, these models should be evaluated not only by classification accuracy, but also by cross-chemistry robustness, reproducibility across biological replicates, and consistency with orthogonal validation. Moreover, reliable information on modification sites obtained by methods based on NGS should be fully utilized in the construction of model training sets [179], [181].

The third requirement is explicit uncertainty modeling and cross-platform validation. Because many DRS modification calls remain model-dependent, algorithm outputs should ideally include confidence estimates at the read, site, and transcript levels [685]. This could be implemented through Bayesian formulations, ensemble prediction, conformal prediction, or calibration procedures that distinguish confident calls from ambiguous ones [162], [175], [579]. Such uncertainty estimates are not only statistically desirable; they are essential for determining which candidates should advance to orthogonal validation. In this context, computational analysis should be coupled more tightly to experimental design. For high-confidence sites, validation may involve writer/eraser perturbation, synthetic or in vitro-transcribed reference RNAs, class-level mass spectrometry, or site-directed biochemical assays, depending on the modification under study [181], [685]. Equally important is validation across analytical environments: a candidate signal that disappears after chemistry change, basecaller update, or remapping is unlikely to be a robust biological event. Future best-practice pipelines should therefore treat reproducibility across biological replicates, software versions, and orthogonal assays as part of the calling framework itself rather than as an optional downstream step.

The fourth frontier lies in modeling combinatorial and context-dependent RNA regulation at the single-molecule level. At present, single-molecule integrative modeling with DRS remains largely at the proof-of-principle stage rather than a routine analytical reality. Although DRS can in principle link multiple transcript features within the same native RNA molecule, this integration is still limited by the limitations outlined above, such as basecalling errors, systematic loss of 5’ terminal information and relatively low sequencing throughput. In addition, these features are rarely measured with comparable coverage or confidence on the same molecules, making downstream network inference highly sensitive to missing data and error propagation [445]. Near-term progress will therefore depend less on simply adding more data layers and more on improving the reliability of each layer and their cross-modal alignment [189], [685]. Practically, this will require higher-fidelity basecalling, transcript-end recovery, and confidence-aware modification calling with replicate and perturbation support. A realistic path forward is stepwise integration, beginning with better-supported combinations, such as isoform-poly(A) or isoform-modification coupling, and then extending to RNA structure, RBP occupancy, translation, and stability through matched multi-omics datasets and perturbation-based validation.

Finally, multi-omics integration will be essential for constructing hierarchical maps of RNA regulation. In future studies, DRS data should not be modeled only as an alternative transcriptomic readout, but as one molecular layer within a broader regulatory system that also includes genome variation, chromatin state, transcriptional activity, RNA processing, translation, and proteome output [503]. A practical analytical framework may involve three levels: first, read-level inference of transcript structure, poly(A) tail features, and candidate modification states [179]; second, transcript- or gene-level integration with abundance, allele, and structural information; and third, network-level modeling that links these RNA features to upstream regulators and downstream phenotypes. This hierarchy may be especially important in complex biological settings such as development, stress adaptation, tumor evolution, or host-pathogen interactions, where the functional significance of an RNA modification cannot be inferred from its presence alone [285].

Overall, the most pressing computational need is not only higher predictive accuracy, but a shift toward scalable candidate prioritization, uncertainty-aware classification, reproducibility-centered filtering, and mechanistically informed integration.

Standardization and benchmarking

Standardization and benchmarking are essential in DRS not only for technical harmonization, but because many biologically important outputs, especially modification calls, stoichiometry estimates, allele-specific analyses, and even some transcript-end assignments, remain highly sensitive to chemistry version, basecaller, alignment strategy, and training data [516], [686]. The central problem is therefore not the absence of a single universal benchmark, but the lack of task-specific truth standards and evidence frameworks. Future benchmarking should be organized by analytical task, since transcript structure reconstruction, 5’ and 3’ end definition, poly(A) measurement, modification detection, stoichiometry estimation, and allele-aware inference differ substantially in their error modes and validation requirements [81], [84]. A useful next step would be to establish community reference sets that combine synthetic RNAs, perturbation-derived controls, and biologically matched samples processed across laboratories, chemistries, and software versions, so that both accuracy and robustness can be assessed explicitly. Just as importantly, DRS studies would benefit from an evidence hierarchy in which relatively direct readouts are distinguished from model-dependent inferences and from higher-level biological claims that require orthogonal validation. Such a framework would make cross-study comparisons more meaningful, reduce overinterpretation of version-specific results, and provide a more realistic basis for future clinical or regulatory adoption.

A task-oriented evidence framework for interpreting DRS data

Given the uneven maturity of different DRS outputs, we suggest that interpretation should follow a task-oriented evidence framework. Tier 1: relatively robust readouts (High Confidence). These include long-read-supported transcript structures, broad APA patterns, poly(A) tail estimates with dorado software, and quantification of gene expression in sufficiently depth and well-behaved transcript regions [84], [200], [537].

Tier 2: model-dependent inferences (Moderate Confidence). These include identification and quantification of well-characterized RNA modifications (e.g., m6A), low-abundance isoform quantification, splicing events, prediction of RNA structure, and RNA editing sites. Tier 3: partial single-base-level output. These applications encompass SNV detection, transcription start site identification [506], as well as the identification and quantification of emerging RNA Modifications that cannot be confidently resolved using NGS-based methods [685].

This framework does not undermine the value or the substantial technological advancements represented by DRS; instead, it helps align data interpretation with current technical maturity and may mitigate overinterpretation in both basic and translational research. Moreover, the framework is not absolute. For example, alternative splicing sites identified by DRS show significantly higher confidence when cross-validated with reliable platforms such as Illumina sequencing [84], [365]. Notably, this framework is only suitable for the present stage, and the corresponding classification will need to be adjusted accordingly as DRS technology continues to improve.