Module 02 · 60 min

Reading the Genome: Platforms & Modalities

Short-read, long-read, optical mapping, single-cell, methylation — what each modality actually sees.

FoundationsClinicalResearch

Core topics

What's covered

T01Illumina short-read SBS: chemistry, error model, throughput economics
T02PacBio HiFi (CCS): consensus accuracy, methylation calling
T03Oxford Nanopore: ultra-long reads, native modifications, real-time
T04Optical mapping (Bionano), Hi-C, Strand-seq for structural variation
T05Single-cell genomics: scRNA-seq, scATAC-seq, multiomics
T06Targeted approaches: exome, panels, adaptive sampling

Learning objectives

By the end of this module you will be able to

L01Match a clinical question (germline SNV, repeat expansion, methylation, structural variant) to the appropriate sequencing modality.
L02Quantify the cost-vs-resolution trade-offs between WGS, WES, and gene panels in 2026.
L03Describe why long reads are required for accurate structural-variant detection and pseudogene resolution.

Key takeaways

What you should walk away believing

Short-read WGS misses ~30–50% of structural variants and most repeat expansions; long-read WGS now finds them at clinical sensitivity.
PacBio HiFi and ONT both call 5mC and 6mA natively without bisulfite conversion — methylome and genome from a single library.
Adaptive sampling on ONT lets you target arbitrary regions in silico with no library redesign.
Single-cell multiomics (10x Multiome, Parse, BD Rhapsody) is now standard for tissue heterogeneity studies but rarely clinical.

Lesson · Foundations emphasis

What this means at your level

Foundations

Different sequencing technologies see different parts of biology. Short-read sequencing (Illumina) is cheap and accurate for single-base changes. Long-read sequencing (PacBio, Oxford Nanopore) is required to resolve repeat expansions, large structural variants, and DNA methylation natively. The right test depends on the clinical question.

Clinician deep-dive

Default to a gene panel for known-disease workups; default to exome (WES) when the phenotype is broad; reserve genome (WGS) for unsolved cases or when structural variation, intronic variants, or mitochondrial DNA are likely. For Fragile X, Huntington, FSHD, repeat-expansion ALS — short-read is inadequate; order long-read or a targeted repeat-primed PCR.

Research note

HiFi error rates (~Q30) match short-read while preserving 15–25 kb read length, making it the de facto standard for de novo assembly. ONT's PromethION + Dorado basecalling now achieves modal Q30 simplex and Q35+ duplex with native 5mC/6mA. Adaptive sampling enables programmable enrichment without wet-lab capture.

Myth-buster

Whole-genome sequencing finds all variants.

Reality

Standard short-read 30x WGS systematically misses repeat expansions, large insertions, complex SVs, and many regions of segmental duplication. Diagnostic yield improvements from short-read WGS over WES are modest (~5–10%); long-read WGS adds a meaningful additional yield in unsolved Mendelian cases.

Evidence-graded claims

What the data say

Evidence summary

4 graded claims · evidence index 88%

Open all →

Strongest evidence

Long-read sequencing resolves structural variants missed by short-read WGS

Established across multiple benchmarks; HG002 truthset sensitivity gain is well documented.

Audit grade A in this module →

Native methylation calling on PacBio and ONT is accurate enough to replace bisulfite sequencing

Concordance >95% with WGBS at >5x coverage; saves a sample and is non-destructive.

Audit grade A in this module →

Whole-exome sequencing has higher diagnostic yield than gene panels for most rare-disease workups

True for broad/atypical phenotypes; panels still preferred for well-defined disease with clear-cut gene lists.

Audit grade B in this module →

Clinical-grade nanopore sequencing is ready for primary diagnostic use today

Increasingly true for SVs and methylation; SNV calling is competitive but not yet a default reimbursable replacement.

Audit grade C in this module →

See all claims for this module in the Evidence Grader →

Quick check

Test yourself

Q1A patient presents with adult-onset cerebellar ataxia and a strong family history. What's the highest-yield single test?

Q2PacBio HiFi achieves ~Q30 accuracy by:

Glossary

Key terms & abbreviations

WGS / WES: Whole-genome / whole-exome sequencing. Exome captures ~1–2% of the genome (coding regions plus splice sites).
HiFi: PacBio circular-consensus reads, typically 15–25 kb at >Q30 — currently the gold standard for de novo assembly.
Adaptive sampling: Oxford Nanopore feature that ejects molecules in real time based on early-read mapping, enabling target enrichment without library prep changes.
Hi-C: Proximity-ligation method to map 3D genome organization; resolves structural variants, scaffolds assemblies, and identifies TAD boundaries.

Anchor references

Long-read human genome sequencing and its applications — Logsdon, Vollger & Eichler, Nat Rev Genet 2020
Nanopore sequencing technology, bioinformatics and applications — Wang et al., Nat Biotechnol 2021

The Human Genome at Scale

Variant Interpretation: ACMG, VUS, and Penetrance