Editorial cover image for single cell figure design: How to Design Readable Figures Without Losing Meaning
Data Visualization

single cell figure design: How to Design Readable Figures Without Losing Meaning

SA
Shobajo AbdulAzeez
14 min read2,869 words
In This Article

Why single cell figure design needs a readability-first mindset

single cell figure design is often treated as a final formatting task, but it is really part of the analysis. Your UMAP, dot plot, heatmap, trajectory, or marker panel is not just decoration. It is the argument readers use to judge whether your interpretation is believable. If the figure is crowded, inconsistent, or visually noisy, the science becomes harder to trust.

Single-cell and omics datasets are naturally dense. You may have dozens of clusters, thousands of genes, multiple samples, time points, perturbations, batches, and cell states. The temptation is to show everything because everything took effort to generate. We sympathize, but readable figures are selective. They do not hide complexity. They organize it.

single cell figure design comparison showing a crowded multi-panel UMAP figure beside a cleaner version with clearer hierarchy, labels, and legends
Photo by Phong Thanh on Pexels, via Pexels

A readable single-cell figure answers one primary question first. It then gives readers enough supporting detail to evaluate that answer. That may mean splitting one overloaded panel into two simpler panels, moving secondary markers to supplement, or grouping annotations into a cleaner hierarchy. The goal is not minimalism for its own sake. The goal is scientific clarity.

This guide focuses on practical decisions you can make before export, during layout, and while polishing a figure for slides, preprints, papers, or grant applications. If you want to speed up the visual assembly step, you can also create with Graffiy and build cleaner scientific figures from structured visual components.

Start with the claim, not the plot type

Before choosing a visual format, write the sentence the figure must support. For example: “Inflammatory macrophages expand after treatment,” “cell state transitions follow a continuous gradient,” or “sample donor effects are smaller than disease effects.” This sentence becomes your filter. Anything that does not help test or explain that claim should be reduced, moved, or removed.

This is especially important for UMAP and t-SNE panels. These plots are familiar, but they are often overused as proof of every possible finding. A UMAP is useful for showing broad structure, annotation, and relative neighborhood patterns. It is weaker for showing quantitative abundance changes unless paired with statistics, proportions, or sample-aware summaries.

Good single cell figure design often pairs a “where” plot with a “how much” plot. Use the embedding to show cell identity or state location. Then use a bar chart, box plot, ridge plot, or model-based estimate to show abundance, expression, or enrichment. This pairing helps readers separate visual impression from measurable effect.

Ask three questions before placing a panel in the figure. What should the reader notice first? What comparison should they make second? What evidence lets them believe the conclusion? If a panel cannot answer one of those questions, it may not belong in the main figure.

Use visual hierarchy to reduce cognitive load

Dense figures become readable when the hierarchy is obvious. Readers should know where to look first, then where to look next. You can create hierarchy through panel order, title wording, color intensity, annotation size, and whitespace. None of these choices require changing the data. They change how quickly the data can be understood.

Put the most important panel in the strongest position, usually top left for left-to-right reading layouts. Use short, explanatory panel titles rather than file-like labels. “T cells separate by activation state” is more useful than “UMAP by cluster.” Panel letters are necessary for manuscript reference, but they should not carry the burden of meaning.

Whitespace is not wasted space. It is the visual pause that prevents one panel from colliding with another. In multi-panel single-cell figures, legends, color bars, labels, and axis text often compete with the data. Give legends a consistent home, align panels to a grid, and avoid squeezing every panel to the same size if the data needs differ.

Visual hierarchy also applies inside each panel. If all labels, points, outlines, and annotations have equal weight, nothing feels important. Make the primary signal strongest. Let supporting structure remain quieter. For example, show all cells in light gray, then highlight selected clusters in saturated colors when discussing a specific lineage.

Make UMAPs readable without overselling them

UMAPs are often the visual center of a single-cell story. They are also easy to make unreadable. Too many colors, tiny points, overlapping labels, and long legends can turn a useful embedding into confetti. A better approach is to decide whether the UMAP is showing identity, expression, condition, trajectory, or quality control. It should rarely do all of those at once.

For cell type annotation, use distinct colors only for categories that matter in the figure. If you have 28 clusters but the claim is about 6 broad compartments, show the broad compartments in the main panel. Then provide cluster-level annotation elsewhere. Readers can inspect detail when needed, but the main figure stays interpretable.

For gene expression overlays, avoid using the same color scale across unrelated genes if it creates false comparisons. Use clear color bars, state whether values are normalized, and consider showing violin plots or dot plots beside the embedding. Expression on an embedding is spatially intuitive, but it can hide distributional differences and dropout patterns.

UMAP readability example showing highlighted cell populations in color on a muted gray background, with concise labels and a compact legend
Photo by turek on Pexels, via Pexels

Labels should help, not decorate. Direct labels near clusters can be faster than legends, but only if they do not overlap points. Use abbreviations carefully. If a label requires readers to memorize a glossary, it may save space while increasing mental work. For publication figures, test the UMAP at final print size, not only on your large monitor.

Remember that UMAP distance is not a universal measurement of biological distance. It is a representation shaped by parameters, preprocessing, and the dataset. The UMAP documentation is a useful reference for understanding how the method preserves local and global structure. Your figure should avoid implying more precision than the embedding supports.

single cell figure design for heatmaps, dot plots, and marker panels

Heatmaps and dot plots are powerful because they compress many measurements into one view. They are also where readability often collapses. A heatmap with 80 genes, 30 clusters, and tiny labels may prove that you did a lot of work, but it does not help readers understand the pattern. The cure is grouping, ordering, and trimming.

Start by grouping genes by biological program, lineage, pathway, or marker class. Then order cell types or clusters to match the story. If the figure discusses epithelial differentiation, order columns along that trajectory. If it compares immune compartments, group related cell types together. Alphabetical ordering is easy, but it rarely supports interpretation.

Dot plots need special care because they encode two values at once, usually expression level and percentage of expressing cells. That is useful, but only when the legend is clear and the dot range is distinguishable. Avoid too many dot sizes. Readers cannot reliably compare ten size levels. Three to five meaningful size steps are usually enough.

Marker panels should be edited like text. Keep canonical markers that identify cell types, markers that distinguish confusing populations, and markers that support the key biological claim. Remove redundant markers if they repeat the same pattern. If two genes tell the same story, ask whether both are needed in the main figure.

Visual typeCommon readability problemBetter design choice
HeatmapToo many genes and tiny labelsGroup genes by program and show only essential markers
Dot plotUnclear color and size meaningsUse a simple legend and limit size categories
UMAPToo many cluster colorsHighlight relevant groups and mute the rest
Trajectory plotUnlabeled branches and vague directionAdd branch labels, arrows, and supporting marker trends

Color choices that preserve meaning

Color is one of the fastest ways to improve or damage a figure. In single-cell visuals, color often carries categorical identity, quantitative expression, condition, sample, or pseudotime. Mixing these roles carelessly creates confusion. Decide what color means in each panel, and keep that meaning consistent across the figure.

Use categorical palettes for cell types and sequential palettes for expression or scores. Use diverging palettes only when there is a meaningful center, such as zero log fold change or a scaled expression midpoint. Do not use red and green as the only distinction between key groups. Many readers have color vision differences, and red-green pairs also reproduce poorly in some formats.

Consistency matters more than novelty. If B cells are blue in panel A, they should not become orange in panel C. If treated samples are purple in one chart, keep them purple across related panels. This simple discipline makes complex multi-panel figures feel coherent.

color palette guide for omics figures showing categorical cell type colors, sequential expression scales, and a colorblind-aware condition palette
Photo by Леся Терехова on Pexels, via Pexels

Also control saturation. Saturated colors attract attention, so reserve them for the main comparison. Background cells, reference populations, and secondary categories can be pale or gray. This lets you show context without making the figure feel crowded. A figure can contain many data points and still feel calm if color hierarchy is disciplined.

Annotations should explain, not clutter

Annotations are often added late, after the plot is already busy. That is why they become clutter. Plan annotations early. Decide which labels, arrows, brackets, and callouts are essential for interpretation. If an annotation does not change what the reader understands, remove it.

Use direct labels when they reduce eye movement. For example, placing cell type names near clusters can be more readable than forcing readers to match 20 colors to a legend. However, direct labels should not cover dense data regions. Place them in nearby whitespace, use subtle connector lines if needed, and keep typography consistent.

Statistical annotations deserve restraint. Asterisks alone are not enough for many omics comparisons. Whenever possible, show effect size, uncertainty, sample size, or adjusted P values in a readable format. If the full model output is too detailed for the main figure, place the key result in the panel and the full table in supplemental materials.

For trajectory and pseudotime figures, add direction cues. Readers should not have to guess where a process begins or ends. Use arrows, labeled branch points, and small marker trend panels. Trajectory plots often look persuasive, so your annotations should make assumptions visible rather than smoothing them away.

Simplify multi-panel stories with a clear layout

Most single-cell papers rely on multi-panel figures because no single plot can carry the full story. The challenge is making panels feel like a sequence rather than a collage. Start with overview, then evidence, then validation or quantification. This structure helps readers build confidence step by step.

A common layout works well: panel A introduces the dataset or workflow, panel B shows major cell populations, panel C focuses on the relevant subset, panel D quantifies the change, and panel E validates markers or pathways. You can adapt this sequence, but the logic should be visible. Do not make readers reconstruct the story from scattered panels.

Keep repeated elements stable. Use the same sample order, condition order, and cell type order across charts. If control appears before treatment in one panel, keep that order elsewhere. Small inconsistencies cause unnecessary friction. They also make readers wonder whether differences are analytical or accidental.

Workflow panels can help, but only if they are specific. Avoid generic boxes that say “analysis” or “visualization.” Instead, show the actual steps that matter for interpretation, such as filtering, integration, annotation, differential abundance testing, and pathway scoring. A compact workflow can reduce methods confusion and prepare readers for the figures that follow.

Design for the final size, not your analysis screen

A plot that looks readable in RStudio, Python, or a browser can fail once placed into a manuscript. Final size changes everything. Labels shrink, point clouds merge, legends become cramped, and color differences fade. Always review figures at the size readers will see them.

Export early test versions at journal column width, slide size, and preprint PDF scale. Print a page if the figure is for a manuscript. This old habit still catches problems that screens hide. If you cannot read axis labels, legends, and cluster names without zooming, the figure is not ready.

Use vector formats for line art and text when possible, such as PDF or SVG. Use high-resolution raster formats for dense point clouds when vector files become too heavy. For UMAPs with hundreds of thousands of points, rasterizing the point layer while keeping labels as vector text can preserve both performance and clarity.

Typography should be boring in the best way. Use one font family, a small set of font sizes, and consistent label styling. Avoid decorative fonts. Scientific figures already carry enough complexity. The type should quietly support reading.

What to remove, what to keep, and what to move

Simplification feels risky because you do not want to lose scientific meaning. The answer is not to delete evidence blindly. Instead, classify each element as keep, reduce, or move. Keep what supports the main claim. Reduce what provides context. Move what is valid but secondary.

Keep experimental design details that affect interpretation, such as sample number, condition, time point, tissue, and major processing choices. Keep labels for populations central to the claim. Keep uncertainty and sample-aware quantification where conclusions depend on comparisons between groups.

Reduce visual weight for background populations, secondary labels, dense gridlines, repeated legends, and decorative borders. A gray reference layer can be useful. A heavy black outline around every point is usually not. A legend repeated across five panels can often appear once.

Move extended marker sets, sensitivity analyses, alternative clustering resolutions, and full pathway lists to supplements when they are not essential to the main reading path. Supplemental figures are not a junk drawer. They are where deeper inspection belongs. The main figure should guide, and the supplement should document.

Readable single-cell figures are not less rigorous. They are more honest about what the reader can reasonably process in one view.

A practical checklist before you export

Use this checklist after the analysis is complete but before final export. It works for manuscripts, conference posters, talks, and grant figures. The point is to catch readability problems while they are still easy to fix.

  • One main claim: Can you state the figure message in one sentence?
  • Panel sequence: Does the figure move from overview to evidence to quantification?
  • Consistent mapping: Are colors, sample orders, and labels stable across panels?
  • Readable scale: Can labels and legends be read at final size?
  • Color discipline: Are categorical and quantitative palettes used correctly?
  • Evidence balance: Does each visual impression have statistical or sample-aware support when needed?
  • Annotation restraint: Do labels, arrows, and callouts explain something necessary?
  • Supplement strategy: Have valid but secondary details been moved rather than crammed into the main figure?

If a figure fails several checklist items, do not polish it yet. Redesign the structure first. Cosmetic fixes cannot rescue a figure with no hierarchy. Once the logic is clear, polishing becomes faster and more productive.

Common mistakes in single cell figure design

The first common mistake is using too many categories in one view. If every cluster receives a bright color, readers cannot identify the important ones. Group related populations or highlight only the comparison under discussion. Detail can still exist in a separate annotation panel.

The second mistake is treating pooled cells as independent evidence for sample-level claims. A beautiful plot with thousands of cells can still be misleading if the biological replicate structure is unclear. When comparing conditions, show sample-aware summaries whenever possible. This is especially important for differential abundance and expression claims.

The third mistake is overusing default settings. Defaults are useful starting points, not final design decisions. Default point size, legend placement, color scales, and label sizes often need adjustment for publication. Your figure should reflect your question, not your software’s first suggestion.

The fourth mistake is making supplemental figures unreadable because they are “only supplemental.” Reviewers and readers often inspect supplements closely. They should be simpler than main figures in purpose, not sloppier in execution. A clean supplemental figure can prevent confusion and support trust.

Final thoughts: simplify the view, not the biology

Good single cell figure design respects both the data and the reader. It accepts that omics biology is complex, but it does not force all complexity into one frame. Instead, it builds a clear reading path through the evidence.

Your job is to decide what must be seen first, what should be compared, and what belongs in supporting detail. That requires judgment. It also requires a willingness to remove visual noise, even when the noise came from real analysis work.

When figures are readable, readers spend less energy decoding the layout and more energy evaluating the science. That is the best outcome. A clear figure does not make a weak result strong, but it gives a strong result the chance to be understood.

If you are preparing UMAPs, heatmaps, dot plots, trajectories, or multi-panel omics stories, treat design as part of the scientific workflow. Start with the claim, build hierarchy, use color carefully, annotate with restraint, and test at final size. That is how dense visuals become readable without losing their meaning.

Frequently Asked Questions

What is the most important rule in single cell figure design?

Start with the scientific claim the figure needs to support. Once that claim is clear, choose panels, labels, colors, and annotations that help readers evaluate it. Anything secondary can be reduced or moved to supplemental material.

How do I make a UMAP readable when I have many clusters?

Group related clusters when the main story does not require fine resolution. You can show broad cell types in the main figure and provide cluster-level detail in a secondary panel or supplement. Muting background cells and highlighting relevant populations also helps.

Can simplifying a single-cell figure make it less scientifically rigorous?

Not if you simplify the visual presentation while preserving the evidence. Keep sample-aware quantification, key markers, uncertainty, and relevant methods visible where they matter. Move secondary details to supplements rather than deleting them entirely.

SA

Written by

Shobajo AbdulAzeez

More from Shobajo

Tags

Share this article

Related Articles

Create your own scientific designs with Graffiy

AI-powered figures, posters, and social content, designed for researchers by researchers.