Publications

2026

  • scellop: A Scalable Redesign of Cell Population Plots for Single-Cell Data
    Thomas C Smits, Nikolay Akhmetov, Tiffany S Liaw, Mark S Keller, Eric Moerth, Nils Gehlenborg
    Bioinformatics Advances, vbag083
    Three panels (A-C) with cell population plots. A) Heatmap with bar charts on the top and left and colored legends on the right and bottom. B) Same heatmap as A, but two rows are now bar charts. C) Stacked bar chart with colors for cell types.
  • Geranium: Multimodal Retrieval of Genomics Data Visualizations
    Huyen N Nguyen, Sehi L'Yi, Thomas C Smits, Shanghua Gao, Marinka Zitnik, Nils Gehlenborg
    OSF Preprints
    Schematic, top is an overview of the database system for retrieval and authoring genomics data visualizations, and bottom is a search interface (left) and authoring interface (right).

2025

  • HuBMAP Data Portal: A Resource for Multi-Modal Spatial and Single-Cell Data of Healthy Human Tissues
    Morgan L Turner, Thomas C Smits, Tiffany S Liaw, Brendan Honick, Bill Shirey, Lisa Choy, Nikolay Akhmetov, Shaokun An, David Betancur, Dominic Bordelon, Karl Burke, Ivan Cao-Berg, John Conroy, Chris Csonka, Penny Cuda, Sean Donahue, Stephen Fisher, Derek Furst, Ed Hanna, Josef Hardi, Tabassum Kakar, Mark S Keller, Xiang Li, Yan Ma, Allison McWilliams, Austen Money, Richard Morgan, Eric Moerth, Juan Muerto, Mark A Musen, Emily Nic, Martin J O'Connor, Gesina Phillips, Alex Ropelewski, Ryan Sablosky, Sravani Saripalli, Max Sibilla, Derek Simmel, Alan Simmons, Xu Tang, Joel Welling, Zhou Yuan, Martin Hemberg, Matt Ruffalo, Jonathan Silverstein, Philip Blood, Nils Gehlenborg
    arXiv
    Schematic of data flow, with data submitter on left and data consumer on right. In between, arrows flow from data ingestion (data submission and data processing) to resources (metadata and data) to interfaces (browsing and search, visualizations, workspaces, download, and APIs). Data, Browsing and Search, Visualizations, and Workspaces are highlighted with color.
  • GQVis: A Dataset of Genomics Data Questions and Visualizations for Generative AI
    Skylar Sargent Walters, Arthea Valderrama, Thomas C Smits, David Kouřil, Huyen N Nguyen, Sehi L'Yi, Devin Lange, Nils Gehlenborg
    IEEE 2025 1st Workshop on GenAI, Agents, and the Future of VIS (VIS x GenAI)
    Examples of visualizations in the GQVis dataset include interactive chromoscope visualizations, epigenetic data, and structural and functional data.
  • Ten Simple Rules for Making Biomedical Data Resources Accessible
    Thomas C Smits, Lawrence Weru, Sehi L'Yi, Nils Gehlenborg
    PLoS Computational Biology, 21(11): e1013657
    A visual overview of the ten rules for making biomedical data resources accessible. Three boxes labeled with assess, construct, and consider are underneath each other. Each contains smaller boxes that represent individual rules, as described in the main text. The six rules in the construct box are marked with one of the letters in the POUR acronym.
  • A comprehensive evaluation of life sciences data resources reveals significant accessibility barriers
    Sehi L’Yi, Harrison G Zhang, Andrew P Mar, Thomas C Smits, Lawrence Weru, Sofía Rojas, Alexander Lex, Nils Gehlenborg
    Scientific Reports, 15, 23676
    Eight subfigures are showing bar charts and pie charts. Subpanel A shows the proportion of pages with the overall impact of issues between two resource types, i.e., data portals and journal websites. Subpanels B and C show the proportion of pages with critical issues and data-related issues, respectively, in pie charts. Subpanels D and E are bar charts showing the proportion of pages by WCAG levels and the difficulty of fixing in post-deployment. Subpanel F shows the top 10 accessibility issues. Subpanel G shows the proportion of pages with missing labels, and subpanel H shows the proportion of issues related to images and tables.
  • The State of Single-Cell Atlas Data Visualization in the Biological Literature
    Mark S Keller, Eric Moerth, Thomas C Smits, Simon Warchol, Qianwen Wang, Robert Krueger, Hanspeter Pfister, Nils Gehlenborg
    IEEE computer graphics and applications, 45(5), 18-34
    Repository of single cell atlas visualizations, with key filter terms on left and search results on the right. Each search result is a visualization from a paper with different codes, such as 'genomic' and 'dot plot' underneath. The total number of papers is 45, the total number of subfigures is 1846, the total number of codes is 85.

2024

  • AltGosling: Automatic Generation of Text Descriptions for Accessible Genomics Data Visualization
    Thomas C Smits, Sehi L’Yi, Andrew P Mar, Nils Gehlenborg
    Bioinformatics, 40(12), btae670
    Award Runner-Up Abstract Award at ISMB BioVis 2025
    Schematic with two sections. On the top left is a file icon with ‘Gosling Spec’. This points to the right section, labeled ‘Gosling.js’ with a matrix visualization, and the bottom section, labeled ‘AltGosling’ with three text schematics.
  • Explaining Unfamiliar Genomics Data Visualizations to a Blind Individual through Transitions
    Thomas C Smits, Sehi L’Yi, Huyen N Nguyen, Andrew P Mar, Nils Gehlenborg
    IEEE 2024 1st Workshop on Accessible Data Visualization (AccessViz)
    Two-by-two grid with sketches (top row) and schematics (bottom row) from a direct (left) and gradual (right) approach. The direct approach has a few black and colored vertical lines, the gradual one has colored letters (A, C, T, G) stacked on top, next to each other.
  • Using OpenKeyNav to Enhance the Keyboard-Accessibility of Web-based Data Visualization Tools
    Lawrence Weru, Sehi L’Yi, Thomas C Smits, Nils Gehlenborg
    OSF Preprints
    Three cropped screenshots of Voyager, labelled A through C, with arrows pointing from A to B and from B to C. A shows a two-column layout, with a list of data field 'pills' on the left, and a list of empty data shelves (encodings, marks, facets and more) on the right. The data field pills on the left column are all given pink labels (unique alphabetic letters for each) and surrounded with a pink outline. B shows the same view but cropped to part of the layout. Here, only one of the pills in the left column is labeled, this time with a small dot instead of a letter. The right column's empty shelves are highlighted green with the words 'drop a field here' and have the same pink outlines with alphabetic labels. C shows the same view but cropped. Here, there are no pink highlights. Instead, on the encoding field called 'x', there is now a filled data field called 'Major_Genre'.

2023

  • Advances and prospects for the Human BioMolecular Atlas Program (HuBMAP)
    Sanjay Jain, Liming Pei, Jeffrey M Spraggins, Michael Angelo, James P. Carson, Nils Gehlenborg, Fiona Ginty, Joana P. Gonçalves, James S. Hagood, John W. Hickey, Neil L. Kelleher, Louise C. Laurent, Shin Lin, Yiing Lin, Huiping Liu, Alexandra Naba, Ernesto S. Nakayasu, Wei-Jun Qian, Andrea Radtke, Paul Robson, Brent R. Stockwell, Raf Van de Plas, Ioannis S. Vlachos, Mowei Zhou, HuBMAP Consortium, Katy Börner, Michael P. Snyder
    Nature Cell Biology 25, 1089–1100 (2023)
    Schematic overview of modalities and assays in the HuBMAP consortium. It ranges the scale of organ to subcellular (x-axis), and has molecular coverage of bulk, spatial proteomics and transcriptomics, imaging, single-cell, histology, and antibody. The captured analyte (gene, protein, etc.) is shown with color.
  • Digital Accessibility of Life Science Data Portals and Journal Websites
    Sehi L’Yi, Thomas C Smits, Alexander Lex, Nils Gehlenborg
    OSF Preprints

2022

  • Somatic Changes Prior to the Development of Hyperdiploidy Expose Mutation Accumulation Rate and Activated Processes in Multiple Myeloma
    Thomas C Smits, Anil Aktas Samur, Romain Lannes, Mariateresa Fulciniti, Masood Shammas, Jill Corre, Kenneth Anderson, Giovanni Parmigiani, Hervé Avet-Loiseau, Nikhil Munshi, Mehmet Samur
    Blood, 140(Supplement 1), 168837
    Award American Society of Hematology Abstract Achievement Award
    Two boxplots showing exposure for mutational signatures in pre and post hyperdyploid and subclonal groups.
  • PHF19 Inhibits Multiple Myeloma Cell Response to Immunotherapy Via Promoting Immunosuppressive Microenvironment
    Tengteng Yu, Mu Hao, Hailin Chen, Kenneth Wen, Tingjian Wang, Thomas Smits, Mehmet Samur, Eugenio Morelli, Lijie Xing, Liang Lin, Jun Qi, Gang An, Nikhil Munshi, Yu-Tzu Tai, Lugui Qiu, Kenneth Anderson
    Blood, 140(Supplement 1), 159137
    Award American Society of Hematology Abstract Achievement Award
  • OAB-017: Mutations accumulated before and after hyperdiploidy reveal timing and impact of chromosomal gains on multiple myeloma
    Thomas C Smits, Anil Aktas Samur, Romain Lannes, Mariateresa Fulciniti, Masood Shammas, Jill Corre, Kenneth Anderson, Giovanni Parmigiani, Hervé Avet-Loiseau, Nikhil Munshi, Mehmet Samur
    Clinical Lymphoma, Myeloma and Leukemia, Volume 22, S10 - S11
    Award International Myeloma Society Young Investigator Award
    Pseudotime sketch of events around hyperdiploidy with varying activity based on mutational signatures.
  • OAB-031: PHF19 promotes multiple myeloma cell resistant to daratumumab/isatuximab via upregulation in immunosuppressive microenvironment and reduced CD38 target expression
    Tengteng Yu, Hailin Chen, Kenneth Wen, Tingjian Wang, Phillip Hsieh, Thomas C Smits, Mehmet Samur, Lijie Xing, Liang Lin, Mu Hao, Lugui Qiu, Yu-Tzu Tai, Kenneth Anderson
    Clinical Lymphoma, Myeloma and Leukemia, Volume 22, S18 - S19
  • OAB-013: Universal loss of BCL7A allows release of its binding partner IRF4 inducing its transcriptional activity promoting MM cell growth
    Chandraditya Chakraborty, Srikanth Talluri, Eugenio Morelli, Sanika Derebail, Yan Xu, Charles Epstein, Thomas Smits, Moritz Binder, Kenneth Anderson, Masood Shammas, Mehmet Samur, Mariateresa Fulciniti, Nikhil Munshi
    Clinical Lymphoma, Myeloma and Leukemia, Volume 22, S8

Additional

  • Accessibility in Grammar-Based Genomics Visualization Language Gosling through Automatic Generation of Text Descriptions
    Thomas C Smits, Sehi L’Yi, Nils Gehlenborg
    HMS Master’s Programs Research Symposium
  • Workspaces in Portal: Data Linking and Templates in Jupyter Lab
    Thomas C Smits, Nikolay Akhmetov, Lisa Choy, John Conroy, Mark Keller, Tiffany Liaw, Juan Puerto, Samson Toor, Morgan L Turner, Philip Blood, Nils Gehlenborg
    HuBMAP Demo Day
  • Workspaces in Portal (in progress): templates allow for easy cell type composition exploration
    Thomas C Smits, HuBMAP Harvard HIVE-TC, HiDIVE Lab
    HuBMAP Annual Meeting