Latent Scope is built by Ian Johnson (enjalot), an independent data visualization researcher known earlier for d3.js-era work. The project describes itself as a scientific instrument for investigating latent spaces and packages a five-stage pipeline that turns an unstructured text dataset into an interactive map of clusters and themes.
The pipeline runs locally end-to-end. Stage one embeds each row of text using either a local transformer model (BGE, E5 are referenced) or an API embedding (OpenAI, Mistral). Stage two reduces the embedding to two dimensions with UMAP. Stage three clusters the 2D points with HDBSCAN. Stage four labels each cluster by passing representative points to an LLM (Zephyr 7B locally, or GPT-3.5-class via API) for summarization. Stage five renders the result as an interactive web scatterplot the user can pan, zoom, and query.
Implementation is Python on the backend (command-line utilities orchestrate the pipeline) and React on the frontend. Storage is intentionally flat-file: HDF5 for embeddings, Parquet for tabular data, with metadata logging so each run is reproducible and the artifacts can be moved between machines. Demonstrated scales run from a few hundred rows up to several hundred thousand, including a 400,000-row emotion classification dataset.
The Mozilla Builders Accelerator awarded Latent Scope up to $100K in the September 23, 2024 cohort. In the stack the project sits at the data and evaluation layers: it is a workbench for analysts who want to look at what an embedding model has actually clustered without uploading the dataset to a hosted vector service. The design choice to keep the full workflow on the user's machine matches the cohort's local-AI theme and makes the tool usable on sensitive datasets that cannot leave the host.
Recipient
Latent Scope
Funder
Mozilla Foundation / Builders / Mozilla.ai · foundation · US
Open-source AI tooling, developer-facing AI applications, democratic AI. Three audience-segmented sites (Builders, Mozilla.ai, AI Guide).
Primary source
Additional sources
More from Mozilla Foundation / Builders / Mozilla.ai
- Democracy x AI Cohort 2026 2026-02 · $1M / 10 teams + $250K Sustain Track
10 teams at $50K each plus a $250K Sustain Track top-up. Focus on civic AI infrastructure.
- Ente Photos 2024-09-23 · Up to $100K
Open-source on-device semantic search and image recognition for private photo storage. Mozilla Builders Accelerator cohort 1 (local AI theme); avoids cloud dependency for personal photo libraries.
- Transformer Lab 2024-09-23 · Up to $100K
Cross-platform desktop application for building, training, fine-tuning, evaluating, and exporting local LLMs on Windows, macOS, and Linux. Ships in Mozilla's first Builders Accelerator local-AI cohort.
- Sartify (Swahili LLM) 2024-09-23 · Up to $100K
Foundation-model and tooling work for Swahili-speaking regions. Creates general and domain-specific AI capabilities for a language poorly served by major commercial models.
- Ersilia Model Hub 2024-09-23 · Up to $100K
Curated catalog of AI/ML drug-discovery models that operate locally. Targets researchers in low- and middle-income countries who cannot rely on US cloud platforms.