AI Tooling / Methodology
This page describes the AI Tooling / Methodology column of the Papers.md matrix — general-purpose AI methods, foundation models, agent frameworks, benchmarks, and infrastructure that could be applied across many cellular-agriculture problems but haven’t yet been deployed against any specific one. As of late 2026 this is the highest-traffic column in the matrix, reflecting both the rapid expansion of AI-for-biology research and CAAIL’s curatorial decision to track these foundational methods even when their cell-ag application is implicit rather than demonstrated.
The boundary between this column and the applied columns is intentionally pragmatic: a paper that introduces a general method and demonstrates it on a cell-ag problem belongs in the applied column for that problem; a paper that introduces a general method without a cell-ag deployment belongs here. As the column has grown, it is now sub-divided into seven method-architecture clusters that each occupy their own matrix row. The sections below mirror those clusters, plus a short section for AI Tooling-column refs that live in other method rows and a final section cross-referencing unpublished agent software catalogued in Software.md. Closely paired is the AI Evaluation & Benchmarking column, which catalogues papers whose primary contribution is an evaluation suite, benchmark, or eval methodology for the tools in this column.
Foundation Models for Biology
Pretrained transformer models — distinct from agents in that they predict or generate without invoking external tools or planning multi-step workflows. They are the substrate downstream agentic systems build on, and the cluster where the boundary between “method paper” and “applied paper” is most often pragmatic rather than principled.
- #42 OmicsLM (Sypetkowski et al. 2026) — a multimodal LLM connecting quantitative transcriptomic profiles with natural-language biological tasks; a candidate substrate for any cell-ag work involving omics interpretation, particularly in cellular-engineering contexts where scRNA-seq is already the dominant readout (refs 5, 8, 9, 12, 13).
- #88 SpectraLLM (Su et al. 2026) — an LLM pretrained and fine-tuned to reason over multi-spectral data (IR, Raman, UV-Vis, NMR, MS) in a shared language space for end-to-end molecular structure prediction; the spectral analogue of OmicsLM for sensomics workflows (see also SensoryPrediction.md).
- #91 NP Foundation Model (Ding et al. 2026, Nat Machine Intelligence) — a pretrained foundation model for small-molecule natural products; substrate for downstream tasks ranging from media-component prediction to flavor-precursor discovery.
- #92 TranscriptFormer (Pearce et al. 2026, Science) — a generative cell atlas trained across 1.5 billion years of evolution and 12 species; the closest existing cross-species transcriptomic foundation model, with direct cell-ag utility for translating biological knowledge between bovine, porcine, chicken, salmonid, and other livestock cells where annotated reference data is sparse (the per-species pages in
Datasets/collect that cell-ag data substrate). Catalogued atFoundation Models × Cellular Engineeringrather than× AI Toolingbecause its primary cell-ag relevance is direct: cross-species transcriptomic reasoning is the cellular-engineering substrate. See also the TranscriptFormer software entry.
Scientific Literature & Discovery Agents
Agents focused on the scientific research process itself — retrieving and synthesizing literature, generating hypotheses, drafting papers. They are not domain-specific to biology, but the literature, methods, and ideation patterns relevant to cell-ag are squarely within their operating envelope.
- #44 PaperQA (Lála et al. 2023, FutureHouse) — canonical retrieval-augmented agent for answering questions over scientific literature with verified citations.
- #46 PaperQA2 (Skarlinski et al. 2024, FutureHouse) — extends PaperQA with superhuman synthesis of scientific knowledge.
- #45 The AI Scientist (Lu et al. 2024, Sakana AI) — first end-to-end framework for fully automated open-ended discovery: idea generation, experiment design, paper drafting.
- #47 The AI Scientist-v2 (Yamada et al. 2025, Sakana AI) — workshop-level extension using agentic tree search.
- #53 ARIEL (Liu et al. 2026) — biomedical AI research assistant with expert-involved learning.
General-Purpose Biomedical Agents
Broad biomedical-reasoning agents designed to handle multiple sub-domains within drug discovery, biomarker discovery, or generalist biomedical research. Distinct from the domain-specific cluster below in that they pitch themselves as horizontal platforms rather than vertical tools.
- #40 TxAgent (Gao et al. 2025, Zitnik lab) — AI agent for therapeutic reasoning across a universe of 211 biomedical tools.
- #49 Biomni (Huang et al. 2025, Stanford SNAP / Leskovec) — general-purpose biomedical AI agent across drug discovery, genomics, and clinical analysis.
- #94 BRAD (Pickard et al. 2025, Bioinformatics) — automatic biomarker discovery and enrichment.
- #95 OLAF (Riffle et al. 2025) — Open Life Science Analysis Framework for conversational bioinformatics; agent-pipe-router architecture.
- #96 STELLA (Jin et al. 2025) — self-evolving LLM agent for biomedical research.
- #98 BioMANIA (Dong et al. 2024) — conversational chatbot-per-Python-tool framework for simplifying bioinformatics data analysis.
Chemistry / Synthesis Agents
Foundational LLM-agent patterns from chemistry. Both papers share a common template (LLM + curated chemistry tools + autonomous experiment loop) and have been the dominant reference for tool-augmented LLM design in the broader biomedical-agent literature.
- #70 Coscientist (Boiko et al. 2023, Nature) — GPT-4 autonomous chemistry research system; the foundational pattern for tool-augmented LLMs in the natural sciences.
- #71 ChemCrow (Bran et al. 2024, Nat Machine Intelligence) — GPT-4 + 18 chemistry tools for synthesis planning, drug discovery, and materials design.
Domain-Specific Biomedical Agents
Agents specialized for a single biomedical task or sub-domain. Narrower scope than the general-purpose cluster above, but often the most directly transferable to cell-ag because the underlying tasks (sequencing analysis, perturbation prediction, metabolic-model reasoning, spatial biology) are also the tasks cell-ag workflows depend on.
- #43 Lee NGS (Lee et al. 2025) — agentic AI for NGS downstream analysis (differential-expression workflows for users without computational backgrounds).
- #50 Talk2Biomodels (Wehling et al. 2025, BMC Bioinformatics) — agent for kinetic biological models (SBML); the kinetic-modeling member of the AIAgents4Pharma Talk2X family.
- #51 Medea (Sui et al. 2026, Zitnik lab) — omics AI agent for therapeutic discovery.
- #54 ClockBase Agent (Ying et al. 2025) — autonomous agent mining methylation/RNA-seq data for aging interventions.
- #56 SpatialAgent (Wang et al. 2025) — autonomous agent for spatial biology research.
- #66 Lila (Singh et al. 2023, Carbonell group) — automated scientist for microbial strain design.
- #68 Li LLMs ME (Li et al. 2024) — RAG-augmented LLMs for metabolic-engineering design.
- #69 KinModGPT (Maeda & Kurata 2023) — GPT-driven generation of SBML kinetic models from natural-language texts.
Sibling refs in this row’s × Cellular Engineering cell — cell-engineering-specific agentic work — include #90 GenCellAgent (Yu et al. 2026, training-free cellular image segmentation), #93 CellForge (Tang et al. 2026, agentic design of virtual cell models), and #97 PerTurboAgent (Hao et al. 2025, self-planning agent for sequential Perturb-seq).
Robot Scientists & Lab Automation
Closed-loop autonomous research systems integrating LLM agents with wet-lab execution. The cluster spans 25+ years of “Robot Scientist” work from the King group (Adam → Eve → Genesis) extended by recent LLM-era multi-agent systems.
- #64 Genesis (Tiukova et al. 2024, King group) — third-generation robot scientist for systems biology.
- #65 AutonoMS (Brunnsåker et al. 2025) — agentic AI integrated with scientific knowledge for laboratory validation in systems biology.
Sibling refs in this row’s × Bioprocess Control cell — applied multi-agent lab automation for cell and organoid manufacturing — include #61 Agentic Lab (Wang et al. 2025) and #62 BioMARS (Qiu et al. 2025). See also MetabolicModeling.md for the broader closed-loop FBA / strain-design context.
Agent Infrastructure (Frameworks, KGs, Protocols)
Substrate platforms — agent frameworks, biomedical knowledge-graph backends, and tool-orchestration protocols — that downstream agents are built on. These are the papers that don’t introduce a specific agent but introduce the scaffolding that bespoke agent projects have historically had to reinvent.
- #41 ToolUniverse (Gao et al. 2025) — ecosystem for democratizing AI scientists from any open- or closed-weight model; companion to TxAgent (#40) and the rest of the Zitnik-lab agent stack.
- #48 BioCypher (Lobentanzer et al. 2025, Nat Biotech) — knowledge-graph platform purpose-built for biomedical applications of LLMs.
- #67 MCP Servers for biology (Ruscone et al. 2025, Saez-Rodriguez lab) — Model Context Protocol server implementations as AI-biology interfaces (NeKo, MaBoSS, PhysiCell).
Other AI methodology in the AI Tooling column
Two papers in the AI Tooling column live outside the LLM/agent taxonomy above and don’t fit any of the eight clusters:
- #52 BioMedReasoner (Mulyadi et al. 2025, NeurIPS 2025 AI4Science Workshop) — multi-hop reasoning via path-based relational learning on biomedical knowledge graphs (lives in the GNN row).
- #63 Pandi et al. (2022, Nat Comms) — versatile active-learning workflow for optimization of genetic and metabolic networks (lives in the Active Learning row).
Related Software (Not in Matrix)
A growing number of AI-agent tools are released as open-source platforms or commercial products without a separate companion paper. They don’t appear in the matrix (which catalogues papers, not tools) but are catalogued in Software.md, and they fill important gaps in the agent landscape:
- K-Dense-AI — co-scientist ecosystem combining the commercial K-Dense Web platform with an open-source stack of Agent Skills (
scientific-agent-skills, 120+ skills wrapping scientific Python libraries includingcobrapy,pyopenms,scanpy/scvi-tools,rdkit/datamol), thek-dense-byokdesktop client, themimeoskill-generation tool, andclaude-scientific-writer. Directly composable with Claude Code, Cursor, and other code-execution agents. - Superpowers (
obra/superpowers) — one of the most-starred general-purpose Claude Code skill collections (Jesse “obra” Vincent). Domain-agnostic skills for planning, debugging, code review, and execution that compose cleanly with the cell-ag-specific skill packs above. - Skill Seekers — meta-tool that converts documentation websites, GitHub repos, and PDFs into Claude AI skills with automatic conflict detection. The closest existing automation for the pattern an AI-augmented cell-ag lab needs as it scales: turn a new wet-lab protocol, bioinformatics package, or GitHub library into a skill that the lab’s agents can call directly.
- AI Research Skills Library (
orchestra-research/AI-research-SKILLs) — open-source skills library covering AI research and ML engineering infrastructure (vLLM, Megatron, GRPO, HuggingFace). Relevant for cell-ag teams building or fine-tuning their own biology foundation models or running large-scale agentic workflows. - AIAgents4Pharma — beyond the published Talk2Biomodels (#50), the platform hosts three sibling Talk2X agents — Talk2KnowledgeGraphs, Talk2Cells, and Talk2Scholars — that share infrastructure but lack standalone papers at time of curation.
- Seqera AI / Co-Scientist — Seqera Cloud’s AI assistant for Nextflow pipeline authoring, debugging, and workflow-run analysis.
- Dotmatics Luma — commercial lab-orchestration platform connecting laboratory instruments, data systems, and AI assistance; representative of the commercial-tooling layer cell-ag startups increasingly evaluate as they scale beyond bench-scale workflows.
Further reading
- Software: AI Agents & Foundation Models section in
Software.md. - Adjacent research areas: AI Evaluation & Benchmarking, Media Optimization, Cellular Engineering, Bioprocess Control, Sensory Prediction, Metabolic Modeling.
- Talks: AI Agents & Foundation Models for Biology section in
OtherResources.md.