Sensory Prediction
The prediction of sensory attributes is a critical challenge for cellular agriculture and alternative-protein development. Consumer acceptance is driven in large part by taste, smell, and texture — and bridging the gap from cell-level engineering (cell type, media formulation, bioprocess conditions, scaffolding choices) to the final organoleptic experience is one of the central commercial barriers in the field. The GFI 2024 State of Alternative Proteins and the NECTAR Taste of the Industry 2024 sensory benchmark both flag taste parity as the binding constraint for plant-based and cultivated meat products. AI offers one promising toolkit for compressing the experiment-and-iterate cycle by modeling the relationship between molecular composition and human perception.
Applied AI for cell-ag and alt-protein flavor
Recent work demonstrates the practical application of ML methods to cultivated and alternative-protein flavor problems. Sun et al. (2023, ref #26) developed a CNN to profile off-flavors in cultured salmonids using hyperspectral imaging, providing a quality-control pipeline that bypasses expensive panel work (see Datasets/Fish.md for finfish data resources). Shen et al. (2024, ref #11) combined chemometric methods with a GAN-based integrated deep-learning framework to discriminate salted goose breeds — illustrating how generative architectures can extend limited sensory training data. Du et al. (2025, ref #27) used ensemble machine learning to predict volatile compound profiles in Saccharomyces cerevisiae fermentation simulating canned meat, a directly precision-fermentation-relevant application. Sun et al. (2026, ref #28) leveraged ML for odor control in algal foods. Together these papers establish that ML approaches — CNN, GAN/VAE, ensemble, and k-NN — work on the kinds of metabolomic and sensory data that alt-protein labs already collect.
Foundational AI for olfactory perception
A second line of work develops AI methods for olfactory perception itself, with the goal of replacing or augmenting expensive human sensory panels. Lee et al. (2023, ref #14) introduced the Principal Odor Map using a graph neural network trained on >5,000 molecule-odor pairs (Monell dataset); the model outperformed human panels on unseen molecules and generalized to detection thresholds and cross-species olfaction. Qian et al. (2023, ref #36) dissected the Principal Odor Map to show that metabolic activity organizes olfactory representations — a follow-up that bridges molecular chemistry, evolutionary biology, and ML interpretability. Going further back, Keller et al. (2017, ref #80) launched the DREAM Olfaction Prediction Challenge with crowdsourced ML models predicting human olfactory perception from chemical features — a foundational benchmark dataset. On the breeding-and-flavor side, Colantonio et al. (2022, ref #72) used 18 ML models on tomato and blueberry metabolomes paired with consumer-panel ratings, capturing up to 56% of variance in consumer liking from volatiles alone — the closest published reproducible metabolome-to-flavor pipeline to date.
Sensomics methodology and reference work
The Schieberle / Hofmann school at TU Munich and the Leibniz-Institute for Food Systems Biology established the molecular sensory science paradigm: from the ~10,000 volatiles found in foods, only ~226 “Key Food Odorants” (KFOs) account for the aroma of ~230 documented foods, identifiable through GC-O + Aroma Extract Dilution Analysis + stable-isotope dilution + Odor Activity Value (OAV) ranking + recombination / omission tests. Nicolotti, Mall & Schieberle (2019, ref #73) introduced SEBES (Sensomics-Based Expert System), automating this workflow with GC×GC-TOF-MS + GC Image + OAV computation — though the implementation is not released as open-source software.
For cell-ag specifically, the foundational sensory-science work is Lew, Yuen, Zhang, Fuller, Frost & Kaplan (2024, ref #75) — the first GC-MS + GC-O + descriptive-panel characterization of cultivated porcine adipose tissue as a flavor enhancer for meat alternatives (Tufts group; see Datasets/Pig.md for porcine data resources). Zhou et al. (2025, ref #193) complements that tissue-level work at the cell level: spontaneously immortalized porcine myoblasts and fibroblasts adapted to suspension culture, processed via freeze-thaw flavor-precursor isolation followed by optimized Maillard and lipid thermal-degradation reactions, deliver a hybrid cultivated meat at 1.2% w/w cell incorporation with 78.5% sensory similarity to pork and an 80% production-cost reduction; myoblasts outperform fibroblasts on aroma fidelity. On the precision-fermentation side, Spaccasassi et al. (2024, ref #74) demonstrate microbial starter screening for pea-protein-based beverages using UHPLC-TOF-MS plus sensory profiles. And O’Neill et al. (2022, ref #77) report spent-media analysis suggesting cell-ag media will require species- and cell-type-specific optimization — directly tying sensory-relevant metabolite profiles to media formulation strategy.
Tools and data
The analytical stack for sensory prediction draws on mass-spectrometry preprocessing, chemometrics, and curated flavor databases:
- Mass-spec tooling: OpenMS / pyOpenMS and the broader Mass Spectrometry & Chemometrics section in Software.md.
- Chemometrics: ropls (PCA / PLS / OPLS / OPLS-DA), used as the multivariate engine in Workflow4Metabolomics.
- Flavor databases: FlavorDB / FlavorDB2, BitterDB, and Pherobase — see the Flavor & Taste Compound Databases section in Databases.md.
- Species metabolomes: HMDB and the Bovine Metabolome Database for species-specific metabolite profiling that grounds OAV calculations in physiologically relevant concentrations.
Open challenges for cell-ag
Several gaps remain conspicuous. The 2024 organoleptic-analysis landscape (see also #76 Wang, #78 Mittermeier, and #79 Alasi reviews) flags: no purpose-built reproducible workflow-manager pipeline (Nextflow / Snakemake / CWL) for organoleptic analysis; no standardized sensory data exchange format (no mzTab equivalent for sensory panels); a structural “two cultures” gap between sensometrics (food-science / Pangborn tradition) and the bioinformatics workflow-manager community; vendor-software dominance (Agilent, Thermo, LECO, Bruker, Waters) that constrains open-pipeline adoption; and confidentiality of industry data (Givaudan, Symrise, IFF, dsm-firmenich, Mane) that limits public benchmarks. For cell-ag specifically there is no equivalent of the BiGG Models / Cellosaurus / Human Cell Atlas canonical home for sensory-and-flavor data from cultivated tissues — most cultivated-meat sensory data is locked behind company R&D walls. A reproducible Snakemake or Nextflow pipeline that fuses GC-MS volatile data, microbiome data, and trained-panel scores into an OAV-ranked sensomics report is a substantial open opportunity, and one that this repo’s curated set of tools, databases, and reference work would directly support.
Further reading
- Adjacent research areas: Media Optimization, Cellular Engineering, Bioprocess Control, AI Tooling / Methodology, Metabolic Modeling.
- Software: Mass Spectrometry & Chemometrics section in
Software.md. - Data: Flavor & Taste Compound Databases and Pathways, Metabolism & Metabolic Models sections in
Databases.md. - Reference texts: the Flavor & sensory and Bioactives & nutrition chapter clusters of the Encyclopedia of Meat Sciences, 3rd ed. (Dikeman, ed., 2024) catalogued in
OtherResources.md— conventional-meat reference substrate for the cultivated-counterpart sensomics work above. Specifically the Flavor development, Measuring meat flavour, Spices and Flavorings, and Contribution of bioactive compounds from meat chapters.