Khula Platform · Case Study
Portfolio / Sawubona Mycelium

An AI workflow Khula built for Sawubona Mycelium — proof that the same approach that gives a professional their week back can compress weeks of lab research into hours.

“BRAP” Biotech Research Assistant Platform  |  AI-powered Metabolite Biomining  ·  Sawubona Mycelium

Turn your mass spectrometry data into research-ready insights

A bespoke AI-powered metabolite biomining service for microbiological sourced extracts — built on a purpose-designed multistep algorithm that cleans, classifies, and researches your compounds automatically. Provided by Sawubona Mycelium.

Up to 100compounds biomined per run
6+authoritative scientific databases cross-checked
5structured output sheets per analysis
Cultivated mushrooms — the fungal biomass Sawubona Mycelium analyses
Why this matters

One method. Many domains.

Khula Platform helps people reclaim hours every week by reworking the repetitive parts of their work with AI. This case study is that same promise, taken into a specialised scientific field.

For a fermentation-biotech, the slow, repetitive work isn’t admin — it’s manually researching hundreds of compounds from raw instrument data, one database at a time, for weeks on end.

Khula built a bespoke workflow that does it automatically: weeks of metabolite research compressed into a single, cited workbook. Different industry, same outcome — intelligence that scales, growth that lasts.

Speed

Weeks → hours

Manual literature research replaced by an automated, repeatable pipeline.

Range

Admin to advanced science

The same Khula method, applied from everyday workflows to cheminformatics.

Fit

Built around real work

Shaped with the client’s own data, instruments, and quality standards.

End-to-end data pipeline
01

Instrument

HPLC · GC-MS · LC-MS

02

Raw data

Hundreds of signals

03

Bespoke AI algorithm

Clean · Filter · Classify · Rank · Biomine

04

End-to-end report

Cited & scored workbook

The challenge

The challenge with raw mass spectrometry data

When a researcher or scientist run microbiological sourced extracts through a mass spectrometer, the instrument produces a large spreadsheet of detected signals — often hundreds or thousands of rows. Most of those rows are noise, duplicates, or low-confidence detections. The compounds that matter are buried in the data.

Turning that raw output into something useful — knowing which compounds are real, what they are, where they come from, and what they are used for — normally takes weeks of manual database searching, literature review, and quality checking. Our service does this automatically, using a bespoke AI-driven multistep algorithm designed specifically for this type of exploratory biomining and data analysis.

The service is compatible with standard mass spectrometry data processing software outputs, accepting the Excel files your existing instruments and workflows already produce.

Hundreds of signals, only some are real

Raw data is full of noise. Our algorithm filters it down to only the compounds that meet strict identification standards.

Manual research takes weeks

Searching databases for each compound one by one is slow, error-prone, and hard to repeat. We automate the entire biomining process.

Results that are hard to trust

Without a clear confidence rating for each compound, it is hard to know which findings to act on. We score every identification.

How the service works

A structured, repeatable multistep process

From raw data file to finished research workbook — every step automated and auditable.

Laboratory analysis — pipetting extract samples for mass-spectrometry workup
1

You send us your data file

Share the Excel file from your mass spectrometry software, with any focus preferences you have.

2

Data cleaning and filtering

Rows failing quality standards are removed. Only detections passing every check move forward.

3

Compound classification

Each passing compound is classified as a natural plant product, external contaminant, or uncertain.

4

Ranking and biomining

Top-ranked compounds — up to 100 per run — are biomined across multiple scientific databases and published literature.

5

Report delivered

A formatted Excel workbook with five structured sheets. Everything cited from authoritative sources.

What the output looks like

Five structured sheets. One complete workbook.

An end-to-end comprehensive report delivered as a structured Excel workbook.

Structured, chart-rich data output representing the delivered workbook

The workbook bundles cleaned compound data, research, instrument settings, charts, and a full audit trail.

Cleaned and classified compound list

Every compound that passed quality filtering, with molecular formula, structural identifiers, mass accuracy, identification score, classification, and confidence rank. Duplicates removed, one row per unique compound.

Detailed compound research

For each top-ranked compound: chemical family, natural occurrence, discovery history, commercial uses, and relevant industries — biomined from PubChem, HMDB, ClassyFire, and MassBank.

Instrument settings record

The original acquisition and processing parameters from your mass spectrometry software, preserved in a dedicated sheet for traceability and reproducibility.

Research summary with charts

Aggregated statistics on chemical group distribution, industry relevance, and confidence scores — with embedded charts for immediate visual insight into what your extract contains.

Full run metadata and audit trail

A complete record of every database queried, every threshold applied, every source consulted, and deduplication statistics. You can see exactly how the analysis was run and reproduce it.

Everything in one file

All five sheets in a single formatted Excel workbook — no scattered files, no manual assembly. One file, everything cited, scored, and ready for publication, formulation, or regulatory documentation.

Under the hood

How the algorithm works

A purpose-built seven-stage pipeline handles everything from your raw data file to the finished workbook — automatically, transparently, and with a full audit trail at every step.

1

Receive inputs

Your data file is checked and accepted for processing.

2

Clean and filter

Signals that do not meet identification standards are removed.

3

Classify compounds

Each retained compound is classified as a natural product, xenobiotic, or uncertain.

4

Rank compounds

Compounds are ranked by a composite confidence score.

5

Quality checkpoint

A scientist reviews the ranked list before biomining begins.

6

Biomine databases

Top compounds are researched across six authoritative scientific databases.

7

Score and report

Every identification receives a 0–100 confidence score. The final workbook is assembled with full citations and audit trail.

Databases cross-referenced in priority order

PubChem HMDB LIPID MAPS ClassyFire NIST WebBook MassBank / MoNA Primary literature

What the algorithm will not do

No fabricated citations. No vendor-site sources. No Wikipedia. Compounds with low confidence scores are explicitly flagged as uncertain rather than reported as confirmed identifications.

Representative analysis

From 400+ signals to 55 confirmed compounds

Two mycelium extract samples were processed through the BRAP Mycobiomolecule EDA pipeline. The before-and-after below illustrates the transformation from raw instrument output to a structured, cited research workbook.

Before — Raw output

400+ detected signals per sample. Most are noise, background, or duplicates — not immediately actionable.

Signal IDm/zScoreStatus
SIG-001182.080.31Noise
SIG-002251.100.41Low conf.
SIG-003251.110.43Duplicate
SIG-004309.220.38Low conf.
SIG-005252.090.62Uncertain
SIG-006183.090.28Noise
⋮ 394 more rows
After — BRAP Mycobiomolecule EDA output

55 confirmed compounds retained — classified, scored, cross-referenced, and fully cited.

CompoundGroupConf.Industry
CordycepinNucleoside94Pharma
UridineNucleoside91Nutrac.
ErgosterolSteroid87Cosmetics
CrotonosideNucleoside83Pharma
Oleic acidLipid79F&B
+ 50 more compounds →
55compounds identified across both samples
7chemical groups mapped
73–100confidence score range across all identifications
5industry categories mapped per compound

Water Extract profile

Alkaloids25%
Nucleosides22%
Lipids20%
Steroids & other33%

Enriched in alkaloids and nucleosides (cordycepin, crotonoside, uridine). Pharmaceutical relevance: 46%.

Diluted Filtrate profile

Phenolics35.5%
Terpenoids29%
Iridoid glycosides18%
Other groups17.5%

Greater phenolic and terpenoid diversity, expanded iridoid glycosides. Nutraceutical relevance: 29%.

Representative analysis. Compound names and figures are generalised for illustrative purposes.

Who this service is built for

Built for researchers and teams working with mushroom, fungal, and microbial data

Academic and university researchers

If you are studying the chemistry of mushrooms, fungi, or microbiological natural product leads — including partners at NMU and SMU — BRAP turns your mass spectrometry data into a structured literature review, saving weeks of manual work and producing a comprehensive, database-backed metabolite inventory.

Pharmaceutical and biotech companies

If you screen fungal or fermentation-derived extracts for active compounds as part of early drug discovery, BRAP helps you quickly see what metabolites are present, how confident the identifications are, and what is already known about each compound in the published literature.

Nutraceutical and supplement brands

If you need to know the mycobiomolecule profile of your mushroom or fermentation-derived ingredients — for formulation, labelling, or regulatory documentation — BRAP gives you a clear, database-backed metabolite inventory with confidence scores for every detection.

Cosmetic and personal care formulators

If you work with mycelium or mushroom-derived extracts and need to understand their bioactive compounds, BRAP connects your raw mass spectrometry data to published science about each ingredient’s properties and known uses — supporting claims, formulation, and ingredient transparency.

Where this has real value

Real-world scenarios

Scenario

Screening a medicinal mushroom extract

A university research team studying a mushroom used in traditional African or Asian medicine gets over 400 detected signals from their mass spectrometer. BRAP returns a cleaned list of confirmed metabolites — classified, cross-checked, and linked to published literature. Within hours they know which ergosterols, polysaccharides, terpenoids, or phenolics are present and what the science says about each.

Scenario

Verifying what is in a fermentation-derived ingredient

A nutraceutical company profiles a fermentation-derived mushroom extract before formulation. The output tells them whether expected bioactive metabolites are present, whether unexpected compounds showed up, and how confident each identification is — a science-backed basis for ingredient decisions and product documentation.

Scenario

Finding lead compounds in a fungal extract library

A biotech company screens a library of fungal and mycelium extract samples. BRAP quickly profiles which samples contain high-confidence mycobiomolecule identifications — terpenoids, polyketides, phenolics, beta-glucans — and what is already known, helping prioritise candidates before committing to expensive bioassay work.

About the science partner

The team behind the service

Sawubona Mycelium (Pty) Ltd was founded in 2018 by Busi Moloi (CEO, MSc, MBA) and Neo Moloi in Centurion, Gauteng, South Africa. The name Sawubona comes from the Zulu greeting meaning “I see you” — a recognition of the unseen value within Africa’s extraordinary biodiversity.

We operate a fermentation-based bioprocessing platform producing bio-based ingredients from mushroom mycelium, with access to more than 3,000 fungi strains native to Southern Africa. Our work spans cosmetics, pharmaceuticals, nutraceuticals, functional foods, and beverages — now deepened by AI-powered bioinformatics tools like Mycobiomolecule EDA.

CSIR Technology Innovation Agency Dept. of Science & Innovation BIO Africa / EuroQuity

Visit sawubonamycelium.com →

Key milestones

2018 — Company founded, combining expertise in fermentation science and medicinal mushrooms.

2019 — Joined the CSIR Biomanufacturing Industrial Development Centre (BIDC) programme.

2021–22 — First in Africa: 800 litres of liquid cultivated Enokitake mushrooms for bio-based cosmetics, in partnership with the CSIR.

2022 — Launched Blu Beryl — South Africa’s first fermentation-derived beta-glucan skincare products, supported by TIA, DSI, Innovation Hub and SEDA.

2023 — Exhibited at BIO2023 (Boston) and SynBioBeta2023, showcasing mycelium-derived bioactives internationally.

2025–26 — AI-powered Metabolite Biomining, including mushroom metabolite Effect Directed Analysis (EDA), to mine fermentation data for high-value biomolecules at scale.

Centurion, Gauteng, South Africa

Hands working with soil and new growth — Khula building alongside its partners
The Khula approach

We build it with you, not just for you.

Khula isn’t a faceless tool dropped in your lap. We sit beside the people who know the science — learning the workflow, shaping the algorithm around real data, and refining it together until it’s something the team trusts. This workflow was built hand-in-hand with the scientists at Sawubona Mycelium, the South African fermentation-biotech behind the BRAP service.

1-on-1Built with the team
BespokeShaped around real data
TransparentEvery result cited & scored
Start a conversation

The results of this service rest on a bespoke AI-driven algorithm built specifically for exploratory data analysis and biomining of mass spectrometry data. It was purpose-designed for the task of cleaning, classifying, ranking, and researching plant metabolite data — using a multistep approach that combines domain-specific quality thresholds with automated literature mining.

General data analysis tools are flexible but not specialised. Our algorithm understands the structure of mass spectrometry output, distinguishes between natural metabolites and environmental contaminants, applies domain-specific quality thresholds, and is connected directly to the databases that matter for plant chemistry. It is built to do one thing well — and it does it with the precision and transparency that scientific work demands.

Want a workflow like this?

From biotech labs to busy professionals, Khula builds AI that gives you your time back. Tell us what eats your week — the first discovery call is free.