Grounded, not generated
Every claim the operator makes traces to a Perplexity citation. If a human asks "where did that come from," the answer is one MongoDB query.
Perplexity is excellent at finding things on the live web. The job of the surrounding stack is to turn that into evidence operators can act on — with citations, recency checks, and an audit trail.
The stack
Updated · 2026-05-21
Perplexity does the live-web search and returns answers grounded in citations the operator can verify.
Anthropic Claude takes Perplexity's output and decides what to do with it — file a brief, update a record, escalate a flag.
Glama exposes Perplexity as one MCP tool. Caching and rate-limiting happen at the gateway, not in the prompt.
Supabase caches recent research per account to keep token cost down. MongoDB stores the citation set behind every action the operator took.
Critical research outputs land in an Obsidian vault humans can search, link, and annotate. The vault is the operator's working memory humans can audit at their own pace.
Every claim the operator makes traces to a Perplexity citation. If a human asks "where did that come from," the answer is one MongoDB query.
The operator queries fresh every time something matters; cached only when it doesn't. Glama enforces the policy so the model doesn't have to remember.
Supabase caching cuts redundant Perplexity calls during a single workflow run. Token cost stays linear with new information, not with operator chatter.
Obsidian vault means a human can read the operator's research notes the way they'd read another analyst's. Trust grows because the work is legible.
Competitive intelligence operators that brief sales teams nightly
Compliance and regulatory-update operators that watch a jurisdiction list
Prospect-research operators that enrich CRM records before outreach
Procurement and price-monitor operators that flag market moves
Pros
Cons
Pros
Cons
Pros
Cons
Set per-workflow freshness policies. Competitive price data needs sub-hour recency; regulatory monitoring may tolerate a daily refresh.
Use Glama's rate-limiting to keep Perplexity spend predictable. A runaway operator querying every minute is the most common cost surprise.
Cache Perplexity results in Supabase keyed by the canonicalised query plus a TTL. The operator only re-queries when the TTL expires or when the workflow demands fresh.
Pipe every cited URL into the MongoDB log against the action it informed. "Why did the operator say X" should be one cross-join.
For Obsidian, structure the vault as one note per account, one section per research run. Humans navigate by account; auditors navigate by action.
Add a recency check in the reasoning prompt — "if the most recent citation is older than X days for this workflow, escalate."
Industries it fits
Workflows it fits
Perplexity is excellent at retrieval and decent at summarisation. It's not designed to take action — file the brief, update the record, queue the alert. Claude does the deciding; Perplexity does the finding.
Perplexity covers public web. For paywalled sources we add a separate adapter inside Glama (vendor APIs, licensed feeds). The operator sees them as additional tools with the same calling pattern.
The reasoning prompt requires citations for every claim that drives action. If the citation doesn't support the claim, the operator escalates rather than acts. The MongoDB log shows both the citation and the operator's reasoning.
Yes — that's the default for high-stakes workflows. The operator drafts the action, attaches the citations, and waits for human sign-off. For low-stakes routine workflows the action proceeds and the human reviews after.
Three to four weeks. Week one is workflow + source mapping, week two is Glama wiring and cache policy, week three is operator-on-shadow against real queries, week four is staged handover with human review.
We don't advise on AI. We run it for you.