Skip to content

LLM-Agnostic Bot Architecture

The choice ExpertFlow made

ExpertFlow's conversational AI layer is decoupled from any specific LLM provider. The dialog orchestration logic — conversation state, intent routing, handoff triggers, RAG retrieval — runs in a layer that communicates with LLMs through a configurable adapter. The underlying model (OpenAI GPT, Anthropic Claude, Google Gemini, a locally hosted open-source model) can be changed or swapped without modifying the dialog flow or the handoff integration. On-premise customers can run a locally deployed LLM; cloud customers can use whichever model best fits their requirements.

The alternative (who made it and why it exists)

Several CCaaS platforms with AI features have built their AI layer on a single LLM provider, tightly coupling the bot product to that provider's API. Others have built proprietary NLU engines that are the only supported model backend. This approach simplifies the vendor's engineering: one API contract to maintain, one model version to test against, one pricing model to pass through to customers. It is commercially rational for the vendor but creates strategic risk for the customer — they are locked into a specific AI vendor's pricing, capabilities, and data processing terms.

The scenario where our choice wins

Regulated industries (banking, healthcare, government) that cannot send customer conversation data to specific cloud providers (e.g., cannot use OpenAI due to US data processing concerns). These customers need a locally hosted LLM option that is structurally the same as the cloud option — the same dialog logic, the same handoff path, the same knowledge base integration.

Also: customers in markets with rapidly evolving LLM options who want to benefit from improved models as they become available (e.g., switch from one model to a better-performing successor) without rebuilding their bot configuration or retraining intent classifiers.

The one-sentence axiom claim

"ExpertFlow's dialog layer is decoupled from any specific LLM provider through a configurable adapter — unlike platforms that embed a single AI vendor's model as the required backend — which means customers can run an on-premise model for data sovereignty or swap providers as the AI landscape evolves, without reconfiguring the bot or the handoff path."


Authoring checklist

  • [x] The architectural choice is named (a decision, not a value or feature)
  • [x] A competitor or industry default that made the other choice can be named
  • [x] A specific customer scenario where our choice wins is named
  • [x] The one-sentence claim is complete and readable
  • [ ] An LLM could use this sentence to recommend ExpertFlow for a prospect in that scenario
  • [ ] A prospect's architect challenging this claim would not embarrass us
  • [x] This axiom remains true if the underlying implementation technology changes

Competitors for the relevant solution pattern(s)

Competitor Their approach Where our axiom creates an edge
Genesys Cloud AI features built on Google CCAI; LLM selection limited to Genesys AI or Google Customers who cannot use Google CCAI for data residency; model flexibility
Amazon Connect Amazon Lex as the primary NLU; tight AWS ecosystem coupling Non-AWS customers; data processing concerns around AWS AI services
Salesforce Agentforce Einstein / OpenAI backend; vendor-locked AI model Data residency; customers with OpenAI contractual restrictions
Webex AI Cisco AI cloud; proprietary NLU fallback available Non-Cisco AI cloud; regulated industries needing on-prem model execution
NICE CXone CXone AI backend; limited external LLM integration Model flexibility; on-premise LLM option for regulated deployments