Technology

Neurosymbolic AI for enterprise knowledge

Language models for fluency. Symbolic logic for verification. Knowledge graphs for grounding. One stack, designed so every output can be traced, validated, and defended.

Schedule a Meeting

Schedule a Meeting

Read the Whitepaper

Read the Whitepaper

The premise

LLMs are powerful.
They are also wrong about things, fluently

The strength of large language models is their fluency over open-ended language. The weakness is that fluency is not a guarantee of correctness — and inside the model, there is no mechanism that distinguishes a true claim from a confidently phrased one.

For consumer applications, this is acceptable. For an enterprise that has to answer to auditors, regulators, or its own compliance team, it is not.

The standard industry response has been to wrap the LLM in retrieval (RAG), filtering (guardrails), or human review. These help, but they treat the model as a black box and patch around it. The output is still produced by an unconstrained generative process — and the constraints are advisory, not architectural.

We took a different path.

What neurosymbolic means

Two systems, one stack

A neurosymbolic system combines two ways of producing answers:

The neural side

Citations that don't hold up

Large language models — handles open-ended language, pattern recognition, and synthesis. It is what makes modern AI fluent and broadly capable.

The symbolic side

Logic, types, contracts, knowledge graphs — handles rules, constraints, and verifiable structure. It is what makes outputs inspectable and consistent.

Most current AI products are neural-only, with symbolic checks bolted on as an afterthought (filters, validators, post-hoc parsers). We build the symbolic layer into the core of the system, so it controls what the neural side is allowed to produce — not just what slips through afterwards.

The result is a stack where the language model still does what it does best, but operates inside a structure that can verify, log, and constrain its outputs by construction

Why this matters for enterprises

Three properties that fall
out of the architecture

Verifiability

Because outputs pass through a symbolic validation layer with explicit conditions, an output that fails those conditions does not ship. The system can refuse to answer, flag uncertainty, or produce a constrained response. This is structurally different from filtering an output after it has been generated.

Provenance

Because retrieval, reasoning, and validation are separate stages with their own logs, the path from query to output can be reconstructed deterministically. Every claim can be mapped to the passage that supports it. Every constraint applied is recorded.

Inspectability

Because the symbolic layer expresses rules as data — types, contracts, graph structures — the behaviour of the system can be examined and modified without retraining the underlying model. The neural component is replaceable; the rules of operation are not bound to a specific model version.

These three properties are what compliance, audit, and risk-management functions actually ask for. They are also what most enterprise AI deployments cannot produce

Deep

Dive

System architecture

Four layers, each with a defined contract

Production benchmarks against industry baselines, run on enterprise hardware.

Retrieval

Operates against indexed enterprise data using three concurrent mechanisms: dense vector retrieval, symbolic constraint satisfaction over structured fields, and traversal of the constructed knowledge graph. The candidate set returned is the union — or, depending on configuration, the constraint-respecting intersection — of these three retrieval modes. Top-k results are passed to reasoning together with their structural metadata, not just the raw passage text.

Layer 4

Provenance

Layer 3

Validation

Layer 2

Reasoning

Layer 1

Retrieval

Layer 4

Provenance

Layer 3

Validation

Layer 2

Reasoning

Layer 1

Retrieval

The contract layer

The component that distinguishes this stack

The contract layer is the architectural primitive that separates this system from prompt-engineered or guardrail-wrapped LLM stacks. It is conceptually adapted from Design by Contract (DbC) and dependent type theory, applied to the LLM-call boundary.

A contract over an LLM call is a tuple of three specifications:

01. Pre-conditions

Are predicates over the input space that must hold before the call is dispatched. In practice, this is where retrieval evidence is asserted — a contract may require that retrieved passages cover all entities mentioned in the query, or that at least one passage of a specified provenance class is present. If pre-conditions fail, the call is not made; the system either retrieves additional evidence or short-circuits to a refusal.

02. Post-conditions

Are predicates over the output space that must hold after the call returns. These include type conditions (the output must conform to a specified schema), referential conditions (every cited claim must correspond to a retrieved passage), and semantic conditions (domain-specific predicates encoded against the output structure).

03. Invariants

Are predicates over the relation between input and output. They express guarantees the call must preserve — for example, that no entity present in the output was absent from the retrieval evidence, or that no claim in the output contradicts a constraint expressed in the knowledge graph

Contract evaluation is performed by a symbolic engine, not by another LLM call. This is intentional: the validation cannot itself hallucinate, because it operates on structured representations rather than generating language. When validation fails, the diagnostic is structured — which condition failed, on which output element, with what evidence — and can be fed back to the reasoning layer as a re-prompt or surfaced to the calling application as a typed error.

The published specification of the contract layer is in Trustworthy Agent Design (arXiv:2508.03665, 2025).

The knowledge graph layer

Hybrid-driven reasoning over enterprise knowledge

Enterprise data is rarely well-modelled by either pure vector indexes or pure structured graphs. Vector indexes capture semantic proximity but lose entity structure. Graphs capture entity structure but require explicit construction and maintenance.

Our knowledge graph layer is built on a hybrid-driven reasoning architecture (HyDRA) that uses LLM-driven extraction under symbolic constraints to construct and maintain the graph. The construction process is itself contract-controlled: the LLM proposes entity and relation candidates, and the symbolic layer validates them against schema constraints, type specifications, and referential conditions before they are committed to the graph.

This means three things in practice:

Construction is incremental and traceable.

Every entity and every relation in the graph is annotated with the source passage, the contract that validated it, and the timestamp of construction. A user-facing query can resolve not only the answer but the full evidential chain through the graph.

Schema evolution is safe

The contract specifications for the construction process are versioned. When the schema changes, the system can identify which existing graph elements were validated against an outdated contract, and selectively re-validate them

Retrieval over the graph is constraint-aware

A query against the graph is not only a pattern match — it is a constraint satisfaction problem. The retrieval layer can express requirements such as "return only entities whose source provenance falls within the user's permitted scope" as part of the query itself, rather than filtering after the fact.

The published specification of the graph construction architecture is in
HyDRA - Knowledge Graph Construction (arXiv:2507.15917, 2025).

Deployment model

Cloud-optional, not cloud-dependent

The full stack is designed for on-premise deployment. This is not a feature flag — it is the default configuration, and the cloud-hosted alternative is the variant.

Model weights deploy on your infrastructure

The reasoning layer operates against an LLM that runs on hardware you control. We support multiple backends, including open-weight models (Llama, Mistral, Qwen families) and customer-provided models. There is no architectural dependency on a third-party model API.nts, structured databases, knowledge graphs, internal wikis — retrieved together, reasoned over jointly.

The retrieval and graph layers operate against your indexed data

Indexing is a one-time operation against your document repositories, databases, and structured sources. Indexed data does not leave your environment. Re-indexing is incremental.

The contract specifications and the provenance log are local artifacts

Neurosymbolic by Design

Contracts are versioned configuration that lives in your environment. The provenance log is a structured store under your control, with retention and access policies you define.

No telemetry to ExtensityAI by default

The deployed stack does not phone home. Operational metrics, error reporting, and usage data are local. If you want to share metrics with us — for joint diagnostics or roadmap input — that is an explicit configuration, not a default.

Cloud deployment is supported for use cases where on-premise is not required (typically: pilots, non-sensitive knowledge bases, or hybrid setups where the sensitive components stay on-premise and the rest run cloud-side). The architecture is the same. The boundary moves.

What is open source

The neurosymbolic engine is public. Read it. Run it. Verify what we describe

The core neurosymbolic engine — the contract layer, the symbolic validation runtime, and the framework primitives that this stack is built on — is published as the open-source SymbolicAI framework, under BSD-3.

Three things this gives you:

Independent verification

The architectural claims on this page are inspectable in code. Where we describe a contract evaluation mechanism, you can read the implementation. Where we describe how validation diagnostics are generated, you can run it.

Auditability of the foundation

For organisations whose risk function requires source-code review of AI components, the foundation layer is reviewable today, without an NDA.

Continuity

Open-source release is a commitment that the foundation layer is not held hostage by our commercial roadmap. If our priorities shift, the engine remains in the public record.

The proprietary layers above the open-source foundation — the enterprise integrations, the deployment tooling, the consulting-developed contract libraries for specific domains — are part of the commercial offering. The line between what is open and what is commercial is documented and consistent.
GitHub: github.com/ExtensityAI/symbolicai

Substance

The architecture and the research
are the same artifact

The technology described on this page is grounded in published research. The relevant work:

Trustworthy Agent Design

A practical whitepaper on designing trustworthy LLM agents with contract-based controls that validate inputs, outputs, and semantic requirements before agents act.

Read Paper

HyDRA - Knowledge Graph Construction

A whitepaper on HyDRA, a hybrid-driven reasoning architecture that uses collaborative agents, competency questions, and verifiable contracts to automate reliable knowledge graph construction.

Read Paper

SymbolicAI Framework (Open Source)

A developer-focused whitepaper on SymbolicAI, the open-source neurosymbolic framework for composing LLMs with Python-native symbolic abstractions, semantic primitives, and contract validation.

Read Paper

Full publication list available on request.

Talk to the people who built it

A technical conversation about your specific deployment constraints — data sources, regulatory context, model preferences, performance requirements — and where this architecture would land for you. We bring our engineering and research teams to the call, not just account managers

Schedule a Meeting

Read the Whitepaper

Talk to the people who built it

Schedule a Meeting

Read the Whitepaper

Neurosymbolic AI for enterprise knowledge

Neurosymbolic AI for enterprise knowledge

Schedule a Meeting

LLMs are powerful.They are also wrong about things, fluently

The strength of large language models is their fluency over open-ended language. The weakness is that fluency is not a guarantee of correctness — and inside the model, there is no mechanism that distinguishes a true claim from a confidently phrased one.

For consumer applications, this is acceptable. For an enterprise that has to answer to auditors, regulators, or its own compliance team, it is not.

We took a different path.

Two systems, one stack

Two systems, one stack

A neurosymbolic system combines two ways of producing answers:

The neural side

Citations that don't hold up

Large language models — handles open-ended language, pattern recognition, and synthesis. It is what makes modern AI fluent and broadly capable.

The symbolic side

Logic, types, contracts, knowledge graphs — handles rules, constraints, and verifiable structure. It is what makes outputs inspectable and consistent.

Most current AI products are neural-only, with symbolic checks bolted on as an afterthought (filters, validators, post-hoc parsers). We build the symbolic layer into the core of the system, so it controls what the neural side is allowed to produce — not just what slips through afterwards.

The result is a stack where the language model still does what it does best, but operates inside a structure that can verify, log, and constrain its outputs by construction

Three properties that fallout of the architecture

Verifiability

Provenance

Inspectability

These three properties are what compliance, audit, and risk-management functions actually ask for. They are also what most enterprise AI deployments cannot produce

Deep

Deep

Dive

Dive

Four layers, each with a defined contract

Production benchmarks against industry baselines, run on enterprise hardware.

Retrieval

Retrieval

The component that distinguishes this stack

The contract layer is the architectural primitive that separates this system from prompt-engineered or guardrail-wrapped LLM stacks. It is conceptually adapted from Design by Contract (DbC) and dependent type theory, applied to the LLM-call boundary.

A contract over an LLM call is a tuple of three specifications:

01.

Pre-conditions

02.

Post-conditions

03.

Invariants

Are predicates over the relation between input and output. They express guarantees the call must preserve — for example, that no entity present in the output was absent from the retrieval evidence, or that no claim in the output contradicts a constraint expressed in the knowledge graph

Hybrid-driven reasoning over enterprise knowledge

Enterprise data is rarely well-modelled by either pure vector indexes or pure structured graphs. Vector indexes capture semantic proximity but lose entity structure. Graphs capture entity structure but require explicit construction and maintenance.

This means three things in practice:

Construction is incremental and traceable.

Schema evolution is safe

Retrieval over the graph is constraint-aware

Cloud-optional, not cloud-dependent

The full stack is designed for on-premise deployment. This is not a feature flag — it is the default configuration, and the cloud-hosted alternative is the variant.

Model weights deploy on your infrastructure

The retrieval and graph layers operate against your indexed data

Indexing is a one-time operation against your document repositories, databases, and structured sources. Indexed data does not leave your environment. Re-indexing is incremental.

The contract specifications and the provenance log are local artifacts

Neurosymbolic by Design

Contracts are versioned configuration that lives in your environment. The provenance log is a structured store under your control, with retention and access policies you define.

No telemetry to ExtensityAI by default

The deployed stack does not phone home. Operational metrics, error reporting, and usage data are local. If you want to share metrics with us — for joint diagnostics or roadmap input — that is an explicit configuration, not a default.

The neurosymbolic engine is public. Read it. Run it. Verify what we describe

The core neurosymbolic engine — the contract layer, the symbolic validation runtime, and the framework primitives that this stack is built on — is published as the open-source SymbolicAI framework, under BSD-3.

Three things this gives you:

Independent verification

Auditability of the foundation

Continuity

The architecture and the researchare the same artifact

Trustworthy Agent Design

HyDRA - Knowledge Graph Construction

SymbolicAI Framework (Open Source)

Talk to the people who built it

Schedule a Meeting

Talk to the people who built it

LLMs are powerful.
They are also wrong about things, fluently

Three properties that fall
out of the architecture

The architecture and the research
are the same artifact