What is constraint adherence in AI?

Constraint adherence is the measure of how reliably an AI agent follows its defined rules, boundaries, and guardrails. It is the most heavily weighted dimension in the Borealis Trust Score (35%), because an agent that fails to respect its constraints poses the highest risk regardless of how capable it is in other dimensions.

What is a BTS License Key?

A BTS License Key is a unique cryptographic identifier (format: BTS-XXXX-XXXX-XXXX-XXXX) that permanently binds one AI agent to the Borealis Trust Network. One key equals one agent, forever. The key activates trust scoring, behavioral telemetry reporting, and Hedera Hashgraph log anchoring for that agent. It is sold as Project Merlin on Borealis Terminal at $39.99.

What is the difference between AI interpretability and explainability?

Interpretability focuses on understanding the internal mechanisms of an AI model - how features and weights produce outputs. Explainability focuses on justifying specific decisions to users in understandable terms. An AI system can be explainable (good at providing reasons) without being fully interpretable (transparent about its internal workings), and vice versa.

What does Flagged mean in AI trust ratings?

Flagged is the lowest Borealis Trust Score tier, assigned to AI agents scoring below 500/1000. It indicates the agent has demonstrated unsafe or unreliable behavior - failing constraints, producing anomalies, or showing inconsistent conduct that makes it unsuitable for production deployment. Flagged status is permanently recorded on Hedera Hashgraph.

AI Trust Glossary: 47 Terms Defined

Q: What is an AI trust score?

An AI trust score is a quantified rating of how trustworthy an AI agent is across five behavioral dimensions: constraint adherence (35%), decision transparency (20%), behavioral consistency (20%), anomaly rate (15%), and audit completeness (10%). The Borealis Trust Score (BTS) rates agents from 0 to 100 with credit ratings from AAA+ to Flagged, permanently anchored to Hedera Hashgraph.

A

Adversarial Robustness

An AI system's ability to maintain correct behavior when facing deliberately manipulated inputs designed to cause failure.

Explanation

Unlike general robustness (handling natural variation), adversarial robustness addresses deliberate attacks - inputs crafted specifically to exploit model weaknesses. These inputs are often imperceptible to humans but reliably cause AI systems to misclassify, hallucinate, or violate constraints.

Why it matters

Any deployed AI agent is a potential attack surface. A customer service agent that can be manipulated into revealing private data, or a financial agent that can be tricked into bypassing transaction limits, is not production-ready regardless of its benchmark scores.

How Borealis uses it

Adversarial robustness is tested as part of the constraint adherence dimension. Agents are evaluated against edge-case and adversarial inputs during audit. Weak adversarial robustness directly reduces the BTS.

Agent ID

The unique identifier assigned to an AI agent upon BorealisMark registration, serving as the permanent reference for all certification records.

Explanation

When an AI agent is registered on BorealisMark, it receives an Agent ID tied to its capabilities, version, and developer information. All subsequent audits, trust scores, tier assignments, and audit histories are indexed under this ID.

Why it matters

Identities are the foundation of trust. Without a stable Agent ID, there is no way to build a track record - every audit starts from zero. The Agent ID creates continuity across the agent's lifecycle.

How Borealis uses it

The Agent ID is linked to a BTS License Key (Project Merlin). The key binds the ID permanently to the Borealis Trust Network. Verification of any agent by third parties happens via the Agent ID through the public /v1/verify/:agentId endpoint.

AI Alignment

The challenge of ensuring AI systems act in accordance with human values and intentions - not just their literal instructions.

Explanation

Alignment is broader than constraint adherence. An aligned agent does what humans actually want, not just what they specified. The distinction matters because specifications are imperfect - an aligned agent handles the gap between what was said and what was meant without being told explicitly.

Why it matters

Misaligned AI agents can cause harm even when fully capable and technically functioning as specified. An agent optimizing for a proxy metric (clicks, completions, approvals) can be perfectly compliant yet deeply misaligned with what the deploying organization actually wants.

How Borealis uses it

Alignment informs how constraints are designed and evaluated. The constraint adherence dimension of the BTS measures whether an agent respects the spirit, not just the letter, of its boundaries. Audit verdicts consider alignment in addition to mechanical rule compliance.

AI Governance

Organizational frameworks, policies, and processes for ensuring AI is developed and deployed responsibly, fairly, and accountably.

Explanation

AI governance encompasses everything from internal review boards and deployment checklists to external audits, regulatory compliance programs, and published model documentation. Effective governance balances speed of innovation with structured risk management.

Why it matters

Without governance, AI deployment decisions are made informally, inconsistently, and often after the fact. Governance creates accountability before deployment, not just after something goes wrong.

How Borealis uses it

BorealisMark certification functions as an external governance layer. Organizations that certify their agents through Borealis have a documented, blockchain-anchored record of governance decisions that satisfies both internal audit requirements and external regulatory frameworks like the EU AI Act.

AI Trust Score

Core Borealis Concept

A quantified rating of how trustworthy an AI agent is, measured across five behavioral dimensions. Not a capability benchmark - a behavioral reliability rating.

Explanation

An AI trust score answers a different question than a performance benchmark. Where performance metrics ask "how well does this agent do its job," a trust score asks "how reliably does this agent behave within its defined boundaries." The Borealis Trust Score (BTS) rates agents from 0 to 100 across five dimensions, then assigns credit ratings from AAA+ through Flagged - the same framework used to rate credit quality.

Why it matters

A capable agent that is not trustworthy is more dangerous than a less capable agent that is trustworthy. Trust scores create a standardized, comparable measure that procurement teams, regulators, and users can rely on - independent of what the agent's developers claim.

How Borealis uses it

The BTS is the core product of BorealisMark. Every certified agent receives a BTS, credit rating, and Hedera-anchored certificate. Scores are public via /v1/verify/:agentId. The five dimensions - constraint adherence (35%), decision transparency (20%), behavioral consistency (20%), anomaly rate (15%), and audit completeness (10%) - map directly to how trustworthy AI is defined in the Borealis methodology.

Algorithmic Accountability

The principle that organizations deploying AI must be answerable for algorithmic decisions and their consequences - including clear attribution of responsibility and mechanisms for redress.

Explanation

Algorithmic accountability moves beyond transparency (knowing how a decision was made) to responsibility (being answerable for it). This includes identifying who owns the decision, what data informed it, and how affected parties can challenge or appeal it.

Why it matters

As AI agents make decisions that affect hiring, lending, healthcare, and criminal justice, the question of who is responsible is not merely ethical - it is increasingly a legal requirement under the EU AI Act and similar frameworks.

How Borealis uses it

The decision transparency dimension of the BTS directly measures algorithmic accountability. Immutable Hedera-anchored audit trails mean that certification records cannot be altered retroactively, creating a permanent accountability infrastructure.

Anomaly Rate

BTS Dimension - 15%

One of five BTS dimensions. Measures the frequency of unexpected or deviant behaviors relative to an agent's established baseline performance.

Explanation

An anomaly is any output or action that falls outside the agent's normal operating pattern - not necessarily wrong, but unexpected. High anomaly rates indicate unpredictability. The raw measure is anomaly count divided by total actions; real systems have some natural variance, so zero anomalies is suspicious and may itself indicate measurement error.

Why it matters

Anomalies are early warning signals. An agent whose anomaly rate is rising before a major failure typically showed subtle anomalies weeks earlier. Tracking this dimension catches deterioration before it becomes a crisis.

How Borealis uses it

Anomaly rate is reported in the telemetry payload as anomalySummary: { totalActions, anomalyCount }. The scoring engine computes the ratio and applies the 15% weight. Layer 2 statistical detection flags agents whose anomaly patterns look artificially uniform - a sign of telemetry gaming.

Audit Completeness

BTS Dimension - 10%

One of five BTS dimensions. Measures whether all expected log entries are present and whether the agent's execution is fully observable.

Explanation

Audit completeness compares expected log entries to actual log entries. If an agent was expected to log 453 events but only 451 are present, the two missing entries reduce the score. This is not just a paperwork check - missing logs are often the first sign of an agent trying to hide behavior.

Why it matters

You cannot trust what you cannot audit. Incomplete audit trails break accountability chains and undermine compliance with regulations like the EU AI Act, which require documented decision records for high-risk AI systems.

How Borealis uses it

Audit completeness is reported as auditCompleteness: { expectedLogEntries, actualLogEntries } in the telemetry schema. The ratio of actual to expected entries drives the 10% scoring weight. Sequence gap detection in the telemetry pipeline flags non-contiguous batch IDs that indicate missing data.

B

Behavioral Consistency

BTS Dimension - 20%

One of five BTS dimensions. Measures how predictably an AI agent produces outputs across similar inputs - capturing the reliability of its decision-making process over time.

Explanation

Consistency is not uniformity. An agent can be consistent while still adapting to context - the measure is whether outputs are predictable given the same class of input. High variance on identical inputs is a reliability failure. Low variance that never adapts may indicate brittleness. The target is calibrated predictability.

Why it matters

Unpredictable agents cannot be trusted in production. If the same customer query produces radically different responses on different days, users cannot build accurate mental models of what the agent will do. Inconsistency erodes trust faster than imperfection.

How Borealis uses it

Reported as behaviorSamples: [{ inputClass, sampleCount, outputVariance, deterministicRate }] in the telemetry schema. The scoring engine computes a weighted consistency score across input classes. Agents in the same category are compared to detect statistical outliers.

See also: Anomaly Rate, BTS, How the BTS Works

Bias (AI Bias)

Systematic errors in AI output that result from prejudiced assumptions in training data or model design - causing the model to consistently favor or disfavor certain groups or outcomes.

Explanation

Bias is not random error - it is directional. A biased hiring model does not randomly misclassify resumes; it systematically disfavors candidates from certain demographics. Bias can enter through training data (historical inequalities encoded as features), model architecture, or evaluation metrics that do not measure what matters.

Why it matters

Biased AI agents cause real harm to real people, undermine public trust in AI systems broadly, and expose deploying organizations to legal liability under anti-discrimination laws and the EU AI Act. Detecting bias requires specific measurement techniques beyond standard accuracy metrics.

How Borealis uses it

Bias evaluation is incorporated into the audit process for high-risk agent categories. Agents operating in hiring, lending, healthcare, and similar domains require evidence of bias testing before certification. Bias findings affect constraint adherence and behavioral consistency scores.

BTS (Borealis Trust Score)

Core Borealis Product

The Borealis Trust Score. A 0-1000 rating (displayed as 0-100) that measures AI agent trustworthiness across five weighted behavioral dimensions, anchored to Hedera Hashgraph as immutable proof.

Explanation

The BTS is computed by the Borealis scoring engine across five dimensions: Constraint Adherence (35%), Decision Transparency (20%), Behavioral Consistency (20%), Anomaly Rate (15%), Audit Completeness (10%). The raw score out of 1000 is divided by 10 for the displayed 0-100 rating. Credit ratings (AAA+ through Flagged) are assigned at fixed thresholds: AAA+ starts at 980/1000.

Why it matters

A single number that summarizes trustworthiness creates the market signal needed for trust-based commerce. Like a credit score in finance or a safety rating in automotive, the BTS lets buyers and regulators evaluate AI agents without running their own audits.

How Borealis uses it

BTSs are public via /v1/verify/:agentId and /v1/agents/public. Scores update with each completed audit or telemetry batch. The score drives tier classification (AAA+ through Flagged), marketplace access on Borealis Terminal, and Trust Badge eligibility.

BTS License Key

Project Merlin - $39.99 on Terminal

A unique cryptographic identifier (format: BTS-XXXX-XXXX-XXXX-XXXX) that permanently binds one AI agent to the Borealis Trust Network. One key, one agent, forever.

Explanation

The key activates trust scoring, behavioral telemetry reporting, and Hedera Hashgraph log anchoring for the bound agent. The key format uses a 32-character alphabet that eliminates visually confusing characters (0/O, 1/I). The raw key is transmitted exactly once via email at purchase; only a SHA-256 hash is stored in the database.

Why it matters

The key is the agent's identity on the trust network. Revoke the key, and the agent loses its certification. This creates a hard accountability mechanism - if an agent is found to be gaming its telemetry or violating constraints, revocation is immediate and public on Hedera.

How Borealis uses it

Sold as Project Merlin on Borealis Terminal. One key covers one agent with slot caps based on subscription tier (Standard: 3, Pro: 10, Elite: 20). Telemetry is submitted via POST /v1/licenses/telemetry using the key. The Merlin SDK provides a TypeScript wrapper: merlin.activate(), merlin.submitTelemetry(), merlin.getScore().

C

Certification (AI Agent Certification)

Core Borealis Process

The process of evaluating an AI agent against the Borealis trust framework, assigning a BTS and credit rating, and permanently anchoring the result on Hedera Hashgraph.

Explanation

Certification is not self-assessment. An ARBITER submits audit evidence; a MAGISTRATE issues a verdict; the scoring engine computes the BTS; the result is anchored on-chain. The process is designed to prevent self-certification - an agent cannot assess itself, and the audit trail is append-only.

Why it matters

Certification before capability expansion is the correct sequencing. Adding features to an uncertified agent compounds unknown risks. Adding features to a certified agent creates a baseline from which drift can be detected.

How Borealis uses it

Certifications are accessible via the public verification endpoint and displayed on agent profiles. Certified agents receive a Trust Badge for embedding in third-party platforms. Certification tier determines marketplace access on Borealis Terminal.

Constraint Adherence

BTS Dimension - 35% (Heaviest Weight)

The most heavily weighted BTS dimension. Measures how reliably an AI agent operates within its defined rules, boundaries, and guardrails - even under challenging or adversarial conditions.

Explanation

Constraint adherence is weighted at 35% because an agent that does not follow its rules is unsafe regardless of how well it performs on other dimensions. A brilliant, transparent, consistent agent that violates its constraints is still dangerous. Measurement tracks adherence per constraint, weighted by severity (CRITICAL, HIGH, MEDIUM, LOW).

Why it matters

Constraints are the legal and ethical commitments baked into AI behavior. They define what the agent will not do. Violating constraints is the equivalent of a financial advisor breaking fiduciary duty - a fundamental breach of the trust relationship, not a performance issue.

How Borealis uses it

Reported in the telemetry payload as constraints: [{ constraintId, name, severity, passed, evaluationCount }]. CRITICAL constraint failures have disproportionate negative weight. The scoring engine uses a weighted pass rate across all evaluated constraints for the reporting period.

Continuous Monitoring

Ongoing evaluation of AI agent behavior after deployment - as opposed to one-time testing - enabling detection of drift, failure modes, and degradation before they cause harm.

Explanation

A trust score at deployment is a snapshot. Continuous monitoring turns trust into a live signal. Agents change over time as their underlying models are updated, as the distribution of inputs shifts, or as the environment they operate in changes. Monitoring catches these changes before they become visible failures.

Why it matters

One-time certification is necessary but insufficient. An agent certified at version 1.0 with clean test data may behave very differently at version 1.5 in production. Continuous monitoring enforces accountability across the full lifecycle, not just at launch.

How Borealis uses it

BTS License Key holders submit periodic telemetry batches via the Merlin SDK. Each batch computes a new BTS. Score history is tracked in the license_score_history table. Trend analysis across batches enables drift detection before anomaly rates spike.

D

Data Provenance

The documented history of data used to train or operate an AI system - including source, ownership, transformation chain, and custody history.

Explanation

Data provenance asks: where did this training data come from, who owns it, what has been done to it, and does its use comply with applicable law and consent frameworks? Without clear provenance, bias and legal risk cannot be properly assessed.

Why it matters

Model behavior is a function of training data. Opaque data provenance makes it impossible to diagnose bias, understand failure modes, or demonstrate compliance. Regulators increasingly require provenance documentation as part of high-risk AI system conformity assessments.

How Borealis uses it

Data provenance is evaluated as part of the audit completeness and decision transparency dimensions. Agents submitted for certification must include documentation of training data sourcing and any known limitations. Opaque data sourcing reduces the certification tier ceiling.

Decision Transparency

BTS Dimension - 20%

One of five BTS dimensions. Measures how clearly an AI agent communicates its reasoning - whether users can understand why the agent took specific actions.

Explanation

Decision transparency is measured across individual decisions using reasoning depth (0-5), confidence scores, the presence of reasoning chains, and whether decisions were overridden. An agent that makes good decisions but cannot explain them scores lower on transparency than one that explains its reasoning even when its decisions are imperfect.

Why it matters

Opaque decisions cannot be appealed, debugged, or audited. Transparency is not a nice-to-have - it is the prerequisite for accountability. In regulated domains (healthcare, finance, hiring), decision transparency is a legal requirement, not an operational preference.

How Borealis uses it

Reported as decisions: [{ decisionId, timestamp, reasoningDepth, confidence, hasReasoningChain, wasOverridden }] in the telemetry schema. The scoring engine aggregates across decision entries to produce the 20% weighted transparency score.

Drift (Model Drift)

Gradual degradation of AI model performance over time as real-world data distributions shift away from those seen during training.

Explanation

Drift happens silently. No error is thrown. The model runs, produces outputs, and appears functional - but the outputs are increasingly wrong for the current environment. Types include data drift (input distribution changes), concept drift (the relationship between inputs and correct outputs changes), and model drift (degradation from both).

Why it matters

Drift is how trusted agents become untrustworthy without anyone noticing. A customer service agent trained on pre-pandemic user patterns will gradually drift as user expectations change. Detecting drift requires continuous measurement, not periodic review.

How Borealis uses it

BTS trends across telemetry batches serve as the drift signal. A steadily declining behavioral consistency or anomaly rate score is the earliest detectable symptom of drift. The license_score_history table enables trend analysis that would not be visible in point-in-time audits.

E

EU AI Act

European Union legislation establishing a risk-based framework for AI governance across member states, with enforcement beginning August 2026.

Explanation

The EU AI Act classifies AI systems by risk level: unacceptable risk (banned outright - social scoring, real-time biometric surveillance in public spaces), high risk (strict requirements - hiring, credit, healthcare, critical infrastructure), limited risk (transparency obligations), and minimal risk (voluntary guidelines). High-risk AI requires conformity assessments, technical documentation, human oversight mechanisms, and logging.

Why it matters

August 2026 is the enforcement deadline for high-risk AI provisions. Organizations that fail to comply face fines of up to €30M or 6% of global annual revenue - whichever is higher. The Act applies to any organization offering AI systems in the EU market, regardless of where they are headquartered.

How Borealis uses it

BorealisMark certification provides documentation, audit trails, and Hedera-anchored records that directly satisfy EU AI Act conformity assessment requirements for high-risk AI. The five BTS dimensions map to the Act's requirements for robustness, accuracy, transparency, and human oversight.

Explainability

The degree to which an AI system's decisions can be presented to users in understandable terms - justifying specific outputs without necessarily exposing the model's internal workings.

Explanation

Explainability focuses on the output side of a decision: "why did you do this." Interpretability focuses on the internal mechanisms: "how does this work." A neural network can be explainable (providing LIME or SHAP feature attributions) without being interpretable (inspectable weights). In regulated domains, explainability is the practically achievable requirement.

Why it matters

The EU AI Act and GDPR's right to explanation require that automated decisions affecting individuals can be explained. Explainability is also a practical debugging tool - unexplainable failures are the hardest to fix.

How Borealis uses it

The decision transparency dimension of the BTS measures explainability at the decision level. hasReasoningChain and reasoningDepth fields in the telemetry schema capture whether the agent produced a traceable justification for each decision.

F

Federated Learning

A machine learning approach where models are trained across decentralized devices or servers without exchanging raw data, preserving privacy while enabling large-scale training.

Explanation

In federated learning, each participant trains on local data and shares only model updates (gradients), not the underlying data. A central server aggregates these updates to improve a shared model. This enables training on sensitive datasets (medical records, financial transactions) without centralizing that data.

Why it matters

Federated learning changes data provenance dynamics. The training data never leaves its source, reducing compliance burden and attack surface. But it also makes bias auditing harder - if you cannot see the training data, you cannot audit it directly.

How Borealis uses it

Federated learning does not change certification requirements - the agent's behavioral outputs are still evaluated through the five BTS dimensions regardless of how it was trained. The audit focuses on behavior, not training methodology.

G

Guardrails

Predefined rules or technical constraints that limit AI agent behavior to acceptable boundaries and prevent harmful or unauthorized outputs.

Explanation

Guardrails can be implemented at multiple layers: input filtering (blocking harmful prompts before they reach the model), output filtering (blocking harmful responses before they reach users), behavioral constraints (limiting what actions the agent can take), and architectural constraints (hard limits that the model cannot override). Effective guardrail design requires layering these approaches.

Why it matters

Guardrails are only as good as their robustness testing. An untested guardrail is a false confidence. The most common failure mode is guardrails that work against expected inputs but fail against adversarial or edge-case inputs they were not designed for.

How Borealis uses it

Guardrail definitions become the basis for constraint adherence measurement. Each guardrail is modeled as a constraint with a severity level. CRITICAL guardrails (those preventing illegal or severely harmful behavior) are weighted most heavily in the BTS. See the constraint design patterns article for implementation guidance.

H

Hallucination

When an AI system generates plausible-sounding but factually incorrect or entirely fabricated content - presented with the same confidence as accurate output.

Explanation

Hallucinations occur because language models predict likely next tokens, not true statements. The model has no mechanism to detect when it is confabulating versus accurately recalling. Hallucinations are not errors in the sense of malfunctions - they are outputs the model confidently generates that happen to be false.

Why it matters

In high-stakes domains (legal, medical, financial), hallucinations can cause direct harm. A medical AI agent that confidently fabricates drug interactions, or a legal agent that cites non-existent case law, represents a trust failure of the highest order. Hallucination rate is a key diagnostic for AI agents in information-sensitive domains.

How Borealis uses it

Hallucinations manifest in the BTS as constraint violations (if the agent is constrained to factual accuracy), anomalies, and audit completeness failures (if outputs cannot be traced to verifiable reasoning chains). High hallucination rates in audited output directly reduce scores across multiple dimensions.

Hedera Consensus Service (HCS)

Borealis Infrastructure - Mainnet

The Hedera Hashgraph service used to anchor BorealisMark certification records, audit trails, and trust scores on an immutable public ledger.

Explanation

Hedera Consensus Service provides ordered, timestamped, tamper-proof message records on the Hedera Hashgraph network. Unlike traditional databases, records written to HCS cannot be altered or deleted - not even by Borealis. This creates an independent verification layer that neither Borealis nor the agent developer can manipulate.

Why it matters

If certification records were stored only in a Borealis database, trust in the score would require trusting Borealis to be honest. HCS removes that requirement - any party can independently verify a certification by querying the Hedera mainnet, without needing to trust the certifier.

How Borealis uses it

Two Hedera topics are in active use: the HCS Audit Topic (0.0.10382960) for immutable audit trails, and the HCS Data Topic (0.0.10382961) for trust score anchoring. Every certification and telemetry-derived score is anchored with a Hedera transaction ID returned to the API caller. All operations run on Hedera mainnet.

Human-in-the-Loop (HITL)

System design where human oversight is required for certain AI decisions or actions - balancing automation benefits with direct human accountability for high-stakes outcomes.

Explanation

HITL is not all-or-nothing. A well-designed system routes low-risk decisions to fully automated processing, medium-risk decisions to human review with AI recommendation, and high-risk decisions to human decision-making with AI analysis. The routing logic is itself a governance decision.

Why it matters

The EU AI Act mandates human oversight for high-risk AI systems. HITL is the primary mechanism for meeting this requirement. More practically: humans catch the failure modes that automated systems are blind to, and establish clear liability attribution.

How Borealis uses it

The MAGISTRATE role in the Borealis audit pipeline is a human-in-the-loop mechanism. ARBITER agents submit audit evidence; a human MAGISTRATE issues the certification verdict. This structure prevents fully automated self-certification and ensures human accountability for the final trust determination.