Rules Authoring

Summary
Quick Reference
Rule Authoring Workflow
NL-to-PyRel Translation Patterns
Rule Design Principles
Complex Multi-Entity Rule Design
Rule Chaining
Cross-Reasoner Integration
Handling Missing Data in Rules
Common Pitfalls (Rules-Specific)
Examples
Reference Files

Summary

What: Business rule authoring — translating natural language rules into PyRel derived ontology properties and concepts. Covers validation, classification, derivation, alerting, and reconciliation rule types.

When to use:

Translating a business rule from natural language to PyRel (e.g., "flag high-value customers")
Classifying which rule pattern to use (validation, classification, derivation, alerting, reconciliation)
Writing define() + where() expressions that encode business logic
Reviewing or debugging an existing rule definition
Chaining rules with other reasoner outputs (graph metrics, predictions)
Testing that a rule produces correct results on known data

When NOT to use:

Discovering what rules a model can support (reasoner classification, feasibility) — see rai-discovery
PyRel syntax reference (imports, types, property patterns) — see rai-pyrel-coding
Ontology modeling decisions (concept vs property, data mapping) — see rai-ontology-design
Optimization formulation (variables, constraints, objectives) — see rai-prescriptive-problem-formulation
Query syntax (select, aggregation, filtering) — see rai-querying

Overview (process steps):

Parse the natural language rule to identify intent, entities, conditions, and output
Classify the rule type (validation, classification, derivation, alerting, reconciliation)
Ground in the current model via inspect.schema(model) before mapping entities to ontology — catches duplicates, hallucinated surface, and wrong-type inference
Translate to PyRel using the appropriate define() + where() pattern
Validate the rule against known test cases
Connect to downstream consumers (queries, other rules, or reasoner chains)

Quick Reference

# Imports (rule authoring typically needs)
from relationalai.semantics import Model, Float, Integer, String
from relationalai.semantics.std import aggregates, math, strings
from relationalai.semantics.std.datetime import datetime
from relationalai.semantics.std import numbers

model = Model("my_model")

| Rule Type | Output Type | Canonical Pattern | |-----------|------------|-------------------| | Validation | Relationship (boolean) | model.where(cond).define(Entity.is_valid()) | | Classification | Relationship to segment concept | model.where(range).define(Entity.value_segment(SegmentSubtype)) | | Derivation | Property (computed value) | model.define(Entity.total(aggregates.sum(Child.val).per(Entity))) | | Alerting | Relationship (boolean + optional severity) | model.where(model.not_(Entity.resolved), elapsed > limit).define(Entity.is_breach()) | | Reconciliation | Property (delta) + Relationship (flag) | model.define(Match.delta(A.val - B.val)) then model.where(math.abs(Match.delta) > tol).define(Match.has_discrepancy()) |

Key decision rules:

Boolean yes/no output → Relationship (validation or alerting)
Category from fixed set → typed sub-concepts with Relationship (classification)
Computed number → Property (derivation)
Two-source comparison → Property for delta + Relationship for flag (reconciliation)

Rule Authoring Workflow

Step 1: Parse the Natural Language Rule

Extract four components from the NL statement:

| Component | Question | Example (NL: "Flag orders exceeding their customer's credit limit") | |-----------|----------|----------------------------------------------------------------------| | Subject entity | What concept does the rule apply to? | Order | | Condition properties | What data fields are evaluated? | Order.amount, Customer.credit_limit | | Threshold / logic | What boundary or logic is applied? | amount > credit_limit | | Output | What does the rule produce? | Boolean flag: Order.exceeds_credit |

Additional extraction hints:

"for each X" → the rule applies .per(X) or iterates over concept X
"across all Y" → aggregation is needed (sum, count, avg)
"if ... then ..." → condition → output mapping
"unless" / "except" → negation with model.not_()

Step 2: Classify the Rule Type

| Type | NL Signals | Output Pattern | PyRel Pattern | |------|-----------|----------------|---------------| | Validation | "is valid", "complies", "within policy", "meets requirement" | Boolean flag (Relationship) | model.where(cond).define(Entity.is_valid()) | | Classification | "categorize", "tier", "grade", "segment", "bucket" | Subtype or Relationship to segment concept | model.where(range).define(SegmentSubtype(Entity)) | | Derivation | "total", "calculated from", "sum of", "derived", "equals" | Computed value (Property) | model.define(Entity.total(expression)) | | Alerting | "flag", "alert", "overdue", "exceeds", "breach", "violation" | Boolean + optional severity | model.where(violation).define(Entity.is_flagged()) | | Reconciliation | "match", "agree", "discrepancy", "difference between" | Delta value (Property) | model.define(Entity.delta(A.val - B.val)) |

Disambiguation: If the NL rule produces a boolean yes/no answer, it is validation or alerting. If it assigns a category from a fixed set, it is classification. If it computes a numeric value, it is derivation.

Step 3: Map to Ontology and Explore Data

Inspect-before-authoring is the first action of this step. Before proposing any rule, ground in the current model:

from relationalai.semantics import inspect

schema = inspect.schema(model)
concept_info = schema[subject_concept_name]     # fails loudly if the concept doesn't exist
existing_properties = concept_info.properties   # tuple[RelationshipInfo, ...] — includes inherited
existing_names = {p.name for p in existing_properties}

This catches the three silent-failure modes that account for most rules-authoring errors:

Duplicate authoring — the property is already declared (possibly under a near-synonym, or inherited from a parent concept). Adding it again wastes work and can introduce conflicts.
Hallucinated surface — the concept or property name is plausible but not real (Customer.tier vs actual Customer.category). Fails silently until a downstream query returns wrong results.
Wrong-type inference — for properties created via Concept.new(table.to_schema()), inspect.schema() enriches the type view from the backing table (e.g., shows Integer / String / Date) even though the frontend model types them as Any. Read the enriched type from inspect when writing threshold logic or bound derivation; don't guess from column names.

Verification checklist:

| Check | How to verify | If missing | |-------|--------------|------------| | Concept exists | inspect.schema(model)["ConceptName"] (raises KeyError if absent) | Define the concept or flag as model gap | | Property exists on concept | "prop_name" in {p.name for p in inspect.schema(model)[concept].properties} | Add property or enrich model | | Property type | next(p.type_name for p in inspect.schema(model)[concept].properties if p.name == "prop") (type is a string like "Integer") | Cast with numbers.integer(), floats.float(), date.fromisoformat() | | Relationship path exists | Trace join from subject to related concept; use inspect.fields(rel) to see field shape | Define relationship or denormalize needed property | | Data distribution | Query min, max, avg of condition properties | Set thresholds informed by actual data, not assumptions |

See rai-querying/references/inspect-module.md for full API.

When to skip: greenfield authoring on an empty model, or single-shot rules on a fresh session. In those cases, inspect just adds a turn without payoff.

Data exploration is mandatory for threshold rules. See Complex Multi-Entity Rule Design for the pattern. Common scale mismatches: scores 0-10 vs 0-100, ratios > 1.0, percentages as 0-1 vs 0-100.

Step 4: Translate to PyRel

Use the canonical pattern for each rule type. See pyrel-rule-patterns.md for full examples and variants.

Validation — boolean flag via unary Relationship:

Entity.is_valid = model.Relationship(f"{Entity} is valid")
model.where(condition).define(Entity.is_valid())

Classification — typed sub-concepts with mutually exclusive conditions:

HighValue = model.Concept(f"{Entity} is high value")
model.where(Entity.score > threshold).define(HighValue(Entity))

Use typed sub-concepts (not string Properties) for derived classifications. For full subtype classification with multiple tiers, see pyrel-rule-patterns.md. For enumeration vs. subtyping guidance, see rai-ontology-design categorization-and-advanced.md.

Derivation — computed value via Property:

Entity.total = model.Property(f"{Entity} has {Float:total}")
total = aggregates.sum(Child.amount).per(Entity).where(Child.parent(Entity))
model.define(Entity.total(total))

Alerting — violation flag, often time-based:

Entity.is_breach = model.Relationship(f"{Entity} is breach")
model.where(violation_condition).define(Entity.is_breach())

Reconciliation — delta with tolerance:

Match.delta = model.Property(f"{Match} has {Float:delta}")
model.define(Match.delta(Source_A.amount - Source_B.amount))
Match.has_discrepancy = model.Relationship(f"{Match} has discrepancy")
model.where(math.abs(Match.delta) > tolerance).define(Match.has_discrepancy())

Step 5: Validate

Run these checks on every rule before considering it complete:

| Check | What to verify | Failure signal | |-------|---------------|----------------| | Output type correct | Relationship for boolean, Property for values | FDError if Property gets multiple values | | Conditions exhaustive | Classification covers all entities (if intended) | Entities with no assigned value | | Conditions exclusive | No entity matches two classification branches | FDError on Property | | Join paths valid | .where() relationships traverse existing paths | Zero results despite matching data | | Type alignment | Condition compares same types | Zero matches from silent type mismatch | | Aggregation scoped | .per() present when aggregating across entities | Single global result instead of per-entity |

For exhaustiveness validation (finding unclassified entities, diagnosing gaps, and adding catch-all rules), see rule-validation-and-testing.md.

Step 6: Connect to Downstream

Rules produce derived properties that downstream consumers can query or chain:

# Query rule output
violations = model.where(Order.exceeds_credit()).select(
    Order.id.alias("order_id"),
    Order.amount.alias("amount"),
).to_df()

# Chain: rule output feeds another rule
model.where(
    Customer.value_segment(ValueSegmentVIP),
    Customer.open_cases > 3,
).define(Customer.needs_escalation())

NL-to-PyRel Translation Patterns

Condition Mapping

| NL Phrase | PyRel Translation | |-----------|-------------------| | "X is Y when condition" | model.where(condition).define(X.is_Y()) | | "X has tier based on score" | Multiple model.where(range).define(X.tier(value)) | | "total X across children" | aggregates.sum(Child.x).per(Parent).where(Child.parent(Parent)) | | "flag X where threshold exceeded" | model.where(X.value > X.limit).define(X.is_flagged()) | | "X and Y agree" | model.where(math.abs(X.v - Y.v) < epsilon).define(Match.is_reconciled()) |

Logical Operators

| NL | PyRel | Notes | |----|-------|-------| | "A and B" | model.where(A, B) | Multiple args in .where() = conjunction | | "A or B" | model.union(branch_a, branch_b) | Separate .where() calls for each branch | | "not A" | model.not_(A) | Negates full conjunction; use parentheses for clarity | | "at least N" | >= N | | | "no more than N" | <= N | | | "exactly N" | == N | |

For detailed string, numeric, date, missing-data, and aggregation syntax with examples, see rai-pyrel-coding and its standard-library.md and expression-rules.md.

Rule Design Principles

Declare the output first. Define the Property or Relationship that the rule produces before writing the logic. This makes the rule's shape explicit and reviewable.
Use Relationship for boolean flags, Property for values. Boolean outputs (is_compliant, is_overdue) are unary Relationships. Categorical or numeric outputs (risk_tier, total_cost) are Properties. Mixing these up causes FDError or silent data loss.
Conditions are conjunctive by default. Multiple arguments in .where() are AND. Use model.union() for OR conditions. Never mix AND and OR in a single .where().
Classification rules must be mutually exclusive. When defining multiple categories, ensure conditions do not overlap — otherwise the Property receives multiple values for the same entity, causing FDError. Use < on one boundary and >= on the other.
Decide exhaustive vs partial. Should every entity get a classification? If yes, include a default/catch-all rule. If no, document which entities will have no value.
Test boundary conditions. Rules with >= / < boundaries must handle the boundary value. Always verify: what happens when the value is exactly at the threshold?
Prefer data-driven thresholds. Reference properties from the ontology where possible (Entity.amount > Entity.credit_limit) rather than hardcoding values (> 10000).
One rule per derived property. Keep all conditions that produce the same output property together in one place for readability and correctness verification.

Complex Multi-Entity Rule Design

Data Exploration Before Threshold Selection

CRITICAL: Always explore the actual data distribution before choosing threshold values for rules. Assumptions about data scales can be wrong:

# Step 1: Check the actual data range BEFORE setting thresholds
stats = model.select(
    aggregates.count(Entity).alias("total"),
    aggregates.min(Entity.score).alias("min"),
    aggregates.max(Entity.score).alias("max"),
    aggregates.avg(Entity.score).alias("avg"),
).to_df()
# Example: avg_foot_traffic_score ranges 1.2–9.7 (0-10 scale), NOT 0-100!
# Using >= 80.0 would yield zero results. Use >= 7.0 instead.

Common scale mismatches:

Scores may be 0-10 (not 0-100)
Ratios may be > 1.0 when numerator and denominator are on different scales
Percentages may be stored as 0-1 (not 0-100) or vice versa

For multi-entity subtype rules (cross-entity joins, existential checks, OR conditions) and rule dependency building blocks (layered derivations), see complex-multi-entity-rules.md.

Rule Chaining

Rule-to-Rule Chaining

Rules can consume other rules' output. The derived property from Rule A becomes a condition in Rule B. For a full rule-to-rule chaining example, see rule-chaining-patterns.md.

Ordering guarantee: PyRel definitions are declarative. The runtime resolves dependencies automatically. If Rule B references Rule A's output, the engine evaluates A before B. No explicit ordering is needed in code.

Cross-Reasoner Chaining

| Chain | How rules participate | Example | |-------|----------------------|---------| | Rules → Prescriptive | Rule output constrains optimization | Compliance flag filters which entities the solver can assign | | Predictive → Rules | Predicted score feeds rule threshold | predicted_risk > 0.8 triggers alert rule | | Graph → Rules | Graph metric feeds rule condition | Centrality score below threshold flags isolated nodes | | Rules → Predictive | Rule classification becomes a feature | risk_tier used as feature in churn prediction |

Cross-Reasoner Integration

Rule outputs (boolean flags, derived values, classifications) feed other reasoners as inputs. For code examples of each integration pattern, see rule-chaining-patterns.md.

Handling Missing Data in Rules

PyRel does not raise errors on missing values — conditions silently don't match. This can cause rules to skip entities unexpectedly.

# Detect missing values
model.where(model.not_(Ticket.priority)).define(Ticket.needs_triage())

# Provide defaults with fallback operator
priority = Ticket.priority | "unknown"
order_count = aggregates.count(Order).per(Customer).where(Order.customer(Customer)) | 0

# Presence flags for downstream rules
Ticket.has_assignee = model.Relationship(f"{Ticket} has assignee")
model.where(Ticket.assigned_to).define(Ticket.has_assignee())
model.where(model.not_(Ticket.has_assignee), Ticket.priority == "p0").define(
    Ticket.needs_urgent_escalation()
)

Aggregation in rules

# Count-based classification (| 0 for zero-match groups)
order_count = aggregates.count(Order).per(Customer).where(Order.customer(Customer)) | 0
model.where(order_count >= 10).define(Customer.value_segment(ValueSegmentVIP))

# Sum-based derivation
total = aggregates.sum(Order.amount).per(Customer).where(Order.customer(Customer))
model.define(Customer.total_spend(total))

Aggregation pitfalls: Missing .per() = global aggregate. Zero-match groups omitted — use | 0. Missing property values don't contribute to sum/avg/min/max. Use distinct() when joins expand matches. For full guidance, see rai-querying.

Common Pitfalls

| Mistake | Cause | Fix | |---------|-------|-----| | FDError on classification | Overlapping conditions assign two values to same entity | Ensure mutual exclusivity: use < not <= on one boundary | | Classification misses entities | Non-exhaustive conditions | Add a default catch-all .where() clause | | Boolean flag uses Property | Treating boolean as value type | Use unary Relationship for boolean; Property for typed values | | Threshold hardcoded | Not using data-driven boundaries | Reference Entity.limit property instead of literal value | | Rule chaining circular dependency | Rule A depends on B which depends on A | Refactor to break cycle; rule dependencies must form a DAG | | Rule silently skips entities | Condition property is missing for those entities | Check for missing data with model.not_(property); add presence flag | | Aggregation-based rule gives wrong counts | Missing .per() or wrong .where() scope | Validate contributing rows with model.select() before defining the rule | | Classification + aggregation: FDError | Overlapping ranges when aggregate values land on boundary | Use strict < on one boundary, >= on the other | | define() in a Python loop | Defining rules per entity in a for loop instead of declaratively | Use model.data() + .where().define(). See rai-pyrel-coding Common Pitfalls for before/after examples | | ~Relationship() for negation | TypeError: bad operand type for unary ~: 'Expression' — Python ~ doesn't work on RAI expressions | Use model.not_(Concept.relationship()) in .where(). For set-difference queries (entities matching flag A but not flag B), either nest model.not_() or query both sets and subtract in pandas | | Unground Variables from mixed numeric comparison | Comparing a Float property to an Integer property (or vice versa) in a rule condition — types must match exactly | Cast to a common type: Entity.float_prop < floats.float(Entity.int_prop) or numbers.integer(Entity.float_prop). See rai-pyrel-coding Common Pitfalls for general ungrounded variable debugging | | Boolean flags can't be selected as columns | Unary Relationships can only be used in .where() filters, not in .select().alias() | Query flagged entity IDs separately, then merge into the main DataFrame — see pattern below |

Projecting boolean flags into a compliance table: Since boolean Relationships can't appear in select(), query each flag's matching IDs separately and merge:

def query_flag(model, relationship, concept, flag_name):
    """Query entities matching a boolean Relationship, return df with flag column."""
    df = model.where(relationship()).select(concept.id.alias("id")).to_df()
    df[flag_name] = True
    return df

# Usage: build compliance table from multiple boolean rules
base_df = model.select(Entity.id.alias("id"), Entity.name.alias("name")).to_df()
flag_df = query_flag(model, Entity.fails_check, Entity, "fails_check")
base_df = base_df.merge(flag_df, on="id", how="left")
base_df["fails_check"] = base_df["fails_check"].fillna(False)

For general PyRel pitfalls (type mismatches, aggregation scoping, join expansion, missing data, f-string syntax, rai function availability, subtype limitations, boolean negation), see rai-pyrel-coding and rai-querying.

For subtype-specific pitfalls (OR operator crashes, aggregation chaining, dot-chain navigation, cross-entity property access), see pyrel-subtype-rules.md.

Examples

| Pattern | Description | File | |---------|-------------|------| | Validation | Threshold compliance with cross-entity join | validation_rule.py | | Classification | Multi-tier entity classification with mutually exclusive ranges | classification_rule.py | | Derivation | Computed total via aggregation with property materialization | derivation_rule.py | | Alerting | Temporal threshold breach detection with missing-data handling | alerting_rule.py | | Reconciliation | Two-source delta with tolerance and severity classification | reconciliation_rule.py | | Cross-entity alerting | Disjunctive OR flags via multiple define() calls + proportional comparison | cross_entity_alerting.py |

Reference files

| Reference | Description | File | |-----------|-------------|------| | Rule patterns | Detailed PyRel code patterns for all five rule types | pyrel-rule-patterns.md | | Validation & testing | Rule validation, testing, and debugging guidance | rule-validation-and-testing.md | | Subtype rules | PyRel v1 subtype rules, f-string syntax, rai functions, boolean negation | pyrel-subtype-rules.md | | Complex multi-entity rules | Multi-entity subtype rules, cross-entity joins, rule dependency building blocks | complex-multi-entity-rules.md | | Rule chaining patterns | Rule-to-rule chaining and cross-reasoner integration code examples | rule-chaining-patterns.md | | Complex rule example | Real-world 5-entity subtype rule with OR branches and layered dependencies | complex-rule-example.md |

Rules Authoring

Summary
Quick Reference
Rule Authoring Workflow
NL-to-PyRel Translation Patterns
Rule Design Principles
Complex Multi-Entity Rule Design
Rule Chaining
Cross-Reasoner Integration
Handling Missing Data in Rules
Common Pitfalls (Rules-Specific)
Examples
Reference Files

Summary

When to use:

Translating a business rule from natural language to PyRel (e.g., "flag high-value customers")
Classifying which rule pattern to use (validation, classification, derivation, alerting, reconciliation)
Writing define() + where() expressions that encode business logic
Reviewing or debugging an existing rule definition
Chaining rules with other reasoner outputs (graph metrics, predictions)
Testing that a rule produces correct results on known data

When NOT to use:

Discovering what rules a model can support (reasoner classification, feasibility) — see rai-discovery
PyRel syntax reference (imports, types, property patterns) — see rai-pyrel-coding
Ontology modeling decisions (concept vs property, data mapping) — see rai-ontology-design
Optimization formulation (variables, constraints, objectives) — see rai-prescriptive-problem-formulation
Query syntax (select, aggregation, filtering) — see rai-querying

Overview (process steps):

Parse the natural language rule to identify intent, entities, conditions, and output
Classify the rule type (validation, classification, derivation, alerting, reconciliation)
Ground in the current model via inspect.schema(model) before mapping entities to ontology — catches duplicates, hallucinated surface, and wrong-type inference
Translate to PyRel using the appropriate define() + where() pattern
Validate the rule against known test cases
Connect to downstream consumers (queries, other rules, or reasoner chains)

Quick Reference

# Imports (rule authoring typically needs)
from relationalai.semantics import Model, Float, Integer, String
from relationalai.semantics.std import aggregates, math, strings
from relationalai.semantics.std.datetime import datetime
from relationalai.semantics.std import numbers

model = Model("my_model")

Key decision rules:

Boolean yes/no output → Relationship (validation or alerting)
Category from fixed set → typed sub-concepts with Relationship (classification)
Computed number → Property (derivation)
Two-source comparison → Property for delta + Relationship for flag (reconciliation)

Rule Authoring Workflow

Step 1: Parse the Natural Language Rule

Extract four components from the NL statement:

Additional extraction hints:

"for each X" → the rule applies .per(X) or iterates over concept X
"across all Y" → aggregation is needed (sum, count, avg)
"if ... then ..." → condition → output mapping
"unless" / "except" → negation with model.not_()

Step 2: Classify the Rule Type

Step 3: Map to Ontology and Explore Data

Inspect-before-authoring is the first action of this step. Before proposing any rule, ground in the current model:

from relationalai.semantics import inspect

schema = inspect.schema(model)
concept_info = schema[subject_concept_name]     # fails loudly if the concept doesn't exist
existing_properties = concept_info.properties   # tuple[RelationshipInfo, ...] — includes inherited
existing_names = {p.name for p in existing_properties}

This catches the three silent-failure modes that account for most rules-authoring errors:

Duplicate authoring — the property is already declared (possibly under a near-synonym, or inherited from a parent concept). Adding it again wastes work and can introduce conflicts.
Hallucinated surface — the concept or property name is plausible but not real (Customer.tier vs actual Customer.category). Fails silently until a downstream query returns wrong results.
Wrong-type inference — for properties created via Concept.new(table.to_schema()), inspect.schema() enriches the type view from the backing table (e.g., shows Integer / String / Date) even though the frontend model types them as Any. Read the enriched type from inspect when writing threshold logic or bound derivation; don't guess from column names.

Verification checklist:

See rai-querying/references/inspect-module.md for full API.

When to skip: greenfield authoring on an empty model, or single-shot rules on a fresh session. In those cases, inspect just adds a turn without payoff.

Data exploration is mandatory for threshold rules. See Complex Multi-Entity Rule Design for the pattern. Common scale mismatches: scores 0-10 vs 0-100, ratios > 1.0, percentages as 0-1 vs 0-100.

Step 4: Translate to PyRel

Use the canonical pattern for each rule type. See pyrel-rule-patterns.md for full examples and variants.

Validation — boolean flag via unary Relationship:

Entity.is_valid = model.Relationship(f"{Entity} is valid")
model.where(condition).define(Entity.is_valid())

Classification — typed sub-concepts with mutually exclusive conditions:

HighValue = model.Concept(f"{Entity} is high value")
model.where(Entity.score > threshold).define(HighValue(Entity))

Derivation — computed value via Property:

Entity.total = model.Property(f"{Entity} has {Float:total}")
total = aggregates.sum(Child.amount).per(Entity).where(Child.parent(Entity))
model.define(Entity.total(total))

Alerting — violation flag, often time-based:

Entity.is_breach = model.Relationship(f"{Entity} is breach")
model.where(violation_condition).define(Entity.is_breach())

Reconciliation — delta with tolerance:

Match.delta = model.Property(f"{Match} has {Float:delta}")
model.define(Match.delta(Source_A.amount - Source_B.amount))
Match.has_discrepancy = model.Relationship(f"{Match} has discrepancy")
model.where(math.abs(Match.delta) > tolerance).define(Match.has_discrepancy())

Step 5: Validate

Run these checks on every rule before considering it complete:

For exhaustiveness validation (finding unclassified entities, diagnosing gaps, and adding catch-all rules), see rule-validation-and-testing.md.

Step 6: Connect to Downstream

Rules produce derived properties that downstream consumers can query or chain:

# Query rule output
violations = model.where(Order.exceeds_credit()).select(
    Order.id.alias("order_id"),
    Order.amount.alias("amount"),
).to_df()

# Chain: rule output feeds another rule
model.where(
    Customer.value_segment(ValueSegmentVIP),
    Customer.open_cases > 3,
).define(Customer.needs_escalation())

NL-to-PyRel Translation Patterns

Condition Mapping

Logical Operators

For detailed string, numeric, date, missing-data, and aggregation syntax with examples, see rai-pyrel-coding and its standard-library.md and expression-rules.md.

Rule Design Principles

Declare the output first. Define the Property or Relationship that the rule produces before writing the logic. This makes the rule's shape explicit and reviewable.
Use Relationship for boolean flags, Property for values. Boolean outputs (is_compliant, is_overdue) are unary Relationships. Categorical or numeric outputs (risk_tier, total_cost) are Properties. Mixing these up causes FDError or silent data loss.
Conditions are conjunctive by default. Multiple arguments in .where() are AND. Use model.union() for OR conditions. Never mix AND and OR in a single .where().
Classification rules must be mutually exclusive. When defining multiple categories, ensure conditions do not overlap — otherwise the Property receives multiple values for the same entity, causing FDError. Use < on one boundary and >= on the other.
Decide exhaustive vs partial. Should every entity get a classification? If yes, include a default/catch-all rule. If no, document which entities will have no value.
Test boundary conditions. Rules with >= / < boundaries must handle the boundary value. Always verify: what happens when the value is exactly at the threshold?
Prefer data-driven thresholds. Reference properties from the ontology where possible (Entity.amount > Entity.credit_limit) rather than hardcoding values (> 10000).
One rule per derived property. Keep all conditions that produce the same output property together in one place for readability and correctness verification.

Complex Multi-Entity Rule Design

Data Exploration Before Threshold Selection

CRITICAL: Always explore the actual data distribution before choosing threshold values for rules. Assumptions about data scales can be wrong:

# Step 1: Check the actual data range BEFORE setting thresholds
stats = model.select(
    aggregates.count(Entity).alias("total"),
    aggregates.min(Entity.score).alias("min"),
    aggregates.max(Entity.score).alias("max"),
    aggregates.avg(Entity.score).alias("avg"),
).to_df()
# Example: avg_foot_traffic_score ranges 1.2–9.7 (0-10 scale), NOT 0-100!
# Using >= 80.0 would yield zero results. Use >= 7.0 instead.

Common scale mismatches:

Scores may be 0-10 (not 0-100)
Ratios may be > 1.0 when numerator and denominator are on different scales
Percentages may be stored as 0-1 (not 0-100) or vice versa

For multi-entity subtype rules (cross-entity joins, existential checks, OR conditions) and rule dependency building blocks (layered derivations), see complex-multi-entity-rules.md.

Rule Chaining

Rule-to-Rule Chaining

Rules can consume other rules' output. The derived property from Rule A becomes a condition in Rule B. For a full rule-to-rule chaining example, see rule-chaining-patterns.md.

Cross-Reasoner Chaining

Cross-Reasoner Integration

Rule outputs (boolean flags, derived values, classifications) feed other reasoners as inputs. For code examples of each integration pattern, see rule-chaining-patterns.md.

Handling Missing Data in Rules

PyRel does not raise errors on missing values — conditions silently don't match. This can cause rules to skip entities unexpectedly.

# Detect missing values
model.where(model.not_(Ticket.priority)).define(Ticket.needs_triage())

# Provide defaults with fallback operator
priority = Ticket.priority | "unknown"
order_count = aggregates.count(Order).per(Customer).where(Order.customer(Customer)) | 0

# Presence flags for downstream rules
Ticket.has_assignee = model.Relationship(f"{Ticket} has assignee")
model.where(Ticket.assigned_to).define(Ticket.has_assignee())
model.where(model.not_(Ticket.has_assignee), Ticket.priority == "p0").define(
    Ticket.needs_urgent_escalation()
)

Aggregation in rules

# Count-based classification (| 0 for zero-match groups)
order_count = aggregates.count(Order).per(Customer).where(Order.customer(Customer)) | 0
model.where(order_count >= 10).define(Customer.value_segment(ValueSegmentVIP))

# Sum-based derivation
total = aggregates.sum(Order.amount).per(Customer).where(Order.customer(Customer))
model.define(Customer.total_spend(total))

Common Pitfalls

Projecting boolean flags into a compliance table: Since boolean Relationships can't appear in select(), query each flag's matching IDs separately and merge:

def query_flag(model, relationship, concept, flag_name):
    """Query entities matching a boolean Relationship, return df with flag column."""
    df = model.where(relationship()).select(concept.id.alias("id")).to_df()
    df[flag_name] = True
    return df

# Usage: build compliance table from multiple boolean rules
base_df = model.select(Entity.id.alias("id"), Entity.name.alias("name")).to_df()
flag_df = query_flag(model, Entity.fails_check, Entity, "fails_check")
base_df = base_df.merge(flag_df, on="id", how="left")
base_df["fails_check"] = base_df["fails_check"].fillna(False)

For subtype-specific pitfalls (OR operator crashes, aggregation chaining, dot-chain navigation, cross-entity property access), see pyrel-subtype-rules.md.

Adoption

RelationalAI/rai-rules-authoring

$ install --global

Security Scan Results

SKILL.md

Rules Authoring

Summary

Quick Reference

Rule Authoring Workflow

Step 1: Parse the Natural Language Rule

Step 2: Classify the Rule Type

Step 3: Map to Ontology and Explore Data

Step 4: Translate to PyRel

Step 5: Validate

Step 6: Connect to Downstream

NL-to-PyRel Translation Patterns

Condition Mapping

Logical Operators

Rule Design Principles

Complex Multi-Entity Rule Design

Data Exploration Before Threshold Selection

Rule Chaining

Rule-to-Rule Chaining

Cross-Reasoner Chaining

Cross-Reasoner Integration

Handling Missing Data in Rules

Aggregation in rules

Common Pitfalls

Examples

Reference files

Related Skills

RelationalAI/rai-predictive-training

RelationalAI/rai-predictive-modeling

RelationalAI/rai-setup