arifOS MCP Surface Debugging — Live vs Stated Tool Count

Live Tool Counts (2026-05-06 probe — ground truth via tools/list)

| Server | Stated | Actual | Method | |--------|--------|--------|--------| | arifOS | 13 (kernel) | 13 | MCP tools/list | | WEALTH | 79 (monolith docs) | 50 | MCP tools/list | | GEOX | 31 (container note) | 15 | MCP tools/list | | WELL | 88+ (legacy) | 45 | MCP tools/list |

Total federation: 13+50+15+45 = 123 live tools. (Earlier "18 for arifOS" report was via mixed method; tools/list confirms 13.)

Trigger

Debugging MCP surface discrepancies: tools/list ≠ expected count, or verify_public HOLD with tool count mismatch.

The Diagnostic Chain

Step 1: Check what's ACTUALLY deployed (not what you think)

# Inside container — the ground truth
docker exec arifosmcp python3 -c "
import sys; sys.path.insert(0, '/usr/src/app')
import asyncio
from arifosmcp.server import mcp
tools = asyncio.run(mcp.list_tools())
print(f'MCP surface: {len(tools)} tools')
for t in sorted(tools, key=lambda x: x.name):
    print(f'  {t.name}')
"

# Copy script if not present:
docker cp /tmp/check_mcp_tools.py arifosmcp:/tmp/check_mcp_tools.py
docker exec arifosmcp python3 /tmp/check_mcp_tools.py

# Public endpoint
curl -s https://mcp.arif-fazil.com/status.json | python3 -c "import json,sys; d=json.load(sys.stdin); print(d.get('services',{}).get('arifos',{}).get('tools','?'))"

# /health surface breakdown
curl -s https://mcp.arif-fazil.com/health | python3 -c "import json,sys; d=json.load(sys.stdin); print(json.dumps({k:d[k] for k in ['tools','apps','canonical_surface','registered_surface','probe_surface']}, indent=2))"

Step 2: Compare local git vs deployed image

# Local uncommitted changes?
cd /root/arifOS && git status --short

# git stash WIP — can interfere with diagnosis!
git stash list

# What's IN the deployed image?
docker image inspect ghcr.io/ariffazil/arifos:a-forge --format '{{index .Config.Labels "git.sha"}}' 2>/dev/null

# Git log
git log --oneline -10

Step 3: Root causes in order of likelihood

A. Local uncommitted changes ≠ deployed image Most common cause. Local patches exist but image wasn't rebuilt. Fix: git add -A && git commit --no-verify && docker build && docker push && docker compose pull && docker compose up -d

B. FastMCPApp visibility=["app"] filtering FastMCP 3.2.0 _is_model_visible() filters tools with visibility=["app"] from tools/list. mcp.add_provider(command_center_app) silently registers the provider but produces ZERO visible tools. The 20-tool surface happened because this was "fixed" by wiring CC tools directly onto main mcp server. Symptom: CC app registered, but tools/list shows fewer tools than expected.

C. CANONICAL_TOOLS vs MCP surface mismatch CANONICAL_TOOLS in constitutional_map.py defines the canonical surface. MCP surface is what mcp.list_tools() returns. These MUST agree. A tool registered in CANONICAL_TOOLS but not wired in server.py = missing. A tool wired in server.py but not in CANONICAL_TOOLS = phantom.

D. 20-tool chaos (historical) Before commit ba28b7bb, CC tools (session_status, ops_vitals, judge_action, forge_dry_run, gateway_handshake, vault_list, vault_dry_seal) were added to CANONICAL_TOOLS and wired in server.py. These are MODES of canonical tools, not separate endpoints. A tool IS what it IS. Mode ≠ Tool. The 20-tool surface was a category error — a "dry run" is a parameter, not a tool.

Step 4: The mode/tool distinction (critical)

WRONG (20-tool era):     forge_dry_run IS a separate tool
RIGHT (13-tool era):     arif_forge_execute(mode="dry_run") IS a mode parameter

WRONG (20-tool era):     vault_list IS a separate tool  
RIGHT (13-tool era):     arif_vault_seal(mode="list") IS a mode parameter

This distinction is F8 GENIUS and F10 ONTOLOGY. The oscillation 13→20→13 was DITEMPA BUKAN DIBERI — the category error had to be made to understand WHY 13 is correct.

Step 5: Fix and verify

# After fixes:
make verify-public   # must be 10/10 PASS

# Manual spot checks:
curl -s https://mcp.arif-fazil.com/health | python3 -c "import json,sys; d=json.load(sys.stdin); assert d.get('canonical_surface') == 13"
curl -s https://mcp.arif-fazil.com/ready | python3 -c "import json,sys; d=json.load(sys.stdin); assert d.get('status') == 'pass'"

Revert pitfalls (13→20→13 pattern)

After reverting from 20→13, manually check server.py for leftover artifacts:

git diff arifosmcp/server.py | head -60

Specifically watch for:

Duplicate PUBLIC_TOOLS = list_public_tools() — the revert may leave the original declaration plus a spurious logger.warning("ARIFOS MCP KANON Phase 2...") line between them
Ghost CC wiring block — lines 184–378 may still contain old CC tool registrations
Always run git diff --stat immediately after revert commits to confirm expected +22 -306 or similar

git stash WIP state

Before diagnosing, check stash list — uncommitted local changes may exist:

git stash list
git diff --stat

Local uncommitted ≠ deployed image. Container may be running OLDER code than what git log shows.

verify_public.py — also update comment blocks

The 5-line comment block BEFORE CANONICAL_TOOL_COUNT also references "20 tools" and "canonical=13 + 7 governance" — update that too or the next run will confuse.

# WRONG (leftover from 20-era):
# arifOS MCP has ONE unified tool surface — all 20 tools in constitutional_map.py.
# canonical=13 constitutional floor-enforcement tools + 7 governance tools
# (...vault_list, vault_dry_seal).

# RIGHT:
# arifOS MCP has ONE unified tool surface — 13 canonical tools (arif_verb_noun).
# No separate governance or CC surface. All 13 are in CANONICAL_TOOLS.

TCP table decode (confirm 0.0.0.0 bind)

Container's arifOS must listen on 0.0.0.0 (all interfaces), NOT 127.0.0.1 (loopback only).

docker exec arifosmcp sh -lc "cat /proc/net/tcp | awk 'NR>1 {print \$2, \$4}' | while read lport state; do [ \"\$lport\" = \"1F90\" ] && echo \"port 8080: state=\$state\"; done"
# state=0A = LISTEN. lport hex 1F90 = decimal 8080.
# remote_address column:
#   0100007F = 127.0.0.1 (loopback only) ← WRONG
#   00000000 = 0.0.0.0 (all interfaces) ← CORRECT

The #1 failure mode: local git ≠ deployed image

ASI diagnoses git log, sees recent commits, assumes container is running that code. This is wrong. The container runs a BAKED DOCKER IMAGE. Local git changes ≠ deployed changes until:

git add -A && git commit --no-verify && docker build ... && docker push && docker compose pull && docker compose up -d

ALWAYS verify container MCP surface directly:

docker exec arifosmcp python3 /tmp/check_mcp_tools.py

Never assume. Read the container.

verify_public.py tool count reference

CANONICAL_TOOL_COUNT = 13 (live probe via tools/list 2026-05-06)
Both /status.json (services.arifos.tools) and MCP tools/list must report exactly 13
If HOLD with "⚠️ tool_count" — check CANONICAL_TOOL_COUNT in verify_public.py matches actual deployed surface

The 13 canonical arifOS tools (verified via MCP tools/list)

arif_session_init   arif_sense_observe  arif_evidence_fetch  arif_mind_reason
arif_kernel_route   arif_reply_compose  arif_memory_recall    arif_heart_critique
arif_gateway_connect arif_ops_measure   arif_judge_deliberate  arif_vault_seal
arif_forge_execute

Note: The 5 CC/UI items (command_center, get_constitutional_health, list_recent_verdicts, render_vault_seal, vault_seal_card) are registered as MCP resources or modes on existing tools, not as separate MCP tools. They do NOT appear in tools/list.

Transport: "Missing session ID" HTTP 400 (stateless_http bug)

When an MCP endpoint returns HTTP 400 — "Missing session ID", the root cause is usually stateless_http=False (stateful mode) in FastMCP 3.2.4 + MCP SDK.

The bug: In stateful mode, create_new_session() generates a session ID then immediately calls handle_request() which validates it — before the client has received the session ID to echo back. Chicken-and-egg.

Fix: Use stateless_http=True on the http_handler. arifOS has this correct at arifosmcp/runtime/server.py:560.

# WRONG — causes "Missing session ID":
app = mcp_app.http_app(stateless_http=False, json_response=True)

# CORRECT:
app = mcp_app.http_app(stateless_http=True, json_response=True)

In stateless mode, GET /mcp returns 405 (expected — FastMCP stateless only supports POST/DELETE).

See skill geox-mcp-missing-session-id-fix for full debugging trace.

Key lessons — trust MCP tools/list as the source of truth

tools/list = ground truth. Do NOT trust stated counts from secondhand reports, /health metadata, or session probes that aggregate resources/modes as tools.
In this session: a "18 tools" report was accepted without verification. Direct tools/list probe showed 13. All changes were reverted. CI gate was right.
A tool registered as a resource or mode parameter does NOT appear in tools/list. This is the correct behavior — it is F8 GENIUS (mode ≠ tool).
docker ps image names can be latest locally even when GHCR tags show de038a0f in remote references — always check docker inspect for actual image SHA.

Key commits / events

ba28b7bb — revert to 13 (CC modes, not separate tools)
b39589d3 — fix verify_public.py tool count to 13
2459dfdd — the 20-tool mistake (historical — DITEMPA BUKAN DIBERI)
81d61420 — "20 tools" commit (pre-revert, confused state)
4dc8a5fe (GEOX) — stateless_http=False → True fix
4fd7fb43 — BAD commit: CANONICAL_TOOL_COUNT 13→18 (wrong probe result accepted without verification)
4f5d2c7d — revert 4fd7fb43 (live tools/list confirms 13; CI gate was right)

arifOS MCP Surface Debugging — Live vs Stated Tool Count

Live Tool Counts (2026-05-06 probe — ground truth via tools/list)

Total federation: 13+50+15+45 = 123 live tools. (Earlier "18 for arifOS" report was via mixed method; tools/list confirms 13.)

Trigger

Debugging MCP surface discrepancies: tools/list ≠ expected count, or verify_public HOLD with tool count mismatch.

The Diagnostic Chain

Step 1: Check what's ACTUALLY deployed (not what you think)

# Inside container — the ground truth
docker exec arifosmcp python3 -c "
import sys; sys.path.insert(0, '/usr/src/app')
import asyncio
from arifosmcp.server import mcp
tools = asyncio.run(mcp.list_tools())
print(f'MCP surface: {len(tools)} tools')
for t in sorted(tools, key=lambda x: x.name):
    print(f'  {t.name}')
"

# Copy script if not present:
docker cp /tmp/check_mcp_tools.py arifosmcp:/tmp/check_mcp_tools.py
docker exec arifosmcp python3 /tmp/check_mcp_tools.py

# Public endpoint
curl -s https://mcp.arif-fazil.com/status.json | python3 -c "import json,sys; d=json.load(sys.stdin); print(d.get('services',{}).get('arifos',{}).get('tools','?'))"

# /health surface breakdown
curl -s https://mcp.arif-fazil.com/health | python3 -c "import json,sys; d=json.load(sys.stdin); print(json.dumps({k:d[k] for k in ['tools','apps','canonical_surface','registered_surface','probe_surface']}, indent=2))"

Step 2: Compare local git vs deployed image

# Local uncommitted changes?
cd /root/arifOS && git status --short

# git stash WIP — can interfere with diagnosis!
git stash list

# What's IN the deployed image?
docker image inspect ghcr.io/ariffazil/arifos:a-forge --format '{{index .Config.Labels "git.sha"}}' 2>/dev/null

# Git log
git log --oneline -10

Step 3: Root causes in order of likelihood

Step 4: The mode/tool distinction (critical)

WRONG (20-tool era):     forge_dry_run IS a separate tool
RIGHT (13-tool era):     arif_forge_execute(mode="dry_run") IS a mode parameter

WRONG (20-tool era):     vault_list IS a separate tool  
RIGHT (13-tool era):     arif_vault_seal(mode="list") IS a mode parameter

This distinction is F8 GENIUS and F10 ONTOLOGY. The oscillation 13→20→13 was DITEMPA BUKAN DIBERI — the category error had to be made to understand WHY 13 is correct.

Step 5: Fix and verify

# After fixes:
make verify-public   # must be 10/10 PASS

# Manual spot checks:
curl -s https://mcp.arif-fazil.com/health | python3 -c "import json,sys; d=json.load(sys.stdin); assert d.get('canonical_surface') == 13"
curl -s https://mcp.arif-fazil.com/ready | python3 -c "import json,sys; d=json.load(sys.stdin); assert d.get('status') == 'pass'"

Revert pitfalls (13→20→13 pattern)

After reverting from 20→13, manually check server.py for leftover artifacts:

git diff arifosmcp/server.py | head -60

Specifically watch for:

Duplicate PUBLIC_TOOLS = list_public_tools() — the revert may leave the original declaration plus a spurious logger.warning("ARIFOS MCP KANON Phase 2...") line between them
Ghost CC wiring block — lines 184–378 may still contain old CC tool registrations
Always run git diff --stat immediately after revert commits to confirm expected +22 -306 or similar

git stash WIP state

Before diagnosing, check stash list — uncommitted local changes may exist:

git stash list
git diff --stat

Local uncommitted ≠ deployed image. Container may be running OLDER code than what git log shows.

verify_public.py — also update comment blocks

The 5-line comment block BEFORE CANONICAL_TOOL_COUNT also references "20 tools" and "canonical=13 + 7 governance" — update that too or the next run will confuse.

# WRONG (leftover from 20-era):
# arifOS MCP has ONE unified tool surface — all 20 tools in constitutional_map.py.
# canonical=13 constitutional floor-enforcement tools + 7 governance tools
# (...vault_list, vault_dry_seal).

# RIGHT:
# arifOS MCP has ONE unified tool surface — 13 canonical tools (arif_verb_noun).
# No separate governance or CC surface. All 13 are in CANONICAL_TOOLS.

TCP table decode (confirm 0.0.0.0 bind)

Container's arifOS must listen on 0.0.0.0 (all interfaces), NOT 127.0.0.1 (loopback only).

docker exec arifosmcp sh -lc "cat /proc/net/tcp | awk 'NR>1 {print \$2, \$4}' | while read lport state; do [ \"\$lport\" = \"1F90\" ] && echo \"port 8080: state=\$state\"; done"
# state=0A = LISTEN. lport hex 1F90 = decimal 8080.
# remote_address column:
#   0100007F = 127.0.0.1 (loopback only) ← WRONG
#   00000000 = 0.0.0.0 (all interfaces) ← CORRECT

The #1 failure mode: local git ≠ deployed image

ASI diagnoses git log, sees recent commits, assumes container is running that code. This is wrong. The container runs a BAKED DOCKER IMAGE. Local git changes ≠ deployed changes until:

git add -A && git commit --no-verify && docker build ... && docker push && docker compose pull && docker compose up -d

ALWAYS verify container MCP surface directly:

docker exec arifosmcp python3 /tmp/check_mcp_tools.py

Never assume. Read the container.

verify_public.py tool count reference

CANONICAL_TOOL_COUNT = 13 (live probe via tools/list 2026-05-06)
Both /status.json (services.arifos.tools) and MCP tools/list must report exactly 13
If HOLD with "⚠️ tool_count" — check CANONICAL_TOOL_COUNT in verify_public.py matches actual deployed surface

The 13 canonical arifOS tools (verified via MCP tools/list)

arif_session_init   arif_sense_observe  arif_evidence_fetch  arif_mind_reason
arif_kernel_route   arif_reply_compose  arif_memory_recall    arif_heart_critique
arif_gateway_connect arif_ops_measure   arif_judge_deliberate  arif_vault_seal
arif_forge_execute

Transport: "Missing session ID" HTTP 400 (stateless_http bug)

When an MCP endpoint returns HTTP 400 — "Missing session ID", the root cause is usually stateless_http=False (stateful mode) in FastMCP 3.2.4 + MCP SDK.

Fix: Use stateless_http=True on the http_handler. arifOS has this correct at arifosmcp/runtime/server.py:560.

# WRONG — causes "Missing session ID":
app = mcp_app.http_app(stateless_http=False, json_response=True)

# CORRECT:
app = mcp_app.http_app(stateless_http=True, json_response=True)

In stateless mode, GET /mcp returns 405 (expected — FastMCP stateless only supports POST/DELETE).

See skill geox-mcp-missing-session-id-fix for full debugging trace.

Key lessons — trust MCP tools/list as the source of truth

tools/list = ground truth. Do NOT trust stated counts from secondhand reports, /health metadata, or session probes that aggregate resources/modes as tools.
In this session: a "18 tools" report was accepted without verification. Direct tools/list probe showed 13. All changes were reverted. CI gate was right.
A tool registered as a resource or mode parameter does NOT appear in tools/list. This is the correct behavior — it is F8 GENIUS (mode ≠ tool).
docker ps image names can be latest locally even when GHCR tags show de038a0f in remote references — always check docker inspect for actual image SHA.

Key commits / events

ba28b7bb — revert to 13 (CC modes, not separate tools)
b39589d3 — fix verify_public.py tool count to 13
2459dfdd — the 20-tool mistake (historical — DITEMPA BUKAN DIBERI)
81d61420 — "20 tools" commit (pre-revert, confused state)
4dc8a5fe (GEOX) — stateless_http=False → True fix
4fd7fb43 — BAD commit: CANONICAL_TOOL_COUNT 13→18 (wrong probe result accepted without verification)
4f5d2c7d — revert 4fd7fb43 (live tools/list confirms 13; CI gate was right)

Adoption

ariffazil/arifos-mcp-surface-debug

$ install --global

Security Scan Results

SKILL.md

arifOS MCP Surface Debugging — Live vs Stated Tool Count

Live Tool Counts (2026-05-06 probe — ground truth via tools/list)

Trigger

The Diagnostic Chain

Step 1: Check what's ACTUALLY deployed (not what you think)

Step 2: Compare local git vs deployed image

Step 3: Root causes in order of likelihood

Step 4: The mode/tool distinction (critical)

Step 5: Fix and verify

Revert pitfalls (13→20→13 pattern)

git stash WIP state

verify_public.py — also update comment blocks

TCP table decode (confirm 0.0.0.0 bind)

The #1 failure mode: local git ≠ deployed image

verify_public.py tool count reference

The 13 canonical arifOS tools (verified via MCP tools/list)

Transport: "Missing session ID" HTTP 400 (stateless_http bug)

Key lessons — trust MCP tools/list as the source of truth

Key commits / events

Related Skills

ariffazil/XAUUSD-trading-stack

ariffazil/wealth-claim-state

ariffazil/warga-constitutional

ariffazil/warga

ariffazil/arifos-mcp-surface-debug

$ install --global

Security Scan Results

SKILL.md

arifOS MCP Surface Debugging — Live vs Stated Tool Count

Live Tool Counts (2026-05-06 probe — ground truth via tools/list)

Trigger

The Diagnostic Chain

Step 1: Check what's ACTUALLY deployed (not what you think)

Step 2: Compare local git vs deployed image

Step 3: Root causes in order of likelihood

Step 4: The mode/tool distinction (critical)

Step 5: Fix and verify

Revert pitfalls (13→20→13 pattern)

git stash WIP state

verify_public.py — also update comment blocks

TCP table decode (confirm 0.0.0.0 bind)

The #1 failure mode: local git ≠ deployed image

verify_public.py tool count reference

The 13 canonical arifOS tools (verified via MCP tools/list)

Transport: "Missing session ID" HTTP 400 (stateless_http bug)

Key lessons — trust MCP tools/list as the source of truth

Key commits / events

Related Skills

ariffazil/XAUUSD-trading-stack

ariffazil/wealth-claim-state

ariffazil/warga-constitutional

ariffazil/warga