SMO Agent Operational Intent Benchmark

Ilias Chatzistefanidis

A benchmark suite for LLM-powered multi-agent RAN assistants in a 5G Open RAN deployment. It evaluates 150 realistic operator intents (100 observability, 50 control) and reports coherence (0–5), action accuracy, latency, and GPU/VRAM usage across cloud and local LLM backends.

release

Detailed Description

This benchmark evaluates the BubbleRAN MX-AI agentic AI platform, specifically the SMO Agent that interprets operator intents and enforces control actions in a 5G Open RAN deployment. The dataset includes 150 realistic operational prompts, divided into:

100 observability queries covering KPIs, policies, slices, Custom Resource Definitions (CRDs), logs, and topology/context
50 control actions, such as UE lifecycle management, blueprint deployment/deletion, and slice PRB reconfiguration

For each LLM backend, the benchmark reports:

Observability Coherence (0–5): Scored using an LLM-assisted evaluator (GPTScore) with explicit rubrics. Three expert annotators review disagreements and validate edge cases.
Action Accuracy (%): Binary correctness per control task (intent → enforced change), aggregated as a percentage.
End-to-End Latency (seconds): Measured from prompt submission to answer or action completion.
GPU Footprint / VRAM Usage: Reported for local deployments to analyze coherence–latency–resource trade-offs.

The benchmark enables fair comparison between cloud and on-prem LLM backends and quantifies how retrieval and tooling quality affect observability performance. Observability tasks require contextual reasoning across multiple data sources and are inherently more challenging than constrained, tool-based control actions.

Key Features

Live 5G Open RAN testbed evaluation
Observability Q&A and closed-loop network control tasks
Hybrid scoring using GPTScore and expert review
Multi-metric reporting including coherence, action accuracy, latency, and GPU VRAM usage
Cloud and on-prem LLM backend comparison

Use Cases

Benchmark new LLM and SLM backends for SMO-level operations
Compare retrieval and tool-calling strategies
Evaluate on-prem deployment feasibility across latency, VRAM, and answer quality
Measure agent time-to-action against human operators

External Resources

MX-AI: Agentic Observability and Control Platform for Open and AI-RAN

Published at IEEE ICC 2026

View Details

Technical Details

Version2026.01

Published01 Jan, 2026

Base

Platform:

BubbleRAN MX-AI

Network:

5G Standalone

Author

Ilias Chatzistefanidis

Affiliation

BubbleRAN

Certified By

BubbleRAN

More from Ilias Chatzistefanidis

Uplink Power Control Agent

Ilias Chatzistefanidis

An uplink power control agent designed to perform adaptive power control for uplink channels (e.g., PUCCH and PUSCH) by dynamically adjusting parameters such as pZeroNominal in 5G networks.

O-RANMonitorControl+1

release

VM Observability Intent Agent

Ilias Chatzistefanidis

An intent-driven observability agent that translates high-level monitoring and troubleshooting intents into PromQL/MetricsQL queries against VictoriaMetrics. It delivers actionable insights, including KPIs, trends, anomaly detection, and supporting evidence, for RAN and network slicing environments in 5G O-RAN networks.

O-RANMonitorOpenAirInterface+4

release

Related Artifacts

SMO Agent

Andrea Leone

Enrico Sbuttoni

The Service Management and Orchestration (SMO) Agent processes high-level intents for network lifecycle management, including deployment, monitoring, and deletion. It operates on top of the Hermes API Agent, translating service-level intents into structured orchestration requests.

O-RANArtificial IntelligentMonitor+7

develop

SMO Agent Operational Intent Benchmark

Detailed Description

Key Features

Use Cases

External Resources

MX-AI: Agentic Observability and Control Platform for Open and AI-RAN

Technical Details

Author

Tags

Affiliation

Certified By

More from Ilias Chatzistefanidis

Uplink Power Control Agent

VM Observability Intent Agent

Related Artifacts

SMO Agent