Type something to search...
SMO Agent Operational Intent Benchmark
SMO Agent Operational Intent Benchmark

SMO Agent Operational Intent Benchmark

Ilias Chatzistefanidis

A benchmark suite for LLM-powered multi-agent RAN assistants in a 5G Open RAN deployment. It evaluates 150 realistic operator intents (100 observability, 50 control) and reports coherence (0–5), action accuracy, latency, and GPU/VRAM usage across cloud and local LLM backends.

release
84
66

Detailed Description

This benchmark evaluates the BubbleRAN MX-AI agentic AI platform, specifically the SMO Agent that interprets operator intents and enforces control actions in a 5G Open RAN deployment. The dataset includes 150 realistic operational prompts, divided into:

  • 100 observability queries covering KPIs, policies, slices, Custom Resource Definitions (CRDs), logs, and topology/context
  • 50 control actions, such as UE lifecycle management, blueprint deployment/deletion, and slice PRB reconfiguration

For each LLM backend, the benchmark reports:

  • Observability Coherence (0–5): Scored using an LLM-assisted evaluator (GPTScore) with explicit rubrics. Three expert annotators review disagreements and validate edge cases.
  • Action Accuracy (%): Binary correctness per control task (intent → enforced change), aggregated as a percentage.
  • End-to-End Latency (seconds): Measured from prompt submission to answer or action completion.
  • GPU Footprint / VRAM Usage: Reported for local deployments to analyze coherence–latency–resource trade-offs.

The benchmark enables fair comparison between cloud and on-prem LLM backends and quantifies how retrieval and tooling quality affect observability performance. Observability tasks require contextual reasoning across multiple data sources and are inherently more challenging than constrained, tool-based control actions.

Key Features

  • Live 5G Open RAN testbed evaluation
  • Observability Q&A and closed-loop network control tasks
  • Hybrid scoring using GPTScore and expert review
  • Multi-metric reporting including coherence, action accuracy, latency, and GPU VRAM usage
  • Cloud and on-prem LLM backend comparison

Use Cases

  • Benchmark new LLM and SLM backends for SMO-level operations
  • Compare retrieval and tool-calling strategies
  • Evaluate on-prem deployment feasibility across latency, VRAM, and answer quality
  • Measure agent time-to-action against human operators

External Resources

MX-AI: Agentic Observability and Control Platform for Open and AI-RAN


Published at IEEE ICC 2026
View Details

Technical Details


Version2026.01
Published01 Jan, 2026
Base
Platform:
BubbleRAN MX-AI
Network:
5G Standalone

Author


Ilias Chatzistefanidis
Ilias Chatzistefanidis

Tags


  • O-RAN
  • Monitor
  • Control
  • Artificial Intelligent
  • OpenAirInterface

Affiliation


  • BubbleRAN

Certified By


  • BubbleRANBubbleRAN

More from Ilias Chatzistefanidis

Uplink Power Control Agent

Uplink Power Control Agent

Ilias Chatzistefanidis

An uplink power control agent designed to perform adaptive power control for uplink channels (e.g., PUCCH and PUSCH) by dynamically adjusting parameters such as pZeroNominal in 5G networks.

O-RANMonitorControl+1
release
33
12
VM Observability Intent Agent

VM Observability Intent Agent

Ilias Chatzistefanidis

An intent-driven observability agent that translates high-level monitoring and troubleshooting intents into PromQL/MetricsQL queries against VictoriaMetrics. It delivers actionable insights, including KPIs, trends, anomaly detection, and supporting evidence, for RAN and network slicing environments in 5G O-RAN networks.

O-RANMonitorOpenAirInterface+4
release
25
13

Related Artifacts

SMO Agent

SMO Agent

Andrea Leone
Enrico Sbuttoni

The Service Management and Orchestration (SMO) Agent processes high-level intents for network lifecycle management, including deployment, monitoring, and deletion. It operates on top of the Hermes API Agent, translating service-level intents into structured orchestration requests.

O-RANArtificial IntelligentMonitor+7
develop
46
37