EXP ID: OGRB-2026-X8712 DATASET VER: v1.4.0-Stable OPERATOR VER: O.i-Core v2.1 BENCHMARK: v2.5.0-Dev
COMMIT: git-9f12ac38 DOI: 10.1109/TOI.2026.39841 REPRODUCIBILITY STAMP VALID
ACADEMIC VALIDATION RUNTIME Q1 Submission Protocol

Operator-Guided Reasoning (OGRB Platform)

Phan Thanh Trung • Independent Researcher, Vietnam • 0009-0000-7520-6781 • DOI:10.5281/zenodo.20669008 • Artifactsmaker/operatorization-framework

Reproducibility Engine

DETERMINISTIC CORES
Experiment Snapshot Profiles

Benchmark Corpus Manager

TOTAL LIBRARIES 3 (ΩTuy, ΩBrauer, ΩDEO2)
SAMPLE DENSITY 450 Tasks

Silicon Mapping Profile

ISA Instruction Set Profile Unified O.i-v2
TCU-2
Active
BZNU-2
Active
CEU-2
Active

Reasoning Pipeline Topology

Deterministic Static Ready
INPUT Context $\Psi$
DYNAMIC BOUNDS Unified O.i Pipeline Fully Stacked Operators
SOLVER Core LLM Engine
OUTPUT Optimal Answer

Benchmark Custom Configuration

TASKS PER RUN 120
REPEATED RUNS (N) 8

Task Playground View

Active Sample: 1/4
Selected Reasoning Task Scenario
Awaiting verification sweep... Select a task component or hit "Run Custom Benchmark" to execute.
Polyhedral Invariant Bounds
// Idle
Theoretical Target Orbit
// Idle
Solver Execution Pipeline Trace
STANDBY
// Deterministic verification pipeline stands ready.
Difficulty Level: Expert

Statistical Validation Module

RIGOROUS T-TEST
MEAN SUCCESS RATE - 95% CI: [-]
STANDARD DEV (σ) - Sample variance
HYPOTHESIS TESTING (vs Baseline) Ready
Student t-value: -
Mann-Whitney U: -
Calculated p-value: -
Effect Size (Cohen's d): -
Awaiting run to execute parametric student-T and non-parametric Mann-Whitney U testing on generated task distributions.
STABILITY RATE -
VIOLATIONS RATE -
EVALUATION SWEEP PROGRESS 0%

Formal Operatorology Math

ΩTuy (Linear Cut Subspace Selector):
$$\Omega_{\mathrm{Tuy}}(\Psi) \rightarrow \operatorname{argmin}_{x \in \Psi} \mathcal{E}(x)$$
ΩBrauer (Invariance Orbit Projection):
$$\mathcal{H}_{0}^{\perp} = \lim_{H \to 0} g_H \quad \text{s.t.} \quad \det(g_H) \to 0$$
ΩDEO2 (Disciplined Second-Order Evolution):
$$\mathcal{D}_{\mathrm{DEO\text{-}2}}(T)=\lim_{n\to\infty}\prod_{k=1}^{n}\Big(\Pi_{t_k}\,\exp(\Delta t_k\,\Lambda(t_k))\,\Pi_{t_k}\Big)$$

Ablation Study Matrix Configuration Comparison

ABLATION STRUCTURE ΩTuy ΩBrauer ΩDEO2 MEAN (%) STABILITY (%) VIOLATIONS (%) P-VALUE vs BASELINE

Operator Contribution Gains

Success Rate comparison across configurations

Path stability over iterative epoch runs

Multi-Dimensional Performance Metric Radar

Academic Comparative Ledger

Showing benchmark results sorted by system performance • 95% Confidence Interval Listed
SYSTEM EVALUATION PIPELINE REASONING CORE MEAN ACCURACY (%) STABILITY RATIO (%) CONSISTENCY (%) AVG LATENCY (MS) FIDELITY SCORE (%) CONSTRAINT VIOLATIONS (%) 95% CI WIDTH