Software engineering
SWE-bench
Catalyst is evaluated on SWE-bench — resolving real GitHub issues against real repositories — as the public yardstick for end-to-end code work.
Technology / Legacy Modernisation
Catalyst is our proprietary multi-agent system and MCP for migrating and modernising legacy codebases. COBOL and IBM mainframe to Java and modern frameworks — verified, not hoped.
Trillions of lines of COBOL still run banking, insurance and government systems. The risk is never the language — it is that the behaviour is undocumented, the experts have left, and the system cannot be switched off while it is being replaced.
Catalyst treats modernisation as an engineering problem with a verification loop — not a translation exercise you check by eye.
COBOL on a mainframe is not a museum piece. It is processing transactions tonight. You cannot stop it to rewrite it.
The original authors retired. The documentation is partial. The behaviour lives only in the source — and in production.
Big-bang rewrites slip, blow budgets, and re-introduce bugs that the old system had quietly fixed twenty years ago.
Catalyst · proprietary multi-agent system & MCP
Catalyst is not one model translating files. It is a set of specialised agents — each scoped to a stage, each governed by our internal protocol for agent quality, readiness and security — working over a shared map of the codebase.
Catalyst reads the whole codebase and reconstructs the call graph, data flows and hidden coupling — the map nobody on the team currently holds in full.
It recovers the implicit architecture: what the modules really do, where the boundaries are, which behaviours are load-bearing and which are dead.
A sequenced plan — strangler-fig where it fits, module-by-module where it does not — so the system keeps running while it is modernised.
COBOL, IBM mainframe code and other legacy stacks translated into Java and modern frameworks — preserving behaviour, not just syntax.
Characterisation tests pinned to the legacy behaviour first, so the migration is verified against what the old system actually did — not against a spec that may not exist.
Outputs validated against the generated tests and against provenance. Every change is traceable to the legacy behaviour it preserves or deliberately changes.
We hold Catalyst to public and internal benchmarks so a migration claim is backed by evidence a client can inspect.
Software engineering
Catalyst is evaluated on SWE-bench — resolving real GitHub issues against real repositories — as the public yardstick for end-to-end code work.
Internal · legacy translation
Internal task batteries on COBOL-to-Java and mainframe migration: behaviour preservation, test pass-through and review effort measured per module.
Full benchmark methodology and results are being written up in a technical paper — in preparation. Reach out if you want the current figures under NDA.
Catalyst applies the same multi-agent architecture and verification discipline we research across KVA — role-scoped agents, audited workflows, capability gated behind evidence.
We start with a Catalyst assessment: a dependency map and a sequenced modernisation plan, before a single line is migrated.