UDP offload
Inbound market data is received and offloaded directly in fabric off the 10/40 GbE MAC, with no kernel network stack involved.
From the moment a packet hits the wire to the moment an order leaves it, every stage that matters runs inside the FPGA. There's no kernel in the way and no PCIe round-trip, so the timing holds steady down to the nanosecond.
Each stage hands off to the next inside the same chip. The host sets things up and watches, but it never sits between the market and the matching engine.
Inbound market data is received and offloaded directly in fabric off the 10/40 GbE MAC, with no kernel network stack involved.
The ITCH feed handler decodes messages and reconstructs a 5–10 level L2 order book entirely in hardware.
The quoting engine and pre-trade risk evaluate the book in the same clock domain. MQC can pull every quote in sub-microsecond time.
Triggered orders are encoded into the exchange's OUCH protocol directly in fabric, ready for egress.
A custom hardware TCP/IP stack sends the order out through the dedicated TCP MAC to the matching engine.
The timing barely moves run to run, because the whole decision happens in the chip rather than in software.
Nothing here is a black box. Generate cores from C/C++ with HLS, or bring your own SystemVerilog and run it virtualized in the fabric. Either way, you can replay captured traffic through it before it goes near production.
| Specifications | |
| Wire-to-trade | < 150 ns, deterministic (FPGA-dependent) |
| Market data | ITCH UDP, in-fabric L2 book (5–10 levels) |
| Order entry | OUCH over hardware TCP/IP |
| Networking | 10 GbE & 40 GbE, dedicated UDP & TCP MACs |
| Risk | Hardware MQC, sub-microsecond reaction |
| Hardware | Xilinx / AMD FPGA accelerator cards |
| Scale | Multi-symbol, multi-card, horizontal per rack |
Latency depends on card, venue and configuration. Tell us your setup and we'll talk specifics.