By clicking “Accept”, you agree to the storing of cookies on your device to enhance site navigation, analyze site usage, and assist in our marketing efforts. View our Privacy Policy for more information.

Privacy Preference Center

When you visit websites, they may store or retrieve data in your browser. This storage is often necessary for the basic functionality of the website. The storage may be used for marketing, analytics, and personalization of the site, such as storing your preferences. Privacy is important to us, so you have the option of disabling certain types of storage that may not be necessary for the basic functioning of the website. Blocking categories may impact your experience on the website.

Reject all cookies Allow all cookies

Manage Consent Preferences by Category

Essential

Always Active

These items are required to enable basic website functionality.

Marketing

Essential

These items are used to deliver advertising that is more relevant to you and your interests. They may also be used to limit the number of times you see an advertisement and measure the effectiveness of advertising campaigns. Advertising networks usually place them with the website operator’s permission.

Personalization

Essential

These items allow the website to remember choices you make (such as your user name, language, or the region you are in) and provide enhanced, more personal features. For example, a website may provide you with local weather reports or traffic news by storing data about your current location.

Analytics

Essential

These items help the website operator understand how its website performs, how visitors interact with the site, and whether there may be technical issues. This storage type usually doesn’t collect information that identifies a visitor.

Confirm my preferences and close

XELERA SILVA

Ultra-Low Latency
AI Inference Platform

Xelera Silva provides best-in-class throughput and latency for Gradient Boosting Trees and Neural Network inference.

Request a Demo Download Product Brief

decision treee acceleration software picture

Ultra-low latency with Machine Learning Acceleration

Artificial Intelligence (AI) models—powered by Gradient Boosting Trees and Neural Networks—are increasingly critical for real-time data analysis in domains such as algorithmic trading, recommender systems, biosciences, and cybersecurity applications including ransomware and DDoS detection.
However, deploying these models in latency or throughput sensitive environments poses significant performance challenges.

The Xelera Silva platform addresses these demands with ultra-low latency AI inference delivered through a unified API. Leveraging high-performance hardware accelerator cards, Silva eliminates common latency and throughput bottlenecks, providing a turnkey solution for seamless integration of AI-driven decision-making into production systems.

You must accept cookies, to view this video. Open cookie preferences.

Technical features

Gradient Boosting Trees
1 µs – 1.5 µs typical latency

Algorithms:
XGBoost, LightGBM,CatBoost

Feature Types:
Numerical (float32), Categorical
‍
Batch Size: 1
‍
Devices:
AMD Alveo U50, U55C, V80
Napatech NT200A02
‍
Deployment:
On-premise
‍
Operating System:
Linux Ubuntu, Linux Rocky, CentOS

Neural Networks
1.8 µs – 5 µs typical latency

Algorithms:
LSTM, Linear Layers, ctivations(sigmoid, tanh, relu)

Feature Types:
float16, bfloat16
‍
Batch Size: 1
‍
Devices:
AMD Alveo V80
‍
‍
Deployment:
On-premise
‍
Operating System:
Linux Ubuntu, Linux Rocky, CentOS

Your benefits

Unmatched Speed

Inference time with a typical latency of 1 to 3microseconds, depending on the AI algorithm

Seamless Integration

Neural networks and boosted tree algorithms under aunified high-performance software API in C/C++ and Python

Bring Your Own Model

Train your own model with the standard frameworks onyour data and dynamically deploy on the accelerator card

Model Hot-Swap

Concurrent execution of multiple models on a singleaccelerator with instantaneous model hot-swapping

Use Cases

High-Frequency Trading
Software API Integration

High Frequency Traders use decision algorithms to automate trading instructions. The automated decisions are increasingly made by AI models.
A low latency is key for these systems. Silva overcomes the latency disadvantage of Machine Learning algorithms: Inference of Gradient Boosting Trees and Neural Networks models is performed with a latency of a few microseconds.
This enables our clients to make better, sophisticated trading decisions and win speed races. The turn-key accelerator connects to the software-based trading system and offloads the AI inference to a PCIe-attached hardware accelerator card.

The picture shows the Use Case for High-Frequency Trading: Software Tick-to-Trade

The picture shows the Use Case for High-Frequency Trading: Hardware Tick-to-Trade

High-Frequency Trading
IP Core

In addition to the turnkey version, Xelera Silva is also available as an IP core. The inline Machine Learning accelerator is inserted into the fast path of network-bound hardware accelerators and receives input from the card's network port.
In this way, no data needs to be transferred via the PCIe bus and the corresponding latency for data transfer is eliminated. This product is relevant for customers with their own FPGA teams and offers the lowest latencies.

Deliverables

Xelera Silva is a turnkey full-stack solution designed to jumpstart best-in-class AI Inference acceleration.

Software packages

DEB / RPM packages and FPGA bitstreams for AMD AlveoU50, U55C, V80 accelerator cards and Napatech NT200A02.

API Support

API: C/C++, Python
‍
Host library to load model to the FPGA and run inference

Example design

Jumpstart the AI inference acceleration with the provided example design

Support and User Guide

Integration and full lifecycle maintenance support
Periodic software updates

Pricing and Support

We understand that integrating solutions does not only require exceptional functionality but also transparent pricing models and reliable support. As technology evolves, so do we. We are committed to continuous innovation, ensuring that our software remains at the forefront of machine learning acceleration. With regular updates and feature enhancements, you can trust that you're always leveraging the latest advancements in the field. Our commitment to innovation means that you can stay ahead of the competition and unlock new possibilities for your projects. Contact us today to learn more about our pricing plans and support services. Unlock the full potential of your projects.

Ultra-Low Latency
AI Inference Platform

Ultra-low latency with Machine Learning Acceleration

Technical features

Gradient Boosting Trees
1 µs – 1.5 µs typical latency

Neural Networks
1.8 µs – 5 µs typical latency

Your benefits

Use Cases

High-Frequency Trading
Software API Integration

High-Frequency Trading

How is it working?

Gradient Boosting Tree: 1.136 µs*

Neural Network: 2.935 µs*

High-Frequency Trading
IP Core

IP Inline core

How is it working?

Gradient Bossting Tree: 400 ns*

Neural Network: 1.56 µs*

Xelera Silva Datasheets

Deliverables

Getting Started

Pricing and Support

Latest Product news

Machine Learning Inference for HFT: How Xelera Silva and ICC Deliver Ultra-Low Latency Trading Decisions

Ultra-low Latency XGBoost with Xelera Silva

Low-latency Machine Learning Inference for High-Frequency Trading

Products

Company

Resources

Ultra-Low Latency AI Inference Platform

Ultra-low latency with Machine Learning Acceleration

Technical features

Gradient Boosting Trees 1 µs – 1.5 µs typical latency

Neural Networks 1.8 µs – 5 µs typical latency

Your benefits

Use Cases

High-Frequency Trading Software API Integration

High-Frequency Trading

How is it working?

Gradient Boosting Tree: 1.136 µs*

Neural Network: 2.935 µs*

High-Frequency TradingIP Core

IP Inline core

How is it working?

Gradient Bossting Tree: 400 ns*

Neural Network: 1.56 µs*

Xelera Silva Datasheets

Deliverables

Getting Started

Pricing and Support

Latest Product news

Machine Learning Inference for HFT: How Xelera Silva and ICC Deliver Ultra-Low Latency Trading Decisions

Ultra-low Latency XGBoost with Xelera Silva

Low-latency Machine Learning Inference for High-Frequency Trading

Products

Company

Resources

Ultra-Low Latency
AI Inference Platform

Gradient Boosting Trees
1 µs – 1.5 µs typical latency

Neural Networks
1.8 µs – 5 µs typical latency

High-Frequency Trading
Software API Integration

High-Frequency Trading
IP Core