- SUT ID: XLRA260312
- STAC-ML
STAC-ML™ Pack for Xelera Silva with AMD Alveo™ V80 on an HPE Proliant DL385 Gen10 Plus v2 server
Type: Audited
Specs: STAC-ML™ Markets (Inference)
STAC recently performed a STAC-ML™ Markets (Inference) benchmark audit on a stack including a STAC-ML™ Pack for Xelera Silva with AMD Alveo™ V80 on an HPE Proliant DL385 Gen10 Plus v2 server.
This report represents the first STAC-ML™ Audit utilizing Gradient Boosted Trees. A relatively recent suite, codenamed “El Popo”, added to STAC-ML™ to complement the existing LSTM based suites “Tacana” and “Sumaco”.
STAC-ML Markets (Inference) is the technology benchmark standard for solutions that can be used to run inference on realtime market data. Designed by quants and technologists from some of the world's leading financial firms, the benchmarks test the latency, throughput, energy efficiency, space efficiency, and algorithm quality of a technology stack across three model sizes and different numbers of model instances.
Highlights include:
- For the small (GBT_A) and medium (GBT_B) models, 99th percentile latencies were <= 1.95µs for all Numbers of Model Instances (NMI) tested, with worst-case instance throughput > 560K inferences per second at the highest NMIs tested.
- For the large (GBT_C) model, the 99th percentile latency was 2.88µs, with worst-case instance throughput of 379K inferences per second
- The maximum latency was <= 12.3µs across all models and NMI tested.
Further results and details can be found in the report.
The benchmark reports are available to all STAC Observer members. Additionally, Insights Subscribers have access to extensive visualizations of all test results, the micro-detailed configuration information for the solutions tested, the code used in this project, and the ability to run these same benchmarks in the privacy of their own labs. To learn about subscription options, please contact us.
