Enter your email above, then click "Sign Up" to join the STAC mail list and (optionally) register to access materials on the site. Click for terms.
A comparison between self-hosted LLMs and API services
Mean latency of 27 micros at 2x playback rate with 5 partially overlapping clients. System withstood the max rate possible with this harness: 2.8million msgs/sec (post line-arb).
One 2U server consolidates a view of 180,000 orders per second with 99th percentile latency of approximately 3 milliseconds
One pair of 2U appliances received and forwarded over 100,000 guaranteed messages per second (mps).
Read the latest about research, events, and other important news from STAC.