Skip to content
#

roofline-model

Here are 19 public repositories matching this topic...

Roofline analysis of BitNet b1.58 2B4T inference across H100, MI300X, Groq LPU, Cerebras WSE-3, and two hypothetical ternary chips. Five phases covering gate counts, precision sweeps, memory bandwidth, hybrid activation Pareto, and prefill regime, plus empirical weight-distribution validation against the published microsoft/bitnet weights.

  • Updated May 10, 2026
  • Python

High-performance discrete-event simulator (C++20/Python) for modeling agentic LLM traffic, KV cache dynamics, Prefill-Decode Disaggregation (PDD), and scheduling policies. Features roofline model analysis, K-Means request clustering, and a real-time web dashboard.

  • Updated Jun 22, 2026
  • C++

Improve this page

Add a description, image, and links to the roofline-model topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the roofline-model topic, visit your repo's landing page and select "manage topics."

Learn more