Anbeeld

Projects Articles Support Contact

KVarN KV Cache: Implementation and Benchmarks

Tests on Qwen 3.6 27B show how an experimental BeeLlama KVarN preview moves q4-class KV memory into higher tier and beats all the alternatives in the same weight class.

  1. What Changed
  2. What KVarN Is
  3. The BeeLlama Preview
  4. The Setup
  5. How To Read The Results
  6. The Main Result
  7. The K/V Matrix
  8. TurboQuant And TCQ
  9. What This Does Not Prove
  10. Practical Takeaway
  11. Benchmark Data
    1. Perplexity
    2. KL Divergence
  12. Reference Data
    1. Perplexity
    2. KL Divergence

Disclaimer: this is still a narrow benchmark. The KVarN path tested here is a preview implementation in BeeLlama v0.3.2, not a mature optimized runtime. The precision numbers are the meaningful part of the article, but may be subject to change with future updates. The tok/s column is prompt-processing behavior on a very raw preview of this specific fork, not a final verdict on KVarN's decode or generation speed.

1. What Changed

The original KV-cache benchmark had a pretty clear sub-6-bit story: normal q4/q5 cache quantization was stronger than TurboQuant at similar practical sizes, while TCQ made the 2-3 bit turbo modes less bad. You could save memory, or you could preserve the output distribution, and the tradeoff was mostly where you expected it to be.

KVarN changes that curve. On the main Q5_K_S 64k comparison, kvarn4-kvarn4 uses 27.9% of the bf16 KV-cache footprint and lands around the q5_0/q5_1 quality tier. kvarn4-kvarn3 uses 24.8%, below ordinary q4_0, and still beats q4_0 on mean KLD. kvarn3-kvarn3 sits near the turbo3 memory class and cuts the turbo3_tcq mean KLD by a third.

That does not make KVarN lossless. The bf16 KLD reference is still far away. But for VRAM-constrained long-context setups, "q5-ish quality at q4-ish memory" is the result people are happy to see for low-bit KV-cache methods. Where TurboQuant promised that kind of shift, KVarN is the first thing in these benches that actually looks like it.

2. What KVarN Is

KVarN is a KV-cache quantization method from the paper KVarN: Variance-Normalized KV-Cache Quantization Mitigates Error Accumulation in Reasoning Tasks. The paper's central claim is that current KV-cache quantizers make bad token-scale errors during autoregressive decoding, and those errors accumulate over time, while KVarN reduces the outlier errors that matter most.

The mechanism has two parts. First, KVarN applies a Hadamard rotation in the channel dimension, which spreads outlier energy across coordinates before quantization. That part is familiar from QuaRot-style weight work and from the broader low-bit KV-cache literature. Second, it applies dual-axis variance normalization across both dimensions of a KV tile, so both token and channel scale variation are brought under control before round-to-nearest quantization. The paper describes this as a Sinkhorn-inspired variance-normalization step.

In plain terms: ordinary per-axis quantization can preserve one shape of scale variation and still blow up a few token norms. KVarN spends extra metadata on a second scale so the quantizer can keep those token magnitudes closer to where they were. The paper argues that the worst few percent of quantization errors do disproportionate damage, especially in decode-heavy reasoning tasks, so suppressing those outliers matters more than lowering average reconstruction error everywhere.

3. The BeeLlama Preview

BeeLlama is my llama.cpp fork for long-context experiments. The earlier benchmark used it for q6_0, TurboQuant and TCQ cache modes. This KVarN run is the same kind of fork-level experiment, but the KVarN path is much younger. It was wired in as experimental cache types named kvarn2, kvarn3 and kvarn4, plus asymmetric K/V pairs.

The BeeLlama implementation tested here follows the same broad KVarN structure: 128-token groups, 2/3/4-bit payloads, Hadamard rotation, iterative variance normalization, and stored scale/zero-point metadata. The measured sizes in this article are from the actual BeeLlama run logs, which matters because this preview stores auxiliary values as fp16 records, while the paper reports a more optimized accounting with lower auxiliary precision.

KVarN is also not just one more GGUF-style cache quant like q8_0, q5_0, q4_0, or the turbo types. In BeeLlama, kvarn2, kvarn3, and kvarn4 are CLI pseudo-types that select a separate structured KVarN cache backend. The underlying cache tensors are kept as an fp16 staging path plus KVarN records, and the KVarN configuration is stored separately from normal type_k/type_v because each record spans a full 128-token K/V tile.

That is why there are no rows like q8_0-kvarn4 in the current preview. They are not impossible in principle, but they would require a real hybrid-cache architecture: one side allocated and served by the normal KV path, the other side by KVarN, with attention graph routing, CUDA kernels, state save/load, rollback, prompt cache, seq_cp/seq_rm, DFlash backup, SWA/iSWA, and multi-sequence behavior all updated for split ownership. That would be quite complex to implement, and it is unclear whether the partial compression would be worth the extra risk.

Any speed claims also get a caveat. The paper's speed story is about an optimized vLLM-oriented decode path, while the tok/s values in this article are prompt-processing throughput from llama-perplexity. BeeLlama's current path is a raw llama.cpp integration with custom KVarN store/materialize operations and conservative memory handling. If it is slower in prompt processing today, that is operationally true for this build, but it says little about optimized generation speed.

4. The Setup

The hardware and model setup matches the original article as closely as possible. Hardware: one RTX 3090 with 24 GB VRAM, Ryzen 7 5700X3D, and 32 GB system RAM. Model: Qwen 3.6 27B. The main comparison is Q5_K_S weights at 64k context, because that is the cleaner weight quant for this purpose. IQ4_XS at 64k and 128k is included mostly as a tie-breaker and noise check; it is useful, but I do not treat it as the primary ranking signal.

The PPL run tested only symmetric KVarN cache types: kvarn4, kvarn3, and kvarn2. The KLD run tested the full 3x3 K/V KVarN matrix: kvarn4-kvarn4, kvarn4-kvarn3, kvarn4-kvarn2, and so on down to kvarn2-kvarn2. KLD also included the control rows from the original benchmark to ensure there's no significant drift from the engine changes.

All KLD rows are measured against a bf16 KV-cache baseline using llama-perplexity --kl-divergence on Wikitext-2. That is not the paper's pseudo-decode evaluation and it is not an end-to-end reasoning benchmark. It is a local distribution-fidelity test. I like it here because it catches output-distribution movement that PPL tends to hide.

The tok/s column is prompt-processing/prefill throughput from the benchmark run, not generation speed. The table sizes are the actual KV-cache sizes reported by the run logs. KVarN could still behave differently during decode, including faster after optimized kernels; these tables do not decide that question.

5. How To Read The Results

PPL makes KVarN look nearly lossless at the top settings. On Q5_K_S 64k, bf16 PPL is 5.4800 and kvarn4 is 5.4807. That looks like nothing happened. But KLD says something did happen: bf16 mean KLD is 0.000375, while kvarn4-kvarn4 is 0.002974. That is not fp16 precision at the distribution level.

That distinction matters. The KVarN paper reports near-fp16 behavior on generative benchmarks such as MATH500, AIME24 and HumanEval. That can be true at the task-score level while still being false at the tensor or distribution level. At 2-4 bits plus stored scales, there is no mathematical reason to expect fp16-equivalent cache reconstruction. What KVarN can do is move the errors into less damaging shapes and preserve the task behavior better than older cache methods.

For this article, the main metric is mean KLD on Q5_K_S 64k, with 99.9% KLD as the tail check and IQ4_XS as a tie-breaker. If a row only wins PPL, I do not count that as much.

6. The Main Result

Here is the core Q5_K_S 64k comparison. The table is deliberately smaller than the full reference data (available at the end of the article), because this is the part that explains the curve shift.

CacheSize vs bf16Mean KLD99.9% KLD99.9% precision vs bf16Read
bf16100.0%0.0003750.023258100.00%Reference
q5_137.5%0.0029110.09835492.77%Best ordinary sub-6-bit row here
kvarn4-kvarn427.9%0.0029740.09481993.09%q5-ish fidelity at q4-ish memory
q5_034.4%0.0032060.09907392.70%Slightly worse mean than KVarN4, larger cache
q5_0-q4_031.3%0.0035810.11333291.39%Old asymmetric middle tier
kvarn4-kvarn324.8%0.0038240.13502889.42%Between q4_0 and q5_0-q4_0, below q4_0 memory
q4_028.1%0.0047110.13041989.84%Old q4 anchor
turbo425.8%0.0047600.13837089.13%Same memory band, worse than kvarn4-kvarn3
kvarn3-kvarn321.7%0.0053490.16813586.51%Better turbo3-class option
turbo3_tcq20.3%0.0079780.22710481.56%Old compact recommendation
kvarn2-kvarn215.4%0.0213950.63020854.50%Compression endpoint
turbo2_tcq14.1%0.0230730.63240154.38%Similar endpoint, slightly worse mean

kvarn4-kvarn4 is the headline. It almost matches q5_1 on mean KLD, beats q5_0, and uses less memory than q4_0. The q5_1 edge is real but narrow: 0.002911 versus 0.002974. In exchange, q5_1 uses 37.5% of bf16 KV while kvarn4-kvarn4 uses 27.9%. For a VRAM-constrained long-context setup, that is not a practical win for q5_1, it's a quality tie-breaker for people with memory to spare.

kvarn4-kvarn3 is an interesting middle point. It is not simply "smaller q4_0". It sits between q4_0 and q5_0-q4_0: worse than q5_0-q4_0 on the two 64k mean-KLD rows, basically tied on the 128k IQ4_XS row, and consistently better than q4_0. Its memory footprint is lower than both: 24.8% on the 64k runs, versus 28.1% for q4_0 and 31.3% for q5_0-q4_0.

kvarn3-kvarn3 is the compact sweet spot if you were previously looking at turbo3_tcq. On the main row it uses 21.7% instead of 20.3% of bf16 KV, but the mean KLD drops from 0.007978 to 0.005349 and the 99.9% KLD drops from 0.227104 to 0.168135. That is a large quality improvement for a small memory increase.

kvarn2-kvarn2 is not the revolution. It is useful as the low endpoint, and it beats turbo2_tcq on Q5 mean KLD, but the tails are not clean enough to make it the row I would recommend first.

7. The K/V Matrix

The full 3x3 matrix shows why KVarN should not be treated as a single symmetric setting. The first rule is familiar: K matters more than V. At the same total size, reducing K hurts more than reducing V.

Q5_K_S 64k pairSize vs bf16Mean KLD99.9% KLD99.9% precision vs bf16Read
kvarn3-kvarn424.8%0.0046520.14035888.95%Lower K, higher V
kvarn4-kvarn324.8%0.0038240.13502889.42%Higher K wins
kvarn2-kvarn421.7%0.0136390.41824067.37%2-bit K is painful
kvarn3-kvarn321.7%0.0053490.16813586.51%Balanced middle wins
kvarn4-kvarn221.7%0.0104490.34039272.82%2-bit V is also painful
kvarn2-kvarn318.6%0.0145890.44501465.59%Lower K, higher V
kvarn3-kvarn218.6%0.0111220.34599572.42%Higher K wins again

The second rule is more important: do not overreact and dump all the budget into K. At 21.7% memory, kvarn3-kvarn3 crushes both asymmetric extremes. kvarn2-kvarn4 damages K too much, while kvarn4-kvarn2 damages V too much. K deserves priority, but V cannot be abandoned.

That is why the two settings that matter most are kvarn4-kvarn4 and kvarn4-kvarn3. The first gives the q5-like result. The second keeps K at 4 bits, trims V to 3, and creates a memory tier that ordinary q4/q5 cache quantization did not have.

8. TurboQuant And TCQ

The turbo comparison is where KVarN looks most like the "TurboQuant that was promised". That is not a formal claim about the TurboQuant paper, but a practical read of this benchmark. The old turbo modes were attractive because they reached memory sizes normal q4/q5 could not. The problem was quality leakage. But KVarN reaches the same broad memory bands with much better KLD.

On the main Q5 row, kvarn4-kvarn3 is smaller than turbo4 and better: 24.8% memory and 0.003824 mean KLD versus 25.8% and 0.004760. On the IQ4_XS rows, the same comparison holds directionally. kvarn3-kvarn3 is slightly larger than turbo3_tcq, but much better: 0.005349 versus 0.007978 mean KLD on Q5, with a much better 99.9% tail.

The 2-bit end is less clean. kvarn2-kvarn2 beats turbo2_tcq on Q5 mean KLD, but it's... bad. The tails are still large, and the IQ4_XS checks are noisy enough that I would treat it as an emergency compression setting, not the main result.

9. What This Does Not Prove

First, this does not prove KVarN has fp16 precision. It obviously does not in KLD. kvarn4-kvarn4 is around the q5 cache tier, not around bf16. That is still strong because it gets there at q4-class memory, but the distribution has moved. If an article says "fp16-level" without specifying task score, not tensor fidelity or KLD, it is being loose with the phrase.

Second, this does not prove the paper's reasoning-benchmark results. The paper is about decode-time accumulation and reports results on generative tasks such as MATH500, AIME24, HumanEval and IFEval. This page uses Wikitext PPL and KL divergence against stored bf16 baseline logits. KLD is a strong local fidelity signal, and it is much more sensitive than PPL, but it is not a replacement for running the actual long-generation tasks.

Third, this does not settle speed. The current BeeLlama implementation is too raw, and these tok/s values are prompt-processing numbers rather than generation numbers. The preview path allocates the right cache shapes and produces meaningful quality data, but it is not the final optimized kernel. I would not use today's implementation neither to reject KVarN nor to claim a speedup.

10. Practical Takeaway

For VRAM-constrained long context, KVarN looks like a real frontier shift. The old sub-6-bit recommendation was a ladder of compromises: q5 if you could afford it, q4 if you needed memory, turbo3_tcq if you needed even more memory and accepted quality loss. KVarN rearranges that.

If memory allows it, kvarn4-kvarn4 is the row I would test first. It is the cleanest q5-class-at-q4-memory result. If memory is tighter, kvarn4-kvarn3 is the interesting new middle tier: below q4 memory, above q4 fidelity, approaching q5_0-q4_0. If the goal is to beat turbo-class memory without turbo-class error, kvarn3-kvarn3 is the better compact candidate.

11. Benchmark Data

Perplexity

Q5_K_S 64k

CacheKV cache (MiB)Size vs bf16Median PPLPrecision vs bf16PPL +/-Same top pTok/sElapsed (s)
kvarn41144.0027.9%5.480799.99%0.0346597.682% +/- 0.042%787.55364.80
kvarn3888.0021.7%5.483799.93%0.0346697.069% +/- 0.047%790.92363.25
kvarn2632.0015.4%5.549998.74%0.0352394.086% +/- 0.065%796.36360.77

IQ4_XS 64k

CacheKV cache (MiB)Size vs bf16Median PPLPrecision vs bf16PPL +/-Same top pTok/sElapsed (s)
kvarn41144.0027.9%5.519899.95%0.0350098.562% +/- 0.033%825.75347.93
kvarn3888.0021.7%5.521099.93%0.0349997.608% +/- 0.042%843.84340.47
kvarn2632.0015.4%5.586398.76%0.0355494.218% +/- 0.064%846.30339.48

IQ4_XS 128k

CacheKV cache (MiB)Size vs bf16Median PPLPrecision vs bf16PPL +/-Same top pTok/sElapsed (s)
kvarn42264.0027.6%5.275199.95%0.0327198.625% +/- 0.032%631.85445.80
kvarn31752.0021.4%5.280399.85%0.0327597.721% +/- 0.041%646.56435.66
kvarn21240.0015.1%5.338298.77%0.0332394.648% +/- 0.062%651.61432.28

KL Divergence

Q5_K_S 64k

CacheKV cache (MiB)Size vs bf16Mean KLDPrecision vs bf16KLD +/-90% KLD95% KLD99% KLD99.9% KLD99.9% precision vs bf16Maximum KLDSame top pTok/sElapsed (s)
kvarn4-kvarn41144.0027.9%0.00297499.74%0.0001760.0053960.0085550.0238570.09481993.09%17.85526597.682% +/- 0.042%760.88377.59
kvarn3-kvarn41016.0024.8%0.00465299.57%0.0002290.0086630.0137300.0358580.14035888.95%13.96718997.236% +/- 0.045%770.52372.87
kvarn4-kvarn31016.0024.8%0.00382499.66%0.0002030.0070340.0111310.0295290.13502889.42%19.54113497.510% +/- 0.043%765.23375.44
kvarn2-kvarn4888.0021.7%0.01363998.68%0.0003530.0292640.0477690.1207070.41824067.37%21.64638195.234% +/- 0.059%771.78372.25
kvarn3-kvarn3888.0021.7%0.00534999.50%0.0002370.0103580.0166150.0430530.16813586.51%19.51161097.069% +/- 0.047%773.12371.61
kvarn4-kvarn2888.0021.7%0.01044999.00%0.0004250.0190610.0297730.0768750.34039272.82%19.98859696.018% +/- 0.054%765.57375.28
kvarn2-kvarn3760.0018.6%0.01458998.59%0.0003460.0317860.0511140.1309330.44501465.59%21.72279295.102% +/- 0.060%773.83371.27
kvarn3-kvarn2760.0018.6%0.01112298.93%0.0002890.0227250.0355490.0900610.34599572.42%17.62851395.738% +/- 0.056%773.65371.36
kvarn2-kvarn2632.0015.4%0.02139597.92%0.0004290.0467740.0745460.1855040.63020854.50%28.28741194.086% +/- 0.065%776.81369.85

IQ4_XS 64k

CacheKV cache (MiB)Size vs bf16Mean KLDPrecision vs bf16KLD +/-90% KLD95% KLD99% KLD99.9% KLD99.9% precision vs bf16Maximum KLDSame top pTok/sElapsed (s)
kvarn4-kvarn41144.0027.9%0.00124599.89%0.0001140.0022400.0036250.0096080.03872396.60%10.49299198.562% +/- 0.033%806.72356.13
kvarn3-kvarn41016.0024.8%0.00283399.73%0.0001900.0055130.0090840.0234760.08528692.21%17.75322797.853% +/- 0.040%818.20351.14
kvarn4-kvarn31016.0024.8%0.00229099.78%0.0002000.0038590.0063100.0170610.07404493.25%20.20384498.152% +/- 0.037%811.53354.02
kvarn2-kvarn4888.0021.7%0.01196198.82%0.0002040.0267940.0442800.1142010.37772068.83%11.51254595.618% +/- 0.057%821.22349.85
kvarn3-kvarn3888.0021.7%0.00370899.64%0.0002250.0072540.0119070.0320990.11944189.11%21.54865697.608% +/- 0.042%822.71349.21
kvarn4-kvarn2888.0021.7%0.00757699.25%0.0001900.0156760.0246070.0623980.26007677.42%13.38846296.398% +/- 0.051%813.96352.97
kvarn2-kvarn3760.0018.6%0.01301798.72%0.0002900.0285700.0468460.1218750.41218866.50%21.72863695.438% +/- 0.058%824.46348.47
kvarn3-kvarn2760.0018.6%0.00904699.11%0.0001350.0194620.0307440.0790350.31861873.02%8.40475695.983% +/- 0.054%824.23348.57
kvarn2-kvarn2632.0015.4%0.01969398.06%0.0002790.0443480.0705860.1805580.59558855.35%12.98286294.218% +/- 0.064%824.45348.47

IQ4_XS 128k

CacheKV cache (MiB)Size vs bf16Mean KLDPrecision vs bf16KLD +/-90% KLD95% KLD99% KLD99.9% KLD99.9% precision vs bf16Maximum KLDSame top pTok/sElapsed (s)
kvarn4-kvarn42264.0027.6%0.00092899.91%0.0000110.0020180.0032190.0082770.03013697.04%0.83520398.625% +/- 0.032%614.31458.53
kvarn3-kvarn42008.0024.5%0.00218599.78%0.0000260.0049660.0081470.0218400.07267593.00%1.95691997.963% +/- 0.039%624.77450.85
kvarn4-kvarn32008.0024.5%0.00154399.85%0.0000170.0033770.0055020.0146990.05431794.72%1.11303998.352% +/- 0.035%616.44456.95
kvarn2-kvarn41752.0021.4%0.00991399.01%0.0000820.0231620.0383120.1057790.35041870.44%2.71553395.842% +/- 0.055%626.81449.39
kvarn3-kvarn31752.0021.4%0.00277699.72%0.0000230.0063210.0104810.0281740.09888290.59%0.61380697.721% +/- 0.041%627.52448.88
kvarn4-kvarn21752.0021.4%0.00618399.38%0.0000500.0139740.0219540.0551070.20257381.67%1.95429596.714% +/- 0.049%618.51455.42
kvarn2-kvarn31496.0018.3%0.01073798.93%0.0000900.0252210.0419050.1117610.36861369.17%3.49815895.653% +/- 0.056%630.69446.62
kvarn3-kvarn21496.0018.3%0.00755999.25%0.0000600.0172520.0273710.0683260.24488878.28%2.00895596.293% +/- 0.052%630.77446.56
kvarn2-kvarn21240.0015.1%0.01645298.37%0.0001280.0383500.0617900.1601030.54023458.26%3.97756094.648% +/- 0.062%631.58445.99

12. Reference Data

Perplexity

Q5_K_S 64k

CacheKV cache (MiB)Size vs bf16Median PPLPrecision vs bf16PPL +/-Same top pTok/sElapsed (s)
bf164096.00100.0%5.4800100.00%0.0346599.647% +/- 0.016%851.75326.30
q8_02176.0053.1%5.4774100.05%0.0346597.942% +/- 0.039%851.57331.27
q6_01664.0040.6%5.4778100.04%0.0346597.890% +/- 0.040%852.96336.74
q5_11536.0037.5%5.4777100.04%0.0346497.787% +/- 0.041%848.27332.64
q5_01408.0034.4%5.4802100.00%0.0346697.707% +/- 0.041%848.36332.45
q4_11280.0031.3%5.480899.99%0.0346797.259% +/- 0.045%853.49330.43
q4_01152.0028.1%5.487799.86%0.0347397.179% +/- 0.046%853.50330.76
turbo41056.0025.8%5.484199.93%0.0346897.037% +/- 0.047%705.06395.16
turbo3_tcq832.0020.3%5.505499.54%0.0348096.265% +/- 0.052%794.21353.31
turbo3800.0019.5%5.514999.37%0.0349395.517% +/- 0.057%802.71344.83
turbo2_tcq576.0014.1%5.570598.38%0.0356693.456% +/- 0.068%805.25348.78
turbo2544.0013.3%5.640397.16%0.0358191.646% +/- 0.076%840.34335.07

IQ4_XS 64k

CacheKV cache (MiB)Size vs bf16Median PPLPrecision vs bf16PPL +/-Same top pTok/sElapsed (s)
bf164096.00100.0%5.5169100.00%0.0349799.776% +/- 0.013%909.83336.43
q8_02176.0053.1%5.5157100.02%0.0349998.950% +/- 0.028%910.43309.49
q6_01664.0040.6%5.5171100.00%0.0350098.878% +/- 0.029%922.50311.45
q5_11536.0037.5%5.518199.98%0.0350198.618% +/- 0.032%906.38310.47
q5_01408.0034.4%5.517599.99%0.0350098.553% +/- 0.033%906.88310.27
q4_11280.0031.3%5.523799.88%0.0350597.880% +/- 0.040%911.35308.85
q4_01152.0028.1%5.525199.85%0.0350597.793% +/- 0.041%912.54308.61
turbo41056.0025.8%5.527799.80%0.0350897.652% +/- 0.042%746.00372.98
turbo3_tcq832.0020.3%5.542699.54%0.0351396.569% +/- 0.050%844.33331.64
turbo3800.0019.5%5.556199.29%0.0353395.746% +/- 0.056%882.76318.29
turbo2_tcq576.0014.1%5.608598.37%0.0359993.669% +/- 0.067%858.66326.24
turbo2544.0013.3%5.682397.09%0.0362191.865% +/- 0.076%898.66312.75

IQ4_XS 128k

CacheKV cache (MiB)Size vs bf16Median PPLPrecision vs bf16PPL +/-Same top pTok/sElapsed (s)
bf168192.00100.0%5.2724100.00%0.0326999.995% +/- 0.002%703.61389.94
q8_04352.0053.1%5.2716100.02%0.0327198.950% +/- 0.028%707.02387.84
q6_03328.0040.6%5.272999.99%0.0327298.855% +/- 0.029%720.53390.94
q5_13072.0037.5%5.2723100.00%0.0327198.603% +/- 0.032%702.53390.33
q5_02816.0034.4%5.273899.97%0.0327298.543% +/- 0.033%703.18390.05
q4_12560.0031.3%5.277299.91%0.0327697.961% +/- 0.039%708.53387.23
q4_02304.0028.1%5.280399.85%0.0327697.793% +/- 0.041%709.40386.58
turbo42112.0025.8%5.282299.81%0.0328197.639% +/- 0.042%520.46520.82
turbo3_tcq1664.0020.3%5.298599.51%0.0328196.591% +/- 0.050%647.40421.95
turbo31600.0019.5%5.308499.32%0.0330195.861% +/- 0.055%689.73396.86
turbo2_tcq1152.0014.1%5.351398.53%0.0336393.807% +/- 0.067%655.44416.90
turbo21088.0013.3%5.428797.12%0.0338692.033% +/- 0.075%696.53393.24

KL Divergence

Q5_K_S 64k

CacheKV cache (MiB)Size vs bf16Mean KLDPrecision vs bf16KLD +/-90% KLD95% KLD99% KLD99.9% KLD99.9% precision vs bf16Maximum KLDSame top pTok/sElapsed (s)
bf164096.00100.0%0.000375100.00%0.0000580.0005680.0016930.0052340.023258100.00%7.37404699.647% +/- 0.016%850.81337.59
bf16-q8_03136.0076.6%0.00247599.79%0.0001710.0042090.0066290.0192750.07982794.50%14.72976597.991% +/- 0.039%850.62337.75
bf16-q6_02880.0070.3%0.00239399.80%0.0001010.0043020.0067880.0194160.07885394.59%6.56477097.961% +/- 0.039%848.99338.40
bf16-q5_12816.0068.8%0.00244099.79%0.0000900.0045670.0071710.0200560.09680592.91%9.26014197.903% +/- 0.040%848.20338.72
bf16-q5_02752.0067.2%0.00249199.79%0.0001280.0045990.0071100.0206640.08975393.57%13.07955397.829% +/- 0.040%848.53338.58
bf16-q4_12688.0065.6%0.00322199.72%0.0002080.0056610.0086820.0234040.10592392.07%20.04471697.637% +/- 0.042%853.30336.69
bf16-q4_02624.0064.1%0.00333999.70%0.0001870.0059520.0091360.0254300.11016691.68%19.45288197.594% +/- 0.042%854.17336.35
bf16-turbo42576.0062.9%0.00354999.68%0.0001630.0065320.0098490.0265390.10105492.52%11.93411797.473% +/- 0.043%843.17340.74
bf16-turbo3_tcq2464.0060.2%0.00537799.50%0.0002810.0098150.0145090.0368720.14617588.43%23.30722296.962% +/- 0.047%823.83348.74
q8_02176.0053.1%0.00232899.80%0.0001250.0042330.0066560.0196690.07870994.61%14.35599697.942% +/- 0.039%851.11337.66
q8_0-q6_01920.0046.9%0.00249999.79%0.0001840.0042950.0067080.0193810.08161694.33%17.81619697.942% +/- 0.039%848.78338.40
q8_0-q5_11856.0045.3%0.00252999.78%0.0001430.0045570.0071080.0203460.08288094.21%15.36768397.884% +/- 0.040%828.63346.71
q8_0-q5_01792.0043.8%0.00265699.77%0.0001680.0046730.0073480.0210390.08848693.69%17.98765097.826% +/- 0.040%847.33338.90
q8_0-q4_11728.0042.2%0.00308099.73%0.0001150.0056450.0085870.0233900.09908092.70%8.07323197.655% +/- 0.042%786.54364.58
q8_0-q4_01664.0040.6%0.00331699.71%0.0001650.0059760.0090750.0248920.10468092.18%13.48150697.532% +/- 0.043%849.37338.13
q6_01664.0040.6%0.00261499.78%0.0001800.0044260.0069490.0200780.09080093.47%14.11258697.890% +/- 0.040%845.96339.52
q8_0-turbo41616.0039.5%0.00356199.68%0.0002150.0065180.0098340.0264260.10304192.33%23.10272497.460% +/- 0.043%838.90342.38
q6_0-q5_11600.0039.1%0.00278199.76%0.0002280.0046820.0073480.0209980.09044793.50%23.77049197.913% +/- 0.039%846.24339.41
q5_11536.0037.5%0.00291199.75%0.0001670.0050450.0079160.0226040.09835492.77%13.39706897.787% +/- 0.041%841.65341.63
q6_0-q5_01536.0037.5%0.00282099.76%0.0002090.0047480.0074570.0218830.09268293.29%23.18686797.788% +/- 0.041%846.86339.16
q8_0-turbo3_tcq1504.0036.7%0.00509099.53%0.0001880.0097360.0144010.0370560.14938788.15%20.12875296.899% +/- 0.048%817.57350.23
q6_0-q4_11472.0035.9%0.00331299.71%0.0002320.0057550.0088470.0243870.10458292.19%23.24465997.605% +/- 0.042%848.42338.54
q5_01408.0034.4%0.00320699.72%0.0002860.0052320.0081940.0227590.09907392.70%22.61989297.707% +/- 0.041%849.79338.00
q5_1-q4_11408.0034.4%0.00338099.70%0.0001950.0061400.0094790.0258860.09501193.08%21.39401197.529% +/- 0.043%846.27339.25
q6_0-q4_01408.0034.4%0.00328899.71%0.0001290.0060960.0092940.0254560.11156691.55%10.71110097.524% +/- 0.043%848.24338.61
q6_0-turbo41360.0033.2%0.00374899.66%0.0002240.0066420.0099970.0269020.10737791.93%16.44510397.465% +/- 0.043%837.77342.84
q5_0-q4_11344.0032.8%0.00347199.69%0.0002060.0063100.0095820.0258290.09961892.65%21.86311797.539% +/- 0.043%847.59339.65
q5_1-q4_01344.0032.8%0.00362699.68%0.0002120.0064410.0097730.0256680.10864991.82%15.80972697.515% +/- 0.043%846.91339.23
q4_11280.0031.3%0.00447699.59%0.0002670.0077160.0119010.0311660.14181388.82%18.15086997.259% +/- 0.045%854.33336.49
q5_0-q4_01280.0031.3%0.00358199.68%0.0001740.0066000.0100580.0274230.11333291.39%14.93859997.437% +/- 0.044%847.64338.79
q6_0-turbo3_tcq1248.0030.5%0.00537999.50%0.0002470.0099060.0145560.0372850.15468087.68%19.73954896.922% +/- 0.048%819.23350.60
q5_0-turbo41232.0030.1%0.00381299.66%0.0001760.0070680.0107350.0282030.11224991.49%17.03202497.371% +/- 0.044%837.52342.95
q5_1-turbo3_tcq1184.0028.9%0.00559499.48%0.0002910.0102640.0153240.0381750.14459188.57%24.68442996.878% +/- 0.048%816.05350.73
q4_01152.0028.1%0.00471199.57%0.0003010.0084390.0129490.0336630.13041989.84%21.63613597.179% +/- 0.046%855.08336.11
q5_0-turbo3_tcq1120.0027.3%0.00547199.49%0.0002650.0102590.0152290.0382140.15851487.35%22.26880196.865% +/- 0.048%815.80350.94
q5_0-turbo31104.0027.0%0.00709799.33%0.0002590.0137470.0202590.0487610.19242884.44%18.09429696.331% +/- 0.052%837.90342.47
q4_1-turbo3_tcq1056.0025.8%0.00618499.42%0.0002920.0116520.0173200.0429970.17483185.94%25.07903596.663% +/- 0.050%816.95350.43
turbo41056.0025.8%0.00476099.55%0.0002010.0090460.0136920.0352050.13837089.13%13.96749497.037% +/- 0.047%705.32401.18
q4_0-turbo3_tcq992.0024.2%0.00626999.41%0.0002700.0122200.0181730.0454210.18657284.93%23.15737596.622% +/- 0.050%821.89349.67
q4_0-turbo3976.0023.8%0.00823599.22%0.0003360.0155760.0228280.0565270.22215481.96%24.35326896.075% +/- 0.054%839.29341.78
q4_0-turbo2_tcq864.0021.1%0.01516898.53%0.0002880.0318260.0455690.1054610.39524468.94%20.74323894.591% +/- 0.062%826.07347.04
turbo3_tcq832.0020.3%0.00797899.24%0.0002670.0156630.0236280.0582860.22710481.56%20.51747196.265% +/- 0.052%795.20359.09
turbo3800.0019.5%0.01118198.93%0.0003040.0228050.0342090.0820150.29606076.12%22.97721195.517% +/- 0.057%836.75342.73
turbo3_tcq-turbo2_tcq704.0017.2%0.01638698.41%0.0002830.0341860.0491330.1150720.43704366.11%18.27553294.379% +/- 0.064%796.16358.86
turbo3-turbo2672.0016.4%0.02398597.67%0.0004030.0501000.0728500.1682580.60508755.89%20.81255393.154% +/- 0.070%831.88344.85
turbo2_tcq576.0014.1%0.02307397.76%0.0004200.0487770.0718650.1703500.63240154.38%24.77132093.456% +/- 0.068%807.25354.12
turbo2544.0013.3%0.03623096.48%0.0004650.0789420.1175450.2764380.90357641.47%26.50826391.646% +/- 0.076%842.29340.66

IQ4_XS 64k

CacheKV cache (MiB)Size vs bf16Mean KLDPrecision vs bf16KLD +/-90% KLD95% KLD99% KLD99.9% KLD99.9% precision vs bf16Maximum KLDSame top pTok/sElapsed (s)
bf164096.00100.0%0.000097100.00%0.0000200.0001860.0003980.0010620.004152100.00%2.34505699.776% +/- 0.013%909.80315.80
bf16-q8_03136.0076.6%0.00058799.95%0.0000920.0009240.0014080.0040180.01614898.81%11.61589298.952% +/- 0.028%909.80315.78
bf16-q6_02880.0070.3%0.00063699.95%0.0000790.0010290.0015560.0041480.01700598.72%8.44026698.875% +/- 0.029%908.22316.33
bf16-q5_12816.0068.8%0.00075799.93%0.0000740.0012830.0019210.0050450.02020698.41%8.46264998.798% +/- 0.030%907.39316.62
bf16-q5_02752.0067.2%0.00087399.92%0.0001010.0013840.0020880.0056880.02239498.19%10.60526798.764% +/- 0.031%908.46316.25
bf16-q4_12688.0065.6%0.00135799.87%0.0000950.0024550.0036070.0089770.03474196.99%10.62426898.392% +/- 0.035%913.53314.50
bf16-q4_02624.0064.1%0.00145999.86%0.0000730.0027560.0040210.0101210.03967696.51%6.61931398.367% +/- 0.035%914.07314.31
bf16-turbo42576.0062.9%0.00177099.83%0.0000970.0032860.0047410.0116790.04312796.18%8.78224898.144% +/- 0.037%901.27318.77
bf16-turbo3_tcq2464.0060.2%0.00325699.68%0.0000920.0065740.0094630.0222780.08444892.28%8.79170297.398% +/- 0.044%879.05326.83
q8_02176.0053.1%0.00057799.95%0.0000730.0009330.0014280.0040500.01737298.69%8.13080798.950% +/- 0.028%912.71314.76
q8_0-q6_01920.0046.9%0.00065999.94%0.0000930.0010320.0015780.0044190.01867098.56%10.62567298.906% +/- 0.029%908.70316.18
q8_0-q5_11856.0045.3%0.00083699.93%0.0001100.0012910.0019290.0052720.02154498.28%11.86198998.814% +/- 0.030%895.23320.91
q8_0-q5_01792.0043.8%0.00088199.92%0.0001070.0013920.0020920.0055740.02243598.19%10.86761498.714% +/- 0.031%906.00316.81
q8_0-q4_11728.0042.2%0.00131799.88%0.0001220.0024360.0035720.0089740.03470696.99%6.67526498.357% +/- 0.035%818.78346.30
q8_0-q4_01664.0040.6%0.00160699.85%0.0001180.0027930.0040840.0099690.03929996.55%8.99398698.309% +/- 0.036%908.09316.08
q6_01664.0040.6%0.00076699.93%0.0001090.0011790.0017910.0047620.02040798.39%11.36899598.878% +/- 0.029%906.47316.96
q8_0-turbo41616.0039.5%0.00184599.83%0.0001080.0033110.0047870.0115510.04612495.89%8.48830998.147% +/- 0.037%898.63319.72
q6_0-q5_11600.0039.1%0.00088299.92%0.0000990.0014310.0021690.0057460.02196898.23%10.72814898.772% +/- 0.030%906.67316.89
q5_11536.0037.5%0.00101999.91%0.0000750.0017870.0027240.0072620.02985497.46%6.70749398.618% +/- 0.032%907.45316.44
q6_0-q5_01536.0037.5%0.00093399.92%0.0001030.0015190.0022690.0060440.02358898.08%10.47576698.666% +/- 0.032%906.68316.89
q8_0-turbo3_tcq1504.0036.7%0.00333699.68%0.0001190.0065800.0094510.0223740.08481892.25%11.13049997.411% +/- 0.044%871.88328.10
q6_0-q4_11472.0035.9%0.00148899.87%0.0001150.0025930.0038300.0097630.03758196.71%10.88983598.378% +/- 0.035%908.06316.40
q5_01408.0034.4%0.00113599.90%0.0000880.0020280.0031130.0081130.03134897.32%8.00191398.553% +/- 0.033%908.72315.93
q5_1-q4_11408.0034.4%0.00168399.84%0.0001400.0029280.0043290.0109560.03897696.58%11.20368998.302% +/- 0.036%906.39316.62
q6_0-q4_01408.0034.4%0.00155599.85%0.0001130.0028930.0042000.0103920.03960196.52%11.93431098.279% +/- 0.036%909.10316.04
q6_0-turbo41360.0033.2%0.00193399.82%0.0001180.0034450.0050010.0120210.04472296.02%8.84209798.076% +/- 0.038%896.11320.62
q5_0-q4_11344.0032.8%0.00152999.86%0.0000370.0030730.0045930.0117950.04282896.21%2.32893398.227% +/- 0.036%905.93316.98
q5_1-q4_01344.0032.8%0.00181399.83%0.0001630.0032360.0047820.0117420.04289396.20%18.07721398.160% +/- 0.037%905.51317.08
q4_11280.0031.3%0.00231699.78%0.0001040.0044410.0067760.0167340.06885893.73%8.87420497.880% +/- 0.040%913.32314.50
q5_0-q4_01280.0031.3%0.00193699.82%0.0001470.0033680.0050130.0124190.04439396.06%14.36477998.125% +/- 0.037%906.57316.61
q6_0-turbo3_tcq1248.0030.5%0.00341299.67%0.0001210.0067020.0096420.0228860.08987491.78%9.69500397.394% +/- 0.044%874.99328.36
q5_0-turbo41232.0030.1%0.00212299.80%0.0001310.0039130.0057690.0139950.05231595.30%11.28628997.977% +/- 0.039%895.90320.70
q5_1-turbo3_tcq1184.0028.9%0.00356099.65%0.0001300.0070250.0101760.0240770.08870691.89%10.15408197.304% +/- 0.045%870.78328.52
q4_01152.0028.1%0.00275999.73%0.0001410.0052190.0079500.0198620.07666393.01%10.04576497.793% +/- 0.041%914.36314.20
q5_0-turbo3_tcq1120.0027.3%0.00360099.65%0.0000990.0071980.0104850.0247520.10210990.67%9.03382097.295% +/- 0.045%869.81328.94
q5_0-turbo31104.0027.0%0.00520999.49%0.0001500.0106020.0152450.0365800.13435987.79%11.50617496.750% +/- 0.049%894.86320.42
q4_1-turbo3_tcq1056.0025.8%0.00422699.59%0.0001070.0084650.0125130.0296180.12185488.90%8.72381897.117% +/- 0.046%871.93328.15
turbo41056.0025.8%0.00298899.71%0.0001300.0058810.0088390.0218680.07636393.03%9.16818397.652% +/- 0.042%744.85379.35
q4_0-turbo3_tcq992.0024.2%0.00446699.56%0.0001230.0090970.0134700.0322880.10866290.08%9.38369697.067% +/- 0.047%871.33328.57
q4_0-turbo3976.0023.8%0.00600799.41%0.0001360.0123520.0180440.0424280.16164485.43%11.09244696.508% +/- 0.051%897.34319.56
q4_0-turbo2_tcq864.0021.1%0.01359598.66%0.0002040.0286870.0413210.0953930.36782569.51%14.22252294.742% +/- 0.062%881.14325.07
turbo3_tcq832.0020.3%0.00603899.41%0.0001270.0127140.0190580.0462190.17248084.51%9.41598596.569% +/- 0.050%845.55337.45
turbo3800.0019.5%0.00910299.10%0.0001640.0195020.0293110.0682660.23647279.27%10.84707795.746% +/- 0.056%894.11320.59
turbo3_tcq-turbo2_tcq704.0017.2%0.01446198.57%0.0001650.0310140.0453300.1045590.37485469.02%10.15083294.578% +/- 0.063%847.59336.76
turbo3-turbo2672.0016.4%0.02216897.82%0.0002710.0466980.0680080.1603270.60264954.96%18.98519193.434% +/- 0.068%884.98323.44
turbo2_tcq576.0014.1%0.02073997.96%0.0002300.0454970.0680260.1612560.53819058.62%15.58235293.669% +/- 0.067%861.17331.75
turbo2544.0013.3%0.03438096.63%0.0003400.0758760.1137340.2655350.89538541.01%19.48207991.865% +/- 0.076%901.01318.44

IQ4_XS 128k

CacheKV cache (MiB)Size vs bf16Mean KLDPrecision vs bf16KLD +/-90% KLD95% KLD99% KLD99.9% KLD99.9% precision vs bf16Maximum KLDSame top pTok/sElapsed (s)
bf168192.00100.0%0.000000100.00%0.0000000.0000150.0000230.0000370.000051100.00%0.00006799.995% +/- 0.002%702.50400.97
bf16-q8_06272.0076.6%0.00047999.95%0.0000110.0009710.0014730.0039850.01495798.52%1.25558398.953% +/- 0.028%705.73399.13
bf16-q6_05760.0070.3%0.00052899.95%0.0000110.0010800.0016400.0042610.01691598.33%1.24646498.920% +/- 0.029%703.90400.17
bf16-q5_15632.0068.8%0.00064399.94%0.0000070.0013180.0019810.0051920.01960498.06%0.45965398.803% +/- 0.030%703.75400.25
bf16-q5_05504.0067.2%0.00068099.93%0.0000070.0014100.0020940.0053330.02099597.93%0.44867398.758% +/- 0.031%703.91400.16
bf16-q4_15376.0065.6%0.00114599.89%0.0000110.0024390.0035580.0083890.03061796.99%0.85612498.425% +/- 0.034%709.30397.13
bf16-q4_05248.0064.1%0.00129599.87%0.0000160.0027730.0040270.0094340.03489796.58%1.48167098.347% +/- 0.035%710.30396.57
bf16-turbo45152.0062.9%0.00153399.85%0.0000130.0033040.0047790.0112210.04151095.94%0.50520798.119% +/- 0.038%697.20404.02
bf16-turbo3_tcq4928.0060.2%0.00319199.68%0.0000270.0068560.0098580.0231020.08619691.75%1.07173297.420% +/- 0.044%678.44415.19
q8_04352.0053.1%0.00048299.95%0.0000070.0009830.0015080.0040610.01495198.52%0.47817598.950% +/- 0.028%708.31397.81
q8_0-q6_03840.0046.9%0.00053099.95%0.0000060.0010930.0016520.0043350.01740898.28%0.46899898.900% +/- 0.029%703.96400.14
q8_0-q5_13712.0045.3%0.00065199.93%0.0000120.0013350.0020100.0051610.01891898.13%1.27021298.779% +/- 0.030%694.87405.20
q8_0-q5_03584.0043.8%0.00070399.93%0.0000130.0014330.0021410.0055230.02036097.99%1.16698698.757% +/- 0.031%702.31400.84
q8_0-q4_13456.0042.2%0.00114999.89%0.0000120.0024530.0035820.0085680.02997097.05%0.96473398.407% +/- 0.035%637.52440.27
q8_0-q4_03328.0040.6%0.00129599.87%0.0000160.0027650.0039980.0095870.03574196.49%1.61493198.304% +/- 0.036%706.17398.89
q6_03328.0040.6%0.00058999.94%0.0000080.0012120.0018520.0047060.01917598.11%0.57320098.855% +/- 0.029%701.36401.62
q8_0-turbo43232.0039.5%0.00155499.84%0.0000140.0033350.0048100.0114730.04100695.99%0.63325298.171% +/- 0.037%694.04405.86
q6_0-q5_13200.0039.1%0.00070399.93%0.0000110.0014520.0021800.0055440.02182197.85%1.14590598.740% +/- 0.031%701.05401.80
q5_13072.0037.5%0.00082799.92%0.0000080.0017920.0026870.0067640.02329197.70%0.49684698.603% +/- 0.032%702.81400.71
q6_0-q5_03072.0037.5%0.00075299.92%0.0000130.0015520.0023110.0058720.02222797.81%1.08344598.700% +/- 0.031%701.69401.43
q8_0-turbo3_tcq3008.0036.7%0.00316799.68%0.0000290.0067910.0098600.0229350.08135092.19%2.32976497.407% +/- 0.044%672.90417.21
q6_0-q4_12944.0035.9%0.00119199.88%0.0000090.0025830.0037910.0088750.03103296.95%0.43163098.412% +/- 0.035%703.37400.48
q5_02816.0034.4%0.00092699.91%0.0000070.0020370.0030670.0074680.02741097.30%0.42794998.543% +/- 0.033%704.01400.14
q5_1-q4_12816.0034.4%0.00133599.87%0.0000130.0028840.0042210.0100470.03506296.56%0.85043998.334% +/- 0.035%703.75400.05
q6_0-q4_02816.0034.4%0.00131799.87%0.0000100.0028750.0041830.0097840.03186396.87%0.50489998.275% +/- 0.036%704.62399.76
q6_0-turbo42720.0033.2%0.00159099.84%0.0000160.0034350.0049790.0118590.03900196.18%1.39916198.063% +/- 0.038%691.93407.09
q5_0-q4_12688.0032.8%0.00138799.86%0.0000130.0030170.0044370.0104110.03570696.50%0.97147698.243% +/- 0.036%702.90400.58
q5_1-q4_02688.0032.8%0.00148599.85%0.0000140.0032000.0047460.0113420.04053096.03%0.97673898.200% +/- 0.037%702.58400.94
q4_12560.0031.3%0.00193399.81%0.0000130.0043180.0065950.0160460.05091895.04%0.43583797.961% +/- 0.039%709.38397.11
q5_0-q4_02560.0031.3%0.00152999.85%0.0000160.0033160.0048680.0116400.03903396.18%1.11660698.211% +/- 0.037%704.22399.88
q6_0-turbo3_tcq2496.0030.5%0.00323899.68%0.0000310.0069220.0099330.0232710.08734191.64%2.34683297.388% +/- 0.044%673.13418.46
q5_0-turbo42464.0030.1%0.00180999.82%0.0000230.0038910.0057420.0134250.04732895.38%2.02532398.004% +/- 0.039%691.62407.28
q5_1-turbo3_tcq2368.0028.9%0.00336099.66%0.0000290.0072290.0104960.0251040.08947491.45%2.23717097.375% +/- 0.044%670.63418.53
q4_02304.0028.1%0.00225999.77%0.0000170.0050580.0076970.0185050.05830194.34%1.07467197.793% +/- 0.041%710.51396.57
q5_0-turbo3_tcq2240.0027.3%0.00339199.66%0.0000300.0073210.0105670.0244220.09090191.32%2.25298797.384% +/- 0.044%670.54418.68
q5_0-turbo32208.0027.0%0.00472899.53%0.0000350.0103750.0147670.0341980.12134088.58%1.80996496.732% +/- 0.049%693.49405.71
q4_1-turbo3_tcq2112.0025.8%0.00398199.60%0.0000340.0086120.0127010.0301110.11281289.34%2.19368697.182% +/- 0.046%672.84417.30
turbo42112.0025.8%0.00260599.74%0.0000240.0057010.0086530.0203670.07190293.07%1.17926397.639% +/- 0.042%519.76531.96
q4_0-turbo3_tcq1984.0024.2%0.00413199.59%0.0000320.0090620.0134010.0314350.11229789.38%2.13901697.078% +/- 0.047%671.63418.13
q4_0-turbo31952.0023.8%0.00548899.45%0.0000400.0120730.0176440.0400470.14515886.49%1.54579596.549% +/- 0.050%695.78404.37
q4_0-turbo2_tcq1728.0021.1%0.01332998.68%0.0000900.0298050.0427930.0968110.30663073.60%6.77972694.713% +/- 0.062%678.10414.30
turbo3_tcq1664.0020.3%0.00570899.43%0.0000450.0127480.0194410.0453200.15014486.06%2.07951096.591% +/- 0.050%647.62432.43
turbo31600.0019.5%0.00833499.17%0.0000570.0190230.0285960.0661570.20746881.27%2.45483495.861% +/- 0.055%691.41406.63
turbo3_tcq-turbo2_tcq1408.0017.2%0.01434498.58%0.0000860.0322430.0467370.1052690.34395170.90%4.01086694.530% +/- 0.063%648.06432.26
turbo3-turbo21344.0016.4%0.02046897.97%0.0001540.0455140.0659760.1477450.47441562.23%11.93838793.540% +/- 0.068%686.57409.17
turbo2_tcq1152.0014.1%0.01985798.03%0.0001220.0454910.0683980.1587660.45376163.53%4.37008593.807% +/- 0.067%656.84426.67
turbo21088.0013.3%0.03263196.79%0.0002030.0738330.1117650.2616070.83811343.25%4.73564292.033% +/- 0.075%698.21402.96