Deprecated: preg_replace(): Passing null to parameter #3 ($subject) of type array|string is deprecated in /var/www/www-root/data/www/roipad.com/saas-metrics/origin-details.php on line 112

Deprecated: preg_replace(): Passing null to parameter #3 ($subject) of type array|string is deprecated in /var/www/www-root/data/www/roipad.com/saas-metrics/origin-details.php on line 112
No Difference in tokens/sec - Ministral3 8B Q5_K_M - Origin Analysis

GitHub Issue

No Difference in tokens/sec - Ministral3 8B Q5_K_M

Discovered On Mar 29, 2026

Primary Metric open

I used the repo to rebuild llama-cpp from scratch to a different dest compared to original llama-cpp. I am comparing performance of the same base model being executed with same command line parameters using llama-server -m for turbo3 and turbo4. Not seeing any improvement in tokens/second before and after. Actually before the speed of generation is better than after. I am using MAC M1 with 32GB RAM.

View Raw Thread