Mislim, nije najgori #ai reyultat na mom računaru ali je daleko od dobrog

$ llama-bench -m qwen2.5-coder-3b-instruct-q6_k.gguf -t 3 --cpu-strict 1
ggml_vulkan: Found 1 Vulkan devices:
ggml_vulkan: 0 = Intel(R) HD Graphics 630 (KBL GT2) (Intel open-source Mesa driver) | uma: 1 | fp16: 1 | bf16: 0 | warp size: 32 | shared memory: 65536 | int dot: 0 | matrix cores: none
| model                          |       size |     params | backend    | ngl | threads | cpu_strict |            test |                  t/s |
| ------------------------------ | ---------: | ---------: | ---------- | --: | ------: | ---------: | --------------: | -------------------: |
| qwen2 3B Q6_K                  |   2.60 GiB |     3.40 B | Vulkan     |  99 |       3 |          1 |           pp512 |         25.32 ± 0.01 |
| qwen2 3B Q6_K                  |   2.60 GiB |     3.40 B | Vulkan     |  99 |       3 |          1 |           tg128 |          4.51 ± 0.00 |

build: unknown (7709)
e7a3885db3afa38f