informatique:ai_lm:gpu_bench
Différences
Ci-dessous, les différences entre deux révisions de la page.
| Les deux révisions précédentesRévision précédenteProchaine révision | Révision précédente | ||
| informatique:ai_lm:gpu_bench [20/04/2026 16:50] – [GPU Bench] cyrille | informatique:ai_lm:gpu_bench [30/04/2026 17:45] (Version actuelle) – [GPU Bench] cyrille | ||
|---|---|---|---|
| Ligne 3: | Ligne 3: | ||
| * [[https:// | * [[https:// | ||
| + | * Gigabyte Windforce OC 12GB Geforce RTX 3060, **354 €TTC** neuve 2025-11 | ||
| + | * PNY OC 16 Go Geforce RTX 5060 Ti, **450 €TTC** neuve 2025-11 | ||
| Benchmark d'IA pour [[https:// | Benchmark d'IA pour [[https:// | ||
| Ligne 19: | Ligne 21: | ||
| * Prompt processing: b128, b256, b512 : '' | * Prompt processing: b128, b256, b512 : '' | ||
| - | ^ models | + | ^ models |
| - | ^ ^ | + | ^ |
| - | | Qwen2.5-coder-7b-instruct-q5_k_m | tg128 | 5.47 | 57.65 | | + | | Qwen2.5-coder-7b-instruct-q5_k_m |
| - | | //size: 5.07 GiB// | + | | //size: 5.07 GiB// | tg256 | ... | |
| - | | | tg512 | | + | | |
| - | | | b128 | | + | | |
| - | | | b256 | | + | | |
| - | | | b512 | | + | | |
| - | | Qwen2.5-coder-7b-instruct-q8_0 | + | | Qwen2.5-coder-7b-instruct-q8_0 |
| - | | //size: 7.54 GiB// | + | | //size: 7.54 GiB// | tg256 | ... | |
| - | | | tg512 | | + | | |
| - | | | b128 | + | | |
| - | | | b256 | | + | | |
| - | | | b512 | | + | | |
| - | | EuroLLM-9B-Instruct-Q4_0 | + | | EuroLLM-9B-Instruct-Q4_0 |
| - | | //size: 4.94 GiB// | + | | //size: 4.94 GiB// | tg256 | ... | |
| - | | | tg512 | | + | | |
| - | | | b128 | | + | | |
| - | | | b256 | | + | | |
| - | | | b512 | | + | | |
| - | | Qwen3-14B-UD-Q5_K_XL | + | | Qwen3-14B-UD-Q5_K_XL |
| - | | //size: 9.82 GiB// | + | | //size: 9.82 GiB// | tg256 | ... | |
| - | | | tg512 | | + | | |
| - | | | b128 | | + | | |
| - | | | b256 | | + | | |
| - | | | b512 | | + | | |
| - | | Qwen3-4B-UD-Q8_K_XL | + | | Qwen3-4B-UD-Q8_K_XL |
| - | | //size: 4.70 GiB// | + | | //size: 4.70 GiB// | tg256 | |
| - | | | tg512 | 6.24 | 54.56 | | + | | |
| - | | | b128 | + | | |
| - | | | b256 | | + | | |
| - | | | b512 | | + | | |
| - | | GemmaCoder3-12B-IQ4_NL.gguf | + | | GemmaCoder3-12B-IQ4_NL.gguf |
| - | | //size: 6.41 GiB// | + | | //size: 6.41 GiB// | tg256 | ... | |
| - | | | tg512 | | + | | |
| - | | | b128 | | + | | |
| - | | | b256 | | + | | |
| - | | | b512 | | + | | |
| - | | Gemma3-Code-Reasoning-4B.Q8_0 | + | | Gemma3-Code-Reasoning-4B.Q8_0 |
| - | | //size: 3.84 GiB// | + | | //size: 3.84 GiB// | tg256 | ... | |
| - | | | tg512 | | + | | |
| - | | | b128 | | + | | |
| - | | | b256 | | + | | |
| - | | | b512 | | + | | |
| - | | GemmaCoder3-12B-Q5_K_M | + | | GemmaCoder3-12B-Q5_K_M |
| - | | //size: 7.86 GiB// | + | | //size: 7.86 GiB// | tg256 | ... | |
| - | | | tg512 | | + | | |
| - | | | b128 | | + | | |
| - | | | b256 | | + | | |
| - | | | b512 | | + | | |
| - | | gpt-oss 20B MXFP4 MoE | tg128 | | + | | gpt-oss 20B MXFP4 MoE |
| - | | gpt-oss-20b-mxfp4.gguf | + | | gpt-oss-20b-mxfp4.gguf |
| - | | //size: 11.27 GiB// | tg512 | | + | | //size: 11.27 GiB// |
| - | | | b128 | | + | | |
| - | | | b256 | | + | | |
| - | | | b512 | | + | | |
| - | | gpt-oss 20B Q4_K - Medium | + | | gpt-oss 20B Q4_K - Medium |
| - | | gpt-oss-20b-UD-Q4_K_XL.gguf | + | | gpt-oss-20b-UD-Q4_K_XL.gguf |
| - | | //size: 11.04 GiB// | tg512 | | + | | //size: 11.04 GiB// |
| - | | | b128 | | + | | |
| - | | | b256 | | + | | |
| - | | | b512 | | + | | |
| Ligne 298: | Ligne 300: | ||
| </ | </ | ||
| - | **Mais non**, ça a bien fonctionné avec '' | + | **Mais non**, ça a bien fonctionné avec '' |
| < | < | ||
informatique/ai_lm/gpu_bench.1776696605.txt.gz · Dernière modification : de cyrille
