{"id":266,"date":"2026-01-20T00:00:00","date_gmt":"2026-01-19T23:00:00","guid":{"rendered":"https:\/\/helloblog.io\/hu\/wp-bench-wordpress-ai-benchmark\/"},"modified":"2026-01-20T00:00:00","modified_gmt":"2026-01-19T23:00:00","slug":"wp-bench-wordpress-ai-benchmark","status":"publish","type":"post","link":"https:\/\/helloblog.io\/hu\/wp-bench-wordpress-ai-benchmark\/","title":{"rendered":"WP-Bench: v\u00e9gre egy hivatalos WordPress AI benchmark, ami t\u00e9nyleg WordPress-tud\u00e1st m\u00e9r"},"content":{"rendered":"\n<p>Ha dolgozt\u00e1l m\u00e1r AI-alap\u00fa k\u00f3dasszisztenssel WordPress-projekten, val\u00f3sz\u00edn\u0171leg ismer\u0151s a jelens\u00e9g: a modell sz\u00e9pen meg\u00edrja a PHP-t, azt\u00e1n kider\u00fcl, hogy rossz hook-ot v\u00e1lasztott, nem WP-szer\u0171 a megold\u00e1s, vagy elcs\u00faszik a biztons\u00e1gi alapokon (nonce, capability check, sanitization\/escaping). A klasszikus programoz\u00e1si benchmarkok erre nem adnak j\u00f3 k\u00e9pet, mert \u00e1ltal\u00e1nos feladatokra vannak kital\u00e1lva.<\/p>\n\n\n\n<p>Erre \u00e9rkezett v\u00e1laszk\u00e9nt a WordPress projekt r\u00e9sz\u00e9r\u0151l a <strong>WP-Bench<\/strong>, ami a bejelent\u00e9s szerint az els\u0151 <em>hivatalos WordPress AI benchmark<\/em>: c\u00e9lja, hogy m\u00e9rhet\u0151 legyen, mennyire \u00e9rtik a nyelvi modellek a WordPress fejleszt\u00e9st \u2013 nem elm\u00e9letben, hanem futtathat\u00f3 k\u00f3ddal, WordPress k\u00f6rnyezetben.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Mi az a WP-Bench, \u00e9s mire j\u00f3?<\/h2>\n\n\n\n<p>A <a href=\"https:\/\/github.com\/WordPress\/wp-bench\">WP-Bench<\/a> egy ny\u00edlt forr\u00e1sk\u00f3d\u00fa benchmark-keretrendszer, ami kifejezetten WordPress feladatokon \u00e9rt\u00e9kel nyelvi modelleket. A f\u00f3kusz nem csak a core API-k ismerete, hanem a <strong>WordPress-specifikus fejleszt\u0151i val\u00f3s\u00e1g<\/strong>: k\u00f3dol\u00e1si szabv\u00e1nyok, b\u0151v\u00edtm\u00e9ny-architekt\u00fara, biztons\u00e1gi bevett mint\u00e1k \u00e9s a modern API-k haszn\u00e1lata.<\/p>\n\n\n\n<p>A WordPress a web jelent\u0151s r\u00e9sz\u00e9t hajtja, m\u00e9gis sok AI-\u00e9rt\u00e9kel\u00e9s \u201e\u00e1ltal\u00e1nos programoz\u00e1sk\u00e9nt\u201d kezeli a ter\u00fcletet. A WP-Bench l\u00e9nyege pont az, hogy a WordPress-tud\u00e1s ne mell\u00e9kes szempont legyen, hanem k\u00fcl\u00f6n m\u00e9rhet\u0151 k\u00e9pess\u00e9g.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Mit m\u00e9r pontosan? K\u00e9t dimenzi\u00f3: tud\u00e1s \u00e9s v\u00e9grehajt\u00e1s<\/h2>\n\n\n\n<p>A WP-Bench k\u00e9t nagy tengelyen \u00e9rt\u00e9kel:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n\n<li><strong>Knowledge (tud\u00e1s):<\/strong> feleletv\u00e1laszt\u00f3s k\u00e9rd\u00e9sek WordPress fogalmakr\u00f3l, API-kr\u00f3l, hook-okr\u00f3l (WordPress-es esem\u00e9nypontok \u00e9s sz\u0171r\u0151k), biztons\u00e1gi mint\u00e1kr\u00f3l \u00e9s k\u00f3dol\u00e1si szabv\u00e1nyokr\u00f3l. A le\u00edr\u00e1s alapj\u00e1n k\u00fcl\u00f6n hangs\u00falyt kapnak az \u00fajabb fejleszt\u00e9sek, p\u00e9ld\u00e1ul az Abilities API \u00e9s az Interactivity API.<\/li>\n\n\n<li><strong>Execution (v\u00e9grehajt\u00e1s):<\/strong> k\u00f3dgener\u00e1l\u00f3s feladatok, ahol a modell \u00e1ltal \u00edrt k\u00f3dot egy val\u00f3di WordPress runtime \u00e9rt\u00e9keli statikus elemz\u00e9ssel \u00e9s fut\u00e1s k\u00f6zbeni ellen\u0151rz\u00e9sekkel (assertion-\u00f6k).<\/li>\n\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\">A legfontosabb r\u00e9sz: WordPress \u00e9rt\u00e9kel WordPress-t<\/h2>\n\n\n\n<p>A benchmark egyik er\u0151s \u00f6tlete, hogy nem egy \u201ek\u00fcls\u0151\u201d pontoz\u00f3 pr\u00f3b\u00e1lja kital\u00e1lni, j\u00f3-e a k\u00f3d, hanem <strong>maga a WordPress futtat\u00f3k\u00f6rnyezete<\/strong> \u00e9rt\u00e9kel. A folyamat r\u00f6viden:<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n\n<li>A harness (futtat\u00f3) promptot k\u00fcld a modellnek, \u00e9s WordPress-k\u00f3dot k\u00e9r.<\/li>\n\n\n<li>A gener\u00e1lt k\u00f3d WP-CLI-n kereszt\u00fcl eljut a WordPress runtime-hoz.<\/li>\n\n\n<li>A runtime statikus ellen\u0151rz\u00e9seket futtat (szintaxis, k\u00f3dst\u00edlus, biztons\u00e1gi szab\u00e1lyok).<\/li>\n\n\n<li>A k\u00f3d sandboxolt k\u00f6rnyezetben lefut, \u00e9s teszt-assertion-\u00f6k ellen\u0151rzik az elv\u00e1rt m\u0171k\u00f6d\u00e9st.<\/li>\n\n\n<li>Az eredm\u00e9ny JSON-k\u00e9nt \u00e9rkezik vissza pontsz\u00e1mokkal \u00e9s r\u00e9szletes visszajelz\u00e9ssel.<\/li>\n\n<\/ol>\n\n\n\n<p>Fejleszt\u0151i szemmel ez az\u00e9rt fontos, mert a \u201ej\u00f3l kin\u00e9z\u0151\u201d k\u00f3d helyett a m\u0171k\u00f6d\u0151, szabv\u00e1nyos \u00e9s biztons\u00e1gos megold\u00e1sok fel\u00e9 tolja a m\u00e9rc\u00e9t.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Gyorsind\u00edt\u00e1s: hogyan pr\u00f3b\u00e1lhatod ki a WP-Bench-et?<\/h2>\n\n\n\n<p>A projekt \u00fagy van \u00f6sszerakva, hogy a benchmark futtat\u00f3 r\u00e9sze Pythonb\u00f3l indul, a WordPress-es \u00e9rt\u00e9kel\u00e9s pedig egy k\u00fcl\u00f6n runtime mapp\u00e1ban megy.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">1) Telep\u00edt\u00e9s (Python virtualenv)<\/h3>\n\n\n\n<div class=\"wp-block-kevinbatdorf-code-block-pro\" data-code-block-pro-font-family=\"Code-Pro-JetBrains-Mono\" style=\"font-size:.875rem;font-family:Code-Pro-JetBrains-Mono,ui-monospace,SFMono-Regular,Menlo,Monaco,Consolas,monospace;line-height:1.25rem;--cbp-tab-width:2;tab-size:var(--cbp-tab-width, 2)\"><span style=\"display:block;padding:16px 0 0 16px;margin-bottom:-1px;width:100%;text-align:left;background-color:#24292e\"><svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"54\" height=\"14\" viewBox=\"0 0 54 14\"><g fill=\"none\" fill-rule=\"evenodd\" transform=\"translate(1 1)\"><circle cx=\"6\" cy=\"6\" r=\"6\" fill=\"#FF5F56\" stroke=\"#E0443E\" stroke-width=\".5\"><\/circle><circle cx=\"26\" cy=\"6\" r=\"6\" fill=\"#FFBD2E\" stroke=\"#DEA123\" stroke-width=\".5\"><\/circle><circle cx=\"46\" cy=\"6\" r=\"6\" fill=\"#27C93F\" stroke=\"#1AAB29\" stroke-width=\".5\"><\/circle><\/g><\/svg><\/span><span role=\"button\" tabindex=\"0\" style=\"color:#e1e4e8;display:none\" aria-label=\"Copy\" class=\"code-block-pro-copy-button\"><pre class=\"code-block-pro-copy-button-pre\" aria-hidden=\"true\"><textarea class=\"code-block-pro-copy-button-textarea\" tabindex=\"-1\" aria-hidden=\"true\" readonly>python3 -m venv .venv &amp;&amp; source .venv\/bin\/activate\npip install -e .\/python\n\n<\/textarea><\/pre><svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" style=\"width:24px;height:24px\" fill=\"none\" viewBox=\"0 0 24 24\" stroke=\"currentColor\" stroke-width=\"2\"><path class=\"with-check\" stroke-linecap=\"round\" stroke-linejoin=\"round\" d=\"M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2m-6 9l2 2 4-4\"><\/path><path class=\"without-check\" stroke-linecap=\"round\" stroke-linejoin=\"round\" d=\"M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2\"><\/path><\/svg><\/span><pre class=\"shiki github-dark\" style=\"background-color:#24292e;color:#e1e4e8\" tabindex=\"0\"><code><span class=\"line\"><span style=\"color:#B392F0\">python3<\/span><span style=\"color:#79B8FF\"> -m<\/span><span style=\"color:#9ECBFF\"> venv<\/span><span style=\"color:#9ECBFF\"> .venv<\/span><span style=\"color:#E1E4E8\"> &#x26;&#x26; <\/span><span style=\"color:#79B8FF\">source<\/span><span style=\"color:#9ECBFF\"> .venv\/bin\/activate<\/span><\/span>\n<span class=\"line\"><span style=\"color:#B392F0\">pip<\/span><span style=\"color:#9ECBFF\"> install<\/span><span style=\"color:#79B8FF\"> -e<\/span><span style=\"color:#9ECBFF\"> .\/python<\/span><\/span>\n<span class=\"line\"><\/span><\/code><\/pre><\/div>\n\n\n\n<h3 class=\"wp-block-heading\">2) API kulcsok be\u00e1ll\u00edt\u00e1sa (.env)<\/h3>\n\n\n\n<p>A modell-szolg\u00e1ltat\u00f3k kulcsait egy <code>.env<\/code> f\u00e1jlba tudod tenni. A p\u00e9lda alapj\u00e1n t\u00f6bb szolg\u00e1ltat\u00f3 is t\u00e1mogatott:<\/p>\n\n\n\n<div class=\"wp-block-kevinbatdorf-code-block-pro\" data-code-block-pro-font-family=\"Code-Pro-JetBrains-Mono\" style=\"font-size:.875rem;font-family:Code-Pro-JetBrains-Mono,ui-monospace,SFMono-Regular,Menlo,Monaco,Consolas,monospace;line-height:1.25rem;--cbp-tab-width:2;tab-size:var(--cbp-tab-width, 2)\"><span style=\"display:block;padding:16px 0 0 16px;margin-bottom:-1px;width:100%;text-align:left;background-color:#24292e\"><svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"54\" height=\"14\" viewBox=\"0 0 54 14\"><g fill=\"none\" fill-rule=\"evenodd\" transform=\"translate(1 1)\"><circle cx=\"6\" cy=\"6\" r=\"6\" fill=\"#FF5F56\" stroke=\"#E0443E\" stroke-width=\".5\"><\/circle><circle cx=\"26\" cy=\"6\" r=\"6\" fill=\"#FFBD2E\" stroke=\"#DEA123\" stroke-width=\".5\"><\/circle><circle cx=\"46\" cy=\"6\" r=\"6\" fill=\"#27C93F\" stroke=\"#1AAB29\" stroke-width=\".5\"><\/circle><\/g><\/svg><\/span><span role=\"button\" tabindex=\"0\" style=\"color:#e1e4e8;display:none\" aria-label=\"Copy\" class=\"code-block-pro-copy-button\"><pre class=\"code-block-pro-copy-button-pre\" aria-hidden=\"true\"><textarea class=\"code-block-pro-copy-button-textarea\" tabindex=\"-1\" aria-hidden=\"true\" readonly>OPENAI_API_KEY=sk-...\nANTHROPIC_API_KEY=sk-ant-...\nGOOGLE_API_KEY=...\n\n<\/textarea><\/pre><svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" style=\"width:24px;height:24px\" fill=\"none\" viewBox=\"0 0 24 24\" stroke=\"currentColor\" stroke-width=\"2\"><path class=\"with-check\" stroke-linecap=\"round\" stroke-linejoin=\"round\" d=\"M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2m-6 9l2 2 4-4\"><\/path><path class=\"without-check\" stroke-linecap=\"round\" stroke-linejoin=\"round\" d=\"M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2\"><\/path><\/svg><\/span><pre class=\"shiki github-dark\" style=\"background-color:#24292e;color:#e1e4e8\" tabindex=\"0\"><code><span class=\"line\"><span style=\"color:#E1E4E8\">OPENAI_API_KEY<\/span><span style=\"color:#F97583\">=<\/span><span style=\"color:#9ECBFF\">sk-...<\/span><\/span>\n<span class=\"line\"><span style=\"color:#E1E4E8\">ANTHROPIC_API_KEY<\/span><span style=\"color:#F97583\">=<\/span><span style=\"color:#9ECBFF\">sk-ant-...<\/span><\/span>\n<span class=\"line\"><span style=\"color:#E1E4E8\">GOOGLE_API_KEY<\/span><span style=\"color:#F97583\">=<\/span><span style=\"color:#9ECBFF\">...<\/span><\/span>\n<span class=\"line\"><\/span><\/code><\/pre><\/div>\n\n\n\n<h3 class=\"wp-block-heading\">3) WordPress runtime ind\u00edt\u00e1sa (Node.js)<\/h3>\n\n\n\n<div class=\"wp-block-kevinbatdorf-code-block-pro\" data-code-block-pro-font-family=\"Code-Pro-JetBrains-Mono\" style=\"font-size:.875rem;font-family:Code-Pro-JetBrains-Mono,ui-monospace,SFMono-Regular,Menlo,Monaco,Consolas,monospace;line-height:1.25rem;--cbp-tab-width:2;tab-size:var(--cbp-tab-width, 2)\"><span style=\"display:block;padding:16px 0 0 16px;margin-bottom:-1px;width:100%;text-align:left;background-color:#24292e\"><svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"54\" height=\"14\" viewBox=\"0 0 54 14\"><g fill=\"none\" fill-rule=\"evenodd\" transform=\"translate(1 1)\"><circle cx=\"6\" cy=\"6\" r=\"6\" fill=\"#FF5F56\" stroke=\"#E0443E\" stroke-width=\".5\"><\/circle><circle cx=\"26\" cy=\"6\" r=\"6\" fill=\"#FFBD2E\" stroke=\"#DEA123\" stroke-width=\".5\"><\/circle><circle cx=\"46\" cy=\"6\" r=\"6\" fill=\"#27C93F\" stroke=\"#1AAB29\" stroke-width=\".5\"><\/circle><\/g><\/svg><\/span><span role=\"button\" tabindex=\"0\" style=\"color:#e1e4e8;display:none\" aria-label=\"Copy\" class=\"code-block-pro-copy-button\"><pre class=\"code-block-pro-copy-button-pre\" aria-hidden=\"true\"><textarea class=\"code-block-pro-copy-button-textarea\" tabindex=\"-1\" aria-hidden=\"true\" readonly>cd runtime\nnpm install\nnpm start\n\n<\/textarea><\/pre><svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" style=\"width:24px;height:24px\" fill=\"none\" viewBox=\"0 0 24 24\" stroke=\"currentColor\" stroke-width=\"2\"><path class=\"with-check\" stroke-linecap=\"round\" stroke-linejoin=\"round\" d=\"M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2m-6 9l2 2 4-4\"><\/path><path class=\"without-check\" stroke-linecap=\"round\" stroke-linejoin=\"round\" d=\"M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2\"><\/path><\/svg><\/span><pre class=\"shiki github-dark\" style=\"background-color:#24292e;color:#e1e4e8\" tabindex=\"0\"><code><span class=\"line\"><span style=\"color:#79B8FF\">cd<\/span><span style=\"color:#9ECBFF\"> runtime<\/span><\/span>\n<span class=\"line\"><span style=\"color:#B392F0\">npm<\/span><span style=\"color:#9ECBFF\"> install<\/span><\/span>\n<span class=\"line\"><span style=\"color:#B392F0\">npm<\/span><span style=\"color:#9ECBFF\"> start<\/span><\/span>\n<span class=\"line\"><\/span><\/code><\/pre><\/div>\n\n\n\n<h3 class=\"wp-block-heading\">4) Benchmark futtat\u00e1sa<\/h3>\n\n\n\n<div class=\"wp-block-kevinbatdorf-code-block-pro\" data-code-block-pro-font-family=\"Code-Pro-JetBrains-Mono\" style=\"font-size:.875rem;font-family:Code-Pro-JetBrains-Mono,ui-monospace,SFMono-Regular,Menlo,Monaco,Consolas,monospace;line-height:1.25rem;--cbp-tab-width:2;tab-size:var(--cbp-tab-width, 2)\"><span style=\"display:block;padding:16px 0 0 16px;margin-bottom:-1px;width:100%;text-align:left;background-color:#24292e\"><svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"54\" height=\"14\" viewBox=\"0 0 54 14\"><g fill=\"none\" fill-rule=\"evenodd\" transform=\"translate(1 1)\"><circle cx=\"6\" cy=\"6\" r=\"6\" fill=\"#FF5F56\" stroke=\"#E0443E\" stroke-width=\".5\"><\/circle><circle cx=\"26\" cy=\"6\" r=\"6\" fill=\"#FFBD2E\" stroke=\"#DEA123\" stroke-width=\".5\"><\/circle><circle cx=\"46\" cy=\"6\" r=\"6\" fill=\"#27C93F\" stroke=\"#1AAB29\" stroke-width=\".5\"><\/circle><\/g><\/svg><\/span><span role=\"button\" tabindex=\"0\" style=\"color:#e1e4e8;display:none\" aria-label=\"Copy\" class=\"code-block-pro-copy-button\"><pre class=\"code-block-pro-copy-button-pre\" aria-hidden=\"true\"><textarea class=\"code-block-pro-copy-button-textarea\" tabindex=\"-1\" aria-hidden=\"true\" readonly>cd ..\nwp-bench run --config wp-bench.example.yaml\n\n<\/textarea><\/pre><svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" style=\"width:24px;height:24px\" fill=\"none\" viewBox=\"0 0 24 24\" stroke=\"currentColor\" stroke-width=\"2\"><path class=\"with-check\" stroke-linecap=\"round\" stroke-linejoin=\"round\" d=\"M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2m-6 9l2 2 4-4\"><\/path><path class=\"without-check\" stroke-linecap=\"round\" stroke-linejoin=\"round\" d=\"M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2\"><\/path><\/svg><\/span><pre class=\"shiki github-dark\" style=\"background-color:#24292e;color:#e1e4e8\" tabindex=\"0\"><code><span class=\"line\"><span style=\"color:#79B8FF\">cd<\/span><span style=\"color:#9ECBFF\"> ..<\/span><\/span>\n<span class=\"line\"><span style=\"color:#B392F0\">wp-bench<\/span><span style=\"color:#9ECBFF\"> run<\/span><span style=\"color:#79B8FF\"> --config<\/span><span style=\"color:#9ECBFF\"> wp-bench.example.yaml<\/span><\/span>\n<span class=\"line\"><\/span><\/code><\/pre><\/div>\n\n\n\n<p>Az eredm\u00e9nyek alap\u00e9rtelmezetten az <code>output\/results.json<\/code> f\u00e1jlba ker\u00fclnek, a r\u00e9szletes, tesztenk\u00e9nti log pedig <code>output\/results.jsonl<\/code> form\u00e1ban.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">T\u00f6bb modell \u00f6sszehasonl\u00edt\u00e1sa egy fut\u00e1ssal<\/h2>\n\n\n\n<p>Ha nem csak egy modellt akarsz futtatni, a konfigur\u00e1ci\u00f3ban t\u00f6bb modellt is felsorolhatsz. A WP-Bench egym\u00e1s ut\u00e1n lefuttatja \u0151ket, \u00e9s \u00f6sszehasonl\u00edt\u00f3 t\u00e1bl\u00e1t ad a v\u00e9g\u00e9n. A modellnevek a LiteLLM konvenci\u00f3it k\u00f6vetik (r\u00e9szletek: <a href=\"https:\/\/docs.litellm.ai\/docs\/providers\">LiteLLM providers<\/a>).<\/p>\n\n\n\n<div class=\"wp-block-kevinbatdorf-code-block-pro\" data-code-block-pro-font-family=\"Code-Pro-JetBrains-Mono\" style=\"font-size:.875rem;font-family:Code-Pro-JetBrains-Mono,ui-monospace,SFMono-Regular,Menlo,Monaco,Consolas,monospace;line-height:1.25rem;--cbp-tab-width:2;tab-size:var(--cbp-tab-width, 2)\"><span style=\"display:block;padding:16px 0 0 16px;margin-bottom:-1px;width:100%;text-align:left;background-color:#24292e\"><svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"54\" height=\"14\" viewBox=\"0 0 54 14\"><g fill=\"none\" fill-rule=\"evenodd\" transform=\"translate(1 1)\"><circle cx=\"6\" cy=\"6\" r=\"6\" fill=\"#FF5F56\" stroke=\"#E0443E\" stroke-width=\".5\"><\/circle><circle cx=\"26\" cy=\"6\" r=\"6\" fill=\"#FFBD2E\" stroke=\"#DEA123\" stroke-width=\".5\"><\/circle><circle cx=\"46\" cy=\"6\" r=\"6\" fill=\"#27C93F\" stroke=\"#1AAB29\" stroke-width=\".5\"><\/circle><\/g><\/svg><\/span><span role=\"button\" tabindex=\"0\" style=\"color:#e1e4e8;display:none\" aria-label=\"Copy\" class=\"code-block-pro-copy-button\"><pre class=\"code-block-pro-copy-button-pre\" aria-hidden=\"true\"><textarea class=\"code-block-pro-copy-button-textarea\" tabindex=\"-1\" aria-hidden=\"true\" readonly>models:\n  - name: gpt-4o\n  - name: gpt-4o-mini\n  - name: claude-sonnet-4-20250514\n  - name: claude-opus-4-5-20251101\n  - name: gemini\/gemini-2.5-pro\n  - name: gemini\/gemini-2.5-flash\n\n<\/textarea><\/pre><svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" style=\"width:24px;height:24px\" fill=\"none\" viewBox=\"0 0 24 24\" stroke=\"currentColor\" stroke-width=\"2\"><path class=\"with-check\" stroke-linecap=\"round\" stroke-linejoin=\"round\" d=\"M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2m-6 9l2 2 4-4\"><\/path><path class=\"without-check\" stroke-linecap=\"round\" stroke-linejoin=\"round\" d=\"M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2\"><\/path><\/svg><\/span><pre class=\"shiki github-dark\" style=\"background-color:#24292e;color:#e1e4e8\" tabindex=\"0\"><code><span class=\"line\"><span style=\"color:#85E89D\">models<\/span><span style=\"color:#E1E4E8\">:<\/span><\/span>\n<span class=\"line\"><span style=\"color:#E1E4E8\">  - <\/span><span style=\"color:#85E89D\">name<\/span><span style=\"color:#E1E4E8\">: <\/span><span style=\"color:#9ECBFF\">gpt-4o<\/span><\/span>\n<span class=\"line\"><span style=\"color:#E1E4E8\">  - <\/span><span style=\"color:#85E89D\">name<\/span><span style=\"color:#E1E4E8\">: <\/span><span style=\"color:#9ECBFF\">gpt-4o-mini<\/span><\/span>\n<span class=\"line\"><span style=\"color:#E1E4E8\">  - <\/span><span style=\"color:#85E89D\">name<\/span><span style=\"color:#E1E4E8\">: <\/span><span style=\"color:#9ECBFF\">claude-sonnet-4-20250514<\/span><\/span>\n<span class=\"line\"><span style=\"color:#E1E4E8\">  - <\/span><span style=\"color:#85E89D\">name<\/span><span style=\"color:#E1E4E8\">: <\/span><span style=\"color:#9ECBFF\">claude-opus-4-5-20251101<\/span><\/span>\n<span class=\"line\"><span style=\"color:#E1E4E8\">  - <\/span><span style=\"color:#85E89D\">name<\/span><span style=\"color:#E1E4E8\">: <\/span><span style=\"color:#9ECBFF\">gemini\/gemini-2.5-pro<\/span><\/span>\n<span class=\"line\"><span style=\"color:#E1E4E8\">  - <\/span><span style=\"color:#85E89D\">name<\/span><span style=\"color:#E1E4E8\">: <\/span><span style=\"color:#9ECBFF\">gemini\/gemini-2.5-flash<\/span><\/span>\n<span class=\"line\"><\/span><\/code><\/pre><\/div>\n\n\n\n<h2 class=\"wp-block-heading\">Konfigur\u00e1ci\u00f3: mit \u00e9rdemes testre szabni?<\/h2>\n\n\n\n<p>A kiindul\u00e1si alap a <code>wp-bench.example.yaml<\/code>, amit lem\u00e1solsz \u00e9s m\u00f3dos\u00edtasz. A l\u00e9nyegi r\u00e9szek:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n\n<li><strong>dataset:<\/strong> honnan j\u00f6n a feladatsor (lok\u00e1lis vagy Hugging Face).<\/li>\n\n\n<li><strong>models:<\/strong> melyik modelleket futtatod.<\/li>\n\n\n<li><strong>grader:<\/strong> hogyan indul a WordPress-es \u00e9rt\u00e9kel\u0151 (a p\u00e9ld\u00e1ban Docker \u00e9s egy <code>wp-env<\/code>-es runtime).<\/li>\n\n\n<li><strong>run:<\/strong> melyik suite menjen, mennyi teszt, \u00e9s milyen p\u00e1rhuzamoss\u00e1ggal.<\/li>\n\n\n<li><strong>output:<\/strong> hova \u00edrja az eredm\u00e9nyt.<\/li>\n\n<\/ul>\n\n\n\n<div class=\"wp-block-kevinbatdorf-code-block-pro\" data-code-block-pro-font-family=\"Code-Pro-JetBrains-Mono\" style=\"font-size:.875rem;font-family:Code-Pro-JetBrains-Mono,ui-monospace,SFMono-Regular,Menlo,Monaco,Consolas,monospace;line-height:1.25rem;--cbp-tab-width:2;tab-size:var(--cbp-tab-width, 2)\"><span style=\"display:block;padding:16px 0 0 16px;margin-bottom:-1px;width:100%;text-align:left;background-color:#24292e\"><svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"54\" height=\"14\" viewBox=\"0 0 54 14\"><g fill=\"none\" fill-rule=\"evenodd\" transform=\"translate(1 1)\"><circle cx=\"6\" cy=\"6\" r=\"6\" fill=\"#FF5F56\" stroke=\"#E0443E\" stroke-width=\".5\"><\/circle><circle cx=\"26\" cy=\"6\" r=\"6\" fill=\"#FFBD2E\" stroke=\"#DEA123\" stroke-width=\".5\"><\/circle><circle cx=\"46\" cy=\"6\" r=\"6\" fill=\"#27C93F\" stroke=\"#1AAB29\" stroke-width=\".5\"><\/circle><\/g><\/svg><\/span><span role=\"button\" tabindex=\"0\" style=\"color:#e1e4e8;display:none\" aria-label=\"Copy\" class=\"code-block-pro-copy-button\"><pre class=\"code-block-pro-copy-button-pre\" aria-hidden=\"true\"><textarea class=\"code-block-pro-copy-button-textarea\" tabindex=\"-1\" aria-hidden=\"true\" readonly>dataset:\n  source: local              # 'local' or 'huggingface'\n  name: wp-core-v1           # suite name\n\nmodels:\n  - name: gpt-4o\n\ngrader:\n  kind: docker\n  wp_env_dir: .\/runtime      # path to wp-env project\n\nrun:\n  suite: wp-core-v1\n  limit: 10                  # limit tests (null = all)\n  concurrency: 4\n\noutput:\n  path: output\/results.json\n  jsonl_path: output\/results.jsonl\n\n<\/textarea><\/pre><svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" style=\"width:24px;height:24px\" fill=\"none\" viewBox=\"0 0 24 24\" stroke=\"currentColor\" stroke-width=\"2\"><path class=\"with-check\" stroke-linecap=\"round\" stroke-linejoin=\"round\" d=\"M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2m-6 9l2 2 4-4\"><\/path><path class=\"without-check\" stroke-linecap=\"round\" stroke-linejoin=\"round\" d=\"M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2\"><\/path><\/svg><\/span><pre class=\"shiki github-dark\" style=\"background-color:#24292e;color:#e1e4e8\" tabindex=\"0\"><code><span class=\"line\"><span style=\"color:#85E89D\">dataset<\/span><span style=\"color:#E1E4E8\">:<\/span><\/span>\n<span class=\"line\"><span style=\"color:#85E89D\">  source<\/span><span style=\"color:#E1E4E8\">: <\/span><span style=\"color:#9ECBFF\">local<\/span><span style=\"color:#6A737D\">              # 'local' or 'huggingface'<\/span><\/span>\n<span class=\"line\"><span style=\"color:#85E89D\">  name<\/span><span style=\"color:#E1E4E8\">: <\/span><span style=\"color:#9ECBFF\">wp-core-v1<\/span><span style=\"color:#6A737D\">           # suite name<\/span><\/span>\n<span class=\"line\"><\/span>\n<span class=\"line\"><span style=\"color:#85E89D\">models<\/span><span style=\"color:#E1E4E8\">:<\/span><\/span>\n<span class=\"line\"><span style=\"color:#E1E4E8\">  - <\/span><span style=\"color:#85E89D\">name<\/span><span style=\"color:#E1E4E8\">: <\/span><span style=\"color:#9ECBFF\">gpt-4o<\/span><\/span>\n<span class=\"line\"><\/span>\n<span class=\"line\"><span style=\"color:#85E89D\">grader<\/span><span style=\"color:#E1E4E8\">:<\/span><\/span>\n<span class=\"line\"><span style=\"color:#85E89D\">  kind<\/span><span style=\"color:#E1E4E8\">: <\/span><span style=\"color:#9ECBFF\">docker<\/span><\/span>\n<span class=\"line\"><span style=\"color:#85E89D\">  wp_env_dir<\/span><span style=\"color:#E1E4E8\">: <\/span><span style=\"color:#9ECBFF\">.\/runtime<\/span><span style=\"color:#6A737D\">      # path to wp-env project<\/span><\/span>\n<span class=\"line\"><\/span>\n<span class=\"line\"><span style=\"color:#85E89D\">run<\/span><span style=\"color:#E1E4E8\">:<\/span><\/span>\n<span class=\"line\"><span style=\"color:#85E89D\">  suite<\/span><span style=\"color:#E1E4E8\">: <\/span><span style=\"color:#9ECBFF\">wp-core-v1<\/span><\/span>\n<span class=\"line\"><span style=\"color:#85E89D\">  limit<\/span><span style=\"color:#E1E4E8\">: <\/span><span style=\"color:#79B8FF\">10<\/span><span style=\"color:#6A737D\">                  # limit tests (null = all)<\/span><\/span>\n<span class=\"line\"><span style=\"color:#85E89D\">  concurrency<\/span><span style=\"color:#E1E4E8\">: <\/span><span style=\"color:#79B8FF\">4<\/span><\/span>\n<span class=\"line\"><\/span>\n<span class=\"line\"><span style=\"color:#85E89D\">output<\/span><span style=\"color:#E1E4E8\">:<\/span><\/span>\n<span class=\"line\"><span style=\"color:#85E89D\">  path<\/span><span style=\"color:#E1E4E8\">: <\/span><span style=\"color:#9ECBFF\">output\/results.json<\/span><\/span>\n<span class=\"line\"><span style=\"color:#85E89D\">  jsonl_path<\/span><span style=\"color:#E1E4E8\">: <\/span><span style=\"color:#9ECBFF\">output\/results.jsonl<\/span><\/span>\n<span class=\"line\"><\/span><\/code><\/pre><\/div>\n\n\n\n<h3 class=\"wp-block-heading\">Hasznos CLI parancsok<\/h3>\n\n\n\n<div class=\"wp-block-kevinbatdorf-code-block-pro\" data-code-block-pro-font-family=\"Code-Pro-JetBrains-Mono\" style=\"font-size:.875rem;font-family:Code-Pro-JetBrains-Mono,ui-monospace,SFMono-Regular,Menlo,Monaco,Consolas,monospace;line-height:1.25rem;--cbp-tab-width:2;tab-size:var(--cbp-tab-width, 2)\"><span style=\"display:block;padding:16px 0 0 16px;margin-bottom:-1px;width:100%;text-align:left;background-color:#24292e\"><svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"54\" height=\"14\" viewBox=\"0 0 54 14\"><g fill=\"none\" fill-rule=\"evenodd\" transform=\"translate(1 1)\"><circle cx=\"6\" cy=\"6\" r=\"6\" fill=\"#FF5F56\" stroke=\"#E0443E\" stroke-width=\".5\"><\/circle><circle cx=\"26\" cy=\"6\" r=\"6\" fill=\"#FFBD2E\" stroke=\"#DEA123\" stroke-width=\".5\"><\/circle><circle cx=\"46\" cy=\"6\" r=\"6\" fill=\"#27C93F\" stroke=\"#1AAB29\" stroke-width=\".5\"><\/circle><\/g><\/svg><\/span><span role=\"button\" tabindex=\"0\" style=\"color:#e1e4e8;display:none\" aria-label=\"Copy\" class=\"code-block-pro-copy-button\"><pre class=\"code-block-pro-copy-button-pre\" aria-hidden=\"true\"><textarea class=\"code-block-pro-copy-button-textarea\" tabindex=\"-1\" aria-hidden=\"true\" readonly>wp-bench run --config wp-bench.yaml          # futtat\u00e1s config f\u00e1jllal\nwp-bench run --model-name gpt-4o --limit 5   # gyors, egy-modeles pr\u00f3ba\nwp-bench dry-run --config wp-bench.yaml      # config ellen\u0151rz\u00e9s modellh\u00edv\u00e1s n\u00e9lk\u00fcl\n\n<\/textarea><\/pre><svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" style=\"width:24px;height:24px\" fill=\"none\" viewBox=\"0 0 24 24\" stroke=\"currentColor\" stroke-width=\"2\"><path class=\"with-check\" stroke-linecap=\"round\" stroke-linejoin=\"round\" d=\"M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2m-6 9l2 2 4-4\"><\/path><path class=\"without-check\" stroke-linecap=\"round\" stroke-linejoin=\"round\" d=\"M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2\"><\/path><\/svg><\/span><pre class=\"shiki github-dark\" style=\"background-color:#24292e;color:#e1e4e8\" tabindex=\"0\"><code><span class=\"line\"><span style=\"color:#B392F0\">wp-bench<\/span><span style=\"color:#9ECBFF\"> run<\/span><span style=\"color:#79B8FF\"> --config<\/span><span style=\"color:#9ECBFF\"> wp-bench.yaml<\/span><span style=\"color:#6A737D\">          # futtat\u00e1s config f\u00e1jllal<\/span><\/span>\n<span class=\"line\"><span style=\"color:#B392F0\">wp-bench<\/span><span style=\"color:#9ECBFF\"> run<\/span><span style=\"color:#79B8FF\"> --model-name<\/span><span style=\"color:#9ECBFF\"> gpt-4o<\/span><span style=\"color:#79B8FF\"> --limit<\/span><span style=\"color:#79B8FF\"> 5<\/span><span style=\"color:#6A737D\">   # gyors, egy-modeles pr\u00f3ba<\/span><\/span>\n<span class=\"line\"><span style=\"color:#B392F0\">wp-bench<\/span><span style=\"color:#9ECBFF\"> dry-run<\/span><span style=\"color:#79B8FF\"> --config<\/span><span style=\"color:#9ECBFF\"> wp-bench.yaml<\/span><span style=\"color:#6A737D\">      # config ellen\u0151rz\u00e9s modellh\u00edv\u00e1s n\u00e9lk\u00fcl<\/span><\/span>\n<span class=\"line\"><\/span><\/code><\/pre><\/div>\n\n\n\n<h2 class=\"wp-block-heading\">Hogyan \u00e9p\u00fcl fel a repo? (gyors t\u00e9rk\u00e9p)<\/h2>\n\n\n\n<p>A strukt\u00fara el\u00e9g tiszta, k\u00fcl\u00f6n van a futtat\u00f3 (Python), k\u00fcl\u00f6n a WordPress runtime \u00e9s k\u00fcl\u00f6n a feladatsorok:<\/p>\n\n\n\n<div class=\"wp-block-kevinbatdorf-code-block-pro\" data-code-block-pro-font-family=\"Code-Pro-JetBrains-Mono\" style=\"font-size:.875rem;font-family:Code-Pro-JetBrains-Mono,ui-monospace,SFMono-Regular,Menlo,Monaco,Consolas,monospace;line-height:1.25rem;--cbp-tab-width:2;tab-size:var(--cbp-tab-width, 2)\"><span style=\"display:block;padding:16px 0 0 16px;margin-bottom:-1px;width:100%;text-align:left;background-color:#24292e\"><svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"54\" height=\"14\" viewBox=\"0 0 54 14\"><g fill=\"none\" fill-rule=\"evenodd\" transform=\"translate(1 1)\"><circle cx=\"6\" cy=\"6\" r=\"6\" fill=\"#FF5F56\" stroke=\"#E0443E\" stroke-width=\".5\"><\/circle><circle cx=\"26\" cy=\"6\" r=\"6\" fill=\"#FFBD2E\" stroke=\"#DEA123\" stroke-width=\".5\"><\/circle><circle cx=\"46\" cy=\"6\" r=\"6\" fill=\"#27C93F\" stroke=\"#1AAB29\" stroke-width=\".5\"><\/circle><\/g><\/svg><\/span><span role=\"button\" tabindex=\"0\" style=\"color:#e1e4e8;display:none\" aria-label=\"Copy\" class=\"code-block-pro-copy-button\"><pre class=\"code-block-pro-copy-button-pre\" aria-hidden=\"true\"><textarea class=\"code-block-pro-copy-button-textarea\" tabindex=\"-1\" aria-hidden=\"true\" readonly>.\n\u251c\u2500\u2500 python\/          # Benchmark harness (pip installable)\n\u251c\u2500\u2500 runtime\/         # WordPress grader plugin + wp-env config\n\u251c\u2500\u2500 datasets\/        # Test suites (local JSON + Hugging Face builder)\n\u251c\u2500\u2500 notebooks\/       # Results visualization and reporting\n\u2514\u2500\u2500 output\/          # Benchmark results (gitignored)\n\n<\/textarea><\/pre><svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" style=\"width:24px;height:24px\" fill=\"none\" viewBox=\"0 0 24 24\" stroke=\"currentColor\" stroke-width=\"2\"><path class=\"with-check\" stroke-linecap=\"round\" stroke-linejoin=\"round\" d=\"M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2m-6 9l2 2 4-4\"><\/path><path class=\"without-check\" stroke-linecap=\"round\" stroke-linejoin=\"round\" d=\"M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2\"><\/path><\/svg><\/span><pre class=\"shiki github-dark\" style=\"background-color:#24292e;color:#e1e4e8\" tabindex=\"0\"><code><span class=\"line\"><span style=\"color:#79B8FF\">.<\/span><\/span>\n<span class=\"line\"><span style=\"color:#B392F0\">\u251c\u2500\u2500<\/span><span style=\"color:#9ECBFF\"> python\/<\/span><span style=\"color:#6A737D\">          # Benchmark harness (pip installable)<\/span><\/span>\n<span class=\"line\"><span style=\"color:#B392F0\">\u251c\u2500\u2500<\/span><span style=\"color:#9ECBFF\"> runtime\/<\/span><span style=\"color:#6A737D\">         # WordPress grader plugin + wp-env config<\/span><\/span>\n<span class=\"line\"><span style=\"color:#B392F0\">\u251c\u2500\u2500<\/span><span style=\"color:#9ECBFF\"> datasets\/<\/span><span style=\"color:#6A737D\">        # Test suites (local JSON + Hugging Face builder)<\/span><\/span>\n<span class=\"line\"><span style=\"color:#B392F0\">\u251c\u2500\u2500<\/span><span style=\"color:#9ECBFF\"> notebooks\/<\/span><span style=\"color:#6A737D\">       # Results visualization and reporting<\/span><\/span>\n<span class=\"line\"><span style=\"color:#B392F0\">\u2514\u2500\u2500<\/span><span style=\"color:#9ECBFF\"> output\/<\/span><span style=\"color:#6A737D\">          # Benchmark results (gitignored)<\/span><\/span>\n<span class=\"line\"><\/span><\/code><\/pre><\/div>\n\n\n\n<h2 class=\"wp-block-heading\">Feladatsorok (test suites): tud\u00e1s + k\u00f3d, kateg\u00f3ri\u00e1k szerint<\/h2>\n\n\n\n<p>A suite-ok a <code>datasets\/suites\/&lt;suite-name&gt;\/<\/code> alatt vannak, \u00e9s k\u00e9t k\u00f6nyvt\u00e1rra bontanak:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n\n<li><code>execution\/<\/code> \u2013 k\u00f3dgener\u00e1l\u00f3s feladatok assertion-\u00f6kkel (kateg\u00f3ri\u00e1nk\u00e9nt k\u00fcl\u00f6n JSON).<\/li>\n\n\n<li><code>knowledge\/<\/code> \u2013 feleletv\u00e1laszt\u00f3s k\u00e9rd\u00e9sek (szint\u00e9n kateg\u00f3ri\u00e1nk\u00e9nt k\u00fcl\u00f6n JSON).<\/li>\n\n<\/ul>\n\n\n\n<p>Az alap\u00e9rtelmezett suite neve <code>wp-core-v1<\/code>, \u00e9s a le\u00edr\u00e1s alapj\u00e1n WordPress core API-kat, hook-okat, adatb\u00e1zis-m\u0171veleteket \u00e9s biztons\u00e1gi mint\u00e1kat fed le.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Suite bet\u00f6lt\u00e9se Hugging Face-r\u0151l<\/h3>\n\n\n\n<p>A dataset forr\u00e1s\u00e1t \u00e1t tudod kapcsolni Hugging Face-re is:<\/p>\n\n\n\n<div class=\"wp-block-kevinbatdorf-code-block-pro\" data-code-block-pro-font-family=\"Code-Pro-JetBrains-Mono\" style=\"font-size:.875rem;font-family:Code-Pro-JetBrains-Mono,ui-monospace,SFMono-Regular,Menlo,Monaco,Consolas,monospace;line-height:1.25rem;--cbp-tab-width:2;tab-size:var(--cbp-tab-width, 2)\"><span style=\"display:block;padding:16px 0 0 16px;margin-bottom:-1px;width:100%;text-align:left;background-color:#24292e\"><svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"54\" height=\"14\" viewBox=\"0 0 54 14\"><g fill=\"none\" fill-rule=\"evenodd\" transform=\"translate(1 1)\"><circle cx=\"6\" cy=\"6\" r=\"6\" fill=\"#FF5F56\" stroke=\"#E0443E\" stroke-width=\".5\"><\/circle><circle cx=\"26\" cy=\"6\" r=\"6\" fill=\"#FFBD2E\" stroke=\"#DEA123\" stroke-width=\".5\"><\/circle><circle cx=\"46\" cy=\"6\" r=\"6\" fill=\"#27C93F\" stroke=\"#1AAB29\" stroke-width=\".5\"><\/circle><\/g><\/svg><\/span><span role=\"button\" tabindex=\"0\" style=\"color:#e1e4e8;display:none\" aria-label=\"Copy\" class=\"code-block-pro-copy-button\"><pre class=\"code-block-pro-copy-button-pre\" aria-hidden=\"true\"><textarea class=\"code-block-pro-copy-button-textarea\" tabindex=\"-1\" aria-hidden=\"true\" readonly>dataset:\n  source: huggingface\n  name: WordPress\/wp-bench-v1\n\n<\/textarea><\/pre><svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" style=\"width:24px;height:24px\" fill=\"none\" viewBox=\"0 0 24 24\" stroke=\"currentColor\" stroke-width=\"2\"><path class=\"with-check\" stroke-linecap=\"round\" stroke-linejoin=\"round\" d=\"M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2m-6 9l2 2 4-4\"><\/path><path class=\"without-check\" stroke-linecap=\"round\" stroke-linejoin=\"round\" d=\"M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2\"><\/path><\/svg><\/span><pre class=\"shiki github-dark\" style=\"background-color:#24292e;color:#e1e4e8\" tabindex=\"0\"><code><span class=\"line\"><span style=\"color:#85E89D\">dataset<\/span><span style=\"color:#E1E4E8\">:<\/span><\/span>\n<span class=\"line\"><span style=\"color:#85E89D\">  source<\/span><span style=\"color:#E1E4E8\">: <\/span><span style=\"color:#9ECBFF\">huggingface<\/span><\/span>\n<span class=\"line\"><span style=\"color:#85E89D\">  name<\/span><span style=\"color:#E1E4E8\">: <\/span><span style=\"color:#9ECBFF\">WordPress\/wp-bench-v1<\/span><\/span>\n<span class=\"line\"><\/span><\/code><\/pre><\/div>\n\n\n\n<h2 class=\"wp-block-heading\">Jelenlegi \u00e1llapot \u00e9s korl\u00e1tok (amit \u00e9rdemes \u00e9szben tartani)<\/h2>\n\n\n\n<p>A bejelent\u00e9s alapj\u00e1n a WP-Bench m\u00e9g korai kiad\u00e1s, \u00e9s t\u00f6bb ponton is l\u00e1tszik, merre kell fejl\u0151dnie:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n\n<li><strong>Kicsi dataset:<\/strong> a jelenlegi feladatsor m\u00e9g nem el\u00e9g nagy ahhoz, hogy \u201emindent lefed\u0151\u201d k\u00e9pet adjon. T\u00f6bb WordPress API-ra \u00e9s t\u00f6bb val\u00f3s mint\u00e1ra lesz sz\u00fcks\u00e9g.<\/li>\n\n\n<li><strong>Verzi\u00f3-bias:<\/strong> a suite jelenleg er\u0151sen a WordPress 6.9 k\u00f6r\u00fcli \u00fajdons\u00e1gok (Abilities API, Interactivity API) fel\u00e9 h\u00faz. Ez r\u00e9szben sz\u00e1nd\u00e9kos, mert ezeken szoktak elv\u00e9rezni a modellek, de torz\u00edt is, mivel ezek az API-k sok modell tr\u00e9ningadat\u00e1ban m\u00e9g nem szerepeltek.<\/li>\n\n\n<li><strong>Benchmark-szatur\u00e1ci\u00f3:<\/strong> a r\u00e9gebbi WordPress-t\u00e9m\u00e1kb\u00f3l t\u00f6bb modell t\u00fal magas pontsz\u00e1mot hozott, vagyis ezek a k\u00e9rd\u00e9sek m\u00e1r nem adnak el\u00e9g inform\u00e1ci\u00f3t. Neh\u00e9z olyan feladatot tal\u00e1lni, ami t\u00e9nyleg kih\u00edv\u00e1s, \u00e9s nem csak \u201e\u00faj\u201d.<\/li>\n\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\">Mi\u00e9rt \u00e9rdekes ez WordPress-fejleszt\u0151k\u00e9nt?<\/h2>\n\n\n\n<p>A WP-Bench k\u00e9t ir\u00e1nyb\u00f3l lehet hasznos a mindennapi d\u00f6nt\u00e9seidhez:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n\n<li><strong>Eszk\u00f6zv\u00e1laszt\u00e1s:<\/strong> ha AI-t haszn\u00e1lsz k\u00f3dol\u00e1sra (vagy AI-funkci\u00f3t \u00e9p\u00edtesz b\u0151v\u00edtm\u00e9nybe), nem mindegy, melyik modell mennyire \u00e9rti a WordPress-es konvenci\u00f3kat. A benchmark c\u00e9lja, hogy err\u0151l legyen \u00f6sszehasonl\u00edthat\u00f3 k\u00e9ped.<\/li>\n\n\n<li><strong>Nyom\u00e1s a modellgy\u00e1rt\u00f3k fel\u00e9:<\/strong> a c\u00e9l, hogy a WP-Bench egy olyan standard legyen, amit a nagy labok is lefuttatnak pre-release \u00e9rt\u00e9kel\u00e9sk\u00e9nt. Ha a WordPress teljes\u00edtm\u00e9ny m\u00e9rhet\u0151 \u00e9s l\u00e1that\u00f3, az \u00f6nmag\u00e1ban \u00f6szt\u00f6nz\u0151 lehet optimaliz\u00e1l\u00e1sra.<\/li>\n\n<\/ul>\n\n\n\n<p>A csapat emellett egy ny\u00edlt, publikus leaderboard (ranglista) ir\u00e1ny\u00e1ba is dolgozik, ahol \u00e1tl\u00e1that\u00f3an k\u00f6vethet\u0151 lesz, melyik modell hogyan teljes\u00edt WordPress feladatokon.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">K\u00f6z\u00f6ss\u00e9gi projekt: mit\u0151l lesz igaz\u00e1n j\u00f3 a WP-Bench?<\/h2>\n\n\n\n<p>Egy WordPress-specifikus benchmark min\u0151s\u00e9ge alapvet\u0151en a teszteseteken \u00e9s az \u00e9rt\u00e9kel\u00e9s szigor\u00e1n \u00e1ll vagy bukik. A bejelent\u00e9s szerint a WP-Bench kifejezetten sz\u00e1m\u00edt k\u00f6z\u00f6ss\u00e9gi hozz\u00e1j\u00e1rul\u00e1sokra, t\u00f6bbek k\u00f6z\u00f6tt:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n\n<li>\u00faj tesztesetek hozz\u00e1ad\u00e1sa (k\u00fcl\u00f6n\u00f6sen a tr\u00fckk\u00f6s, gyakran f\u00e9lre\u00e9rtett WordPress mint\u00e1kb\u00f3l),<\/li>\n\n\n<li>benchmarkok futtat\u00e1sa \u00e9s eredm\u00e9nyek megoszt\u00e1sa,<\/li>\n\n\n<li>a grading logika (\u00e9rt\u00e9kel\u00e9s) szigor\u00edt\u00e1s\u00e1nak, pontos\u00edt\u00e1s\u00e1nak jav\u00edt\u00e1sa,<\/li>\n\n\n<li>eredm\u00e9nyek bek\u00fcld\u00e9se a k\u00e9sz\u00fcl\u0151 publikus ranglist\u00e1hoz.<\/li>\n\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\">Hasznos linkek (hivatalos forr\u00e1sok)<\/h2>\n\n\n\n<ul class=\"wp-block-list\">\n\n<li>WP-Bench repo: https:\/\/github.com\/WordPress\/wp-bench<\/li>\n\n\n<li>AI Building Blocks for WordPress: https:\/\/make.wordpress.org\/ai\/2025\/07\/17\/ai-building-blocks\/<\/li>\n\n\n<li>WordPress Slack #core-ai: https:\/\/wordpress.slack.com\/archives\/C08TJ8BPULS<\/li>\n\n<\/ul>\n\n\n<div class=\"references-section\">\n                <h2>Hivatkoz\u00e1sok \/ Forr\u00e1sok<\/h2>\n                <ul class=\"references-list\"><li><a href=\"https:\/\/make.wordpress.org\/ai\/2026\/01\/14\/introducing-wp-bench-a-wordpress-ai-benchmark\/\" target=\"_blank\" rel=\"noopener noreferrer\">Introducing WP-Bench: A WordPress AI Benchmark<\/a><\/li><li><a href=\"https:\/\/github.com\/WordPress\/wp-bench\" target=\"_blank\" rel=\"noopener noreferrer\">WP-Bench GitHub README<\/a><\/li><\/ul>\n            <\/div>","protected":false},"excerpt":{"rendered":"<p>A k\u00f3dasszisztensek egyre jobbak, de WordPress-specifikus feladatokn\u00e1l m\u00e9g gyakran mell\u00e9fognak. A WP-Bench erre ad v\u00e1laszt: hivatalos benchmark, ami WordPress runtime-ban \u00e9rt\u00e9keli a modellek tud\u00e1s\u00e1t \u00e9s k\u00f3dj\u00e1t.<\/p>\n","protected":false},"author":3,"featured_media":265,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[113],"tags":[64,114,115,10],"class_list":["post-266","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai","tag-ai","tag-benchmark","tag-fejlesztes","tag-wordpress"],"_links":{"self":[{"href":"https:\/\/helloblog.io\/hu\/wp-json\/wp\/v2\/posts\/266","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/helloblog.io\/hu\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/helloblog.io\/hu\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/helloblog.io\/hu\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/helloblog.io\/hu\/wp-json\/wp\/v2\/comments?post=266"}],"version-history":[{"count":0,"href":"https:\/\/helloblog.io\/hu\/wp-json\/wp\/v2\/posts\/266\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/helloblog.io\/hu\/wp-json\/wp\/v2\/media\/265"}],"wp:attachment":[{"href":"https:\/\/helloblog.io\/hu\/wp-json\/wp\/v2\/media?parent=266"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/helloblog.io\/hu\/wp-json\/wp\/v2\/categories?post=266"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/helloblog.io\/hu\/wp-json\/wp\/v2\/tags?post=266"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}