Amamodeli aphezulu ayi-7 okufaka amakhodi ongawagijima endaweni ngo-2026

# Isingeniso
Amamodeli ekhodi endawo agcina eba bucayi. Ngibe umlandeli omkhulu waleli gagasi elisha lamamodeli ezilimi ezinkulu zasendaweni (ama-LLM), ikakhulukazi amamodeli avuliwe kanye nokukhishwa komphakathi kwe-GGML Universal File (GGUF) okuwenza kube lula ukusebenza ku-hardware yabathengi. Manje sesisezingeni lapho amanye alawa mamodeli angasebenza kuma-GPU njenge-RTX 3090, akhiqize ngokushesha ngokwanele ukuze azizwe ewusizo, futhi empeleni axazulule izinkinga zangempela zokubhala ikhodi ne-ejenti. Hhayi nje amademo. Hhayi nje imigilingwane.
Uma ufuna ukusethwa kwekhodi yendawo ngokugcwele futhi ube okungenani ne-16GB ye-Video Random Access Memory (VRAM), lawa mamodeli angakusiza ukuthi usuke ekwethembeni kuphela. Claude Ikhodi, Gemininoma abanye abasizi bekhodi abasingathiwe. Ayashesha, ayakwazi, ayimfihlo, futhi alungele ukugeleza komsebenzi wangempela wokuthuthukisa.
Usuvele ulubona lolu shintsho lwenzeka kuwo wonke umphakathi wasendaweni we-AI. I-Reddit's r/LocalLLaMA igcwele onjiniyela abasebenzisa ama-ejenti okubhala amakhodi wendawo, amamodeli e-GGUF ahlolayo, akha amaseva wendawo ahambisana ne-OpenAI, futhi axhumanise lawa mamodeli kubahleli, amatheminali, nabasizi bokubhala amakhodi.
# 1. Qwen3.6 27B MTP
Qwen3.6 27B MTP ingenye yamamodeli ami engiwathandayo wokufaka amakhodi endawo njengamanje. Ngiyihlolile, ngayisebenzisa, futhi ngayihlola kuzo zonke izinhlelo ezihlukene, futhi izwakala njengebhalansi engcono kakhulu phakathi kosayizi, isivinini, nekhono langempela lokubhala ikhodi.
Ingxenye engcono kakhulu ukuthi ngezinguqulo ze-GGUF ezilinganiselwe, ungayisebenzisa ku-hardware yabathengi esikhundleni sokudinga ukusetha okugcwele kwamafu. Ngisho noma usebenza nge-16GB kuya ku-24GB VRAM GPU, izinguqulo ze-4-bit zikwenza kube ngokoqobo kakhulu ukusetshenziswa endaweni.
Umphakathi we-r/LocalLLaMA ku-Reddit usuvele ugcwele abantu abahlola i-Qwen3.6 27B MTP ukuthola ikhodi yendawo, ukunquma okusheshayo, llama.cpp ukusetha, namaseva asendaweni ahambisana ne-OpenAI. Futhi ngokweqiniso, i-hype inengqondo.
Amamodeli we-Qwen ngokuvamile anamandla ekubhaleni ikhodi ngoba ahlanganisa ukucabanga, ukulandela imiyalelo, ukuqonda ngezilimi eziningi, ukusetshenziswa kwamathuluzi, nokusekelwa komongo omude. Lokho kwenza i-Qwen3.6 27B MTP ibe imodeli yendawo eqinile yomjikelezo wonke yabasizi bokubhala amakhodi, ingxoxo ye-repo, ukulungisa amaphutha, imiyalo yegobolondo, nokugeleza komsebenzi kwe-ejenti.
# 2. Gemma 4 31B IT QAT
IGemma 4 31B IT QAT enye imodeli engicabanga ukuthi ifanelwe indawo engathi sína kunoma yikuphi ukusethwa kwekhodi yendawo. Amamodeli e-Gemma avuliwe e-Google abelokhu elungile kubantu abafuna ukusebenzisa amamodeli anekhono endaweni, futhi le nguqulo ye-GGUF yokuqeqeshwa kwe-quantization-aware (QAT) iyenza isebenze nakakhulu.
Uthola imodeli enkulu ye-31B ngefomethi ye-4-bit quantized okulula kakhulu ukuyilayisha ku-hardware yabathengi, kuyilapho igcina ikhwalithi eqinile. Akuyona nje i-hype futhi. Ngibhale ngamamodeli we-Gemma, ngawasebenzisa, ngawahlola ekuhambeni komsebenzi okuhlukene, futhi azizwa esondelene kakhulu nochungechunge lwe-Qwen uma kuziwa ekubhaleni amakhodi wendawo nokucabanga.
Isizathu esikhulu sokuthi i-Gemma 4 31B igqame ukuthi akuyona nje imodeli yokubhala amakhodi. Futhi iyi-multimodal, okusho ukuthi ingasiza ngezithombe-skrini, izinkinga ze-UI, imidwebo, izithombe zemibhalo, nezakhiwo zohlelo lokusebenza lwewebhu kuyilapho zisasebenziseka ekwenzeni amakhodi, ukulungisa iphutha, nokuhlela.
Izinombolo zebhentshimakhi ezisemthethweni nazo zenza kube nzima ukuziba, ngemiphumela eqinile yokubhala ikhodi ku-LiveCodeBench naku-Codeforces. Uma ufuna imodeli yendawo engakwazi ukusingatha ukubhala kanye nemisebenzi yokuthuthukisa okubukwayo, i-Gemma 4 31B IT QAT ingenye yezinketho ezingcono kakhulu ongazizama.
# 3. I-DiffusionGemma 26B A4B
I-DiffusionGemma 26B A4B ingenye yamamodeli amasha kakhulu futhi athakazelisa kakhulu kulolu hlu. Inamandla, ihlola, futhi yakhiwe ngokuhlukile kumamodeli wolimi wethokheni nethokheni evamile.
Esikhundleni sokukhiqiza umbhalo ngendlela evamile ye-autoregressive, isebenzisa indlela ye-block-diffusion, eklanyelwe ukuthuthukisa isivinini sokukhiqiza ngokukhipha amabhulokhi amathokheni ngokuhambisana.
Kungakho le modeli ijabulisa ekubhalweni kwasendaweni: izwakala njengohlobo lwezakhiwo ezingenza abasizi bendawo basheshe kakhulu, ikakhulukazi ukukhiqizwa kwekhodi, okuphumayo okuhlelekile, nemisebenzi yokucabanga esheshayo.
Isikhalazo esikhulu ukusebenza kahle. I-DiffusionGemma inamapharamitha angu-25B aphelele kodwa azungeze amapharamitha angu-3.8B asebenzayo kuphela, ukuze uthole inzuzo yemodeli yesitayela se-Mixture of Experts (MoE) enkudlwana ngaphandle kokukhokha izindleko eziphelele zemodeli engu-26B eminyene.
# 4. I-Nemotron Cascade 2 30B A3B
I-Nemotron Cascade 2 30B A3B enye imodeli ebukeka iyinqaba ephepheni kodwa eyenza umqondo omkhulu ekubhalweni kwasendaweni.
Kuyimodeli yesitayela se-30B ye-MoE, kodwa cishe amapharamitha angu-3B kuphela asebenzayo ngesikhathi sokuqagela. Ngakho awukhokhi izindleko ezigcwele zemodeli eminyene engu-30B njalo. Yilolo kanye uhlobo lwemodeli engiluthandayo ekusetheni kwasendaweni: inkulu ngokwanele ukucabanga kahle, kodwa isasebenza kahle ngokwanele ukuthi ikwazi ukugijima nokuhlola emshinini wakho.
Okwenza le modeli ijabulise ukuthi izwakala njengemodeli yokucabanga kunemodeli elula yokuqedela ngokuzenzakalela. I-NVIDIA iyichaza njengeqinile emisebenzini yokucabanga neye-agent, enazo zombili izindlela zokucabanga nezokufundisa, futhi ifuna ngisho nokusebenza kwezinga lendondo yegolide ku-International Mathematics Olympiad (IMO) 2025 kanye ne-International Olympiad in Informatics (IOI) 2025.
Konjiniyela, lokho kubalulekile ngoba ukubhala ngekhodi akuseyona nje imisebenzi yokubhala. Ufuna imodeli isuse iphutha, ihlele, ibuyekeze ikhodi, iqonde izinkinga zezinyathelo eziningi, futhi icabange ngemininingwane yokusebenzisa.
# 5. Qwen3.5 9B MTP
Qwen3.5 9B MTP iyimodeli encane kulolu hlu, kodwa ungayithathi kancane.
Ngesigaba sayo sesisindo, ilinganisa kahle kakhulu futhi ikunikeza umsizi ofanele wesimanje wokubhala ikhodi wesitayela se-Qwen ngaphandle kokudinga indawo yokusebenza enkulu. Uma unokusethwa kwasendaweni okuncane, le modeli iyigugu. Iyashesha, iyasebenza, futhi kulula kakhulu ukuyisebenzisa kunezinhlobo ezingama-27B noma ezingama-31B.
Inguqulo ye-GGUF yiyo eyenza isebenziseke nakakhulu konjiniyela bansuku zonke. Awudingi ukusetha okuyinkimbinkimbi noma isibonelo samafu esibizayo ukuze nje ukuhlole. Ungayiqhuba endaweni, uyixhume kusihleli sakho noma ukuhamba komsebenzi kwetheminali, futhi ukusebenzise njengomsizi oyimfihlo wokubhala amakhodi.
Ngeke yehlule amamodeli amakhulu ekucabangeni okuyinkimbinkimbi, kodwa emisebenzini yansuku zonke yokubhala ikhodi kungaphezu kokwanele. Ungayisebenzisela imibhalo emincane, ukulungisa iphutha, izincazelo zekhodi, imiyalo yegobolondo, nokugeleza komsebenzi komsizi wendawo okusheshayo. Kubantu abaqala ngamamodeli wendawo wokubhala amakhodi, i-Qwen3.5 9B MTP cishe ingenye yezinketho eziphephe kakhulu nezisebenzayo.
# 6. HLOLA 4.5 33B
EXAONE 4.5 33B enye imodeli engicabanga ukuthi abathuthukisi akufanele bayizibe, ikakhulukazi uma umsebenzi wakho uhilela okungaphezu nje kwekhodi engenalutho.
Kuyimodeli ye-LG AI Research's open-weight multimodal, futhi lokho kuyenza isebenziseke ngempela ekuhambeni komsebenzi wokufaka amakhodi wendawo lapho udinga futhi ukuqonda izithombe-skrini, ama-PDF, imidwebo, imibhalo, nezakhiwo ze-UI.
Kulapho i-EXAONE iba nentshisekelo khona. Umsebenzi omningi wokubhala amakhodi manje awukona nje ukubhala imisebenzi yePython. Ufunda amadokhumenti, ubheka amaphutha ezithombeni-skrini, uqonda imidwebo yezakhiwo, futhi usebenza ngamafayela ephrojekthi angcolile. Imodeli ekwazi ukuphatha kokubili umbhalo nokufaka okubukwayo iba usizo kakhulu.
Uma ufuna imodeli yendawo yekhodi kanye namadokhumenti, izithombe-skrini, nokugeleza komsebenzi kwesitayela sebhizinisi, i-EXAONE 4.5 33B inketho eqinile ongayizama.
# 7. Ikhodi Encane YaseNyakatho 1.0
Ikhodi Encane yaseNyakatho 1.0 ingenye yamamodeli amasha kakhulu kulolu hlu, futhi kuhle ukubona i-Cohere ekugcineni ingena endaweni yemodeli yekhodi yendawo ngendlela efanele.
Lena akuyona i-chatbot evamile eyenzeka futhi ekubhaleni ikhodi. Yakhelwe ukwenziwa kwekhodi, ubunjiniyela besoftware ye-agent, kanye nemisebenzi esekwe ekugcineni. Lokho kukwenza kuthakaseleke kakhulu konjiniyela abafuna imodeli yendawo yokuhlelwa kwe-repo, usizo lomugqa womyalo, ukubuyekezwa kwekhodi, nokugeleza komsebenzi komenzeli wekhodi.
Futhi iyimodeli ye-30B-A3B, okusho ukuthi inamapharamitha aphelele angu-30B kodwa cishe amapharamitha asebenzayo angu-3B kuphela ngesikhathi sokunquma. Ngakho futhi, uthola leyo bhalansi enhle: ukucabanga okunamandla kunamamodeli amancane, kodwa kusasebenza kahle kakhulu kunemodeli eminyene egcwele engu-30B.
Ingase ingabi banzi njenge-Qwen3.6 27B noma i-Gemma 4 31B, kodwa ngomsebenzi oqondene nekhodi, i-North Mini Code 1.0 ibukeka njengemodeli ewusizo kakhulu ongayizama.
# Imicabango yokugcina
Leli thebula likunikeza ukubuka okusheshayo kokuthi iyiphi imodeli yekhodi yendawo ongakhetha kuyo ngokusekelwe kuzingxenyekazi zekhompuyutha zakho, ukuhamba komsebenzi, kanye necala lokusebenzisa amakhodi.
| Imodeli | Usayizi / Uhlobo | Ikesi elingcono kakhulu lokusebenzisa | Kungani Uyikhetha |
|---|---|---|---|
| Qwen3.6 27B MTP | 27B MTP | Ukubhala amakhodi kwasendaweni okuqinile, ukucabanga, nokugeleza komsebenzi kwe-ejenti | Imodeli yekhodi yendawo yonke eyindilinga |
| IGemma 4 31B IT QAT | 31B, 4-bit QAT, multimodal | Ukubhala ngekhodi kanye nezithombe-skrini, iziphazamisi ze-UI, imidwebo, nomsebenzi onomongo omude | Amabhentshimakhi ekhodi aqinile nosekelo lwe-multimodal |
| I-DiffusionGemma 26B A4B | 26B / ~4B esebenzayo | Ukubhala amakhodi kwasendaweni okusheshayo, kokuhlola nokucabanga | Izakhiwo ezintsha zigxile ekukhiqizeni ngempumelelo |
| I-Nemotron Cascade 2 30B A3B | 30B / ~3B esebenzayo | Ukubhala amakhodi kwe-agent, ukulungisa iphutha, ukuhlela, kanye nemisebenzi enzima yokucabanga | Kuzwakala njengomenzeli wokucabanga kunokuqedela ngokuzenzakalela |
| Qwen3.5 9B MTP | 9B MTP | Imishini yasendaweni emincane nosizo lokufaka amakhodi nsuku zonke | Kuyashesha, kuyasebenziseka, futhi kuhle ngesigaba sayo sesisindo |
| EXAONE 4.5 33B | I-33B ye-multimodal | Ikhodi, amadokhumenti, izithombe-skrini, ama-PDF, nemidwebo | Kuhle kakhulu kumadokhumenti esindayo nokugeleza kwekhodi ebonakalayo |
| Ikhodi Encane yaseNyakatho 1.0 | 30B / ~3B imodeli yekhodi esebenzayo | Ama-ejenti amakhodi wendawo, ukuhlelwa kwe-repo, imisebenzi yetheminali, nokubuyekezwa kwekhodi | Iningi lemodeli eqondene nekhodi kuhlu |
Amamodeli wekhodi wendawo manje aselungile ngokwanele ukuthi ungakwazi ukuwasebenzisa emsebenzini wokuthuthukisa wangempela, hhayi nje ukuhlola noma ukudlala. Uma une-GPU enhle njenge-RTX 3090 noma i-4090, ngingamane ngincome ukuthi uqale nge-Qwen3.6 27B MTP ku-4-bit. Kuyinketho engcono kunazo zonke yokubhala ikhodi yendawo, ukucabanga, nokugeleza komsebenzi kwe-ejenti. Ngokweqiniso, zama lokho kuqala ngaphambi kokumosha isikhathi ngokugxuma phakathi kwamamodeli amaningi kakhulu.
Uma ufuna isizukulwane sasendaweni esishesha kakhulu ku-hardware efanayo, khona-ke i-DiffusionGemma 26B A4B iyona okufanele uyibuke. Yintsha futhi iyahlola, kodwa i-architecture ikwenza kuthakazelise ngempela konjiniyela abanendaba nesivinini kanye nokuqagela okusebenzayo.
Uma ufuna ukuqonda kwe-multimodal, ukucabanga okungcono, kanye nekhono lokusebenza ngekhodi nezithombe-skrini, izakhiwo ze-UI, imidwebo, kanye nemibhalo, khona-ke i-Gemma 4 31B IT QAT iyisinqumo esihle. Ingaphezu nje kwemodeli yokubhala amakhodi, futhi lokho kuyenza isebenziseke ekuthuthukisweni komsebenzi wesimanjemanje.
Futhi uma ungenayo i-GPU enkulu, i-Qwen3.5 9B MTP cishe iyimodeli engcono kakhulu yesigaba sayo sesisindo. Ngisho nokusetha okulula kwendawo kanye ne-RAM yesistimu eyanele, isengasebenza kahle njengomsizi wokubhala amakhodi wansuku zonke ukuze uthole izincazelo, ukulungisa amaphutha, imibhalo, imiyalo yegobolondo, nosizo olujwayelekile lokuhamba komsebenzi.
Amanye amamodeli nawo afanele ukuhlolwa, kuye ngokuthi yini oyikhathalelayo.
I-Nemotron Cascade 2 30B A3B yinhle uma ufuna imodeli yendawo yokucabanga yokubhala ikhodi ye-ejenti, ukuhlela, ukulungisa iphutha, nokuxazulula izinkinga ezihlelekile.
I-EXAONE 4.5 33B iwusizo uma umsebenzi wakho uhlanganisa amadokhumenti, ama-PDF, izithombe-skrini, nokugeleza komsebenzi wokubhala amakhodi wesitayela sebhizinisi.
I-North Mini Code 1.0 iyindlela egxile kakhulu ekubhalweni kwekhodi, futhi ibukeka ithembisa ama-ejenti asendaweni wokubhala amakhodi, ukuhlelwa kwe-repo, imisebenzi yetheminali, nokubuyekezwa kwekhodi. Kungase kungabi ukukhetha kwami kokuqala kwawo wonke umuntu, kodwa ngayinye inesizathu esicacile sokuba khona.
Abid Ali Awan (@1abidiawan) uchwepheshe wesayensi yedatha othanda amamodeli wokufunda womshini wokwakha. Njengamanje, ugxile ekudaleni okuqukethwe nasekubhaleni amabhulogi ezobuchwepheshe ekufundeni komshini kanye nobuchwepheshe besayensi yedatha. U-Abid uneziqu ze-Master's in technology management kanye neziqu ze-bachelor's in telecommunication engineering. Umbono wakhe uwukwakha umkhiqizo we-AI esebenzisa i-graph neural network yabafundi abanenkinga yokugula ngengqondo.



