Zibuu Ai just released GLM-4.5 series

The state of the AI Foundation models appear immediately, but a few installs are important in 2025 as a series of Z.Ai's GLM-4.5: GLM-4.5 as well as its simple child GLM-4.5-Air. Revealations by Zuruu Ai, these models set high quality energy levels with integrated Avonto and open access, which aims to close the gap between consultation, codes, and two scales.
Model Architecture and parameters
| Statue | Complete parameters | Active parameters | Suck |
|---|---|---|---|
| GLM-4.5 | 355b | 32b | Between the largest massive weight, the work of Benchmark. |
| GLM-4.5-Air | 106b | 12 | Compact, which works well, referred to in a MainSream computer configuration |
GLM-4.5 is designed to A mixture of experts (moe) The buildings, and 355 billion parameters (32 billion acted at a time). This model is designed for cutting, directing the requirements of Agentic applications. GLM-4.5-Air, with active 106b parameters and 12Bs, provides similar restructured skills and computing Footprint.
Integrated consultation: Two types of one frame
Both models inform a Hybrid consultation method:
- Imaginative mode: Enabling the complex action, the use of tools, plans that are variable, and private agents.
- Untonable mode: Fast, countless answers, which makes flexible models for converted and quick answering charges.
The Dual-Mode project has considered the flow of the integrated work of a comprehension work and joint venture requirements within one model, enabling the following agents.
The benches of work
Z.ai Benchmatic FLM-4.5 on General Inspections of Industry (including Imbal, GSM8K, Hostereval):
- GLM-4.5: Benchmark Score Score of 63.2, placed on a third party (second worldwide, top between all open models).
- GLM-4.5-Air: Brings 59.8 competition as a leader between ~ 100B-parameter models.
- Antergorms are noteworthy Outperforms in specific areas: Successful Level 90.6%, Outperform Claude 3.5 Sonnet and Kimi Kim.
- Effects mainly in Chinese language activities and codes, through unsoted side effects in all open benches.

Agentic Power and Building
GLM-4.5 progress “Agent-native“Design: Basic agentitic performance (Reasoning, planning, actions) formed directly in the construction of the model. This means:
- Distancing of many jobs and planning
- Use of tools and co-authors with an external API
- The observation of the complex data and operating management of the work
- Traditional Support of Conservation Minds of Understanding Complances


These power enables the finalized last storage and stored e-stored ePut-stored eggs, which have thousands or apartments closed.
Efficiency, speed, and cost
- The imaginary guess & many predictions of Token (MTP): Through features such as MTP, GLM-4.5 Access to 2.5 × -8 × 2 × 2 × 2 × 2 × 2 × -8.
- Memory & Hardware: The active GL-4.5-Air's 12B design is compatible with Consumer GPUS (32-64GB VRAM) and can be measured to fit the broader hardware. This enables higher performance llms to accompany advanced users.
- Putting higher property: API Calls Starts as $ 0.11 for the installation tokens for each million and $ 0.28 with Imizing Output Tokens-pricing in the rate industry and quality donated.
Access to open source and ecosystem
Keyst of the GLM-4.5 series are License open to source: Basic Models, Hybrid Models (Thinking / Defense), and FP8 types are all issued with prohibited commercial and second advancement. Code, PARSERS Tools, and consultation engines are included in large llm organizations, including transformers, VLLM, and Sgglang, with detailed repostories.
Models can be used in large self-sacrifices, with good support and completely shipment. This level of openness and variable is very comparable with the closed rival and closed rival.
Important Technologies
- Multi-Token Prediction (MTP) The base of thinking predicted, highly strengthening the measurement speed in the CPUS and GPUS.
- Integrated construction of consultation, codes, and mealmodal action-action.
- He has been trained 15 billion tokens, with 128k installation support and 96k windows.
- Quick compliance with reproduction and production tools, including redemption and synchronizing models for new cases for use.
In summary, GLM-4.5 and GLM-4.5-AIR They represent a great source of open source, agentic, and basic form models. They set new access levels, work, and integrated understanding skills – to provide the main spinal core of the next generation of intelligent providers and developer applications.
Look GLM 4.5, GLM 4.5 Air, GitHub page and technical details. All credit for this study goes to research for this project. Also, feel free to follow it Sane and don't forget to join ours 100K + ml subreddit Then sign up for Our newspaper.
Asphazzaq is a Markteach Media Inc. According to a View Business and Developer, Asifi is committed to integrating a good social intelligence. His latest attempt is launched by the launch of the chemistrylife plan for an intelligence, MarktechPost, a devastating intimate practice of a machine learning and deep learning issues that are clearly and easily understood. The platform is adhering to more than two million moon visits, indicating its popularity between the audience.



![Black Forest Labs Releases FLUX.2 [klein]: Integrated Flow Models for Interactive Visual Intelligence Black Forest Labs Releases FLUX.2 [klein]: Integrated Flow Models for Interactive Visual Intelligence](https://i2.wp.com/www.marktechpost.com/wp-content/uploads/2026/01/blog-banner23-30-1024x731.png?w=390&resize=390,220&ssl=1)
