Z.ai: GLM 4.7 Flash

Provided by OpenRouter

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning,...

Specifications

Context Length

202,752 tokens

Input Price

$0.060/M

Output Price

$0.400/M

Capabilities

TextFast

About Z.ai: GLM 4.7 Flash

Strengths

•Large context window (203k tokens) for long conversations
•Fast response times for real-time interactions

Use Cases

•Content creation and writing assistance
•General conversations and Q&A

Limitations

Performance may vary based on query complexity, context length, and task type. Consider using higher-tier models for production-critical applications.

Sample Prompts

Try these prompts to explore Z.ai: GLM 4.7 Flash's capabilities:

Explain quantum computing in simple terms like I'm 10 years old

Write a compelling email asking for a meeting to discuss a project proposal

Help me brainstorm creative solutions for improving team productivity

Tip: Customize these prompts to fit your specific needs and use cases.

Credits required

Z.ai: GLM 4.7 Flash uses tiered credit pricing. Subscribe for a monthly credit allowance, connect your own provider API key (BYOK), or browse lower-cost models on the catalog.

Credit cost per message is shown in the model picker. Economy models typically cost 1 credit; frontier models cost more.

Related Models

Similar models you might be interested in

DeepSeek: DeepSeek V4 Flash

DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and...

inclusionAI: Ling-2.6-flash

Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency....

StepFun: Step 3.5 Flash

Step 3.5 Flash is StepFun's most capable open-source foundation model. Built on a sparse Mixture of Experts (MoE) architecture, it selectively activates only 11B of its 196B parameters per token....

Xiaomi: MiMo-V2-Flash

MiMo-V2-Flash is an open-source foundation language model developed by Xiaomi. It is a Mixture-of-Experts model with 309B total parameters and 15B active parameters, adopting hybrid attention architecture. MiMo-V2-Flash supports a...