Granite
Open, performant and trusted AI models built for business.

Open, performant and trusted AI models built for business.

WatsonWorks Products

IBM Storage Software

Start building with Granite 4.0, our family of open, performant and trusted AI models, tailored for business and optimized to scale your AI applications.

#granite
Our Price: Request a Quote

Get a Quote

Click here to jump to more pricing!

Please Note: All Prices are Inclusive of GST

Overview
Features

Overview:

Meet Granite

Build and scale AI faster with customizable, open-source models optimized for enterprise workloads, cost efficiency, and flexible deployments.

Open

Open source under Apache 2.0, Granite ensures transparency, while enabling full customizability and deployment flexibility across any infrastructure.

Performant

The small, high-performing models are designed to maximize efficiency and scalability for essential enterprise tasks

Trusted

Eliminate the risk of “black box” AI with transparency into training data and processes, harm detection capabilities and built-in guardrails.

Features:

Introducing Granite 4.0

Meet the models

Granite 4.0 Nano

Lightweight, local, and edge AI tasks where compute and connectivity are limited

Granite 4.0 Micro & Tiny

High-volume, low-complexity tasks where speed, cost, and efficiency are the top priority

Granite 4.0 Small

Enterprise workflows that require stronger performance without the cost of frontier models

By the Numbers

70%+

reduction in memory requirements

2x

faster inferencing speeds

Performance and efficiency

Granite 4.0 is engineered for efficiency, using less memory while delivering faster speeds and high performance. This balance allows enterprises to reduce costs and scale solutions faster across critical workloads.

Memory Usage

Granite 4.0 models are designed to do more with less. They use dramatically less memory - over 70% less than similar models - so organizations can run powerful AI on more affordable hardware. That means lower infrastructure costs, faster performance, and the ability to scale AI more easily across the business.

Inference Speed

Granite 4.0 delivers consistently high throughput as workloads scale, handling larger batch sizes with ease while other models slow down. This ensures enterprises can maintain reliable performance for applications that need to serve many users or complex tasks at once.

General Accuracy

Granite 4.0 delivers stronger accuracy with far lower memory requirements than competing models, even at smaller sizes. That efficiency translates into cost savings, greater accessibility, and the ability to deploy enterprise AI more widely and flexibly.

RAG Performance

Granite 4.0 outperforms both similarly sized and larger open models on RAG tasks. By delivering higher accuracy without demanding extra infrastructure, Granite helps enterprises build more reliable, knowledge-grounded applications while keeping deployments efficient and cost-effective.

Instruction Following

Granite 4.0 demonstrates industry-leading instruction-following performance among open models, an essential capability for agentic workflows. By balancing strong accuracy with smaller size, Granite provides enterprises with high-quality outputs for complex tasks at lower infrastructure costs than larger open models.

Benefits:

Specifications:

Pricing Notes:

All Prices are Inclusive of GST
Pricing and product availability subject to change without notice.