The On-Device AI Platform

The most reliable way for developers to run foundation models locally.
Ship real-time, private, predictable AI with the Argmax SDK.

Go On-Device

Argmax SDK

Start with Open-source. Scale with Pro.

WhisperKit PRO

Ship frontier speech-to-text models on device with WhisperKit Pro

WhisperKit Pro is a turn-key SDK that brings frontier speech-to-text capabilities to your user's devices, faster and more accurate than most cloud APIs.

Read our research paper.
Try it on our Playground app.

SpeakerKit PRO

Ship state-of-the-art speaker recognition on device

SpeakerKit Pro is an evolution of Pyannote optimized for speed without compromising accuracy.

Read the research paper.
Try it on our demo App.

DiffusionKit

DiffusionKit Pro allows you to run the most advanced diffusion models on device.

We’ve converted the most advanced diffusion models available in order to run on Apple Silicon with CoreML and MLX. DiffusionKit Pro is currently undegoing development.

COPY

Copied

COPY

Copied

COPY

Copied

COPY

Copied

Subscribe for Updates

Articles

Read more
iPhone 17

Apple has redesigned the iPhone for the on-device AI age. In our real-world benchmarks, iPhone 17 Pro is already up to 3.1x faster than iPhone 16 Pro on iOS 26 for large Transformer model inference on the GPU with the new Neural Accelerators. Despite the fact that the Neural Engine improved only by 25% for the same workloads, it remains the clear choice for on-device inference due to faster inference with better energy-efficiency, all-day battery life and no resource contention with traditional workloads.

September 21, 2025

Benchmarks

Interspeech 2025

Argmax is a research-driven on-device AI developer tools company. At Interspeech 2025, Argmax had an oral presentation at the Speaker Diarization track, highlighting the leading speed and accuracy of SpeakerKit.

August 17, 2025

Research

Argmax Local Server

Argmax Local Server brings the on-device AI capabilities of Argmax SDK to an even wider market without SDK-level integration! The server is ideal for apps migrating from cloud APIs for real-time transcription and feature-complete for AI Meeting Notes apps. Available starting today with Python, JavaScript, and Rust clients!

August 8, 2025

Product