No rate limits. No usage limits. No concurrency limits
Basic Plan is free and does not require an account or payment method.
Enterprise Plan customers may request custom payment terms and schedules.
Pro Plan is a monthly (or annual) auto-renewing subscription with a 14-day trial period. You will be prompted to add a payment method while signing up at https://app.argmaxinc.com.
During the initial sign-up:
1) You will be charged $14 to validate the payment method and initiate the 14-day trial period.
2) You will also initiate a monthly (or annual) subscription for a 1,000 device license pack. Your first monthly subscription fee will be charged 14 days after initial sign-up unless cancelled.
After your subscription starts:
1) Billing periods for monthly subscriptions follow calendar months and the initial month will be prorated based on the subscription date.
2) Your monthly 1,000 device license pack is applied as a credit to your account balance and each license pack expires after 12 months.
3) Subscription cancellations are effective as of the immediate next billing cycle start date and require 3 days notice.
When you outgrow your 1,000 device license packs (congrats on your success!) for a given billing month, you will be billed an additional $0.42 for each additional active device license in arrears, rounded down to the nearest multiple of 1,000 units. Note that the unit license fees are the same in either condition. If you would like to discuss volume-based discounts, please reach out to sales@argmaxinc.com.
Argmax takes data security extremely seriously. We are currently in the SOC 2 observation period and will start providing our SOC 2 Type II report upon request starting August 2025. Please see our Trust Center for more information.
This is a common misconception because most on-device inference frameworks in the market primarily target the CPU or GPU of the user's device. In order to avoid slowdowns for demanding non-inference workloads such as video conference meetings and games, on-device inference should avoid the CPU and GPU.
In sharp contrast, on-device inference with Argmax SDK targets the NPU (Neural Processing Unit) and avoids resource contention with other workloads. For example, running WhisperKit (NPU) while attending a Zoom meeting (CPU+GPU) should not slow down the Zoom meeting.
TL; DR: If you observed slowdowns in the past with others, try again with Argmax SDK! You should not observe a slowdown. If you do, please contact us on Discord we can help troubleshoot.
Basic Plan can be used without internet connection after the initial download of the model files.
Pro Plan (Monthly) requires internet connection only once every 30 days in order to renew the device license.
Pro Plan (Annual) requires internet connection only once every 365 days in order to renew the device license.
Enterprise Plan may request custom connectivity requirements (including fully offline).
In sharp contrast, all server-side and most on-device inference providers require internet connectivity for each and every inference request.
Pro Plan is priced per-device or per-user per month with unlimited usage, making your costs predictable. In sharp contrast, server-side inference providers follow usage-based pricing, introducing unpredictability.
Enterprise Plan includes volume discount tiers with significantly reduced unit costs at scale.
For each end-user device with a valid license: There is no rate limit. There is no concurrency limit. There is no usage volume limit.
Pro SDK is continually benchmarked for regression testing on a large fleet of real devices.
Open-source SDK is also benchmarked for each release and the results are uploaded to WhisperKit Benchmarks.
As new devices and operating system versions are introduced to the market, we systematically update our testing fleet to ensure that we remain current with the dynamic end-user device footprint. This enables developers to validate the Pro SDK performance against their specific user base.
Argmax is committed to maintaining open-source projects such as WhisperKit to increase the adoption of FMOD technology by everyone. Argmax is also committed to serving its customers with cutting-edge performance improvements and advanced features built on top of open-source exclusively as part of the Pro SDK.
Pro SDK follows an open-core architecture where the it extends the Open-source SDKThis architecture was explicitly designed to facilitate seamless upgrades and downgrades between the Basic Plan (Open-source SDK) and the Pro Plan (Pro SDK). Parts of the Pro SDK may be open-sourced over time.
For details, please see https://app.argmaxinc.com/docs/wiki/open-source-vs-pro-sdk.
Argmax believes that privacy is a fundamental human right, so every Argmax product and service is designed to minimize the collection and use of your data and use on-device processing whenever possible.
Pro Plan end-user devices may communicate with our API for software licensing and performance telemetry purposes. The data schema is available to active subscribers of the Pro Plan upon request.
Enterprise Plan may request custom behavior such as air-gapped deployment with zero data collection.
Across all plans, Argmax does not collect any personally identifiable information (PII).
Argmax does not use any data it collects for training any models.
Basic Plan is best for academics and hobbyists to tinker with, build on top of and/or even commercialize open-source Argmax FMOD technology. The MIT license is friendly to this cohort.
We are continuously improving existing products to maintain our bleeding-edge performance and accuracy. We are also actively working on real-time text-to-speech and language model framework products.
Please inquire at info@argmaxinc.com if you have a use case or a roadmap request!