No free plan. The cheapest way in is Command R API at $0.15/$0.60 per 1M tokens. Consider free alternatives in the enterprise agents category if budget is tight.
North uses retrieval-augmented generation (RAG) exclusively — it only generates answers based on documents you connect, and every response includes inline citations linking to specific source paragraphs. Users can click any citation to verify the original document. Combined with Rerank 4 Pro's relevance scoring, North ensures that only the most pertinent documents inform each answer, reducing hallucination risk. This citation-first approach means compliance teams can audit any AI-generated response back to its source data.
Yes. Through private deployment options and the Dell Technologies hardware partnership, North can run entirely on customer infrastructure with zero external network connectivity. This air-gapped mode supports defense, intelligence, and classified government environments. The on-premises package includes Command models, Compass search, Agent Studio, and all enterprise connectors, with fine-tuning capabilities for proprietary data. Implementation typically takes 8-12 weeks including hardware provisioning, model deployment, and data source integration.
Microsoft Copilot integrates deeply with Office 365 but requires Azure and Microsoft's cloud infrastructure, locking organizations into a specific ecosystem. North offers deployment flexibility across cloud, hybrid, and on-premises environments, making it better suited for regulated industries with data residency requirements. North also provides native multilingual support across 100+ languages without separate deployments and uses citation-grounded responses for verifiability. However, Copilot has a larger integration ecosystem within the Microsoft stack and lower friction for organizations already standardized on Microsoft tools.
North is powered by Cohere's Command family. Command R+ handles complex reasoning at $2.50 per million input tokens and $10.00 per million output tokens, while Command R serves high-volume queries at $0.15/$0.60 per million tokens. Both models support 256K context windows and 100+ languages. Rerank 4 Pro adds semantic search at $2.00 per 1,000 searches. For predictable billing, Model Vault dedicated instances run $4-5 per hour or $2,500-$3,250 per month. Full North platform pricing (including Agent Studio and Compass) requires a custom enterprise quote.
Standard cloud deployments typically complete within 4-6 weeks, including data connector setup, agent configuration, and user training. Hybrid deployments run 6-8 weeks due to additional infrastructure coordination. Full on-premises or air-gapped deployments take 8-12 weeks, accounting for hardware provisioning, network configuration, model deployment, and security certification. Cohere provides dedicated implementation engineers and a structured onboarding process for all enterprise deployments, with ongoing technical support post-launch.
See Cohere North plans and find the right tier for your needs.
See Pricing Plans →Still not sure? Read our full verdict →
Last verified March 2026