A natural language processing (NLP) service that uses machine learning to find insights and relationships in text, including sentiment analysis, entity recognition, key phrase extraction, language detection, and PII redaction.
Amazon Comprehend is a fully managed natural language processing (NLP) service from AWS that uses machine learning to extract insights from unstructured text without requiring any ML expertise. It analyzes text to identify sentiment, entities, key phrases, language, syntax, topics, and personally identifiable information (PII), making it a versatile building block for document processing pipelines, customer feedback analysis, and compliance workflows.
The service offers both pre-trained general-purpose models and the ability to train custom classification and entity recognition models using your own labeled data. Pre-trained APIs cover sentiment analysis (positive, negative, neutral, mixed with confidence scores), named entity recognition (people, organizations, locations, dates, quantities, and more), key phrase extraction, language detection across 100+ languages, syntax/POS tagging, and PII detection and redaction for over 30 entity types. Custom models allow teams to build domain-specific classifiers and entity extractors by simply uploading labeled training data — no ML code required.
Amazon Comprehend Medical is a specialized HIPAA-eligible variant designed for clinical and biomedical text. It extracts medical entities such as conditions, medications, dosages, procedures, and anatomical terms, and links them to standard ontologies including ICD-10-CM, RxNorm, and SNOMED CT. This makes it particularly valuable for healthcare organizations processing clinical notes, discharge summaries, and clinical trial documentation.
Pricing follows a pay-as-you-go model starting at $0.0001 per unit (1 unit = 100 characters) for standard NLP APIs, with a generous 12-month free tier covering 50,000 units per month. Custom model training runs at $3 per hour, and real-time custom inference endpoints cost $0.50 per hour. The service integrates natively with the broader AWS ecosystem including S3, Lambda, CloudWatch, KMS, IAM, and Step Functions, enabling teams to build end-to-end NLP pipelines entirely within AWS infrastructure.
Was this helpful?
Teams can train custom text classification models and custom entity recognition models by uploading labeled training data in CSV or augmented manifest format. Comprehend handles all ML pipeline steps automatically — feature engineering, training, hyperparameter optimization, and model evaluation. Models can be deployed to real-time endpoints for synchronous inference or used in asynchronous batch jobs. Built-in metrics (precision, recall, F1) help evaluate model quality before deployment.
Identifies over 30 types of personally identifiable information — including names, addresses, Social Security numbers, credit card numbers, phone numbers, email addresses, dates of birth, bank account numbers, and driver's license numbers. Supports both detection mode (returns entity locations and types with confidence scores) and redaction mode (replaces PII with placeholder tags in the output text), enabling compliance with GDPR, CCPA, and other privacy regulations.
A specialized variant that extracts medical entities such as conditions, medications, dosages, procedures, test results, and anatomical terms from clinical text. Links extracted entities to standard medical ontologies including ICD-10-CM (diagnoses), RxNorm (medications), and SNOMED CT (clinical terms). Operates under AWS BAA for HIPAA-eligible workloads and supports processing of clinical notes, discharge summaries, pathology reports, and clinical trial documentation.
Goes beyond document-level sentiment to identify sentiment expressed toward specific entities mentioned in the text. For example, in a product review mentioning both battery life and screen quality, targeted sentiment can separately classify the sentiment toward each attribute. Returns entity-level sentiment scores (positive, negative, neutral, mixed) with confidence values, enabling granular aspect-based sentiment analysis without custom model training.
Processes large collections of documents stored in Amazon S3 via asynchronous batch jobs, supporting up to 5 GB of input data per job with individual documents up to 100 KB. Results are written back to S3 in JSON format. Supports all standard NLP APIs in batch mode, enabling cost-effective processing of millions of documents without managing infrastructure. Jobs can be monitored via CloudWatch and orchestrated using Step Functions or Lambda triggers.
Free for 12 months
From $0.0001/unit
From $3/hr training + $0.50/hr endpoint
From $0.01/unit
Ready to get started with Amazon Comprehend?
View Pricing Options →We believe in transparent reviews. Here's what Amazon Comprehend doesn't handle well:
Weekly insights on the latest AI tools, features, and trends delivered to your inbox.
In early 2026, Amazon Comprehend added support for enhanced PII detection with improved accuracy for financial document types, expanded language support for sentiment analysis to include additional Southeast Asian languages, and introduced a new flywheel feature for continuous custom model improvement that automatically retrains models as new labeled data becomes available.
No reviews yet. Be the first to share your experience!
Get started with Amazon Comprehend and see if it's the right fit for your needs.
Get Started →Take our 60-second quiz to get personalized tool recommendations
Find Your Perfect AI Stack →Explore 20 ready-to-deploy AI agent templates for sales, support, dev, research, and operations.
Browse Agent Templates →