🎓 AWS DVA-C02 STUDY GUIDE - COMPREHENSIVE EDITION

Question

Q1: Troubleshooting Scenario

Scenario: Your Lambda function processes S3 image uploads. Users report that some large images (> 5MB) aren't being processed, but CloudWatch shows no errors. What's the most likely issue?

💭 Think first, then expand answer

Answer 1

🎯 Correct Answer:

Lambda timeout is too short (default 3s)

🔍 Why other answers wrong:

IAM permissions: Would show errors in CloudWatch ❌
S3 event not configured: No images would be processed ❌
Memory too low: Would show OOM (Out of Memory) errors ❌
Concurrent limit: Would show throttling errors ❌

📝 Exam Tip:

"Silent failures" in async Lambda = timeout issue

Large files take longer → timeout before completion → no error logged

🛠️ How to fix:

1. CloudWatch Logs → Search "Task timed out"
2. Lambda Configuration → Increase timeout (e.g., 60s)
3. OR optimize code to process faster
4. OR use Step Functions for long-running tasks

Answer 2

🎯 Best Answers (Multiple correct):

Request limit increase via AWS Support (Recommended)
- Simple, direct solution
- Usually approved within 24-48h
- Can request up to tens of thousands
Use SQS to buffer requests (Alternative)
- Throttle processing rate
- Lambda pulls from queue at manageable rate
- Good for variable traffic
Reserved concurrency for critical functions
- Guarantee capacity for important functions
- Other functions share remaining capacity

❌ Wrong Answers:

Split into multiple AWS accounts: Overkill, management overhead ❌
Use EC2 instead: Defeats serverless purpose ❌
Provisioned concurrency: Keeps warm, doesn't increase limit ❌

📝 Exam Keywords:

If question says...	Answer likely involves...
"Exceeding concurrent limit"	Request limit increase OR SQS buffer
"Throttling errors 429"	Increase limit OR reserved concurrency
"Variable traffic patterns"	SQS buffering

Answer 3

💰 Current Cost:

Compute: 1M × 2s × 512MB = 1M GB-seconds
After free tier (400K): 600K GB-seconds
Cost: 600K × $0.0000166667 = ~$10/month

Requests: 1M requests (free tier covers 1M)
Total: ~$10/month

🎯 Optimization Strategies:

Strategy	Impact	Trade-off
INCREASE memory to 1024MB	✅ Faster execution (1s) = 50% cost saving!	Counterintuitive but works
Optimize code	✅ Reduce runtime directly	Dev time investment
Use Lambda Layers	✅ Smaller package = faster cold start	None, best practice
Batch processing	✅ Fewer invocations	Higher latency per request
❌ Decrease memory	❌ Slower = MORE cost	Don't do this!

📝 Exam Answer:

"Increase memory allocation" - more CPU = faster = cheaper overall

Lambda pricing = memory × time, so reducing time can offset memory cost

🧮 Proof:

512MB × 2s = 1024 MB-seconds
1024MB × 1s = 1024 MB-seconds (same!)

But 1024MB × 0.8s = 819 MB-seconds (20% cheaper!)

Answer 4

🎯 Complete Solution:

Lambda in VPC
- Configure VPC, subnets, security groups
- Lambda gets ENI in your VPC
RDS Security Group
- Allow inbound from Lambda security group
- Port 3306 (MySQL) or 5432 (PostgreSQL)
NAT Gateway for internet
- Lambda in private subnet
- Route table: 0.0.0.0/0 → NAT Gateway
- NAT Gateway in public subnet
IAM Execution Role
- EC2:CreateNetworkInterface
- EC2:DescribeNetworkInterfaces
- EC2:DeleteNetworkInterface

📊 Architecture:

┌─────────────────────────────────────────────┐
│                 VPC                          │
│  ┌──────────────────┐  ┌──────────────────┐│
│  │ Public Subnet    │  │ Private Subnet   ││
│  │                  │  │                  ││
│  │  NAT Gateway ────┼──▶ Lambda ────────┐ ││
│  │       │          │  │       │         │ ││
│  └───────┼──────────┘  └───────┼─────────┼─┘│
│          │                     │         │  │
│     Internet                   ▼         ▼  │
│     Gateway                   RDS    External│
│                                         API  │
└─────────────────────────────────────────────┘

❌ Common Mistakes:

Lambda in public subnet: Still can't reach internet without NAT ❌
No NAT Gateway: Can't call external API ❌
Missing IAM ENI permissions: Lambda can't create network interface ❌

💡 Cost Optimization:

VPC Endpoints for AWS services (no NAT needed for S3, DynamoDB, etc.)

S3 VPC Endpoint → No NAT charges
DynamoDB VPC Endpoint → No NAT charges
Only external APIs need NAT Gateway

📝 Exam Keywords:

"VPC + internet access" = NAT Gateway required
"Private RDS access" = Lambda in VPC, security group rules
"Minimize cost" = VPC Endpoints for AWS services

Trigger	Type	Use Case
API Gateway	Sync	REST APIs
S3	Async	File processing
DynamoDB Streams	Stream	Data replication
SQS	Poll-based	Queue processing
EventBridge	Async	Scheduled tasks, event routing
SNS	Async	Fan-out notifications
CloudWatch Logs	Async	Log processing

Type	Retry Behavior	Triggers	Use When
Sync	Client retries	API Gateway, ALB, direct invoke	Need immediate response
Async	Lambda retries 2x (3 total)	S3, SNS, EventBridge, SES	Fire-and-forget, background jobs
Stream	Retry until success/expire	Kinesis, DynamoDB Streams, SQS	Ordered processing, queue polling

Mistake	Why Wrong	Correct Approach	Exam Keyword
Hardcode credentials in env vars	Security risk, rotation breaks	Use IAM roles + Secrets Manager/Systems Manager	"Store DB password" → ❌ env vars
Not setting timeout correctly	Default 3s too short	Set timeout > expected duration, max 15min	"Silent failures" = timeout issue
Ignoring cold starts	Latency spikes hurt UX	Provisioned concurrency OR optimize package size	"Latency-sensitive" = provisioned
VPC Lambda without NAT	Cannot reach internet endpoints	Add NAT gateway OR use VPC endpoints for AWS services	"VPC + internet" = NAT gateway
Not using DLQ for async	Failed events lost after retries	Configure SQS/SNS DLQ to capture failures	"Prevent data loss" = DLQ
Putting too much in /tmp	/tmp cleaned between invocations	Use S3 or EFS for persistent storage	"Share data between invocations" = S3/EFS
Not handling throttling	429 errors crash application	Exponential backoff + retry logic OR SQS buffer	"Too many requests" = throttling
Using wrong invocation type	Sync when should be async	Async for long-running, sync for immediate response	"> 29s processing" = must use async

Red Flag	Why Wrong	Correct Answer
"Store credentials in env vars"	Security risk	Use IAM roles + Secrets Manager
"Store large files in /tmp"	/tmp ephemeral, 512MB-10GB limit	Use S3 or EFS
"Use provisioned concurrency for cost"	More expensive	Provisioned = performance, not cost
"Lambda in public subnet"	Still need NAT for internet	Private subnet + NAT Gateway
"Increase memory to reduce cost"	Sounds wrong but...	✅ CORRECT! Faster = cheaper overall

Limit	Value	Exam Frequency
Max timeout	900s (15 min)	⭐⭐⭐⭐⭐ Very High
Default timeout	3s	⭐⭐⭐⭐⭐ Very High
Memory range	128MB - 10GB	⭐⭐⭐⭐ High
Account concurrency	1000 (default)	⭐⭐⭐⭐ High
Max layers	5	⭐⭐⭐ Medium
Package size	250MB unzipped	⭐⭐⭐ Medium
/tmp storage	512MB - 10GB	⭐⭐⭐ Medium
Async retries	2 (3 total attempts)	⭐⭐⭐⭐ High
API Gateway timeout	29s	⭐⭐⭐⭐⭐ Very High

Class	Use Case	Retrieval Time
Standard	Frequent access	Milliseconds
Intelligent-Tiering	Unknown access patterns	Milliseconds
Standard-IA	Infrequent access	Milliseconds
One Zone-IA	Infrequent, non-critical	Milliseconds
Glacier Instant	Archive, immediate access	Milliseconds
Glacier Flexible	Archive, min to hours	Minutes to hours
Glacier Deep	Long-term archive	12 hours

Feature	User Pools	Identity Pools
Purpose	Authentication	Authorization (AWS access)
Output	JWT tokens	Temporary AWS credentials
Use Case	Login to app	Access AWS services from app

Scenario	Use Lambda	Use ECS/Fargate	Use EC2
Event-driven, short tasks	✅ Perfect fit	❌ Overkill	❌ Too much overhead
Long-running processes (> 15min)	❌ 15min limit	✅ No time limit	✅ No time limit
Microservices architecture	✅ Good (simple)	✅ Better (complex)	⚠️ Manual setup
Need custom runtime/libraries	⚠️ Layers or container	✅ Full Docker support	✅ Total control
Cost optimization priority	✅ Pay per request	⚠️ Always running	⚠️ Always running
Predictable, steady traffic	⚠️ Can be expensive	✅ Better cost/performance	✅ Reserved instances

Scenario	Use S3	Use EFS	Use EBS
Object storage (images, videos)	✅ Perfect fit	❌ Wrong use case	❌ Wrong use case
Shared file system (multiple instances)	⚠️ Not a file system	✅ NFS protocol	❌ Single instance only
Lambda needs persistent storage	✅ Simple integration	✅ Mount as file system	❌ Can't attach
Database storage (EC2)	❌ Not designed for this	⚠️ Can work but slow	✅ Block storage
Serverless application	✅ Native integration	✅ Lambda can mount	❌ Not serverless

Scenario	Use DynamoDB	Use RDS	Use ElastiCache
Simple key-value access	✅ Fast, scalable	⚠️ Overkill	⚠️ For caching only
Complex SQL queries, JOINs	❌ No SQL support	✅ Full SQL	❌ Not a database
Need ACID transactions	✅ Has transactions	✅ Full ACID	❌ Not transactional
Millisecond latency required	✅ Single-digit ms	⚠️ 5-10ms typical	✅ Sub-ms with cache
Reduce database load	⚠️ Not for caching	⚠️ Not for caching	✅ Cache layer
Session storage	✅ Can work	⚠️ Overkill	✅ Redis perfect
Unpredictable scaling	✅ Auto-scales	⚠️ Manual scaling	⚠️ Manual scaling

Scenario	Use SQS	Use SNS	Use EventBridge	Use Kinesis
Decouple components	✅ Pull model	✅ Push model	✅ Event routing	⚠️ Overkill
Fan-out (1 to many)	❌ 1:1 only	✅ Perfect for this	✅ With rules	✅ Multiple consumers
Message ordering required	✅ FIFO queue	⚠️ With FIFO topic	❌ No guarantee	✅ Per shard
Real-time stream processing	❌ Not streaming	❌ Not streaming	⚠️ Simple events	✅ High throughput
Replay messages	❌ Deleted after read	❌ No replay	❌ No replay	✅ 24h-365 days
Multiple consumers same data	❌ Deleted after read	✅ Each gets copy	✅ Multiple targets	✅ Each reads stream
Email/SMS notifications	❌ No built-in	✅ Native support	⚠️ Via SNS	❌ No built-in
Event-driven architecture	✅ Simple	✅ Simple	✅ Advanced routing	⚠️ For streams

Feature	REST API	HTTP API	WebSocket API
Use case	Full-featured APIs	Simple, low-cost APIs	Real-time bidirectional
Cost	$$$ (Higher)	$ (70% cheaper)	$$ (Per connection)
Request validation	✅ Built-in	❌ Manual	❌ Manual
Caching	✅ Built-in	❌ Not available	❌ Not available
Usage plans & API keys	✅ Yes	❌ No	❌ No
Performance	Good	Better (lower latency)	Persistent connection
Exam recommendation	Default choice	If "cost-effective" mentioned	If "real-time", "chat", "push"

Aspect	Global Secondary Index (GSI)	Local Secondary Index (LSI)
Partition Key	✅ Different from base table	❌ SAME as base table
Sort Key	✅ Different or none	✅ Different from base table
When to create	✅ Anytime (add/delete)	❌ Table creation ONLY
Capacity	✅ Own RCU/WCU (separate)	❌ Shares with base table
Consistency	❌ Eventually consistent only	✅ Eventually OR strongly
Max per table	20	5
Use case	Query by different attributes	Query same PK, different sort
Exam default	✅ Use this unless specified	⚠️ Rare, specific scenarios

If scenario mentions...	Answer likely involves...	Why
Sporadic traffic, unpredictable	Lambda (not EC2)	Pay per request vs always running
Simple API, no advanced features	HTTP API (not REST API)	70% cheaper
Infrequent access data	S3-IA or Glacier	Lower storage cost
Reserved capacity, predictable	Provisioned (not on-demand)	Upfront commitment = discount

If scenario mentions...	Answer likely involves...	Why
Database queries slow	ElastiCache (Redis/Memcached)	In-memory cache
DynamoDB slow queries	DAX (DynamoDB Accelerator)	Microsecond latency
Lambda cold starts	Provisioned concurrency	Keep functions warm
Global users, slow content	CloudFront CDN	Edge caching

If scenario mentions...	Answer likely involves...	Why
Database availability	RDS Multi-AZ	Automatic failover
Lambda reliability	Multiple AZs (automatic)	Lambda is multi-AZ by default
Application resilience	Multi-region deployment	Region failure protection
Load balancing	ALB + multiple AZs	Distribute traffic

Keyword	Think
"Real-time"	Kinesis, Lambda, WebSocket API
"Serverless"	Lambda, API Gateway, DynamoDB
"Cost-effective"	On-demand pricing, auto-scaling, S3 lifecycle
"High availability"	Multi-AZ, DynamoDB global tables, S3
"Decouple"	SQS, SNS, EventBridge
"Ordered processing"	SQS FIFO, Kinesis (per shard)
"Temporary credentials"	STS, IAM roles, Cognito Identity Pools
"Least privilege"	IAM policies with specific actions/resources
"Audit trail"	CloudWatch Logs, CloudTrail, X-Ray
"Rollback"	Lambda aliases, CodeDeploy blue/green

Operation	Capacity
1 RCU	1 strongly consistent read/s (≤4KB)
1 RCU	2 eventually consistent reads/s (≤4KB)
1 WCU	1 write/s (≤1KB)

Limit	Value
Throttle	10,000 RPS
Burst	5,000 requests
Timeout	29 seconds
Payload	10MB

If scenario mentions...	Answer likely involves...	Why
Components shouldn't wait	SQS between services	Async processing
One producer, many consumers	SNS fan-out	Publish/subscribe
Service failures shouldn't cascade	SQS + DLQ	Buffer + retry
Event-driven architecture	EventBridge	Event routing

If scenario mentions...	Answer likely involves...	Red flags to avoid
Store database credentials	Secrets Manager or Systems Manager	❌ Environment variables
API authentication	IAM or Cognito	❌ API keys alone
Encrypt data at rest	KMS encryption	❌ Client-side only
Access AWS resources	IAM roles (not keys)	❌ Hardcoded access keys

Feature	Kinesis	SQS
Ordering	Per shard	FIFO queue only
Retention	Up to 365 days	Up to 14 days
Consumers	Multiple read same data	Message deleted after read
Use Case	Real-time analytics, log streaming	Decouple components

Limit	Value
Memory	128MB - 10,240MB
Timeout	900s (15 min)
/tmp storage	512MB - 10GB
Deployment package	50MB (zipped), 250MB (unzipped)
Concurrent executions	1000/region (default)

Limit	Value
Message size	256KB
Visibility timeout	0s - 12h (default 30s)
Retention	1 min - 14 days (default 4 days)
Delay	0s - 15 min
FIFO throughput	300 TPS (3000 with batching)

Domain	Weight	Questions (~)	Pass Target (720/1000)
Development with AWS Services	32%	~21	15+ correct (71%)
Security	26%	~17	12+ correct (71%)
Deployment	24%	~16	11+ correct (69%)
Troubleshooting & Optimization	18%	~11	8+ correct (73%)

🎓 AWS DVA-C02 STUDY GUIDE - COMPREHENSIVE EDITION

📊 EXECUTIVE SUMMARY - Tổng quan bài thi

🎯 Exam Overview

📈 Score Distribution

⚡ Top 5 Services (80% Score)

🎓 Study Strategy

🔑 Service Dependency Map

🧭 NAVIGATION HUB - Lộ trình học tập

📊 Study Progress Tracker

💡 Tip: Sử dụng checklist này

TIER 1 - CRITICAL SERVICES (65-70% điểm)

1. AWS LAMBDA

🔷 AWS LAMBDA - Core Overview

🎯 Exam Weight

⚡ Core Purpose

🔑 Must-Know Topics

🎓 Study Priority

💡 Mental Model: Lambda như "Function Vending Machine"

🎭 Analogy dễ hiểu:

📚 Core Concepts

⚙️ Function Configuration

🔄 Execution Environment

Environment Variables

Layers

Versions & Aliases

Versions:

Aliases:

Concurrency

Reserved Concurrency:

Provisioned Concurrency:

Error Handling

Synchronous invocation:

Asynchronous invocation:

Stream-based invocation:

Dead Letter Queue (DLQ)

Common Triggers

IAM Permissions

Execution Role: Lambda needs này để access AWS resources

Resource-based Policy: Who can invoke Lambda

🧠 Mnemonics & Memory Tricks

Lambda Limits - "LAMBDA TIME"

Invocation Types - "SAS Model"

Concurrency - "RIP" Model

VPC Lambda - "ENI" Rule

⚠️ Common Mistakes & Exam Traps

🔗 Integration Patterns với Lambda

Pattern 1: API-Driven (Synchronous)

Pattern 2: Event-Driven (Asynchronous)

Pattern 3: Queue-Driven (Poll-based)

Pattern 4: Stream Processing (Real-time)

Pattern 5: Scheduled Jobs (Cron-like)

❓ Self-Check Questions - Lambda

Q1: Troubleshooting Scenario

🎯 Correct Answer:

🔍 Why other answers wrong:

📝 Exam Tip:

🛠️ How to fix:

Q2: Concurrency & Scaling

🎯 Best Answers (Multiple correct):

❌ Wrong Answers:

📝 Exam Keywords:

Q3: Cost Optimization

💰 Current Cost:

🎯 Optimization Strategies:

📝 Exam Answer:

🧮 Proof:

Q4: VPC Integration

🎯 Complete Solution:

📊 Architecture:

❌ Common Mistakes:

💡 Cost Optimization:

📝 Exam Keywords:

📝 Exam-Specific Notes: Lambda

🎯 High-Frequency Question Types:

🚩 Red Flag Keywords - Lambda

⚠️ If answer suggests these → Usually WRONG!

⏱️ Time Management Tips:

🎓 Must Memorize Numbers:

2. AMAZON DYNAMODB

Core Concepts