S3 Glacier, Lifecycle policies, and Replication
S3 Glacier is for data you need to retain but rarely access. S3 Glacier Instant Retrieval delivers data within milliseconds for archives accessed occasionally. S3 Glacier Flexible Retrieval offers expedited retrieval in 1-5 minutes, standard in 3-5 hours, or bulk in 5-12 hours, at a lower per-GB cost. S3 Glacier Deep Archive targets data that must be kept for regulatory reasons but is almost never accessed: retrieval takes 12 hours for standard and 48 hours for bulk, but the storage cost is the lowest in AWS, less than a dollar per terabyte per month.
Lifecycle policies move objects between storage classes automatically based on age. You define rules: objects in the Standard class transition to Standard-IA after 30 days, then to Glacier Flexible Retrieval after 90 days, then expire (delete) after 7 years. This is how you implement data retention policies at scale without writing any code. Lifecycle policies apply to current versions, previous versions (when versioning is enabled), and incomplete multipart uploads.
S3 Replication copies objects between buckets automatically. Cross-Region Replication (CRR) copies to a bucket in a different region, useful for disaster recovery, compliance with data residency requirements in multiple geographies, or serving users closer to a secondary region. Same-Region Replication (SRR) copies to a bucket in the same region, useful for maintaining a separate log aggregate or keeping test environments in sync with production. Replication requires versioning enabled on both source and destination buckets. Only new objects created after replication is configured are replicated by default.
S3 Object Lock, EFS, FSx, and Storage Gateway
S3 Object Lock implements WORM (Write Once, Read Many) storage. Once locked, an object version cannot be deleted or overwritten for a specified retention period. This is a regulatory requirement for financial records, healthcare data, and legal archives. Governance mode allows users with special IAM permissions to bypass locks. Compliance mode allows no one, not even the root account, to delete locked objects before expiration. Vault Lock in Glacier provides equivalent functionality for Glacier vaults.
EFS (Elastic File System) is a managed NFS (Network File System) file share that multiple EC2 instances across multiple Availability Zones can mount simultaneously. EFS grows and shrinks automatically as you add and remove files. It is ideal for shared storage between Linux instances: web servers sharing uploaded content, analytics jobs reading the same dataset, or containerized applications needing persistent storage. FSx for Windows File Server provides managed Windows file shares using the SMB protocol, with Active Directory integration, for Windows workloads that need a native Windows file server. FSx for Lustre is a high-performance parallel file system for HPC, machine learning, and data processing workloads that need hundreds of gigabytes per second of throughput.
Storage Gateway bridges on-premises environments to AWS storage. File Gateway presents an NFS or SMB interface locally; files written to it are stored as objects in S3. Volume Gateway creates iSCSI block storage volumes locally backed by AWS. Tape Gateway replaces physical tape libraries with a virtual tape library interface that stores data in S3 and Glacier. The use case for all three is hybrid environments: organizations that want cloud storage economics while keeping existing on-premises applications and workflows.
How to choose the correct answer
Glacier Instant Retrieval: millisecond access for quarterly archives. Flexible Retrieval: minutes to hours for less frequent access. Deep Archive: 12-48 hour retrieval, lowest cost, regulatory long-term retention.
Lifecycle policies: automate storage class transitions and object expiration based on object age.
CRR: cross-region replication for DR, compliance, or geographic distribution. SRR: same-region, log aggregation, dev/test sync.
Object Lock Compliance mode: no one can delete, not even root, before expiration. Governance mode: privileged users can override.
EFS: shared NFS for Linux instances across multiple AZs, auto-scales. Use for multi-instance shared file access.
FSx for Windows: managed SMB shares with Active Directory for Windows workloads.
FSx for Lustre: high-performance parallel file system for HPC and ML. Can integrate with S3 as a data repository.
Storage Gateway: hybrid cloud storage bridge. File, Volume, or Tape depending on interface needed.