Motivation

The AWS S3 Durability Model

AWS S3 is renowned for providing 11 nines (99.999999999%) of durability. This impressive guarantee is achieved through a robust architecture that also maintains at least 3 copies of your data across different Availability Zones (AZs) within a region. Each AZ represents one or more physically separate data centers with independent power, cooling, and networking.

graph TB
    User[User]

    subgraph Region["AWS Region"]
        Endpoint[Regional Endpoint]

    subgraph AZ1["Availability Zone 1"]
        DC1A[Data Center 1A]
        DC1B[Data Center 1B]
    end

    subgraph AZ2["Availability Zone 2"]
        DC2A[Data Center 2A]
        DC2B[Data Center 2B]
    end

    subgraph AZ3["Availability Zone 3"]
        DC3A[Data Center 3A]
        DC3B[Data Center 3B]
    end
    end

    User -->|Request| Endpoint
    Endpoint -->|Write Data| AZ1
    Endpoint -->|Write Data| AZ2
    Endpoint -->|Write Data| AZ3

This architecture ensures that if an entire data center experiences a catastrophic failure, your data remains safe and accessible. For even greater protection, AWS also offers cross-region replication, allowing data to be replicated across geographically distant regions.

The Budget Provider Model

In contrast, budget-friendly S3-compatible providers like Backblaze B2 typically achieve durability through erasure coding within a single data center rather than replicating complete copies across multiple physical locations.

graph TB
    User[User]

    subgraph DC["Single Data Center"]
        Endpoint[Storage Endpoint]

        subgraph Vault["Backblaze Vault (20 Storage Pods)"]
            Pod1[Pod 1<br/>Shard 1]
            Pod2[Pod 2<br/>Shard 2]
            Pod3[Pod 3<br/>Shard 3]
            PodDots[...]
            Pod17[Pod 17<br/>Shard 17]
            Pod18[Pod 18<br/>Parity 1]
            Pod19[Pod 19<br/>Parity 2]
            Pod20[Pod 20<br/>Parity 3]
        end
    end

    User -->|Request| Endpoint
    Endpoint -->|17 Data Shards| Pod1
    Endpoint -->|+| Pod2
    Endpoint -->|+| Pod3
    Endpoint -->|+| PodDots
    Endpoint -->|+| Pod17
    Endpoint -->|3 Parity Shards| Pod18
    Endpoint -->|+| Pod19
    Endpoint -->|+| Pod20

For example, Backblaze's architecture uses Reed-Solomon erasure coding (17 data shards + 3 parity shards) to achieve 11 nines durability³. This means your file is split into 17 pieces, with 3 additional parity pieces calculated from the original data. The file can be reconstructed from any 17 of the 20 shards, allowing the system to tolerate up to 3 simultaneous drive/pod failures.

While this provides excellent protection against individual hardware failures, all shards exist within a single physical location. If the entire data center experiences a catastrophic event, all 20 shards could be lost simultaneously.

The Cost vs. Durability Trade-off

While AWS S3 provides exceptional durability, it comes at a premium price. Many S3-compatible storage providers have emerged offering significantly cheaper alternatives:

These providers are often considerably more affordable than AWS S3. However, this cost savings comes with a trade-off: reduced protection against data center-level failures.

Single-Location Storage

As shown in the Backblaze example above, budget-friendly S3-compatible providers typically use erasure coding or RAID within a single data center rather than maintaining complete copies across multiple physical locations. While this provides excellent protection against individual hardware failures, all data remains in one geographic location.

What It Takes to Lose Data

A catastrophic failure means damage severe enough that the stored object data cannot be reconstructed. The difference in disaster resilience becomes clear when comparing what must fail for permanent data loss to occur:

Single Data Center: If that one DC suffers catastrophic failure, your data is gone
Multi-AZ Architecture: Requires catastrophic failures across at least 3 different data centers (affecting all 3 AZs) for data loss to occur

graph TB
    subgraph SingleDC["Single Data Center Model"]
        DC1["❌ Data Center<br/>(Catastrophic Failure)"]
        style DC1 fill:#ff6b6b,stroke:#c92a2a,stroke-width:4px,color:#fff
        Note1["Data cannot be reconstructed<br/>from anywhere else"]
        style Note1 fill:#fff,stroke:#c92a2a,stroke-width:2px
        DC1 -.-> Note1
    end

    subgraph MultiAZ["Multi-AZ Model"]
        subgraph AZ1M["Availability Zone 1"]
            DC1A["❌ DC 1A<br/>(Catastrophic)"]
            DC1B["DC 1B"]
            style DC1A fill:#ff6b6b,stroke:#c92a2a,stroke-width:3px,color:#fff
            style DC1B fill:#51cf66,stroke:#2f9e44,stroke-width:1px,color:#000
        end
        style AZ1M fill:#ffe0e0,stroke:#c92a2a,stroke-width:2px

        subgraph AZ2M["Availability Zone 2"]
            DC2A["❌ DC 2A<br/>(Catastrophic)"]
            DC2B["DC 2B"]
            style DC2A fill:#ff6b6b,stroke:#c92a2a,stroke-width:3px,color:#fff
            style DC2B fill:#51cf66,stroke:#2f9e44,stroke-width:1px,color:#000
        end
        style AZ2M fill:#ffe0e0,stroke:#c92a2a,stroke-width:2px

        subgraph AZ3M["Availability Zone 3"]
            DC3A["❌ DC 3A<br/>(Catastrophic)"]
            DC3B["DC 3B"]
            style DC3A fill:#ff6b6b,stroke:#c92a2a,stroke-width:3px,color:#fff
            style DC3B fill:#51cf66,stroke:#2f9e44,stroke-width:1px,color:#000
        end
        style AZ3M fill:#ffe0e0,stroke:#c92a2a,stroke-width:2px

        Note2["Catastrophic failures in at least<br/>3 different DCs (one per AZ)<br/>= Data cannot be reconstructed"]
        style Note2 fill:#fff,stroke:#c92a2a,stroke-width:2px
        DC1A -.-> Note2
        DC2A -.-> Note2
        DC3A -.-> Note2
    end

With single-location storage, a catastrophic failure of one data center means total data loss; there's nowhere else to reconstruct from. With multi-AZ architecture, your data remains safe even if an entire AZ is destroyed, and requires the highly unlikely scenario of simultaneous catastrophic failures across at least 3 different data centers in geographically separated locations before data becomes unrecoverable.

Risk of Data Loss

If the data center hosting your data experiences a catastrophic failure (fire, flood, power loss, etc.), you could face permanent data loss. Unlike AWS S3's multi-AZ architecture, there are no additional copies in separate physical locations to fall back on.

This is not a theoretical risk: in March 2021, a fire at an OVH data center in Strasbourg destroyed servers and resulted in permanent data loss for customers who did not have off-site backups¹ ².

Limitations of Native Replication

Some S3-compatible providers do offer native replication features. For example, Backblaze B2 provides bucket replication⁴. However, these solutions have significant limitations:

Async-Only Replication

Native replication is typically asynchronous, meaning there's a delay between when data is written to the primary location and when it appears in replicas, which may be up to several hours⁴. During this window, you're vulnerable to data loss if the primary fails.

Single-Cloud Restriction

Native replication features are confined to the same cloud provider. For example, Backblaze can only replicate to other Backblaze buckets⁴. You cannot replicate from Backblaze to MinIO, or from Hetzner to OVH.

No Cross-Cloud Disaster Recovery

If you want to protect against a provider-level failure (e.g., provider goes out of business, widespread service outage, compliance issues), native replication cannot help you because all copies remain with the same vendor.

The Need for Manual Replication

To achieve AWS-like durability with budget storage providers, you need to manually implement replication as a backup strategy. This increases your effective durability by maintaining copies across multiple independent storage locations or providers.

Option 1: Dual Writes in Your Application

You can implement replication directly in your application code:

# Pseudocode
def upload_file(file, key):
    s3_client_primary.put_object(bucket='primary', key=key, body=file)
    s3_client_backup.put_object(bucket='backup', key=key, body=file)

graph TB
    App1[Application<br/>with Replication Logic]
    style App1 fill:#ffd43b,stroke:#fab005,stroke-width:2px,color:#000

    Backend1A[Backblaze B2]
    Backend1B[MinIO]
    Backend1C[Hetzner Storage]

    App1 -->|Write| Backend1A
    App1 -->|Write| Backend1B
    App1 -->|Write| Backend1C

    Note1[Application must handle:<br/>• Replication logic<br/>• Error handling<br/>• Consistency]
    style Note1 fill:#fff,stroke:#fab005,stroke-width:2px
    App1 -.-> Note1

Drawbacks:

Requires modifying application code
Must be implemented consistently across all applications
Increases application complexity
Difficult to change replication strategies
Error handling becomes complicated

Option 2: Use a Transparent Proxy (ReplicaT4)

ReplicaT4 acts as a proxy layer between your application and storage backends:

# No code changes needed!
s3_client = boto3.client('s3', endpoint_url='http://replicat4:3000')
s3_client.put_object(bucket='my-bucket', key=key, body=file)
# ReplicaT4 automatically replicates to all configured backends

graph TB
    App2[Application<br/>Standard S3 Client]
    style App2 fill:#51cf66,stroke:#2f9e44,stroke-width:2px,color:#000

    Proxy[ReplicaT4 Proxy]
    style Proxy fill:#339af0,stroke:#1864ab,stroke-width:2px,color:#fff

    Backend2A[Backblaze B2]
    Backend2B[MinIO]
    Backend2C[Hetzner Storage]

    App2 -->|Standard S3 API| Proxy
    Proxy -->|Replicate| Backend2A
    Proxy -->|Replicate| Backend2B
    Proxy -->|Replicate| Backend2C

    Note2[ReplicaT4 handles:<br/>• Replication logic<br/>• Error handling<br/>• Consistency]
    style Note2 fill:#fff,stroke:#339af0,stroke-width:2px
    Proxy -.-> Note2

Benefits:

Zero application code changes: your apps continue using standard S3 APIs
Centralized replication logic: change strategies without touching application code
Consistent replication across all applications automatically
Flexible consistency models: choose between async (fast) and sync (consistent) replication
Mix and match providers: combine different storage backends seamlessly

Why ReplicaT4?

ReplicaT4 solves these challenges by providing:

Provider-agnostic replication: works with any S3-compatible storage
Cross-cloud capability: replicate across different providers (Backblaze → MinIO → Hetzner)
Flexible consistency models: choose async for speed or sync for strong consistency
Application transparency: no code changes required
Unified control: manage all replication from a single configuration

Whether you're using budget providers to reduce costs or implementing a defense-in-depth strategy against vendor lock-in, ReplicaT4 enables you to achieve the durability you need without sacrificing flexibility or breaking the bank.

The Name

The name ReplicaT4 reflects its architecture and purpose:

Replica - The core function: replicating data across multiple storage backends
T4 - The 4 connections needed to achieve S3-like resilience:
- 1 connection from your application to the ReplicaT4 proxy
- 3 connections from the proxy to storage backends (mirroring AWS S3's 3-AZ architecture)