NIP-XX: Responsive Image Variants

draft optional

This NIP extends NIP-94 (File Metadata) to support multiple resolution variants of an image, enabling bandwidth-efficient responsive image delivery while preserving content-addressed integrity via Blossom.

Motivation

Modern devices have vastly different display capabilities:

Thumbnails in feeds need only ~128px width
Mobile phones typically display 512-1024px images
Laptops and desktops benefit from 1536-2560px images
Original files may be 4000px+ from modern cameras

Currently, Nostr clients either:

Serve full-resolution images to all devices (wasteful)
Rely on server-side transforms that break content addressing
Use a single thumbnail that looks poor on larger displays

Additionally, images from cameras contain EXIF metadata that may leak:

GPS coordinates (location privacy)
Camera serial numbers (device fingerprinting)
Timestamps and other identifying information

This NIP enables clients to:

Generate multiple resolution variants client-side
Strip EXIF metadata before upload
Publish a binding event linking all variants by their content hashes
Allow other clients to select the appropriate variant for their viewport

Specification

Binding Event (Kind 1063)

A responsive image set is represented by a kind 1063 event (per NIP-94) containing multiple imeta tags, one per variant. This follows the pattern established by NIP-71 for video variants.

Each imeta tag MUST include:

url - Blossom URL for this variant
x - SHA-256 hash of this variant's file content
m - MIME type (same as original)
dim - Dimensions as <width>x<height>
variant - Size category identifier

The variant field identifies the resolution category:

Variant	Target Width	Use Case
`thumb`	128px	Previews, galleries, feed thumbnails
`mobile-sm`	512px	Small mobile portrait
`mobile-lg`	1024px	Large mobile, small tablets
`desktop-sm`	1536px	Laptops
`desktop-md`	2048px	Standard desktops
`desktop-lg`	2560px	Large/HiDPI displays
`original`	native	Full resolution, EXIF stripped

Variant Generation Rules

No upscaling: Only generate variants smaller than the original image width
Preserve aspect ratio: Scale height proportionally to maintain aspect ratio
Preserve format: Output format MUST match input (JPEG→JPEG, PNG→PNG)
Strip metadata: Remove all EXIF, IPTC, and XMP metadata from all variants
Blurhash: The thumb variant SHOULD include a blurhash for placeholder display

Quality Settings

Recommended JPEG quality settings per variant:

thumb: 70
mobile-sm: 75
mobile-lg: 80
desktop-sm: 85
desktop-md: 88
desktop-lg: 90
original: 92

For PNG images, use maximum compression without quality loss.

Variant Selection Rule

Clients SHOULD use next-larger selection: pick the smallest variant >= target width.

targetWidth = containerWidth * devicePixelRatio
selectedVariant = smallest variant where variant.width >= targetWidth

This ensures the client only needs to downscale slightly (or not at all), rather than upscaling which would cause blur. If no variant is large enough, use the largest available.

Event Structure

Example: Large Original (4032x3024)

All variants generated:

{
  "kind": 1063,
  "pubkey": "<publisher-pubkey>",
  "created_at": 1234567890,
  "content": "Sunset over the mountains",
  "tags": [
    ["imeta",
      "url https://blossom.example.com/abc123def456.jpg",
      "x abc123def456789...",
      "m image/jpeg",
      "dim 4032x3024",
      "variant original"
    ],
    ["imeta",
      "url https://blossom.example.com/def456abc789.jpg",
      "x def456abc789012...",
      "m image/jpeg",
      "dim 2560x1920",
      "variant desktop-lg"
    ],
    ["imeta",
      "url https://blossom.example.com/789abc123def.jpg",
      "x 789abc123def345...",
      "m image/jpeg",
      "dim 2048x1536",
      "variant desktop-md"
    ],
    ["imeta",
      "url https://blossom.example.com/012def456abc.jpg",
      "x 012def456abc678...",
      "m image/jpeg",
      "dim 1536x1152",
      "variant desktop-sm"
    ],
    ["imeta",
      "url https://blossom.example.com/234abc567def.jpg",
      "x 234abc567def890...",
      "m image/jpeg",
      "dim 1024x768",
      "variant mobile-lg"
    ],
    ["imeta",
      "url https://blossom.example.com/345abc789def.jpg",
      "x 345abc789def901...",
      "m image/jpeg",
      "dim 512x384",
      "variant mobile-sm"
    ],
    ["imeta",
      "url https://blossom.example.com/678def012abc.jpg",
      "x 678def012abc234...",
      "m image/jpeg",
      "dim 128x96",
      "variant thumb",
      "blurhash eVF$^OI:${M{o#*0-nNFxakD"
    ],
    ["x", "abc123def456789..."],
    ["x", "def456abc789012..."],
    ["x", "789abc123def345..."],
    ["x", "012def456abc678..."],
    ["x", "234abc567def890..."],
    ["x", "345abc789def901..."],
    ["x", "678def012abc234..."]
  ],
  "id": "<event-id>",
  "sig": "<signature>"
}

Note: The separate x tags duplicate the hashes from the imeta tags. This redundancy enables standard NIP-01 tag queries (#x) to discover the binding event by any variant hash, while the imeta tags provide the full metadata for each variant.

Example: Smaller Original (1200x900)

Only smaller variants generated (no desktop variants):

{
  "kind": 1063,
  "pubkey": "<publisher-pubkey>",
  "created_at": 1234567890,
  "content": "Quick snapshot",
  "tags": [
    ["imeta",
      "url https://blossom.example.com/small123.jpg",
      "x small123456789...",
      "m image/jpeg",
      "dim 1200x900",
      "variant original"
    ],
    ["imeta",
      "url https://blossom.example.com/small456.jpg",
      "x small456789012...",
      "m image/jpeg",
      "dim 1024x768",
      "variant mobile-lg"
    ],
    ["imeta",
      "url https://blossom.example.com/small789.jpg",
      "x small789012345...",
      "m image/jpeg",
      "dim 512x384",
      "variant mobile-sm"
    ],
    ["imeta",
      "url https://blossom.example.com/small012.jpg",
      "x small012345678...",
      "m image/jpeg",
      "dim 128x96",
      "variant thumb",
      "blurhash eVF$^OI:${M{o#*0"
    ]
  ],
  "id": "<event-id>",
  "sig": "<signature>"
}

Client Behavior

Publishing Client

Load the image file and extract pixel data (discarding EXIF)
Determine which variants to generate based on original dimensions
Generate each variant using canvas-based scaling
Upload each variant to Blossom server(s)
Collect SHA-256 hashes and URLs for each uploaded blob
Publish kind 1063 event with all imeta tags
Reference the binding event in notes (via e tag or URL)

Consuming Client

Fetch kind 1063 event by event ID or by querying for the x hash
Parse imeta tags to extract available variants
Select appropriate variant based on:

- Current viewport width - Device pixel ratio - Network conditions (optional)

Display blurhash placeholder while loading
Load selected variant's URL
On load failure, fall back to next larger variant
Verify SHA-256 hash matches x tag (optional but recommended)

Variant Selection Algorithm

function selectVariant(variants, viewportWidth, pixelRatio = 1):
    targetWidth = viewportWidth * pixelRatio

    # Sort variants by width ascending
    sorted = variants.sortBy(v => v.width)

    # Find smallest variant >= target width
    for variant in sorted:
        if variant.width >= targetWidth:
            return variant

    # If none large enough, return largest available
    return sorted.last()

Relay Behavior

Relays SHOULD index kind 1063 events by all x hashes present in imeta tags. This enables clients to discover the binding event when they only have one variant's hash.

Query example:

["REQ", "sub1", {"kinds": [1063], "#x": ["<any-variant-hash>"]}]

Discovery Model

Critical: Blossom blob hashes alone are meaningless without the binding event. A client finding a hash on a Blossom server cannot determine:

Whether it's a thumbnail, mobile, or original variant
What other variants exist
Who published it

The binding event (kind 1063) is the authoritative source. Discovery flow:

Client has a hash (from note content or imeta tag)
Client queries relays for kind 1063 events containing that hash
Binding event reveals all variants and their relationships
Client can then fetch appropriate variant from Blossom

Security Considerations

Hash Verification

Clients SHOULD verify that downloaded content matches the x hash in the imeta tag. This prevents:

Server-side image manipulation
CDN corruption
Man-in-the-middle attacks

No Server-Side Transforms

Unlike NIP-96's ?w= parameter, this NIP requires all transforms to happen client-side before upload. This preserves the content-addressing guarantee: the hash always matches the file content.

EXIF Stripping

Publishing clients MUST strip EXIF and other metadata to protect user privacy. This includes:

GPS coordinates
Camera make/model/serial
Timestamps
Lens information
Software used

Immutability

Kind 1063 events are immutable. To update an image (e.g., add a missing variant), publish a new event and update references. Consider using addressable events (kind 31063 with d tag) if updates are needed frequently.

Backward Compatibility

Relays that don't understand variant will still store and serve the events
Clients that don't support responsive images can use any variant URL
The original variant ensures full-resolution access is always available
Existing NIP-94 clients will see the first imeta tag as the primary file

NIP-XX-responsive-images.md raw