Metadata is the difference between a book that shows up as "The Great Gatsby by F. Scott Fitzgerald" with a crisp cover image, sorted into the correct series and genre, and one that appears as "Unknown" by "Unknown" with no cover, floating unsorted in a library. Every reading app, every bookstore, every library catalog depends on metadata to organize and display books.
Most authors and publishers treat metadata as an afterthought — fill in the title and author, slap on a cover, done. But incomplete or incorrect metadata has real consequences: books that don't appear in searches, broken series sorting, missing covers on certain devices, and library management that requires manual fixes for every book. Getting metadata right once saves hours of fixing it later.
Dublin Core: The Metadata Foundation
Dublin Core is the international metadata standard (ISO 15836) that EPUB uses for book description. Originally designed for web resource cataloging in 1995, it provides 15 core elements that map well to book metadata:
| Element | EPUB Tag | Required? | Purpose |
|---|---|---|---|
| Title | dc:title | Yes | Book title as displayed |
| Creator | dc:creator | No (strongly recommended) | Author, editor, illustrator |
| Language | dc:language | Yes | BCP 47 language code (en, en-US, fr, ja) |
| Identifier | dc:identifier | Yes | ISBN, UUID, URI, DOI |
| Publisher | dc:publisher | No | Publishing entity name |
| Date | dc:date | No | Publication date (ISO 8601) |
| Subject | dc:subject | No | Genre, topic, keywords |
| Description | dc:description | No | Synopsis/blurb |
| Rights | dc:rights | No | Copyright statement |
| Type | dc:type | No | Resource type (typically omitted for ebooks) |
| Format | dc:format | No | MIME type (application/epub+zip) |
| Source | dc:source | No | Original work identifier (for adaptations) |
| Relation | dc:relation | No | Related resource identifier |
| Coverage | dc:coverage | No | Spatial/temporal scope |
| Contributor | dc:contributor | No | Secondary contributors |
Three elements are required by the EPUB spec: dc:title, dc:language, and dc:identifier. In practice, you should always include dc:creator (author) and aim to fill in publisher, date, subject, and description. Every missing field is a missed opportunity for discoverability and organization.
EPUB OPF Metadata In Practice
In EPUB, metadata lives in the OPF (Open Packaging Format) file — typically content.opf. Here's a comprehensive real-world example:
<metadata xmlns:dc="http://purl.org/dc/elements/1.1/">
<dc:title id="title">Dune</dc:title>
<meta refines="#title" property="title-type">main</meta>
<dc:creator id="author">Frank Herbert</dc:creator>
<meta refines="#author" property="role" scheme="marc:relators">aut</meta>
<meta refines="#author" property="file-as">Herbert, Frank</meta>
<dc:language>en</dc:language>
<dc:identifier id="isbn">urn:isbn:9780441013593</dc:identifier>
<dc:publisher>Ace Books</dc:publisher>
<dc:date>1965-08-01</dc:date>
<dc:subject>Science Fiction</dc:subject>
<dc:subject>Space Opera</dc:subject>
<dc:description>Set on the desert planet Arrakis...</dc:description>
<dc:rights>Copyright 1965 Frank Herbert</dc:rights>
<meta property="belongs-to-collection" id="series">Dune</meta>
<meta refines="#series" property="collection-type">series</meta>
<meta refines="#series" property="group-position">1</meta>
<meta property="dcterms:modified">2026-01-15T12:00:00Z</meta>
</metadata>
Creator Roles with MARC Relators
EPUB 3 uses MARC relator codes to distinguish different types of creators. The most common codes for ebooks:
aut— Authoredt— Editorill— Illustratortrl— Translatornrt— Narrator (for audiobook-ebook hybrids)aui— Author of introductionbkd— Book designer
The file-as property provides the sort name ("Herbert, Frank" instead of "Frank Herbert"). This controls alphabetical sorting in library apps. Without it, books sort by first name, which is wrong for most Western naming conventions. Libraries with thousands of books become unmanageable without correct file-as values.
Series Metadata
EPUB 3 handles series via the belongs-to-collection meta property with a group-position refinement for the book's position in the series. This is how library apps know "Dune" is book 1 of the Dune series.
Calibre uses its own series metadata scheme (stored in <meta name="calibre:series"> and <meta name="calibre:series_index">). Amazon uses EPUB 3's standard series metadata for Kindle display. Apple Books reads both. For maximum compatibility, include both the EPUB 3 standard and Calibre's custom meta elements.
Series metadata directly affects how your book appears in stores. Amazon groups series titles and shows "Book X of Y" on the product page. Kobo and Apple Books use series information for recommendation and navigation. Missing series metadata means your sequel doesn't link to the original.
MOBI/AZW Metadata: EXTH Records
MOBI and AZW files store metadata in EXTH (Extended Header) records — a flat key-value system using numeric type codes. The most relevant records:
| EXTH Type | Name | Equivalent DC |
|---|---|---|
| 100 | Author | dc:creator |
| 101 | Publisher | dc:publisher |
| 103 | Description | dc:description |
| 104 | ISBN | dc:identifier |
| 105 | Subject | dc:subject |
| 106 | Publication Date | dc:date |
| 108 | Contributor | dc:contributor |
| 201 | Cover Offset | (cover image index) |
| 202 | Thumbnail Offset | (thumbnail index) |
| 503 | Updated Title | dc:title |
When converting MOBI to EPUB (convert here), Calibre maps these EXTH records to Dublin Core elements automatically. The reverse conversion (EPUB to MOBI) does the same mapping in the opposite direction. The main metadata loss in conversion is EPUB 3's extended refinements (role types, file-as, series position) which have no direct MOBI EXTH equivalent.
Cover Image Specifications by Platform
Cover requirements vary by platform, but designing for the largest (Amazon KDP) covers all of them:
Platform-Specific Requirements
| Platform | Min Dimensions | Ideal Dimensions | Aspect Ratio | Format | Color |
|---|---|---|---|---|---|
| Amazon KDP | 625x1000 | 2560x1600 | 1.6:1 | JPEG, TIFF | RGB |
| Apple Books | 1400x1873 | 1600x2133+ | 4:3 | JPEG, PNG | RGB |
| Google Play | 640 (short side) | 2560x1600 | Flexible | JPEG, PNG | RGB |
| Kobo | 1400x1873 | 1600x2400 | 3:4 | JPEG | RGB |
| Barnes & Noble | 750x1200 | 2500x3500 | 5:7 | JPEG, PNG | RGB |
The universal safe spec: 2560x1600 pixels, RGB, JPEG, quality 95. This meets or exceeds every platform's requirements. Some platforms (Apple, B&N) prefer taller aspect ratios (4:3 or 5:7), but a 1.6:1 cover works everywhere — platforms that expect taller covers add letterboxing or crop slightly.
Embedding the Cover in EPUB
The cover image in EPUB is referenced in the OPF manifest with the cover-image property:
<item id="cover-img" href="images/cover.jpg"
media-type="image/jpeg" properties="cover-image"/>Additionally, most readers expect a cover page — an XHTML document that displays the cover image, included as the first spine item:
<!-- cover.xhtml -->
<html xmlns="http://www.w3.org/1999/xhtml">
<head><title>Cover</title>
<style>body{margin:0;padding:0;text-align:center}
img{max-width:100%;max-height:100%}</style>
</head>
<body>
<img src="images/cover.jpg" alt="Book Cover"/>
</body>
</html>Include both the cover-image manifest property AND a cover page in the spine. Some readers use one, some use the other. Including both guarantees the cover displays everywhere. The cover image file should be the same high-resolution image you upload to the store — don't embed a low-res thumbnail in the EPUB and a high-res version on the store page.
Subject and Genre Classification
Genre metadata affects where your book appears in store categories and search results. Different platforms use different classification systems:
- Amazon KDP: Uses BISAC (Book Industry Standards and Communications) codes and their own category tree. You select up to 3 categories during KDP publishing. Amazon also reads
dc:subjectfrom the EPUB for keyword matching - Apple Books: Uses their own category taxonomy (similar to BISAC). Categories are selected during submission
- EPUB metadata:
dc:subjectaccepts freeform text. Include genre terms, subgenre terms, and relevant keywords. Multipledc:subjectelements are allowed - ONIX: The publishing industry standard for metadata exchange. Uses BISAC, BIC, or Thema subject codes. Required by distributors like IngramSpark
For EPUB dc:subject, be specific: "Science Fiction," "Hard Science Fiction," "Space Opera" is better than just "Fiction." Library management software and some stores use these terms for filtering and recommendation. The more precise your subjects, the better your book gets categorized.
Tools for Managing Ebook Metadata
Calibre is the most comprehensive metadata tool. Its metadata editor reads and writes metadata for EPUB, MOBI, AZW3, PDF, and FB2. You can: edit all Dublin Core fields, manage series and series index, set custom columns for personal tracking, download metadata from online sources (Amazon, Google Books, Open Library), bulk-edit metadata across multiple books, and embed/replace cover images. For a library of hundreds or thousands of ebooks, Calibre's metadata management is indispensable.
Sigil provides a metadata editor for EPUB files with direct access to the OPF XML. It's more precise than Calibre for EPUB-specific metadata but doesn't handle other formats.
exiftool can read metadata from EPUB, MOBI, and PDF files from the command line: exiftool book.epub. Useful for quick metadata inspection without opening a GUI application.
Metadata Survival During Format Conversion
When converting between ebook formats, metadata transfer is generally good but not perfect:
- EPUB to MOBI/AZW3: Title, author, publisher, date, description, ISBN, subject, and cover transfer cleanly. Series metadata depends on the converter — Calibre handles it well. MARC relator codes (author roles) are lost
- MOBI to EPUB: EXTH records map to Dublin Core. Cover image transfers if properly indexed. Series metadata in MOBI is non-standard and may require manual verification after conversion
- FB2 to EPUB: FB2's rich metadata (title-info, document-info, publish-info) maps well to Dublin Core. Genre taxonomy converts to
dc:subjecttext. Cover and series metadata transfer - PDF to EPUB: PDF metadata (Title, Author, Subject, Keywords in the document properties) transfers to Dublin Core. But PDF metadata is often incomplete or absent — many PDFs have no metadata at all
After any format conversion, always verify metadata in the output file. Open in Calibre, check title/author/series/cover, and fix any missing or mangled fields before distributing.
Metadata is infrastructure. It's not glamorous, and readers never see it directly. But it determines whether your book shows up in searches, sorts correctly in libraries, displays its cover on store pages, and links to its sequels. Five minutes of thorough metadata entry during EPUB creation saves hours of troubleshooting later.
The minimum viable metadata is: title, author (with file-as sort name), language, identifier (ISBN or UUID), cover image, and series information if applicable. The maximum useful metadata adds publisher, date, multiple subjects/genres, description, and rights. Fill in everything you can — storage is free, and every field improves discoverability.