Guides & Resources

In-depth guides on file formats, conversion, and digital media

Video
4K Video Conversion: Codecs, Settings, and File Sizes
4K (3840x2160) video has 4x the pixels of 1080p, requiring 2-4x the bitrate and storage. At good quality, 4K needs 35-68 Mbps with H.264 (25-50GB/hr), 15-30 Mbps with H.265 (10-22GB/hr), or 10-20 Mbps with AV1 (7-15GB/hr). H.265 or AV1 are effectively required for practical 4K file sizes. For sharing, downscaling to 1080p reduces file size by 75% with minimal visible quality loss on most screens.
Mar 19, 2026 8 min read
Archive
7z: Maximum Compression with 7-Zip
7z is an open archive format created by Igor Pavlov that achieves the highest compression ratios of any mainstream format, typically 30-70% smaller than ZIP. It uses LZMA2 compression with solid archiving and supports AES-256 encryption including optional filename encryption.
Mar 19, 2026 7 min read
Audio
AAC and M4A: The Modern Lossy Audio Standard
AAC (Advanced Audio Coding) is the successor to MP3, delivering better audio quality at the same bitrate through more advanced psychoacoustic modeling and spectral processing. M4A is simply AAC audio stored in an MPEG-4 container. AAC is the default audio codec for Apple devices, YouTube, Instagram, and most MP4 video. At 256 kbps (Apple's iTunes Plus standard), AAC is transparent to the source for virtually all listeners.
Mar 19, 2026 8 min read
Document
File Formats for Academic Papers and Research
Academic papers move through multiple formats: written in LaTeX or Word, submitted as PDF or DOCX (journal-dependent), references managed in BibTeX or Zotero, preprints uploaded to arXiv as TeX source or PDF. The format choice depends on your discipline — STEM fields strongly favor LaTeX; humanities and social sciences typically use Word.
Mar 19, 2026 9 min read
Document
Creating Accessible Documents: Formats and Standards
Document accessibility means people with disabilities can read, navigate, and understand your content using assistive technology. HTML is the most natively accessible format. DOCX with proper styles is accessible by default. PDF requires tagged structure (PDF/UA) to be accessible. Creating accessible documents from scratch is far easier than remediating inaccessible ones after the fact.
Mar 19, 2026 9 min read
Data
API Response Formats: JSON, XML, GraphQL, and Beyond
REST + JSON is the default for public APIs. SOAP + XML persists in enterprise and government. GraphQL lets clients request exactly the data they need. gRPC + Protocol Buffers is the standard for high-performance microservice communication. The format choice depends on your consumers, performance requirements, and ecosystem.
Mar 19, 2026 8 min read
Archive
Archive Encryption: Securing Your Compressed Files
For encrypted archives, use 7z with AES-256 and filename encryption enabled. ZIP's ZipCrypto is broken — crackable in minutes with known-plaintext attacks. ZIP's AES-256 is secure but doesn't encrypt filenames. RAR5 uses AES-256 with optional filename encryption. Password strength matters more than algorithm choice: 12+ random characters minimum.
Mar 19, 2026 6 min read
Archive
Archive Formats for Developers: Source Code, Artifacts, and Deployment
Archive formats are embedded throughout the software development lifecycle. Source code distributes as .tar.gz/.tar.xz. Docker image layers are TAR files. npm packages are tarballs. Java JARs and Python wheels are ZIP files. Debian packages are ar archives containing tarballs. Understanding these formats helps debug build failures and optimize CI/CD pipelines.
Mar 19, 2026 7 min read
Archive
Archive Formats Compared: ZIP vs 7Z vs TAR.GZ
Use ZIP when sharing files with anyone — it's natively supported by Windows, macOS, and Linux with no extra software. Use 7Z when maximum compression matters and the recipient can install 7-Zip. Use TAR.GZ for Linux/Unix environments and source code distribution. TAR.XZ gives the best compression ratio for tarballs but is slower to create.
Mar 19, 2026 7 min read
Audio
Audio Bitrate Explained: What the Numbers Actually Mean
Audio bitrate is the number of bits used per second of audio. Higher bitrate means more data preserved, generally meaning better quality. For MP3, transparency (indistinguishable from CD) is around 192-256 kbps. For AAC, it's around 192 kbps. For Opus, it's around 128 kbps. Above these thresholds, extra bitrate is wasted data that adds file size without audible improvement.
Mar 19, 2026 7 min read
Audio
Audio Formats for Music Production and Distribution
Record and mix in WAV or AIFF at 48 kHz / 24-bit. Deliver stems as WAV. Master to WAV for the CD master, FLAC for hi-res distribution, and let distributors handle lossy encoding for streaming. Keep session files and bounces in lossless formats throughout the production chain — lossy encoding happens once, at the very end.
Mar 19, 2026 9 min read
Audio
Audio Formats for Podcasting: Recording to Distribution
Record podcasts in WAV or FLAC at 48 kHz / 24-bit. Edit in the same format. Export for distribution as MP3 at 128 kbps mono (speech-only) or 192 kbps stereo (with music). Normalize loudness to -16 LUFS for Apple Podcasts and -14 LUFS for Spotify. MP3 is the universal distribution format — every podcast app supports it without exception.
Mar 19, 2026 8 min read
Audio
Audio Formats Explained: MP3 vs WAV vs FLAC vs AAC
MP3 is universally compatible but technically outdated. AAC sounds better at the same bitrate and is the default in Apple devices and MP4 video. FLAC is lossless — bit-perfect reconstruction at 50-70% of WAV's size. WAV is uncompressed raw audio, used in professional production. For the best quality-to-size ratio in 2026, Opus wins at every bitrate, but compatibility is narrower.
Mar 19, 2026 8 min read
Audio
Audio Metadata and ID3 Tags: What You Need to Know
Audio metadata (tags) stores information about the audio content — artist, title, album, track number, cover art — within the audio file itself. MP3 uses ID3 tags (v1 or v2). FLAC and OGG use Vorbis Comments. M4A/AAC uses MP4 atoms. Metadata survives as long as you stay within the same format family, but converting between formats (MP3 to FLAC, etc.) may lose or alter metadata depending on the conversion tool.
Mar 19, 2026 7 min read
Audio
Audiobook Formats: M4B, MP3, and Beyond
M4B is the standard audiobook format — AAC audio in an MPEG-4 container with chapter markers and bookmark support. Apple Books and most audiobook apps use M4B natively. MP3 is the universal fallback that plays everywhere but lacks native chapter support. Audible uses proprietary AA/AAX formats with DRM. For creating audiobooks, encode as M4B with chapters at 64-128 kbps AAC.
Mar 19, 2026 7 min read
Ebook
Audiobooks vs Ebooks: Formats, Platforms, and Conversion
Audiobooks (M4B, AA, AAX, MP3) and ebooks (EPUB, MOBI, PDF) are fundamentally different media sharing the same content. Text-to-speech can bridge them but isn't a replacement for professional narration. AI-generated TTS (via ElevenLabs, OpenAI, Google) has dramatically closed the quality gap since 2024.
Mar 19, 2026 6 min read
Video
AVI: Microsoft's Legacy Video Format
AVI (Audio Video Interleave) is Microsoft's original video container format, introduced in 1992. It lacks native streaming support, modern metadata features, and subtitle tracks. While AVI can technically hold modern codecs like H.264, it wasn't designed for them and adds no value over MP4. AVI persists in legacy camera systems, industrial equipment, and old file archives — but any new video should use MP4 or MKV instead.
Mar 19, 2026 8 min read
Image
AVIF Format: Next-Gen Image Compression Explained
AVIF is an image format based on the royalty-free AV1 video codec, producing files 20-30% smaller than WebP and 50%+ smaller than JPEG at equivalent quality. It supports 10-bit and 12-bit HDR, wide color gamuts, and alpha transparency. The tradeoff is encoding speed — 10-50x slower than JPEG — making it best suited for pre-built assets rather than real-time conversion.
Mar 19, 2026 9 min read
Archive
Best Archive Formats for Backups
TAR.ZSTD is the best backup format for most scenarios: fast compression, good ratio, and Unix metadata preservation. TAR.XZ trades speed for the smallest archive size. ZIP is the right choice for cross-platform backups where Windows users need access. Always test restoration from backups regularly.
Mar 19, 2026 6 min read
General
Base64 Encoding: When and When Not to Use It
Base64 converts binary data to ASCII text, adding a 33% size overhead. Use it for small inline images (under 2KB via data URIs), embedding binary in JSON, and email attachments. Avoid it for large files, cacheable assets, and anywhere Blob URLs or ArrayBuffer would work better.
Mar 19, 2026 7 min read
General
Batch Conversion: Processing Thousands of Files Efficiently
Batch conversion means automating format conversion across hundreds or thousands of files using tools like FFmpeg, ImageMagick, LibreOffice headless, and Pandoc with command-line scripting. The key to reliable batch processing: preserve originals, handle errors gracefully, name outputs predictably, and monitor progress.
Mar 19, 2026 6 min read
Document
Batch Document Conversion: Processing Hundreds of Files
Batch document conversion uses command-line tools — LibreOffice headless for office formats (DOC, DOCX, ODT, RTF to PDF), Pandoc for markup formats (Markdown, HTML, LaTeX), and Ghostscript for PDF operations (compression, merging, PDF/A conversion). The key challenges are error handling, font consistency, and encoding issues at scale.
Mar 19, 2026 6 min read
Image
BMP, TIFF, and Legacy Image Formats: When You Still Need Them
BMP is an uncompressed raster format producing predictably huge files (a 1920x1080 image is exactly 5.7MB). TIFF is a flexible tag-based format supporting lossless compression, CMYK, 16-bit depth, and multi-page — the standard for print and archival. ICO is a multi-resolution container for Windows icons and favicons. These formats aren't obsolete; they serve specific professional workflows where modern web formats can't.
Mar 19, 2026 7 min read
Ebook
CBR, CBZ, and Digital Comic Formats
CBR and CBZ are renamed RAR and ZIP archives containing sequentially numbered image files. CBZ (ZIP-based) is preferred for new archives due to universal tool support. ComicInfo.xml provides optional metadata. For readable comics, images should be at least 1800x2700 pixels.
Mar 19, 2026 7 min read
General
CDN Image Transformation: Cloudflare, imgix, and Cloudinary
CDN image transformation services resize, reformat, and optimize images on-the-fly at the edge. Cloudflare Image Resizing integrates with Workers for maximum control, imgix uses URL parameters for simplicity, and Cloudinary offers the richest transformation API. All three support auto-format detection via the Accept header, eliminating the need for build-time image generation.
Mar 19, 2026 7 min read
Privacy
Why Client-Side File Conversion Is Better for Privacy
Client-side file conversion processes files entirely in your browser using JavaScript and WebAssembly. Your file data stays in local memory (RAM) and never travels over a network connection. This makes it faster than server-based conversion (no upload/download time) and fundamentally more private, since the conversion service never has access to your files.
Mar 19, 2026 8 min read
Font
Color Fonts and Emoji: Formats and Rendering
Emoji render differently across platforms because each OS ships its own color emoji font using different formats: Apple uses SBIX (bitmaps), Google uses CBDT (bitmaps) and COLRv1 (vectors), and Microsoft/Mozilla use COLR/CPAL (vectors). There is no universal color font format yet, though COLRv1 is emerging as the standard with vector support and broad browser adoption.
Mar 19, 2026 6 min read
General
Color Spaces and File Formats: sRGB, Adobe RGB, and P3
A color space defines which colors are available and how numeric values map to visible colors. sRGB is the web standard (35% of visible colors). Adobe RGB is for print workflows (50%). DCI-P3 is for modern displays (45%). ProPhoto RGB is for editing headroom (90%). File formats vary in color space support: JPEG and PNG support any ICC profile, WebP is limited to sRGB, and AVIF supports HDR wide color gamut natively.
Mar 19, 2026 8 min read
Video
How to Compress Video for Email (Under 25MB)
To get a video under 25MB for email, reduce resolution to 720p, use H.264 with CRF 28-32, lower the frame rate to 24fps, and use AAC audio at 96kbps. A 2-minute 720p video at these settings produces a 10-15MB file. For longer videos, trim to the essential clip. For anything over 3 minutes, use a cloud sharing link instead of an attachment.
Mar 19, 2026 8 min read
Archive
How Compression Algorithms Work: From Huffman to LZMA
Lossless compression works by finding and eliminating redundancy in data. Dictionary-based methods (LZ77, LZ78, LZW) replace repeated byte sequences with short references. Entropy coders (Huffman, arithmetic, ANS) assign shorter codes to more frequent symbols. Modern algorithms like DEFLATE, LZMA, Brotli, and Zstandard combine these techniques for maximum compression.
Mar 19, 2026 8 min read
Data
Configuration File Formats Compared: INI, JSON, YAML, TOML, ENV
Use ENV for secrets and 12-factor apps. Use TOML for application config (explicit types, comments, no indentation bugs). Use YAML for infrastructure config (deep nesting, DevOps ecosystem). Use JSON only when the toolchain requires it. Avoid INI for new projects (no standard, no types). HCL is Terraform-specific.
Mar 19, 2026 9 min read
Document
Converting Scanned Documents: OCR and Format Options
A scanned document is an image of text, not actual text. Converting it to an editable or searchable format requires OCR (Optical Character Recognition) to recognize characters in the image. OCR accuracy depends on scan quality (300 DPI minimum), image contrast, language, and the OCR engine used. Tesseract 5 handles clean printed text well; complex layouts and handwriting still challenge all OCR engines.
Mar 19, 2026 8 min read
Video
How to Convert Videos Without Losing Quality
Truly lossless video conversion is only possible when changing containers without changing codecs (e.g., MKV to MP4 with H.264 — called remuxing). When you must re-encode, use CRF 18 for visually lossless quality, avoid re-encoding more than once, never upscale resolution, and keep the original file as your master.
Mar 19, 2026 7 min read
General
Cross-Platform File Compatibility: Windows, macOS, Linux
Files created on one operating system frequently break on another due to filename case sensitivity, CRLF vs LF line endings, character encoding mismatches, archive permission handling, and codec availability differences. Understanding these platform-specific behaviors prevents the most common cross-platform file headaches.
Mar 19, 2026 7 min read
Data
CSV: Simple, Broken, and Everywhere
CSV (Comma-Separated Values) is the simplest and most widely supported tabular data format, but it has no real standard, no type system, and rampant encoding and delimiter inconsistencies. RFC 4180 is the closest thing to a specification, but most real-world CSV files violate it.
Mar 19, 2026 9 min read
Data
CSV Encoding and Delimiters: Why Your Data Looks Wrong
CSV encoding problems stem from the format having no standard way to declare its character encoding or delimiter. UTF-8 without BOM causes garbled text in Excel. European CSVs use semicolons because commas are decimal separators. The fixes: UTF-8 with BOM for Excel, explicit delimiter specification, and TSV as a locale-independent alternative.
Mar 19, 2026 8 min read
General
CSV for Data Analysis: Best Practices and Pitfalls
CSV is the universal interchange format for tabular data, but it's deceptively tricky for analysis. Leading zeros are silently stripped, types must be manually specified, and memory usage when loading is 5-10x the file size. For datasets over 1GB, switch to Parquet (columnar, compressed, typed) or Arrow (fast interchange). For everything else, specify encoding (UTF-8), define dtypes explicitly, and use chunked reading for large files.
Mar 19, 2026 7 min read
Data
CSV vs JSON vs XML: Choosing the Right Data Format
Use CSV for flat, tabular data (spreadsheets, ML training sets, simple exports). Use JSON for structured data with nesting (APIs, config files, document databases). Use XML when you need formal schemas, namespaces, or are working with enterprise/legacy systems. CSV is the simplest but has no type information; JSON has types and nesting but no schema enforcement; XML has both but is verbose.
Mar 19, 2026 8 min read
Data
Data Migration Between Formats: A Practical Guide
Every format conversion involves potential data loss, type changes, or structural transformation. CSV to JSON gains structure (lossless). JSON to CSV loses nesting (lossy). XML to JSON loses attributes and namespaces. YAML to JSON is lossless. Understanding what's preserved and what's lost is the key to safe data migration.
Mar 19, 2026 8 min read
Data
Data Serialization Formats: JSON, Protocol Buffers, MessagePack, and More
Text formats (JSON, XML, YAML) prioritize readability and debugging. Binary formats (Protocol Buffers, MessagePack, CBOR, Avro, Thrift) prioritize size and speed. Protocol Buffers are ~70% smaller and 5-10x faster than JSON. Use binary for high-throughput APIs, IoT, and game networking. Use text for everything else.
Mar 19, 2026 10 min read
General
Digital Preservation: File Formats That Stand the Test of Time
For files that must remain accessible decades from now, choose formats with open specifications, wide tool support, and institutional adoption: PDF/A for documents, TIFF for images, WAV or FLAC for audio, MKV with FFV1 for video, and plain text for anything that doesn't need formatting. Proprietary formats die when the company does.
Mar 19, 2026 7 min read
Document
DOC vs DOCX: Why You Should Finally Migrate
DOC is a proprietary binary format from Word 97 that stores data as opaque binary structures. DOCX is a ZIP archive of XML files, typically 50-75% smaller, safer (can't hide macros under the .docx extension), and backed by an ISO standard. There's no technical reason to keep files in DOC format unless you're supporting systems from before 2007.
Mar 19, 2026 6 min read
Document
How to Convert Documents Between Any Format
Document conversion quality depends on the direction and the formats involved. DOCX-to-PDF is nearly lossless. PDF-to-DOCX is lossy because PDF stores visual positions, not document structure. DOCX and ODT convert between each other well. Converting anything to TXT strips all formatting. For best results, always convert from the original editable source format, not from a PDF.
Mar 19, 2026 7 min read
Document
DOCX: Inside Microsoft Word's Modern Format
DOCX is not a single file but a ZIP archive containing XML documents, style definitions, media assets, and relationship metadata. Introduced with Office 2007 as part of the OOXML standard (ECMA-376, later ISO/IEC 29500), it replaced the proprietary binary DOC format with something inspectable, repairable, and dramatically smaller.
Mar 19, 2026 10 min read
General
DPI vs PPI: Understanding Image Resolution
PPI (pixels per inch) describes digital images — how pixel dimensions relate to physical output size. DPI (dots per inch) describes printer output — how many ink dots the printer lays down per inch. A 3000x2000 pixel image has no inherent DPI; it becomes 300dpi only when printed at 10"x6.67". For screen display, DPI/PPI metadata is meaningless — only pixel dimensions matter.
Mar 19, 2026 6 min read
Ebook
Ebook Accessibility: Standards and Implementation
EPUB accessibility combines WCAG 2.1 with ebook-specific requirements: semantic HTML, complete navigation (TOC, page list, landmarks), image alt text, proper reading order, and accessibility metadata. The EU European Accessibility Act (effective June 2025) makes compliance a legal requirement for ebooks sold in Europe.
Mar 19, 2026 8 min read
Ebook
Ebook DRM and Format Conversion: What You Can and Can't Do
DRM (Digital Rights Management) encrypts ebook files and ties them to specific accounts or devices. The three major systems are Adobe ADEPT (most EPUB retailers), Amazon DRM (Kindle), and Apple FairPlay (Apple Books). DRM-protected files cannot be legitimately converted between formats. DRM-free ebooks convert freely.
Mar 19, 2026 7 min read
Ebook
Ebook Formats: EPUB vs MOBI vs PDF for Every Device
EPUB is the universal ebook standard supported by every major reader except Kindle. For Kindle, use AZW3 (Kindle Format 8) or just send EPUB directly via Send-to-Kindle, which has accepted EPUB since 2022. PDF is wrong for ebooks unless layout matters more than readability — textbooks, sheet music, and technical docs with diagrams are the exceptions.
Mar 19, 2026 8 min read
Ebook
Ebook Metadata and Cover Images: Getting Them Right
Ebook metadata (title, author, series, ISBN, language) determines how your book appears in stores, libraries, and readers. Cover images need to be at least 2560x1600 pixels RGB JPEG for universal platform compatibility. Missing or wrong metadata means invisible books and broken library organization.
Mar 19, 2026 7 min read
Ebook
EPUB: Inside the Open Ebook Standard
EPUB is a ZIP archive containing XHTML content, CSS stylesheets, and XML metadata. EPUB 3 adds HTML5, JavaScript, MathML, audio/video embeds, and fixed-layout support. Understanding the internals lets you debug formatting issues, build EPUBs from scratch, and know exactly what breaks during conversion.
Mar 19, 2026 9 min read
Ebook
EPUB vs PDF for Reading: Reflowable vs Fixed Layout
EPUB reflows text to fit any screen size, letting readers control font, size, and spacing. PDF preserves exact page layout but is miserable to read on phones and small e-readers. Use EPUB for prose, PDF for layout-dependent content like textbooks, sheet music, and technical manuals.
Mar 19, 2026 7 min read
General
Excel and CSV: The 10 Pitfalls That Corrupt Your Data
Excel silently corrupts CSV data in at least 10 documented ways: stripping leading zeros from ZIP codes, auto-formatting gene names as dates (MARCH1 becomes March 1), truncating long numbers to scientific notation, mangling UTF-8 encoding, misinterpreting delimiters, breaking on newlines in cells, executing formula injection attacks, losing timezone information, swapping decimal separators by locale, and introducing invisible characters. Each has a specific prevention strategy.
Mar 19, 2026 8 min read
Image
Favicon and App Icon Formats: ICO, PNG, SVG, and More
In 2026, the minimal favicon set is: a 32x32 favicon.ico at the site root, a 180x180 Apple Touch Icon PNG, a 192x192 and 512x512 PNG in your Web App Manifest, and optionally an SVG favicon for scalability and dark mode support. ICO remains required because browsers still request /favicon.ico by default. SVG favicons support prefers-color-scheme for automatic dark mode adaptation. Skip the dozen-size icon sets from older guides — four files cover everything.
Mar 19, 2026 7 min read
Ebook
FB2: The FictionBook Format for Ebooks
FB2 (FictionBook 2) is an XML-based ebook format that's the dominant standard in Russia and Eastern Europe. It stores all content — text, metadata, and images — in a single well-structured XML file with explicit semantic tags for sections, epigraphs, poems, and citations.
Mar 19, 2026 6 min read
General
The Complete Guide to File Conversion
File conversion reads data structured in one format and writes it in another. Some conversions are lossless (CSV to JSON -- just repackaging the same data), while others are lossy (WAV to MP3 -- discarding audio frequencies to shrink the file). The nine major format categories are images, video, audio, documents, data/config, spreadsheets, ebooks, archives, and fonts, each with different tradeoffs around quality, compatibility, and file size.
Mar 19, 2026 10 min read
General
File Conversion Troubleshooting: Why It Failed and How to Fix It
File conversion fails for predictable reasons: corrupted source files, unsupported format variations, character encoding mismatches, missing fonts, codec incompatibilities, DRM protection, and file size limits. Most failures can be diagnosed by checking the file's first few bytes (magic number), verifying the actual format matches the extension, and reading the error message carefully.
Mar 19, 2026 8 min read
General
File Encryption: Protecting Sensitive Files Before Sharing
File-level encryption protects sensitive data using AES-256 encryption built into formats like 7Z, PDF, and DOCX. The encryption is only as strong as the password — a 12+ character passphrase with AES-256 is practically unbreakable, while a 6-character password with legacy ZipCrypto is useless.
Mar 19, 2026 6 min read
General
A Brief History of File Formats
File formats have evolved over six decades, from ASCII text (1963) through the GIF/JPEG/PNG era of the 1990s to modern codecs like AVIF and AV1. The pattern is consistent: proprietary formats emerge first, open standards eventually win, and patent disputes accelerate the adoption of royalty-free alternatives.
Mar 19, 2026 8 min read
General
File Formats by Use Case: A Decision Guide
The right file format depends on the use case: JPG for email photos, WebP/AVIF for web images, PDF for documents, MP4 H.264 for shareable video, FLAC for audio archival, and CSV/JSON for data exchange. This guide provides decision matrices for every combination of use case and media type.
Mar 19, 2026 8 min read
General
File Formats Every Web Developer Should Know
Web developers need to know roughly 20 file formats across six categories: images (WebP/AVIF for delivery, SVG for icons, PNG for screenshots), video (MP4 H.264 for universal playback, WebM VP9 for modern browsers), audio (MP3/AAC for broad support, Opus for quality-per-bit), fonts (WOFF2 only), data (JSON for APIs, CSV for exports), and documents (PDF for downloads, ZIP for bundles).
Mar 19, 2026 9 min read
General
File Handoff Between Design Tools: Format Compatibility
Design tool handoff depends on the destination. Figma to development: SVG for icons/vectors, PNG 2x for complex graphics, CSS code inspect for styles. Photoshop to web: Export As WebP/PNG. Illustrator to web: SVG export optimized with SVGO. Canva to anything: PDF for print, PNG for digital. Cross-tool exchange is lossy — use intermediate formats (SVG, PNG, PDF) rather than relying on native format compatibility.
Mar 19, 2026 6 min read
General
File Integrity and Checksums: Verifying Your Conversions
Checksums are fingerprints for files: fixed-length values computed from file contents that change if even a single bit is altered. SHA-256 is the standard for verifying file integrity after conversions, downloads, and transfers. If the checksum matches, the file is identical; if it doesn't, something changed.
Mar 19, 2026 5 min read
General
File Size Limits by Platform: A Complete Reference
Every platform enforces file size limits: Gmail caps at 25MB, Discord at 25MB (free) or 500MB (Nitro), WhatsApp at 2GB for video, Instagram at 650MB for video, and Slack at 1GB. Knowing these limits before sharing saves you from failed uploads and lets you choose the right format and compression level upfront.
Mar 19, 2026 6 min read
General
File Upload Best Practices for Web Developers
Validate file type by magic bytes (not just extension), enforce size limits on both client and server, use presigned URLs for direct-to-storage uploads, implement chunked uploads for large files, and always verify files server-side regardless of client-side checks. Drag-and-drop with progress tracking is table-stakes UX.
Mar 19, 2026 7 min read
Audio
FLAC: Lossless Audio That Actually Saves Space
FLAC (Free Lossless Audio Codec) compresses audio to 50-70% of WAV's size while preserving every sample bit-for-bit. Decode a FLAC file and you get the exact original PCM data. It's open-source, royalty-free, supports metadata and album art, and streams on Tidal, Amazon Music, and Apple Music. For anyone building a music library, FLAC is the format.
Mar 19, 2026 8 min read
Font
Font Fallback Strategies: Bulletproof Typography
A good fallback font stack shows readable text instantly, minimizes visual disruption when the web font swaps in, and handles edge cases (monospace, emoji, CJK) without manual intervention. The modern approach: match your fallback font's metrics to the web font using size-adjust and ascent-override CSS properties, eliminating Cumulative Layout Shift from font swaps.
Mar 19, 2026 7 min read
Font
Font Formats: WOFF2 vs TTF vs OTF — When to Use Each
Use WOFF2 for web fonts — it's 30% smaller than WOFF and supported by 97%+ of browsers. Use TTF for desktop installation and application embedding when you need maximum compatibility. Use OTF when you need advanced OpenType features (stylistic alternates, ligatures, small caps) for professional design work. That's it.
Mar 19, 2026 7 min read
Font
Font Licensing Explained: What You Can and Can't Do
Font licenses control how you can use a font file — desktop installation, web embedding, app bundling, and redistribution are typically licensed separately. Most Google Fonts use the SIL Open Font License (OFL), which allows free use including web, app, and commercial projects. Commercial fonts from foundries require separate web licenses (often priced per domain or per pageview).
Mar 19, 2026 7 min read
Font
Font Subsetting: Strip What You Don't Need
Font subsetting removes unused glyphs from font files. A full Noto Sans CJK weighs 16 MB; subsetted to Latin-only, it's 30 KB — a 99.8% reduction. For most English-language sites, Latin subsetting cuts font files by 65-85%. Use pyftsubset to subset, always preserve OpenType layout features, and combine with unicode-range in CSS for multilingual sites.
Mar 19, 2026 7 min read
General
File Formats for 3D Printing: STL, OBJ, 3MF, and More
STL is the most widely used 3D printing format — a triangulated surface mesh without color or material data. OBJ adds material and texture support. 3MF is the modern replacement, supporting colors, materials, metadata, and build information in a single package. For CAD interchange, STEP and IGES transfer precise mathematical surfaces rather than approximated triangle meshes.
Mar 19, 2026 6 min read
General
File Formats for Email: Attachments and Embedded Content
Email has strict format constraints: attachment limits (Gmail 25MB, Outlook 20MB), HTML image support limited to JPG, PNG, and GIF (no WebP or AVIF), and Base64 encoding adds 33% overhead to attachment file sizes. ICS for calendar invites, vCard (.vcf) for contacts, and EML/MBOX for email archival are the key specialized formats.
Mar 19, 2026 6 min read
General
File Formats for Graphic Designers: A Complete Guide
Graphic designers work across three format categories: source files (PSD, AI, INDD, Figma) for editing, print deliverables (PDF/X, TIFF at 300dpi CMYK), and screen deliverables (PNG, WebP, SVG in RGB). Choosing the wrong format at any stage costs hours of rework or produces unusable output.
Mar 19, 2026 9 min read
General
File Formats for Musicians: Recording to Release
Musicians work with three format tiers: uncompressed (WAV/AIFF at 24-bit/48kHz+) for recording and editing, lossless compressed (FLAC) for distribution and archival, and lossy compressed (MP3/AAC/OGG) for streaming and casual listening. The recording standard is WAV 24-bit/48kHz. The mastering output depends on the destination platform.
Mar 19, 2026 7 min read
General
File Formats for Photographers: Shooting to Delivery
Photographers move through four format stages: capture (RAW: CR3, NEF, ARW, DNG), editing (TIFF 16-bit or PSD), delivery (JPEG for clients, WebP for web), and archival (DNG for RAW, TIFF for processed). Each stage has specific color space and bit-depth requirements that directly affect image quality.
Mar 19, 2026 8 min read
General
Presentation File Formats: PPTX, PDF, HTML, and Beyond
PPTX (PowerPoint) is the universal presentation format with the broadest tool support. PDF is the safest for view-only sharing (no layout shifts, no missing fonts). Keynote (.key) produces the best-looking presentations on Apple hardware. Google Slides is the best for real-time collaboration. Each format has specific conversion limitations — animations and custom fonts are the most common casualties.
Mar 19, 2026 6 min read
General
File Formats for Video Editors: Ingest to Export
Video editing involves three distinct format stages: ingest (camera-native MP4/MOV with H.264/H.265 codecs), editing (ProRes 422 or DNxHR intermediate codecs for smooth timeline performance), and export (H.264 MP4 for web, ProRes for handoff, H.265 for archival). Understanding the container-vs-codec distinction eliminates 90% of video format confusion.
Mar 19, 2026 8 min read
General
The Future of File Formats: What's Coming and What's Dying
BMP, AVI, Flash/SWF, and WMA are effectively dead. AVIF and AV1 are the clear winners of the next-gen codec wars. JPEG XL is technically superior but faces an uncertain future without Chrome support. The overarching trend: everything moves toward open, royalty-free, web-native formats.
Mar 19, 2026 6 min read
Image
GIF in 2026: When to Use It and When to Move On
GIF is a 39-year-old image format limited to 256 colors with LZW compression. It survives in 2026 purely because of universal animation support — every messaging app, email client, and social platform plays GIFs automatically. Technically, animated WebP (64% smaller), AVIF, and MP4 (90% smaller) are all superior. Use GIF only when you need guaranteed universal playback; convert to MP4 or WebP for everything else.
Mar 19, 2026 7 min read
Font
Self-Hosting Google Fonts: Why and How
Self-hosting Google Fonts is now faster than using the Google Fonts CDN. Chrome 86 (2020) introduced cache partitioning, which means fonts cached from one site aren't shared with another. Self-hosting eliminates DNS lookup and third-party connection overhead (200-300ms savings), gives you full caching control, and removes GDPR concerns.
Mar 19, 2026 7 min read
General
Google Sheets Export Formats: XLSX, CSV, PDF, and More
Google Sheets exports to XLSX, CSV, TSV, PDF, ODS, and HTML via File > Download. XLSX preserves formulas, formatting, and charts but loses Google-specific functions (QUERY, IMPORTRANGE, GOOGLEFINANCE). CSV exports values only from the active sheet. PDF creates a visual snapshot. ODS preserves most features except Google-specific functions. The Google Sheets API enables programmatic export with fine-grained control over format and range.
Mar 19, 2026 6 min read
Archive
GZIP vs BZIP2 vs XZ vs ZSTD: Compression Algorithms Compared
GZIP is the universal default with fast speed and good compression. BZIP2 is obsolete — XZ compresses better and decompresses faster. XZ gives the best ratios but compresses slowly. Zstandard (ZSTD) is the modern choice: it matches XZ's ratios at GZIP-like speeds and decompresses 3-5x faster than everything else.
Mar 19, 2026 8 min read
Video
H.264 vs H.265 vs AV1: Video Codec Showdown
H.264 is universally compatible but the least efficient. H.265 compresses 50% better but has a fragmented patent landscape that limits adoption. AV1 compresses 30% better than H.265 and is royalty-free, but encoding is extremely slow. Choose H.264 for compatibility, H.265 for file size on controlled playback, and AV1 for web delivery where you encode once and serve millions of times.
Mar 19, 2026 8 min read
Image
HEIC and HEIF: Apple's Image Format Decoded
HEIC (High Efficiency Image Container) is Apple's default camera format since iOS 11, using HEVC (H.265) video compression to produce photos 50% smaller than JPEG at equivalent quality. It supports 10-bit color, Live Photos, depth maps, and burst sequences. The problem: no web browser supports HEIC natively, and Windows/Android support is limited. You must convert HEIC to JPEG or PNG before sharing outside Apple's ecosystem.
Mar 19, 2026 8 min read
Video
How Video Compression Works and Why It Matters
Video compression reduces file size by eliminating redundant visual information between frames (inter-frame) and within each frame (intra-frame). The codec (H.264, H.265, VP9, AV1) handles the compression math, while the container (MP4, MKV, WebM) packages the compressed streams together. Understanding this distinction is the key to converting video files without unnecessary quality loss.
Mar 19, 2026 9 min read
Document
HTML as a Document Format: Underrated and Powerful
HTML is the most universally renderable document format in existence — every device with a screen has a browser that can display it. With semantic markup, it's accessible by default. With print stylesheets, it handles paper output. With data URIs, it can be a single self-contained file. HTML is an underrated choice for documents that need to be responsive, searchable, and accessible.
Mar 19, 2026 8 min read
Font
Icon Fonts vs SVG Icons: The Definitive Comparison
SVG icons are objectively better than icon fonts in 2026. They're accessible (title, aria-label), treeshakeable (ship only icons you use), immune to font loading issues (no FOUT flash), and styleable with CSS and JS. Icon fonts were a clever hack for HTTP/1.1 limitations. Those limitations are gone. Use inline SVG or SVG sprites.
Mar 19, 2026 7 min read
Image
Best Image Formats for Photography: From Camera to Print
Photography uses different formats at each stage of the workflow. RAW (CR2, NEF, ARW, DNG) captures maximum sensor data (12-14 bit, 25-60MB). TIFF (16-bit) preserves full quality during editing. JPEG (quality 85-92) is the standard delivery format with universal compatibility. WebP and AVIF provide 25-50% smaller files for web delivery. Print requires 300 DPI in CMYK TIFF or high-quality JPEG. For archival, use DNG (open RAW) or 16-bit TIFF with embedded color profiles.
Mar 19, 2026 9 min read
Image
Image Formats and Sizes for Every Social Platform in 2026
Every social platform re-compresses uploaded images, so start with high-quality sources (JPEG 90-95 or PNG for text-heavy graphics). Instagram supports 1080x1080 (feed square), 1080x1350 (feed portrait, highest engagement), and 1080x1920 (Stories/Reels). X/Twitter uses 1200x675 for link cards and 1600x900 for timeline images. Facebook recommends 1200x630 for shared links. LinkedIn uses 1200x627. Upload at 2x the display size for Retina sharpness. PNG preserves text and graphic clarity better than JPEG for infographics and screenshots.
Mar 19, 2026 9 min read
Image
Every Image Format Explained: The Complete Reference
There are 12+ image formats in common use, each built for a specific job. JPEG is for photographs (lossy, 24-bit, universal). PNG is for lossless graphics with transparency. WebP replaces both for web delivery at 25-35% smaller files. GIF is for simple animations. AVIF is the next-gen format with HDR support. HEIC is Apple's default camera format. SVG is for scalable vector graphics. Use this guide to pick the right one.
Mar 19, 2026 10 min read
General
Building an Image Optimization Pipeline for the Web
An image optimization pipeline starts with high-quality source files, processes them at build time (resize, compress, generate modern formats), serves responsive variants via srcset, and uses a CDN for edge delivery with content negotiation. The key tools are sharp for Node.js processing, the <picture> element for format selection, and Cloudflare/imgix/Cloudinary for on-the-fly transformation.
Mar 19, 2026 8 min read
Image
JPEG: Everything About the World's Most Used Image Format
JPEG is a lossy image format using DCT (Discrete Cosine Transform) compression, created in 1992 and still the most universally supported image format in existence. It works by converting images to the YCbCr color space, applying chroma subsampling, dividing pixels into 8x8 blocks, transforming them via DCT, and quantizing the coefficients. JPEG supports quality levels from 0-100, progressive and baseline encoding modes, and EXIF/IPTC metadata. It's the right format for photographs when universal compatibility matters, though WebP and AVIF produce smaller files.
Mar 19, 2026 9 min read
Data
JSON: The Data Format That Runs the Internet
JSON (JavaScript Object Notation) is a lightweight text-based data format with six types: string, number, boolean, null, array, and object. Defined by RFC 8259, it is the default format for web APIs, config files, and data interchange. It deliberately lacks comments, trailing commas, and a native date type.
Mar 19, 2026 10 min read
Data
JSON vs YAML for Configuration Files
Use YAML for config files humans edit (comments, readability, multiline strings). Use JSON for config files machines generate and consume (strict syntax, guaranteed parse consistency, universal support). YAML has more footguns; JSON has more limitations. For greenfield projects, TOML often beats both.
Mar 19, 2026 7 min read
Data
JSONL and NDJSON: Streaming JSON for Large Datasets
JSONL (JSON Lines) and NDJSON (Newline Delimited JSON) are the same format: one complete JSON object per line, separated by newlines, with no wrapping array. This enables streaming (process line by line with constant memory), appending (add records without re-serializing), and parallel processing of large JSON datasets.
Mar 19, 2026 7 min read
Ebook
Kindle Publishing Formats: From Manuscript to KDP
Amazon KDP accepts EPUB (preferred), DOCX, KPF, and PDF. All uploads are converted internally to KFX for delivery. EPUB gives the most predictable results. Cover images must be 2560x1600 RGB JPEG/TIFF. Always test with Kindle Previewer 3 before publishing.
Mar 19, 2026 7 min read
General
File Formats for Large Datasets: Beyond CSV and Excel
CSV and Excel hit hard limits at scale: CSV has no types and uses 5-10x its file size in memory; Excel caps at 1,048,576 rows. For large datasets, Parquet (columnar, compressed, schema-aware) is the default choice for analytics. Arrow/Feather is fastest for in-memory interchange. SQLite provides queryable single-file databases. HDF5 handles multidimensional scientific data. JSONL enables streaming JSON processing. The right format depends on your access pattern: full scans, column selection, random access, or streaming.
Mar 19, 2026 9 min read
General
Lazy Loading and Modern Image Formats: A Performance Guide
Native lazy loading via loading="lazy" handles 90% of cases. Combine it with the <picture> element for AVIF/WebP/JPG format cascading, explicit width/height attributes to prevent CLS, and fetchpriority="high" on LCP images. Use LQIP or BlurHash placeholders for premium perceived performance.
Mar 19, 2026 7 min read
Document
File Formats for Legal Documents: Compliance and Best Practices
Legal documents have strict format requirements: PDF/A for court filings and long-term archival, PAdES digital signatures for authenticated documents, proper redaction that removes data from all layers (not just visual masking), and metadata scrubbing before sharing to prevent inadvertent disclosure of hidden information.
Mar 19, 2026 8 min read
Audio
FLAC vs ALAC vs WAV vs AIFF: Lossless Audio Compared
All four formats preserve audio bit-for-bit, but they differ in everything else. FLAC: best compression, widest non-Apple support, open-source. ALAC: Apple's lossless, native Apple device support, open-sourced in 2011. WAV: uncompressed, universal, minimal metadata, 4 GB limit. AIFF: Apple's uncompressed format, better metadata than WAV, same quality and size.
Mar 19, 2026 7 min read
General
Lossy vs Lossless: The Complete Guide to Data Compression
Lossy compression permanently discards data the human senses are unlikely to notice, producing dramatically smaller files at the cost of irreversible quality loss. Lossless compression reduces file size by finding and encoding patterns more efficiently, preserving every bit of the original. The right choice depends on whether you need a bit-perfect copy or a perceptually identical one.
Mar 19, 2026 9 min read
Ebook
Digital Manga Formats: From Scans to Official Releases
Digital manga uses CBZ/CBR for fan scans and personal archives, EPUB fixed layout for official publisher releases, and proprietary streaming formats for platform apps. Key requirements: right-to-left reading direction, minimum 1200x1800 pixel resolution per page, and B&W-optimized image compression.
Mar 19, 2026 6 min read
Document
Markdown: The Writer's Plaintext Format
Markdown is a lightweight markup language created by John Gruber in 2004 that uses plain-text formatting syntax (headings with #, bold with **, links with []()), designed to be readable without rendering. CommonMark standardized the ambiguous parts of the original spec, and GitHub Flavored Markdown (GFM) added tables, task lists, and strikethrough. Markdown has become the default format for documentation, README files, note-taking apps, and static site generators.
Mar 19, 2026 10 min read
General
Metadata in Files: What's Hidden Inside Your Documents
Every file carries invisible metadata: EXIF data in photos reveals your camera model and GPS coordinates, ID3 tags in audio store artist and album info, and documents embed author names, edit history, and software versions. This metadata can expose private information and may or may not survive format conversion depending on the target format.
Mar 19, 2026 7 min read
General
MIME Types and Content-Type: A Developer's Reference
MIME types (type/subtype format like image/jpeg or application/json) tell browsers how to handle files. The Content-Type header sets the MIME type for HTTP responses. Always set it correctly, pair it with X-Content-Type-Options: nosniff to prevent MIME sniffing, and use Content-Disposition to control inline display vs download behavior.
Mar 19, 2026 7 min read
Video
MKV (Matroska): The Everything Container
MKV (Matroska Video) is an open-source container format that can hold virtually any video codec, any audio codec, unlimited subtitle tracks, and chapter markers in a single file. It's the preferred format for media servers, Blu-ray rips, and archival because it sacrifices nothing — but browsers can't play it natively, which limits its use for web delivery and sharing.
Mar 19, 2026 8 min read
Ebook
MOBI, AZW, and KFX: Amazon's Kindle Format Timeline
Amazon has cycled through four ebook formats in 18 years: MOBI (2007, PalmDOC-based), AZW (MOBI + DRM), AZW3/KF8 (2012, HTML5/CSS3), and KFX (2015, enhanced typesetting). Since 2022, Send-to-Kindle accepts EPUB directly, making the proprietary format stack less relevant for personal documents.
Mar 19, 2026 8 min read
Video
MOV: Apple's QuickTime Container
MOV is Apple's QuickTime container format and the direct ancestor of MP4. iPhones record video as MOV, Final Cut Pro exports ProRes in MOV, and professional editing workflows revolve around it. Since MOV and MP4 share the same MPEG-4 container base, converting between them is often a remux — instant and lossless. MOV is the right format inside Apple's ecosystem; MP4 is the right format for everything else.
Mar 19, 2026 8 min read
Audio
MP3: The Format That Changed Music Forever
MP3 (MPEG-1 Audio Layer 3) uses psychoacoustic modeling to discard audio data humans can't perceive, shrinking files to roughly 1/10th their uncompressed size. At 192 kbps or above, most listeners can't distinguish MP3 from the original. Its patents expired in 2017, making it fully free — but newer codecs like AAC and Opus outperform it at every bitrate.
Mar 19, 2026 8 min read
Video
MP4: The Universal Video Container Explained
MP4 (MPEG-4 Part 14) is the most universally supported video container format. It wraps H.264, H.265, or AV1 video with AAC audio into a single file that plays on every device, browser, and platform. Its combination of efficient compression, streaming support via fragmented MP4, and universal compatibility makes it the default choice for sharing, uploading, and distributing video.
Mar 19, 2026 8 min read
Archive
Nested Archives: Handling Archives Within Archives
Nested archives — archives inside archives — occur from email workflows, legacy distribution, and software packaging (JARs in WARs, Docker layers). Recursive extraction requires multiple passes or specialized tools. Watch for zip bombs (archives that decompress to petabytes) and always estimate disk space requirements before extracting nested archives.
Mar 19, 2026 5 min read
General
ODS: The Open Spreadsheet Format
ODS (OpenDocument Spreadsheet) is the ISO/IEC 26300 standard for spreadsheets, the default format in LibreOffice Calc and Apache OpenOffice. It uses a ZIP of XML files (similar to XLSX) and supports formulas, formatting, charts, and pivot tables. Government mandates in the EU, UK, and several countries require ODS for public documents. XLSX compatibility is good for data and basic formulas but degrades for charts, macros, and advanced conditional formatting.
Mar 19, 2026 6 min read
Document
ODT and OpenDocument: The Open Standard Alternative
ODT (OpenDocument Text) is part of the ODF standard (ISO/IEC 26300), the format LibreOffice uses by default. It's a ZIP archive of XML files — structurally similar to DOCX but governed by an independent standards body (OASIS) rather than one vendor. ODT is the format of choice when vendor independence and long-term archival matter more than Microsoft Office compatibility.
Mar 19, 2026 8 min read
Audio
OGG, Vorbis, and Opus: The Open Source Audio Formats
Vorbis and Opus are royalty-free, open-source audio codecs from the Xiph.Org Foundation. Vorbis (2000) beats MP3 at matched bitrates and is used by Spotify for desktop streaming. Opus (2012) is the best lossy audio codec available — transparent at 128 kbps for music, excellent at 32 kbps for voice — and powers WebRTC, Discord, Zoom, and Teams. OGG is the container format that wraps both.
Mar 19, 2026 8 min read
General
Open vs Proprietary File Formats: Why It Matters
Open file formats have publicly available specifications that anyone can implement without license fees. Proprietary formats are controlled by a single company, creating vendor lock-in and obsolescence risk. For long-term data access, interoperability, and tool ecosystem breadth, open formats are almost always the better choice.
Mar 19, 2026 6 min read
Font
OpenType Features: Ligatures, Small Caps, and Beyond
OpenType features are hidden capabilities built into professional fonts — ligatures, small caps, old-style figures, swashes, and up to 20 stylistic sets of alternate character designs. Activate them in CSS with font-variant-* properties (preferred) or font-feature-settings (low-level). Most web fonts include features that developers never activate.
Mar 19, 2026 8 min read
Image
The Complete Guide to Image Optimization for the Web
Web image optimization involves three layers: format (WebP/AVIF instead of JPEG/PNG saves 25-50%), sizing (responsive images via srcset/sizes serve the right resolution per device), and delivery (lazy loading, CDN caching, and preloading critical images). Together, these can reduce image bandwidth by 60-80% while improving Core Web Vitals scores. The biggest single win is converting unoptimized PNGs and JPEGs to WebP — often a 30-50% reduction with zero visible quality loss.
Mar 19, 2026 10 min read
Document
How to Edit a PDF: Methods, Tools, and Format Realities
PDFs store text as individually positioned characters on a fixed canvas, not as flowing paragraphs. Direct editing is limited to small changes (fixing typos, adding signatures). For substantial edits, the reliable workflow is: convert PDF to DOCX, edit in a word processor, then convert back to PDF.
Mar 19, 2026 9 min read
General
Generating PDFs in Web Applications
Client-side PDF generation (jsPDF, html2pdf.js) works for simple documents without server round-trips. Server-side generation (Puppeteer, WeasyPrint, wkhtmltopdf) handles complex layouts with full CSS support. Choose based on complexity: jsPDF for programmatic documents, html2pdf.js for rendering visible HTML, and Puppeteer for pixel-perfect HTML-to-PDF with modern CSS.
Mar 19, 2026 7 min read
General
Pivot Tables and Export: What Survives Format Conversion
Pivot tables store both their configuration (field layout, aggregation functions, filters) and a cached copy of the computed results. When exporting to CSV, only the flattened values survive — all pivot functionality is lost. PDF preserves the visual snapshot. ODS partially preserves pivot structure but refresh behavior may differ. XLSX preserves everything within the Excel ecosystem. For cross-platform work, convert pivot output to a flat table before exporting.
Mar 19, 2026 6 min read
Image
PNG: The Definitive Guide to Lossless Images
PNG (Portable Network Graphics) is a lossless image format using DEFLATE compression, supporting 5 color types (grayscale, RGB, indexed, grayscale+alpha, RGBA), bit depths up to 16 per channel, and full 8-bit alpha transparency. It's the standard choice for screenshots, logos, graphics with text, and any image requiring pixel-perfect accuracy. For web photos, WebP or AVIF is smaller; for print, TIFF offers more features. PNG's sweet spot is lossless web graphics.
Mar 19, 2026 9 min read
Image
PNG vs JPG vs WebP: Which Image Format Should You Use?
Use WebP for web delivery — it produces files 25-35% smaller than JPG and 80%+ smaller than PNG at equivalent quality. Use PNG when you need lossless quality with transparency (screenshots, graphics with text, logos). Use JPG when you need universal compatibility and are working with photos (email, print, legacy systems).
Mar 19, 2026 8 min read
Document
Print-Ready File Formats: From Screen to Paper
Print-ready files use PDF/X (the ISO standard for print), CMYK color (not RGB), 300 DPI minimum resolution, embedded fonts (or outlined text), and include bleed and crop marks. Getting these right means your printed output matches what you see on screen. Getting them wrong means reprints, delays, and wasted money.
Mar 19, 2026 8 min read
General
Print vs Digital: Choosing the Right Format for Your Output
Print and digital outputs have fundamentally different format requirements. Print needs CMYK color, 300dpi resolution, and formats that preserve color fidelity (PDF/X, TIFF). Digital needs RGB color, specific pixel dimensions, and formats optimized for file size (WebP, PNG, SVG). Getting these wrong wastes money on reprints or delivers broken assets to screens.
Mar 19, 2026 6 min read
General
Progressive Enhancement for Media: Images, Video, and Audio
Progressive enhancement for media means serving AVIF to browsers that support it, WebP to the rest, and JPG/MP4 as universal fallbacks. Respect user preferences: reduced motion via prefers-reduced-motion, data saving via the Save-Data header and Network Information API, and contrast needs via prefers-contrast. Every media element should work without JavaScript.
Mar 19, 2026 7 min read
Archive
RAR: The Proprietary Archive You Can't Avoid
RAR is a proprietary archive format created by Eugene Roshal with good compression, solid archive support, and a unique recovery record feature for repairing damaged archives. Despite being proprietary and requiring paid WinRAR to create, RAR remains widespread due to legacy use in file-sharing communities and its genuinely useful error-recovery capability.
Mar 19, 2026 6 min read
General
RAW Photo Formats: CR3, NEF, ARW, DNG, and More
RAW is not a single format but a category of camera-specific formats (Canon CR3/CR2, Nikon NEF, Sony ARW, Fuji RAF, and others) that store unprocessed sensor data at 12-14 bit depth. Each manufacturer uses a proprietary container and compression scheme. Adobe DNG is the open-standard alternative designed for long-term archival and cross-tool compatibility.
Mar 19, 2026 7 min read
Image
How to Reduce Image File Size Without Losing Quality
The fastest way to reduce image file size is to change the format: converting PNG to WebP cuts file size by 80-90% for photographs, and JPG to WebP saves 25-35%. After format, lower the JPG/WebP quality to 80-85% (visually identical to 100% but 60-70% smaller). Strip EXIF metadata for another 50-500KB savings. These three steps alone can reduce a 25MB image to under 500KB with no visible quality loss.
Mar 19, 2026 7 min read
General
Responsive Images: srcset, sizes, and Format Selection
Responsive images use srcset with width descriptors (400w, 800w) to offer multiple resolutions, the sizes attribute to tell the browser the display width, and the <picture> element for format selection (AVIF/WebP/JPG) and art direction. The browser picks the optimal combination of format and resolution before downloading anything.
Mar 19, 2026 7 min read
Document
Best File Formats for Resumes and CVs
Submit your resume as a text-based PDF unless the application specifically requests DOCX. Modern ATS (Applicant Tracking Systems) parse PDF just as well as DOCX. PDF preserves your layout exactly, while DOCX may render differently on the recruiter's machine. Always have both formats ready.
Mar 19, 2026 8 min read
Audio
Ringtone Formats: M4R, MP3, OGG, and Device Compatibility
iPhone uses M4R (renamed M4A/AAC, max 40 seconds). Android uses MP3 or OGG natively. Creating a ringtone from any audio file requires trimming to 30-40 seconds, converting to the right format, and transferring to your device. M4R is just AAC audio in an MPEG-4 container with a different file extension — no special encoding needed.
Mar 19, 2026 6 min read
Document
RTF: The Cross-Platform Document Format That Refuses to Die
RTF (Rich Text Format) is a Microsoft-created document format from 1987 that stores formatted text as plain-text control codes. It can't carry macros (making it inherently safer than DOC/DOCX), works on every platform, and still fills a niche: cross-platform document exchange where basic formatting is sufficient and security matters.
Mar 19, 2026 6 min read
Audio
Sample Rate and Bit Depth: What Musicians Need to Know
Sample rate determines the highest frequency captured (44.1 kHz captures up to 22.05 kHz, covering all of human hearing). Bit depth determines dynamic range (16-bit = 96 dB, 24-bit = 144 dB). For playback, 44.1 kHz/16-bit is perfect. For recording and production, 48 kHz/24-bit gives useful headroom. Higher rates (96/192 kHz) are measurably different but inaudible on finished music.
Mar 19, 2026 8 min read
Video
Screen Recording Formats: Which One to Use
For screen recordings, record to MKV with hardware H.264 encoding, then remux to MP4 afterward. MKV is crash-safe (recoverable if the recording stops unexpectedly), and hardware encoding keeps CPU usage under 5% so your computer stays responsive. After recording, remux MKV to MP4 for sharing — it takes seconds and preserves quality perfectly.
Mar 19, 2026 8 min read
Image
Best Formats for Screenshots: PNG, JPG, or WebP?
PNG is the best default format for screenshots because its lossless compression preserves text clarity and sharp UI edges. JPEG creates visible artifacts around text that make screenshots look unprofessional. WebP lossless is 25% smaller than PNG with identical quality — the best choice for web-shared screenshots. macOS and Windows both default to PNG. Use JPEG only for screenshot-heavy content that's primarily photographic (game captures, movie scenes).
Mar 19, 2026 6 min read
Archive
Self-Extracting Archives (SFX): When and How to Use Them
Self-extracting archives (SFX) are executables that contain compressed files plus extraction code, requiring no archive software from the recipient. They're created with 7-Zip or WinRAR and can include installation-like features. However, antivirus software frequently flags SFX files, email providers block .exe attachments, and macOS/Linux recipients can't run Windows SFX files.
Mar 19, 2026 5 min read
Ebook
Self-Publishing File Formats: Amazon, Apple, Google, and More
EPUB is the universal self-publishing format accepted by every major platform. Amazon KDP accepts EPUB (preferred), DOCX, and PDF. Apple Books requires EPUB3. Google Play and Kobo accept EPUB and PDF. One well-crafted EPUB file covers all platforms except Amazon, which converts it internally to KFX.
Mar 19, 2026 9 min read
General
Caching Files with Service Workers: Strategies by Format
Different file types need different caching strategies. Fonts and versioned static assets use cache-first (fastest, cached indefinitely). Images use stale-while-revalidate (instant display, background refresh). API responses use network-first (fresh data, cached fallback). Precache critical assets during service worker install, and use runtime caching for everything else.
Mar 19, 2026 7 min read
General
Export Settings for Social Media: Platform-by-Platform Guide
Every social media platform re-compresses uploaded images and videos, so upload the highest quality source within their size limits. Instagram: 1080px wide JPG/PNG for feed, 1080x1920 for stories/reels. YouTube: 4K H.264 MP4 if possible. TikTok: 1080x1920 MP4. Twitter/X: 1200x675 JPG. LinkedIn: 1200x627. All platforms strip color profiles and re-encode to their internal codecs.
Mar 19, 2026 8 min read
Archive
Splitting Large Archives: Multi-Volume Archives Explained
Multi-volume archives split a large archive into fixed-size parts for email attachments, FAT32 drives (4GB limit), or upload restrictions. 7z and RAR have native multi-volume support. ZIP's multi-volume support is limited. The Unix split command works with any format. All parts are required to extract the archive.
Mar 19, 2026 5 min read
General
Spreadsheet Compatibility: Excel, Google Sheets, LibreOffice
Excel, Google Sheets, and LibreOffice Calc all read XLSX, but they render differently. Core features (basic formulas, data types, simple formatting) are universal. Charts, conditional formatting, macros, and advanced functions diverge significantly. Excel has the most features, Google Sheets has the best collaboration, and LibreOffice has the best open standards compliance. For cross-platform documents, stick to the shared feature set and test in all target applications before distributing.
Mar 19, 2026 8 min read
General
Spreadsheet Formulas in Format Conversion
Converting XLSX to CSV replaces all formulas with their last computed values — permanently. XLSX to ODS preserves most formulas but Excel-specific functions (XLOOKUP, LET, LAMBDA, dynamic arrays) become errors. ODS to XLSX preserves most formulas but LibreOffice-specific functions become errors. Google Sheets functions like QUERY and IMPORTRANGE become static values in every export format. Always audit formulas before converting to catch functions that won't translate.
Mar 19, 2026 6 min read
General
Migrating Spreadsheets to Databases: Format Considerations
CSV is the standard bridge format between spreadsheets and databases, but the migration is never just an import. You must handle type inference (CSV has no types), NULL vs. empty string ambiguity, date format standardization, schema design from flat spreadsheet structure, and data cleaning during migration. Tools like pgloader, DBeaver, and pandas simplify the mechanics, but the hard part is defining the target schema and cleaning the data to fit it.
Mar 19, 2026 7 min read
Image
SVG: When and How to Use Scalable Vector Graphics
SVG (Scalable Vector Graphics) is an XML-based image format that stores shapes, paths, and text as mathematical descriptions rather than pixels. SVGs scale to any resolution without quality loss, can be styled with CSS, manipulated with JavaScript, and are typically 2-10x smaller than equivalent PNGs for logos and icons. Use SVG for logos, icons, illustrations, charts, and UI elements. Use raster formats (PNG, WebP) for photographs and complex visual content.
Mar 19, 2026 9 min read
Archive
TAR: The Unix Tape Archive That Runs Linux
TAR (Tape Archive) is a Unix bundling format from 1979 that combines multiple files into a single stream without compressing them. It preserves Unix permissions, ownership, symlinks, and timestamps. Paired with GZIP, XZ, or Zstandard for compression, TAR is the standard archive format for Linux software distribution and system backups.
Mar 19, 2026 6 min read
Data
TOML: The Configuration Format That Gets Out of Your Way
TOML (Tom's Obvious Minimal Language) is a configuration file format created by Tom Preston-Werner (GitHub co-founder) designed for clarity and simplicity. It has explicit types including native dates, no indentation-based structure, and is the standard config format for Rust (Cargo.toml), Python (pyproject.toml), and Go modules.
Mar 19, 2026 8 min read
Image
Transparent Images: Which Formats Support Alpha Channels
PNG supports full 8-bit alpha transparency (256 levels of opacity per pixel). WebP supports 8-bit alpha with the unique ability to compress it losslessly while the RGB data is lossy. AVIF supports alpha with 8, 10, or 12-bit depth. GIF only supports binary (on/off) transparency. JPEG has no transparency support whatsoever. SVG supports vector transparency natively. For most transparent image needs, PNG is the safe default and WebP is the optimized choice.
Mar 19, 2026 7 min read
General
TSV: Tab-Separated Values and When Tabs Beat Commas
TSV (Tab-Separated Values) uses tab characters as delimiters instead of commas. This eliminates the quoting problems that plague CSV: commas in addresses, semicolons in European locales, and nested quoted strings. TSV is the standard in bioinformatics (BLAST, BED, VCF formats), the native clipboard format when copying from spreadsheets, and the most Unix-pipeline-friendly tabular format. For data containing free-form text, TSV is reliably cleaner than CSV.
Mar 19, 2026 6 min read
Document
Understanding PDF: Editing, Converting, and Accessibility
PDF is a page description language descended from PostScript that stores visual positions of elements on a page, not semantic document structure. This is why converting a PDF to Word often produces mangled results: the PDF has no concept of "paragraphs" or "headings" — just text drawn at specific coordinates. Tagged PDFs add a semantic layer that dramatically improves conversion quality and accessibility.
Mar 19, 2026 9 min read
Font
Variable Fonts: One File to Replace Them All
Variable fonts store a continuous range of styles (weight, width, slant, and custom axes) in a single file using interpolation between master designs. One variable font replaces 8-20 static files, typically reducing total payload by 70%+ while enabling precise typographic control like weight 450 or width 87%.
Mar 19, 2026 9 min read
General
Vector vs Raster Graphics: When to Use Which
Raster graphics (JPG, PNG, WebP, TIFF) store images as pixel grids and are resolution-dependent — enlarge them and they blur. Vector graphics (SVG, AI, EPS, PDF) store images as mathematical curves and are infinitely scalable. Use raster for photographs and complex imagery, vector for logos, icons, illustrations, and anything that needs to work at multiple sizes.
Mar 19, 2026 6 min read
Video
Video Codecs Explained: From H.264 to AV1
A video codec compresses raw video frames into a manageable file size by exploiting redundancy within and between frames. The codec (H.264, H.265, VP9, AV1) handles compression; the container (MP4, MKV, WebM) packages the compressed streams. Understanding frame types, profiles, encoding modes, and the hardware vs software tradeoff lets you choose the right settings for any conversion task.
Mar 19, 2026 8 min read
Video
Video Formats and Settings for Every Social Platform
Every social platform re-encodes your video regardless of what you upload. The goal is to upload the highest quality source possible (MP4, H.264, AAC) so the platform's re-encoding starts from the best material. Key specs: YouTube accepts up to 256GB; Instagram Reels wants 9:16 at 30fps under 90 seconds; TikTok wants 9:16 under 10 minutes; Twitter/X allows 2:20 at up to 512MB; Facebook prefers H.264 MP4; LinkedIn caps at 10 minutes.
Mar 19, 2026 8 min read
Video
Video Metadata and Chapters: What Survives Conversion
Video files carry more than pixels: metadata (title, date, GPS), chapter markers, subtitle tracks, multiple audio tracks, and HDR color data. During format conversion, some of this data is preserved automatically, some requires explicit flags, and some gets silently dropped. MKV preserves the most; MP4 preserves the basics; AVI and GIF preserve almost nothing. Understanding what survives is essential for archival and professional workflows.
Mar 19, 2026 8 min read
Video
Video Subtitles and Captions: Formats and Conversion
SRT is the universal subtitle format — plain text with timestamps, works everywhere. ASS/SSA adds styling (fonts, colors, positioning) and is standard for anime. VTT is the web standard for HTML5 video. PGS is image-based from Blu-rays. Soft subtitles stay as a separate track (toggleable); burned-in subtitles are rendered into the video pixels permanently. MKV supports all subtitle formats; MP4 only supports basic text subtitles.
Mar 19, 2026 8 min read
Video
How to Convert Video to GIF (And Why You Might Not Want To)
GIF is the worst format for video content — a 10-second 480p GIF can be 20MB while the same clip as MP4 is 500KB. GIF is limited to 256 colors per frame and has no inter-frame compression, making files 20-50x larger than equivalent video. Use HTML5 video (autoplay, loop, muted) instead for web use. GIF remains necessary only where video isn't supported: messaging apps, GitHub READMEs, email bodies, and Slack.
Mar 19, 2026 8 min read
Audio
Best Audio Formats for Voice Recording and Dictation
Voice has a narrower frequency range than music (primarily 300 Hz to 3.4 kHz for intelligibility, up to 8 kHz for natural sound), so it compresses extremely well. For maximum quality, use WAV at 48 kHz / 16-bit. For maximum efficiency, Opus at 32-64 kbps is clear and natural. Phone voice memos typically record in M4A (AAC) at 128 kbps — more than adequate for speech.
Mar 19, 2026 6 min read
Audio
WAV: Uncompressed Audio for Professionals
WAV stores raw PCM audio with zero compression — what goes in comes out bit-for-bit identical. CD-quality WAV (44.1 kHz, 16-bit, stereo) produces files at 10.1 MB per minute. It's the standard format for professional audio production because there's no decoding overhead, no generation loss across edits, and universal DAW support.
Mar 19, 2026 7 min read
General
Web Accessibility for Media: Images, Video, and Audio
Every image needs alt text (descriptive for informative images, empty for decorative). Every video needs captions (<track> element with WebVTT files). Every audio-only content needs a transcript. Use figure/figcaption for images with context, role and aria-label for SVGs, and prefers-reduced-motion to respect motion sensitivity. These are WCAG requirements, not nice-to-haves.
Mar 19, 2026 7 min read
General
Audio Formats for Web Applications
MP3 is universally supported and the safe default for web audio. AAC offers better quality at the same bitrate. Opus is the best lossy codec by quality metrics and the default for WebRTC. WAV is for low-latency sound effects via the Web Audio API. Use the <audio> element with <source> fallbacks for format negotiation.
Mar 19, 2026 6 min read
Font
Web Font Performance: Loading Strategies That Actually Work
Self-host WOFF2. Subset aggressively. Use font-display: swap. Preload your critical font with crossorigin. Use size-adjust to minimize CLS. That's the entire web font performance strategy in five steps.
Mar 19, 2026 9 min read
General
Embedding Video on the Web: Formats, Codecs, and Performance
Embed web video with the HTML5 <video> element using MP4 H.264 as the universal format and WebM VP9 as a smaller alternative for modern browsers. Use preload="metadata" for non-critical video, always include a poster image, and only autoplay when muted. For long-form content, use HLS or DASH adaptive streaming.
Mar 19, 2026 7 min read
Video
WebM: Google's Open Video Format for the Web
WebM is Google's royalty-free video format built for the web. It pairs VP8, VP9, or AV1 video with Vorbis or Opus audio in a Matroska-based container. Every major browser plays WebM natively, and VP9 WebM files are typically 30-50% smaller than H.264 MP4 at the same visual quality. The tradeoff: encoding is slower, and device support outside browsers is inconsistent.
Mar 19, 2026 8 min read
Image
WebP: The Complete Guide to Google's Image Format
WebP is Google's web-optimized image format that produces files 25-35% smaller than JPEG and PNG at equivalent quality. It supports both lossy and lossless compression, animation, and alpha transparency. With 97%+ browser support in 2026, WebP is the default recommendation for web image delivery unless you need HDR (use AVIF) or universal email/print compatibility (use JPEG).
Mar 19, 2026 9 min read
General
When Not to Convert: Keeping Files in Their Original Format
Sometimes the best conversion is no conversion. Lossy-to-lossy transcoding degrades quality. Converting already-optimized formats wastes time. Source files (PSD, RAW, project files) should stay in their native format for editability. The golden rule: always keep the original, and only convert a copy when there's a clear reason.
Mar 19, 2026 5 min read
Font
WOFF2: The Only Web Font Format You Need
WOFF2 compresses font files with Brotli, producing files 30% smaller than WOFF1 and 50-60% smaller than raw TTF/OTF. With 97%+ browser support, it's the only web font format you need in 2026. Serve WOFF2 with font-display: swap, preload your critical font, and you're done.
Mar 19, 2026 7 min read
General
XLS vs XLSX: Binary vs XML Spreadsheets
XLS is the legacy binary spreadsheet format (BIFF8) used by Excel 97-2003, limited to 65,536 rows and 256 columns. XLSX is the modern Open XML format (a ZIP of XML files) introduced in Excel 2007, supporting 1,048,576 rows and 16,384 columns. XLSX is smaller, safer (no macros by default), and the universal standard. XLS persists in legacy systems and regulated industries that haven't migrated.
Mar 19, 2026 6 min read
General
XLSX: Inside Excel's Modern Spreadsheet Format
XLSX is a ZIP archive containing XML files that define worksheets, styles, shared strings, and metadata. It replaced the binary XLS format in Office 2007 with the Open XML standard, supporting up to 1,048,576 rows and 16,384 columns per sheet. Understanding its internals explains why files bloat, why some features survive format conversion, and why .xlsm exists separately for macros.
Mar 19, 2026 8 min read
Data
XML: The Extensible Markup Language That Won't Retire
XML (Extensible Markup Language) is a self-describing, schema-validatable markup language descended from SGML. Despite losing the API format war to JSON, XML remains irreplaceable for document markup (DOCX, SVG), enterprise integration (SOAP, SAML), data validation (XSD), and transformation (XSLT). It's verbose, but that verbosity buys power no other format matches.
Mar 19, 2026 10 min read
Data
XML Namespaces and Schemas: A Practical Guide
XML namespaces prevent element name collisions when combining multiple vocabularies in one document. XML schemas (XSD, DTD, RelaxNG) define and enforce document structure at the parser level. Together, they enable XML's unique strength: composable, validated documents from multiple independent sources.
Mar 19, 2026 8 min read
Data
YAML: Human-Friendly Data That Bites Back
YAML (YAML Ain't Markup Language) is a human-readable data serialization format that's technically a JSON superset. It powers Kubernetes, Docker Compose, GitHub Actions, and most DevOps tooling. Its indentation-based syntax is clean but unforgiving, and its implicit type coercion — the infamous Norway problem — has caused production outages.
Mar 19, 2026 9 min read
Data
YAML vs TOML: Which Config Format Should You Choose?
TOML is simpler, more explicit, and has fewer surprises — ideal for application config with 1-3 levels of nesting. YAML is more expressive and handles deep nesting naturally — required for Kubernetes, CI/CD, and infrastructure tooling. Choose based on nesting depth and ecosystem alignment.
Mar 19, 2026 7 min read
Archive
ZIP: The Universal Archive Format
ZIP is the most universally supported archive format, readable by every major operating system without extra software. It uses per-file Deflate compression, supports AES-256 encryption, and handles archives up to 16 exbibytes with ZIP64. Use ZIP whenever the recipient might not have specialized archive software.
Mar 19, 2026 8 min read