M
MJJ Tools

🔤 UTF-8 Encoder/Decoder

Encode text to UTF-8 byte sequences or decode UTF-8 bytes back to readable text. Support multiple output formats including hexadecimal, decimal, and binary.

Input Text

Output Format

About UTF-8 Encoding

What is UTF-8?

  • UTF-8: Unicode Transformation Format - 8-bit
  • Variable-length: 1-4 bytes per character
  • ASCII compatible: First 128 characters match ASCII
  • Universal: Can represent any Unicode character
  • Web standard: Default encoding for HTML, XML, JSON

Encoding Examples

  • 'A': 0x41 (1 byte)
  • '€': 0xE2 0x82 0xAC (3 bytes)
  • '中': 0xE4 0xB8 0xAD (3 bytes)
  • '🌍': 0xF0 0x9F 0x8C 0x8D (4 bytes)
  • Emoji: Usually 4 bytes in UTF-8

Features

  • ✅ Encode text to UTF-8 bytes in multiple formats
  • ✅ Decode UTF-8 bytes back to readable text
  • ✅ Support for hexadecimal, decimal, and binary formats
  • ✅ Handle multi-byte characters (Chinese, Japanese, emoji)
  • ✅ Real-time error checking and validation
  • ✅ Character and byte count analysis
  • ✅ Copy to clipboard functionality

Use Cases

  • Web Development: Debug character encoding issues
  • Data Analysis: Analyze byte patterns in text data
  • Network Programming: Understand data transmission
  • File Processing: Handle international text files
  • Education: Learn about Unicode and character encoding