🔤 UTF-8 Encoder/Decoder
Encode text to UTF-8 byte sequences or decode UTF-8 bytes back to readable text. Support multiple output formats including hexadecimal, decimal, and binary.
Input Text
Output Format
About UTF-8 Encoding
What is UTF-8?
- • UTF-8: Unicode Transformation Format - 8-bit
- • Variable-length: 1-4 bytes per character
- • ASCII compatible: First 128 characters match ASCII
- • Universal: Can represent any Unicode character
- • Web standard: Default encoding for HTML, XML, JSON
Encoding Examples
- • 'A': 0x41 (1 byte)
- • '€': 0xE2 0x82 0xAC (3 bytes)
- • '中': 0xE4 0xB8 0xAD (3 bytes)
- • '🌍': 0xF0 0x9F 0x8C 0x8D (4 bytes)
- • Emoji: Usually 4 bytes in UTF-8
Features
- ✅ Encode text to UTF-8 bytes in multiple formats
- ✅ Decode UTF-8 bytes back to readable text
- ✅ Support for hexadecimal, decimal, and binary formats
- ✅ Handle multi-byte characters (Chinese, Japanese, emoji)
- ✅ Real-time error checking and validation
- ✅ Character and byte count analysis
- ✅ Copy to clipboard functionality
Use Cases
- • Web Development: Debug character encoding issues
- • Data Analysis: Analyze byte patterns in text data
- • Network Programming: Understand data transmission
- • File Processing: Handle international text files
- • Education: Learn about Unicode and character encoding