Unicode Character Lookup: Code Points
Look up any Unicode character — find code point, block, category, and HTML entity by searching name or symbol.
Published:
Tags: Unicode character lookup, Unicode code point search, character info finder
Unicode Character Lookup: Code Points Every character in the Unicode Standard has a unique code point, an official name, a block assignment, and a general category. A Unicode character lookup tool lets you find all of this by searching for the symbol, its name, or its hex code point value. --- What a Unicode Character Lookup Shows For any character, a lookup tool returns: | Property | Example | Description | |----------|---------|-------------| | Code Point | U+1F525 | Hex position in the Unicode Standard | | Name | FIRE | Official ALL-CAPS Unicode name | | Block | Miscellaneous Symbols and Pictographs | Contiguous range of code points | | General Category | So (Symbol, Other) | Broad character type | | Script | Common | Script assignment | | UTF-8 bytes | F0 9F 94 A5 | Hex byte sequence…
Frequently Asked Questions
How do I look up a Unicode code point?
Paste the character into a Unicode lookup tool, or search by its official name. The tool returns the code point in U+XXXX hex notation, the character's block, general category, and HTML entity. You can also look up by hex value directly — enter U+1F525 to retrieve the FIRE emoji.
What is a Unicode block?
A Unicode block is a named, contiguous range of code points assigned to a particular script or character set. Examples: Basic Latin (U+0000–U+007F), Latin Extended-A (U+0100–U+017F), CJK Unified Ideographs (U+4E00–U+9FFF), Emoticons (U+1F600–U+1F64F). Blocks help locate characters in the Unicode Standard and understand which script or purpose they serve.
What is the Unicode character category?
Every Unicode character is assigned a general category — a two-letter code like Lu (Letter, Uppercase), Ll (Letter, Lowercase), Nd (Number, Decimal Digit), Ps (Punctuation, Open), So (Symbol, Other). Categories are used in regex property escapes: \p{Lu} matches uppercase letters, \p{Nd} matches decimal digits across all scripts.
How do I find a character by name?
Every Unicode character has an official name in ALL CAPS — for example, LATIN SMALL LETTER E WITH ACUTE, FIRE, or COMBINING ACUTE ACCENT. Search the name in a Unicode lookup tool to find the character and its code point. Python's unicodedata.lookup('FIRE') returns the character; unicodedata.name(char) returns the name.
What is the difference between U+0041 and A?
They are the same character. U+0041 is the Unicode code point notation for LATIN CAPITAL LETTER A. The U+ prefix denotes Unicode, and 0041 is the hexadecimal code point value. The character itself — the letter A — is what's stored and rendered. Code points are the standard way to unambiguously identify a character independent of its visual representation.
All articles · theproductguy.in