Hexadecimal
Numeral systems by culture | |
---|---|
Hindu-Arabic numerals | |
Western Arabic Eastern Arabic Khmer |
Indian family Brahmi Thai |
East Asian numerals | |
Chinese Counting rods |
Korean Japanese |
Alphabetic numerals | |
Abjad Armenian Cyrillic Ge'ez |
Hebrew Ionian/Greek Sanskrit |
Other systems | |
Attic Babylonian Colombian Egyptian |
Etruscan Mayan Roman Urnfield |
List of numeral system topics | |
Positional systems by base | |
Decimal (10) | |
2, 4, 8, 16, 32, 64 | |
3, 9, 12, 24, 30, 36, 60, more… | |
In mathematics and computer science, hexadecimal, base-16, or simply hex, is a numeral system with a radix, or base, of 16, usually written using the symbols 0–9 and A–F, or a–f. Its primary purpose is to represent the binary code in a format easier for humans to read, and acts as a form of shorthand, in which one hexadecimal digit stands in place of four binary bits. For example, the decimal numeral 79, whose binary representation is 01001111, is 4F in hexadecimal (4 = 0100, F = 1111). IBM introduced the current hexadecimal system to the computing world; an earlier version, using the digits 0–9 and u–z, had been introduced in 1956, and had been used by the Bendix G-15 computer.
Contents |
Uses
0hex | = | 0dec | = | 0oct | 0 | 0 | 0 | 0 | |||
1hex | = | 1dec | = | 1oct | 0 | 0 | 0 | 1 | |||
2hex | = | 2dec | = | 2oct | 0 | 0 | 1 | 0 | |||
3hex | = | 3dec | = | 3oct | 0 | 0 | 1 | 1 | |||
4hex | = | 4dec | = | 4oct | 0 | 1 | 0 | 0 | |||
5hex | = | 5dec | = | 5oct | 0 | 1 | 0 | 1 | |||
6hex | = | 6dec | = | 6oct | 0 | 1 | 1 | 0 | |||
7hex | = | 7dec | = | 7oct | 0 | 1 | 1 | 1 | |||
8hex | = | 8dec | = | 10oct | 1 | 0 | 0 | 0 | |||
9hex | = | 9dec | = | 11oct | 1 | 0 | 0 | 1 | |||
Ahex | = | 10dec | = | 12oct | 1 | 0 | 1 | 0 | |||
Bhex | = | 11dec | = | 13oct | 1 | 0 | 1 | 1 | |||
Chex | = | 12dec | = | 14oct | 1 | 1 | 0 | 0 | |||
Dhex | = | 13dec | = | 15oct | 1 | 1 | 0 | 1 | |||
Ehex | = | 14dec | = | 16oct | 1 | 1 | 1 | 0 | |||
Fhex | = | 15dec | = | 17oct | 1 | 1 | 1 | 1 | |||
Hexadecimal is primarily used in computing to represent a byte. Representing the 256 possible values has a number of problems: first, there are unprintable control characters; second, ASCII itself stops at 7 bits with the remainder being system-specific extensions; and finally, even if all characters in the machine's set were displayable as something, neither users nor input methods are generally prepared to handle 256 unique characters.
Hex triplets
HTML and CSS use hexadecimal notation (hex triplets) to specify colors on web pages, with "#" standing for hexadecimal. Twenty-four-bit color is represented in the format #RRGGBB: where RR specifies the value of the Red component of the color, GG the Green component, and BB the Blue component. For example, a shade of red that is (238,9,63) in decimal is coded as #EE093F. This syntax is borrowed from the X Window System.
Example of conversion from hexadecimal triplet to decimal triplet: Hexadecimal triplet: FFCF4B
Step 1: Separate the triplets: FF CF 4B
Step 2: Convert each hexadecimal value to a decimal representation:
- FF = 15*16 + 15*1 = 255
- CF = 12*16 + 15*1 = 207
- 4B = 4*16 + 11*1 = 75
Result: Hexadecimal triplet FFCF4B = Decimal triplet 255,207,75
Other common uses
In URLs, all characters can be coded hexadecimally, even those not normally permitted. This is specified in RFC 3986. Each 2-digit (1 byte) hexadecimal sequence is preceded by a percent sign and refers to a specific UTF-8 character code. For example, in the URL http://en.wikipedia.org/wiki/Main%20Page, the (hexadecimal) UTF-8 character code for a space (" ") is 20.
Some software programs will create unique order numbers by using a hexadecimal representation of the exact second the order was taken, based on the total number of seconds since the start of the 20th century. For example, C9BCE0F5 represents April 2, 2007 14:19:32.
Page numbers on teletext are written in hexadecimal, with available numbers being in the range of 100-8FF. However, page numbers with letters are only used for "hidden" and engineering pages.
Representing hexadecimal
Some hexadecimal representations are indistinguishable from decimal representations (to humans and computers); therefore, some convention is usually used to flag them.
In typeset text, hexadecimal is often indicated by a subscripted suffix such as 5A316, 5A3SIXTEEN, or 5A3HEX. In computer programming languages (which are nearly always plain text without such typographical distinctions as subscript and superscript),a wide variety of alternative notations are used to indicate hexadecimal numbers.
The following are some of the most common representations:
- Ada and VHDL enclose hexadecimal numerals in based "numeric quotes", e.g.
16#5A3#
. (Note: Ada accepts this notation for all bases from 2 through 16 and for both integer and real types.) - C and languages with a similar syntax (such as C++, C#, Java and JavaScript) prefix hexadecimal numerals with
0x
, e.g.0x5A3
. The leading0
is used so that the parser can simply recognize a number, and thex
stands for hexadecimal. Thex
in0x
can be either in upper or lower case but is almost always seen written in lower case. In some languages other than C or C++,0b
is used to prefix Binary numbers, and occasionally (in Haskell, for example)0o
is used for Octal. - *nix shells use an escape character form
\x0FF
in expressions and0xFF
for constants. - In HTML, hexadecimal character references also use the x:
֣
should give the same as֣
– with your browser ֣ and ֣ respectively (Hebrew accent munah). Hexadecimal color references are prefixed with#
, e.g.#FFFFFF
(white). - Some assemblers indicate hex by an appended
h
(if the numeral starts with a letter, then also with a preceding 0, to indicate that it is a number), e.g.,0A3Ch
,5A3h
. The syntax may vary per assembly language. For example, the 6502 assembly language uses a $ as prefix, e.g. #$10FF (the # indicates an immediate numeric value). - Postscript indicates hex by a prefix
16#
. - Common Lisp use the prefixes
#x
and#16r
. - Pascal, other assemblers (AT&T, Motorola), and some versions of BASIC and Forth use a prefixed
$
, e.g.$5A3
. - The Smalltalk programming language uses the prefix
16r
. Note Smalltalk accepts the format<radix>r<digits>
where radix is a number base from 2 upwards (i.e. 2r1110 is 10r14 or 16rE), with the practical limitation being within the ASCII character set range 0-9 and A-Z used to represent the digits. Some versions of Smalltalk allow fractional digits following a period character,.
, enabling hexadecimal (and other base) representation of floating point numbers. - Some versions of BASIC, notably Microsoft's variants including QBasic and Visual Basic, prefix hexadecimal numerals with
&H
, e.g.&H5A3
; others such as BBC BASIC just used&
(used for octal in Microsoft's BASIC). - TI 89 and 92 series designate 0h (ex 0hA3)
- Notations such as
X'5A3'
are sometimes seen; PL/I uses such notation. This is the most common format for hexadecimal on IBM mainframes (zSeries) and minicomputers (iSeries) running the traditional OS's (zOS, zVSE, zVM, TPF, OS/400), and is used in Assembler, PL/1, Cobol, JCL, scripts, commands and other places. The most common exceptions are when using a language with a different native convention (C, Java, etc.). This format was common on other (and now obsolete) IBM systems as well. - Microchip's MPASM assembler uses
H'5A'
to represent hexadecimal numbers as well as the more common 0x prefix. - Donald Knuth introduced the use of different fonts to represent radices in his book The TeXbook. In his notation, hexadecimal representations are written in a typewriter type, e.g. 5A3
There is no single agreed-upon standard, so all the above conventions are in use, sometimes even in the same paper. However, as they are quite unambiguous, little difficulty arises from this.
The most commonly used (or encountered) notations are the ones with a prefix "0x" or a subscript-base 16, for hex numbers. For example, both 0x2BAD and 2BAD16 represent the decimal number 11181 (or 1118110).
The choice of the letters A through F to represent the additional digits was not universal in the early history of computers. During the 1950s, some installations favored using the digits 0 through 5 with a macron to indicate the values 10-15. Users of Bendix G-15 computers used the letters U through Z.
One solution how to write hexadecimal numbers distinctively is the use of figures that are made for the hexadecimal system but are not yet representable in Unicode.
Verbal representations
Not only are there currently no proper digits to represent the quantities from ten to fifteen (so letters are used as a substitute), but English also lacks a proper nomenclature to name hexadecimal numbers. Names such as "thirteen" and "fourteen" are decimal-based, and even though English has a few names for non-decimal powers—pair for the first binary power; score for the first vigesimal power; dozen, gross and great gross for the first three duodecimal powers—no English name currently exists for any of the hexadecimal powers (corresponding to the decimal values 16, 256, 4096, etc.). So people have resorted to reading hexadecimal numbers by naming their digits (or digit-letters) individually in sequence in the same way as reading phone numbers (i.e., 4DA as "four-dee-aye"). However, the letter 'A' sounds similar to '8', 'C' sounds similar to '3', and 'D' can easily be mistaken for the 'ty' suffix as in "forty"; so 4DA could be mistaken for 48. To avoid misunderstandings, some convention must be established when exchanging hexadecimal numbers verbally, at least until a proper hexadecimal nomenclature is developed (if ever). To avoid confusion, the digits A-F are commonly pronounced with the NATO phonetic alphabet ("four-delta-alpha"), the World War II era Joint Army/Navy Phonetic Alphabet ("four-dog-able"), or some approximation of one of those systems.
Fractions
As with other numeral systems, the hexadecimal system can be used to represent rational numbers, although recurring digits are common since 16 has only a single prime factor:
1⁄2 |
|
0.8 | 1⁄6 |
|
0.2AAAAAAAA... | 1⁄A |
|
0.1999999999... | 1⁄E |
|
0.1249249249... |
1⁄3 |
|
0.5555555555... | 1⁄7 |
|
0.2492492492... | 1⁄B |
|
0.1745D1745D... | 1⁄F |
|
0.1111111111... |
1⁄4 |
|
0.4 | 1⁄8 |
|
0.2 | 1⁄C |
|
0.1555555555... | 1⁄10 |
|
0.1 |
1⁄5 |
|
0.3333333333... | 1⁄9 |
|
0.1C71C71C71... | 1⁄D |
|
0.13B13B13B1... | 1⁄11 |
|
0.0F0F0F0F0F... |
Because the radix 16 is a perfect square (4²), fractions expressed in hexadecimal have an odd period much more often than decimal ones, and there are no cyclic numbers (other than trivial single digits). Recurring digits occur when the denominator in lowest terms has a prime factor not found in the radix; thus, when using hexadecimal notation all fractions with denominators that are not a power of two will result in an infinite string of recurring digits (e.g., thirds and fifths). This makes hexadecimal (and binary) less convenient than decimal (not to mention duodecimal and sexagesimal) for the purpose of representing rational numbers, since a larger proportion of them lie outside its range of finite representation. All rational numbers finitely representable in hexadecimal are also finitely representable in decimal, duodecimal and sexagesimal (i.e., any hexadecimal number with a finite number of digits will have a finite number of digits when expressed in those other bases), whereas only a fraction of those finitely representable in the latter ones are finitely representable in the former (e.g., decimal 0.1, that is, the fraction one tenth, corresponds to the infinite recurring representation 0.199999999999... in hexadecimal). Although hexadecimal is more efficient than those other bases for the particular case of representing fractions with powers of two as the denominator (cf. one sixteenth is 0.1 in hexadecimal, 0.09 in duodecimal, 0;3,45 in sexagesimal and 0.0625 in decimal).
Mapping to binary
Sometimes it is necessary to use binary data when working with computers, but it is difficult for humans to work with the large number of digits in binary. Although most humans are more familiar with the base 10 system, it is much easier to map binary to hexadecimal than to decimal because each hexadecimal digit maps to a whole number of bits (410). The following is an example of converting 11112 to base 10. Since each position in a binary numeral can only contain either a 1 or 0, its value may be easily determined by its position from the right:
- 00012 = 110
- 00102 = 210
- 01002 = 410
- 10002 = 810
Therefore:
11112 | = 810 + 410 + 210 + 110 |
= 1510 |
This example shows addition of 4 numbers; but with some practice, 11112 can be mapped directly to F16 in one step (see table in Representing hexadecimal). The advantage of using hexadecimal rather than decimal increases with the size of the number. When the number becomes large, conversion to decimal becomes much more tedious; however, when mapping to hexadecimal, it is simple to divide the binary string into blocks of 4 positions and map each block of 4 bits to a single position hexadecimal digit.
This example shows the conversion of a binary number to decimal, mapping each digit to the decimal value, and adding the results.
010111101011010100102 | = 26214410 + 6553610 + 3276810 + 1638410 + 819210 + 204810 + 51210 + 25610 + 6410 + 1610 + 210 |
= 38792210 |
Compare this to the conversion to hexadecimal, where each group of four digits can be considered independently, and converted directly:
010111101011010100102 | = | 0101 | 1110 | 1011 | 0101 | 00102 |
= | 5 | E | B | 5 | 216 | |
= | 5EB5216 |
Conversion from hexadecimal back to binary is just as direct.
The octal system can also be useful as a tool for people who need to deal directly with binary computer data, as in reading and understanding it. Compared to hexadecimal, octal represents data in blocks of 3 bits each, rather than 4.
One advantage of hexadecimal is that every unique 2-digit pair (or octet) always represents the same byte value. To "translate" a hexadecimal value into bytes, one needs only to separate the value into individual 2-digit groups, translate each group into its respective byte value, and then combine the results together to form an accurate translation of the entire original hexadecimal word. Conversely, bytes can also be easily translated into hexadecimal values by translating each byte individually into its hexadecimal 2-digit value, and then recombining the hexadecimal values into a "word". The resulting "word" will be an accurate hexadecimal representation of the original string of bytes.
Converting from other bases
Division-remainder in source base
As with all bases there is a simple algorithm for converting a representation of a number to hexadecimal by doing integer division and remainder operations in the source base. Theoretically this is possible from any base but for most humans only decimal and for most computers only binary (which can be converted by far more efficient methods) can be easily handled with this method.
Let d be the number to represent in hexadecimal, and the series hihi-1...h2h1 be the hexadecimal digits representing the number.
- i := 1
- hi := d mod 16
- d := (d-hi) / 16
- If d = 0 (return series hi) else increment i and go to step 2
"16" may be replaced with any other base that may be desired.
The following is a JavaScript implementation of the above algorithm for converting any number to a hexadecimal in String representation. Its purpose is to illustrate the above algorithm. To work with data seriously however, it is much more advisable to work with bitwise operators.
function toHex(d) { var r = d % 16; var result; if(d-r==0) result = toChar(r); else result = toHex( (d-r)/16 )+toChar(r); return result; } function toChar(n) { var alpha = "0123456789ABCDEF"; return alpha.charAt(n); }
Addition and multiplication in hexadecimal
It is also possible to make the conversion by assigning each place in the source base the hexadecimal representation of its place value and then performing multiplication and addition to get the final representation.
Conversion via binary
As computers generally work in binary the normal way for a computer to make such a conversion would be to convert to binary first (by doing multiplication and addition in binary) and then make use of the direct mapping from binary to hexadecimal.
Etymology
It was IBM that decided on the prefix of "hexa" rather than the proper Latin prefix of "sexa". The word "hexadecimal" is strange in that hexa is derived from the Greek έξ (hex) for "six" and decimal is derived from the Latin for "tenth". It may have been derived from the Latin root, but Greek deka is so similar to the Latin decem that some would not consider this nomenclature inconsistent. An older term was the incorrect Latin-like "sexidecimal" (correct Latin is "sedecim" for 16), but that was changed because some people thought it too risqué, and it also had an alternative meaning of "base 60". However, the word "sexagesimal" (base 60) retains the prefix. The earlier Bendix documentation used the term "sexadecimal". Donald Knuth has pointed out that the etymologically correct term is "senidenary", from the Latin term for "grouped by 16". (The terms "binary", "ternary" and "quaternary" are from the same Latin construction, and the etymologically correct term for "decimal" arithmetic should be "denary".)[1] Schwartzman notes that the expected purely Latin form would be "sexadecimal", but then computer hackers would be tempted to shorten the word to "sex".[2] Incidentally, the etymologically proper Greek term would be hexadecadic (although in Modern Greek deca-hexadic (δεκαεξαδικός) is more commonly used).
Humor
Hexadecimal is sometimes used in programmer jokes because certain words can be formed using only hexadecimal digits. Some of these words are "dead", "beef", "babe", and with appropriate substitutions "c0ffee". Since these are quickly recognizable by programmers, debugging setups sometimes initialize memory to them to help programmers see when something has not been initialized. Some people add an H after a number if they want to show that it is written in hexadecimal. In older Intel assembly syntax, this is sometimes the case. "Hexspeak" may be the forerunner of the modern web parlance of "1337speak"
An example is the magic number in FAT Mach-O files and java programs, which is "CAFEBABE
". Single-architecture Mach-O files have the magic number "FEEDFACE
" at their beginning.
A Knuth reward check is one hexadecimal dollar, or $2.56.
The following table shows a joke in hexadecimal:
3x12=36 2x12=24 1x12=12 0x12=18
The first three are interpreted as multiplication, while in the last one "0x12" in hex is 18.
0xdeadbeef is sometimes put into uninitialized memory.
Microsoft Windows XP clears its locked index.dat files with the hex codes: "0BADF00D"
Trivia
- Hexadecimal is used in the Traveller family of role playing games as a form of shorthand for character attributes and world specifications.
- Hexadecimal is a villain in the CGI animated series ReBoot
See also
- Base 32
- Base 64
- Bubble Babble
- Hex editor
- Hexadecimal time
- Hexspeak
- Nibble — one hexadecimal digit can exactly represent one "nibble"
- Numeral system — a list of other base systems
- Binary numeral system
- HTML
References
External links
- Online Converter for Decimal/Hexadecimal Numerals (JavaScript, GPL)
- Intuitor Hex Headquarters - A site dedicated to changing the traditional base 10 (decimal) standard to hexadecimal.
- The hexadecimal metric system at hexadecimal.florencetime.net
- base42 - A site proposing an alternate symbol set for hexadecimal numbers.
- Leet Key, a Firefox extension that supports ASCII/Hex conversions and typing
- (Hexadecimal) web colors explained