this post was submitted on 06 Sep 2024
1370 points (99.6% liked)

Programmer Humor

32041 readers
959 users here now

Post funny things about programming here! (Or just rant about your favourite programming language.)

Rules:

founded 5 years ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 68 points 1 week ago (1 children)

Everything is 0s and 1s to a computer. What a pattern of 0s and 1s encodes is decided by people--often arbitrarily. Over the years there have been attempts to standardize encodings but, for legacy reasons, older encodings are still valid.

The 0s and 1s that encode ' in UTF-8 (a standardized encoding) are the same 0s and 1s that encode ’ in CP-1252 (a legacy encoding).

The � symbol is shown when the 0s and 1s don't encode anything of meaning.

[–] [email protected] 55 points 1 week ago* (last edited 1 week ago) (1 children)

= e2 80 99 (3 bytes)
’ = e2 80 99 (3 separate bytes)

[–] [email protected] 14 points 1 week ago

Good to see it "spelt out" like that