← Back to post archive

How the heck do you use combining unicode characters???

Published:

Explaining why instead of  you keep getting A◌̂

  • Random stuff

You ever go to Wikipedia page on Combining Diacritical Marks, copy the combining characters and paste them after letters only to find it didn’t combine anything?

tl;dr

is a character too, its hex is E2 97 8C and if you remove it, it will work just fine. i.e.

Instead of thisIt should be this
Hex41 E2 97 8C CC 8241 CC 82
TextA◌̂

You can remove those 3 bytes using something like HxD.


In JavaScript you can do

'A◌̂'.split('').filter((letter) => letter !== '').join('')

because split('') splits ◌̂ into and ̂, surprisingly.


Had to find that out for my implementation of xkcd 3054.


← Back to post archive