Note: Click on "Kernel" > "Restart Kernel and Clear All Outputs" in JupyterLab before reading this notebook to reset its output. If you cannot run this file on your machine, you may want to open it in the cloud .
We refer to the official Unicode HOWTO in the Python documentation. Furthermore, the unicodedata
module in the standard library
provides a lot of utility functions around the Unicode standard.
Next is a brief summary video by the YouTube channel Computerphile titled "Characters, Symbols and the Unicode Miracle".
from IPython.display import YouTubeVideo
YouTubeVideo("MijmeoH9LT4", width="60%")
In his PyCon Australia 2018 talk titled "Unicode and Python: The absolute minimum you need to know" Raphaël Merx explains some caveats and best practices regarding Unicode.
YouTubeVideo("oXVmZGN6plY", width="60%")
In a similar talk at PyCon 2017 titled "Unicode what is the big deal" Łukasz Langa provides further lessons learned regarding Unicode.
YouTubeVideo("7m5JA3XaZ4k", width="60%")
In a "classic" talk from PyCon 2012 titled "Pragmatic Unicode, or, How do I stop the pain?" Ned Batchelder explains among others the concept of a "Unicode Sandwich."
YouTubeVideo("sgHbC6udIqc", width="60%")
Lastly, in his entertaining talk at PyCon.DE 2019 titled "Your Name is Invalid!" Miroslav Šedivý shows how hard it actually is to write software that can process any name a human can possibly have. Miroslav also gave a lightning talk where he shows how he uses only one keyboard for the 12 (!!!) languages he speaks.
YouTubeVideo("pBuS7EUPnQA", width="60%")
YouTubeVideo("-4QjII981sM", width="60%")