If I try to type in Dutch characters such as Ã« in, say, a product description, the text is truncated at the first such character.
This appears to be a common MySQL problem that occurs when character sets don’t jive up as they should. I don’t know if it has anything to do with the Magento admin code (when I type in Ã«, does that get properly interpreted and converted to utf-8? beyond my level of expertise) or if it lies elsewhere, but my database is 100% utf8_general_ci.
ETA: I do see that the tinyMCE files use charset=ISO-8859-1. If the input captured here isn’t converted to utf-8, that could be a source of the problem
ETA #2: After reading up on it, I’m pretty sure it is indeed a problem with the tinyMCE charset encoding. The tinyMCE documentation even says “Please try using entities if possible. Like & aring; etc for non a-z characters.” (from magento/js/tiny_mce/langs/readme.txt). So I tried using & euml; for Ã« in the product description input box, and voila, it worked just fine. Displays properly as Ã« in the storefront, too.
While this is workable, it’s hardly ideal for European users. It will be so clunky having to type html entities for letters we’re used to typing using just a keyboard stroke or two. I don’t know how much work is involved in changing the tiny_mce distribution with Magento to accept the “international keyboard” input method (e.g., double quotation mark + e = Ã«), but I’ll look further tomorrow. It’s way past time for this 6-months-pregnant lady to go to bed .