Is there any 'standard'/'convention' for writing plain text in many languages? Interleaving the different languages somehow?

Β· Β· 2 Β· 0 Β· 1

There's a (depreciated & withdrawn) Unicode standard (& RFC) for embedding language tags in plain text! eg: β€œLanguage XX starts now:....”

It was defined in RFC2482 (in Jan '99), but depreciated in RFC6082 (Nov '10). Alas. I might still use it. πŸ™‚

Show thread

@ebel When you say plain text, I assume you mean long form text, as in documents, and not something like UI elements?

@emacsen correct. Old fashioned plain text files. I wanna write multiple languages in my readme.txt (or

@ebel The only standard I'm aware of would be the filename (minus extension) then a separator, then a two letter code, ala:


If you have many files, I've also seen the directory route of:


Then you can offer links between docs.

This is imperfect, but I think it's what I've seen and probably good enough for most uses, without getting very complicated.

@ebel this just sounds like yet another way to trash Unicode parsers
@ebel conspiracy headcanon: unicode is designed by the five eyes to allow an easy exploitation vector into systems processing foreign languages.
Sign in to participate in the conversation

The social network of the future: No ads, no corporate surveillance, ethical design, and decentralization! Own your data with Mastodon!