== ISO 639 : 2 and 3 latter languag ecodes. Process: Download http://www.loc.gov/standards/iso639-2/php/English_list.php This file is in ISO-8859-1 Run the perl script to produce the Java fragment Convert to UTF-8 iconv --from-code=ISO-8859-1 --to-code=UTF-8 < IN > OUT Check: Gwich'in Bokmċl, Norwegian Provençal Volapük (this text file is ISO-8859-1) Misc: 1/ There are duplicate 3-letter codes - take the first only. 2/ The "Undetermined" language code needs special handling