bug 51351: more progress with WordToFoExtractor: support for hyperlinks, common fields and code cleanup