determine getPageLanguage via ContentHandler