Text extractor for non pdf documents

WebViewer Version:

Do you have an issue with a specific file(s)? No
Can you reproduce using one of our samples or online demos? Yes
Are you using the WebViewer server? No
Does the issue only happen on certain browsers? No
Is your issue related to a front-end framework? yes
Is your issue related to annotations? yes

Please give a brief summary of your issue:
I am trying to extract the text under an annotation (highlight, underline etc). For pdfs, I am using textextractor for PDFNet library. Is there such a capability existing for html, and MS office documents as well?

Please describe your issue and provide steps to reproduce it:
(The more descriptive your answer, the faster we are able to help you)

Please provide a link to a minimal sample where the issue is reproducible: