Can I use PDFNet for data extraction and conversion to XML?


I need our application to read in a PDF provided by the user, reformat
it to meet the company standards and output in the format required by
our own application, which is basically XML.

Does PDFNet SDK provide this functionality? If so, how complex would
the code be, and which of your sample shows this functionality?


Using PDFNet SDK you can extract all information present in a PDF
document and use this for reformatting or conversion to XML. A good
starting point to look at are ElementReader and ElementReaderAdv
sample projects:

As you can see the PDF content extraction code is quite
straightforward, however the overall complexity would depend on your
application requirements.