DataExtractionModule.DetectAndAddFormFieldsToPDF - can I correct the output?

Product: apryse-SDK (Python, Windows)

Product Version: latest

Please give a brief summary of your issue:
Form Field Detection misses a few fields, identifies wrong type (radio vs checkbox) - manual correction possible?

Please describe your issue and provide steps to reproduce it:
I’m using your sample code for DataExtractionModule.DetectAndAddFormFieldsToPDF, have applied both options “keep old”, “keep new”).
The detection, very consistently misses a few small text fields, that are admittedly difficult to identify, confuses checkboxes with radiobuttons, and places a checkbox instead of a text field.

Is there an opportunity to correct the result manually, if needed?
Is there an intermediate JSON generated that I could modify before creating the ‘new’ PDF output?

Thanks.

Please provide a link to a minimal sample where the issue is reproducible:
N/A

1 Like

Hello, I’m AiRon, an automated tech support bot :robot:
While you wait for one of our customer support representatives, I can help you with your question.


It seems like you’re experiencing issues with the automatic detection of form fields using the DetectAndAddFormFieldsToPDF function in Apryse’s SDK. Unfortunately, there isn’t an intermediate JSON file that you can modify directly before creating the final PDF output. However, you can manually adjust the form fields after the initial detection.

Some things you can try:

  • Use the DetectAndAddFormFieldsToPDF function with the “Form Extraction Engine” option set to “Form Field Key-Value Extraction” for better semantic mapping.

  • Manually inspect and correct the form fields after detection using the Apryse SDK’s form editing capabilities.

  • Programmatically adjust field types (e.g., checkbox vs. radio button) using the Apryse SDK’s form field manipulation methods.

  • Preserve existing form fields to avoid overwriting correct fields by using the appropriate options in the function.



Here are some resources that might help you:


Hopefully, this helps you solve your problem while you wait for a human to get back to you.

1 Like

Is there an intermediate JSON generated that I could modify before creating the ‘new’ PDF output?

No, but you can edit the fields afterwards.

Please see our InteractiveFormsTest sample

From the above you can edit the Widget Rects to change/adjust their position on a page.

If you need further assistance, please elaborate on how you know what the correct/new coordinates should be?

1 Like