Retrieve all text from a document
Wrk Product avatar
Written by Wrk Product
Updated over a week ago

Retrieve all text from an image or PDF.

Supported languages: English, Spanish, Italian, Portuguese, French, German

Application

  • Intelligent Document Processing

Inputs (what you have)

Name

Description

Data Type

Required?

Example

Document

Document to retrieve text from. Accepted document formats are PNG, JPEG, TIFF, or PDF

File

Yes

SOP_123.pdf

Note: the value of inputs can either be a set value in the configuration of the Wrk Action within the Wrkflow, or a variable from the Data library section. These variables in the Data library section are the outputs of previous Wrk Actions in the Wrkflow.

Outputs (what you get)

Name

Description

Data Type

Required?

Example

Text captured

Text(Long)

Yes

Note: This Wrk Action will retrieve all text from the document grouped in the order that it would normally be read. This grouping will simplify using the text from the document once it has been retrieved.

Example Document:

Output Text:

123 Supplies 789 Oak Avenue Villagetown, State Country, 54321 INVOICE # INV-003 Bill To: Mark Johnson 246 Maple Lane Citytown, State Country, 98765 Ship To: Emily Brown 654 Pine Street Townsville, State Country, 43210 Date: 2023-05-15 Payment Terms: Net 30 Due Date: 2023-06-14 PO Number: PO-9876 Balance Due: CA$183.53 Item Quantity Rate Amount Widget X 2 CA$25.99 CA$51.98 Premium Widget X for enhanced performance. Widget Y 5 CA$12.50 CA$62.50 Versatile Widget Y suitable for various applications. Widget Z 3 CA$18.75 CA$56.25 Advanced Widget Z with cutting-edge features. Subtotal: CA$170.73 Tax (7.5%): CA$12.80 Total: CA$183.53 Notes: Please handle with care during shipping. Terms: Payment is due within 30 days of the invoice date.

Outcomes

Name

Description

Success

This status is selected if details are extracted from the Document provided.

No result

This status is selected in the event of the following scenarios:

  • No details were found

Impossible to Complete

This status is selected in the event of the following scenarios:

  • Outlined in Application Design

  • The document provided is greater than 500mb

Requirements

  • The document provided can not exceed 500MB.

Did this answer your question?