Extraction fields

Extraction fields are used to define which data to extract from documents.
To ensure consistent training and data extraction typless uses 3 different field data types.



Supplier field

📘

Every document type will always have a supplier_name field

It is the only required data value you need to send when you train your documents.

Every document type will always have a field supplier_name.
When training the value of this field must be the name of your supplier who issued the document.



String type

The string field type is used for general string fields like document numbers, address, company names, payment references, IBANs, ...

Value StandardBehaviourTraining - Response format
Any string of characters written on document.It returns string value read by the OCR.Any sequence of up 250 characters

Examples

Value on documentReturned valueValue for training
Invoice number 123-55ABCD123-55ABCD123-55ABCD
city 369, street 42city 359, street 42city 369, street 42

As you can see from the examples you can extract full strings or just parts to get the value that you want.



Number type

The number field type is used for numbers you want to perform calculations with like total amount, net amount, ...

Value standardBehaviourTraining - Response format
Number written in many different decimal and thousands separators.Always returns numbers as float string rounded to 4 decimal places.Any float or integer with a single . separator and up to 4 decimal points.

Examples

Value on documentReturned valueValue for training
1.11.10001.1000
1"234.121234.12001234.1200
1,000.23451000.23451000.2345
1212.000012.0000


Date type

The date field type is used for dates like issue date, pay due, date of service, delivery date, contract date, ...

Value StandardBehaviourTraining - Response format
Date written in 100+ languages and 50+ formatsIt captures only date information without the time stamp.String in ISO 8601 format YYYY-MM-DD
For example 2020-01-01.

Examples

Value on documentReturned valueValue for training
1.1.212021-01-012021-01-01
10 Jan 20212021-01-102021-01-10
10/31/20212021-10-312021-10-31
February 4, 20212021-02-042021-02-04

Typless will figure out which format the date is written on your documents and return the correct value in the standard ISO 8601 format.