Extraction fields

Extraction fields are used to define which data to extract from documents.
To ensure consistent training and data extraction typless uses 3 different field data types.



Supplier field

📘

Every document type will always have a supplier_name field

It is the only required data value you need to send when you train your documents.

Every document type will always have a field supplier_name.
When training the value of this field must be the name of your supplier who issued the document.



String type

The string field type is used for general string fields like document numbers, address, company names, payment references, IBANs, ...

Value StandardBehaviourTraining - Response format
Any string of characters written on document.It returns string value read by the OCR.Any sequence of up 250 characters

Examples

Value on documentReturned valueValue for training
Invoice number 123-55ABCD123-55ABCD123-55ABCD
city 369, street 42city 359, street 42city 369, street 42

As you can see from the examples you can extract full strings or just parts to get the value that you want.



Number type

The number field type is used for numbers you want to perform calculations with like total amount, net amount, ...

Value standardBehaviourTraining - Response format
Number written in many different decimal and thousands separators.Always returns numbers as float string rounded to 4 decimal places.Any float or integer with a single . separator and up to 4 decimal points.

Examples

Value on documentReturned valueValue for training
1.11.10001.1000
1"234.121234.12001234.1200
1,000.23451000.23451000.2345
1212.000012.0000


Date type

The date field type is used for dates like issue date, pay due, date of service, delivery date, contract date, ...

Value StandardBehaviourTraining - Response format
Date written in 100+ languages and 50+ formats.It captures only date information without the time stamp.String in ISO 8601 format YYYY-MM-DD
For example 2020-01-01.

Examples

Value on documentReturned valueValue for training
1.1.212021-01-012021-01-01
10 Jan 20212021-01-102021-01-10
10/31/20212021-10-312021-10-31
February 4, 20212021-02-042021-02-04

Typless will figure out which format the date is written on your documents and return the correct value in the standard ISO 8601 format.

Constant type

The constant field type is used for fields like currency, unit...
When a piece of information related to the supplier is always the same you can use constant field type.

🚧

Only metadata fields support constant type

It is not possible to use constant type on line items fields.

Value StandardBehaviourTraining - Response format
Any string of characters. String doesn't have to be on document.It returns the same string value that was used during training. If different values were used for different documents during training it returns the most frequently trained value.Any sequence of up to 250 characters.

Examples

📘

Value doesn't have to be on the document.

Returned value is not dependent on any value from document. Returned value is the same as value used for training.

Value on documentReturned valueValue for training
/USDUSD
/kmkm
/2021-01-102021-01-10