Bank Cheque Dataset (Document AI)

Synthetic Bank Cheque

Bank Cheque Dataset (Document AI)

Use Case: OCR

Format: .jpg

Count: 2023

Annotation: No

X

Description: The Bank Cheque Dataset (Document AI): Synthetic bank cheques consists of artificially generated cheque images designed to replicate the appearance and content of real cheques. It includes various elements such as payee names, amounts, dates, signatures, and cheque numbers. This dataset is used for training and evaluating Document AI systems in tasks like optical character recognition (OCR), cheque processing, and automated data extraction, providing a controlled environment for model development without the privacy concerns of real cheques.

Recording Condition: - Clicked Images - Scanned - Web scrapper

Bank Statement Dataset (Document AI)

Synthetic Bank Statements

Bank Statement Dataset (Document AI)

Use Case: OCR

Format: .jpg, png

Count: 5366

Annotation: No

X

Description: The Bank Statement Dataset (Document AI): Synthetic bank statements includes artificially generated bank statements designed to simulate real financial documents. It features various transaction records, dates, amounts, and account details, structured to mirror real-world formats and content. This dataset is used for training and evaluating Document AI systems in tasks such as optical character recognition (OCR), data extraction, and document analysis, offering a controlled environment without the privacy issues of actual financial data.

Recording Condition: - Scanned - Bank_Statement - Web scrapper

Chinese Bills Dataset

Bounding box+Text

Chinese Bills Dataset

Use Case: OCR

Format: Image

Count: 6k

Annotation: Yes

X

Description: The Chinese Bills Dataset includes images or text samples of various types of bills, such as invoices, receipts, and statements, written in Chinese. It features diverse formats and content, including item descriptions, amounts, and dates. This dataset is used for tasks like optical character recognition (OCR), financial document processing, and automated data extraction.

Pay Slips Dataset (Document AI)

Pay Slips Dataset (Document AI)

Use Case: OCR

Format: .jpg

Count: 2010

Annotation: No

X

Description: The Pay Slips Dataset (Document AI): Synthetic Pay Slips consists of images of artificially generated pay slips without any annotations. It features various pay slip formats and details such as employee names, salaries, and dates, used for training and testing Document AI systems in tasks like OCR and document processing.

Recording Condition: - Scanned - Web scrapper

Printed Regular/Cursive Text Dataset (Document AI)

Printed Regular/Cursive Text Dataset (Document AI)

Use Case: Document AI

Format: HEIC (images) & .mov (videos)

Count: 23930

Annotation: No

X

Description: Live Photos with Handwritten text for Japanese, Korean & Russian

Recording Device: iPhone & iPad Camera

Recording Condition: - Aggressive Lighting/Glare - Camera Flash On - Colored Light - Low Light, No Camera Flash - Normal