Bank Cheque Dataset (Document AI)
Synthetic Bank Cheque
Use Case: OCR
Format: .jpg
Count: 2023
Annotation: No
Description: The Bank Cheque Dataset (Document AI): Synthetic bank cheques consists of artificially generated cheque images designed to replicate the appearance and content of real cheques. It includes various elements such as payee names, amounts, dates, signatures, and cheque numbers. This dataset is used for training and evaluating Document AI systems in tasks like optical character recognition (OCR), cheque processing, and automated data extraction, providing a controlled environment for model development without the privacy concerns of real cheques.
Recording Condition: - Clicked Images - Scanned - Web scrapper
Bank Statement Dataset (Document AI)
Synthetic Bank Statements
Use Case: OCR
Format: .jpg, png
Count: 5366
Annotation: No
Description: The Bank Statement Dataset (Document AI): Synthetic bank statements includes artificially generated bank statements designed to simulate real financial documents. It features various transaction records, dates, amounts, and account details, structured to mirror real-world formats and content. This dataset is used for training and evaluating Document AI systems in tasks such as optical character recognition (OCR), data extraction, and document analysis, offering a controlled environment without the privacy issues of actual financial data.
Recording Condition: - Scanned - Bank_Statement - Web scrapper
Chinese Bills Dataset
Bounding box+Text
Use Case: OCR
Format: Image
Count: 6k
Annotation: Yes
Description: The Chinese Bills Dataset includes images or text samples of various types of bills, such as invoices, receipts, and statements, written in Chinese. It features diverse formats and content, including item descriptions, amounts, and dates. This dataset is used for tasks like optical character recognition (OCR), financial document processing, and automated data extraction.
Pay Slips Dataset (Document AI)
Use Case: OCR
Format: .jpg
Count: 2010
Annotation: No
Description: The Pay Slips Dataset (Document AI): Synthetic Pay Slips consists of images of artificially generated pay slips without any annotations. It features various pay slip formats and details such as employee names, salaries, and dates, used for training and testing Document AI systems in tasks like OCR and document processing.
Recording Condition: - Scanned - Web scrapper
Printed Regular/Cursive Text Dataset (Document AI)
Use Case: Document AI
Format: HEIC (images) & .mov (videos)
Count: 23930
Annotation: No
Description: Live Photos with Handwritten text for Japanese, Korean & Russian
Recording Device: iPhone & iPad Camera
Recording Condition: - Aggressive Lighting/Glare - Camera Flash On - Colored Light - Low Light, No Camera Flash - Normal