๐Ÿ“ฆ Business Process Automation Datasets

Curated datasets for document AI, workflow automation, and enterprise chatbot development.

fahmiaziz/dataset-donut-v1-receipt-200-img

Use case: Document AI

Source: Hugging Face | Type: Text | Updated: 2023-09-09

View โ†’
task_categories:feature-extraction task_categories:token-classification language:en license:openrail size_categories:n<1K format:parquet modality:image modality:text library:datasets library:pandas ...truncated...

๐Ÿ’ก Relevant because it matches: receipt, finance.

deeptools-ai/test-document-invoice

Use case: Document AI

Source: Hugging Face | Type: Text | Updated: 2022-06-23

View โ†’
size_categories:n<1K format:parquet modality:image library:datasets library:pandas library:mlcroissant library:polars region:us

๐Ÿ’ก Relevant because it matches: invoice, document.

Jasondeepmusic/receipt-invoice-training-dataset

Use case: Document AI

Source: Hugging Face | Type: Text | Updated: 2024-02-07

View โ†’
license:mit region:us

๐Ÿ’ก Relevant because it matches: invoice, receipt.

UniqueData/ocr-receipts-text-detection

Use case: Information Extraction

Source: Hugging Face | Type: Text | Updated: 2025-10-01

View โ†’
task_categories:image-to-text task_categories:object-detection language:en license:cc-by-nc-nd-4.0 modality:text region:us OCR receipts reading text NLP ...truncated...

๐Ÿ’ก Relevant because it matches: ocr, retail.

alcamilo2/Bitext-customer-support-1-column

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2023-11-03

View โ†’
license:apache-2.0 size_categories:1K<n<10K format:json modality:text library:datasets library:pandas library:mlcroissant library:polars region:us

๐Ÿ’ก Relevant because it matches: customer, support.

amitkedia/Financial-Fraud-Dataset

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2023-12-19

View โ†’
task_categories:text-classification language:en license:apache-2.0 size_categories:n<1K format:csv modality:text library:datasets library:pandas library:mlcroissant library:polars ...truncated...

๐Ÿ’ก Relevant because it matches: finance, fraud.

bitext/Bitext-customer-support-llm-chatbot-training-dataset

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2024-07-18

View โ†’
task_categories:question-answering task_categories:table-question-answering language:en license:cdla-sharing-1.0 size_categories:10K<n<100K format:csv modality:text library:datasets library:pandas library:mlcroissant ...truncated...

๐Ÿ’ก Relevant because it matches: retail, customer, support.

bitext/Bitext-retail-ecommerce-llm-chatbot-training-dataset

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2024-08-05

View โ†’
task_categories:question-answering task_categories:table-question-answering language:en license:cdla-sharing-1.0 size_categories:10K<n<100K format:csv modality:text library:datasets library:pandas library:mlcroissant ...truncated...

๐Ÿ’ก Relevant because it matches: retail, ecommerce.

FunDialogues/customer-service-robot-support

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2023-12-18

View โ†’
task_categories:question-answering language:en license:apache-2.0 size_categories:n<1K format:csv modality:tabular modality:text library:datasets library:pandas library:mlcroissant ...truncated...

๐Ÿ’ก Relevant because it matches: customer, support.

Kaludi/Customer-Support-Responses

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2023-03-27

View โ†’
size_categories:n<1K format:csv modality:text library:datasets library:pandas library:mlcroissant library:polars region:us

๐Ÿ’ก Relevant because it matches: customer, support.

t4tiana/store-sales-time-series-forecasting

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2023-07-05

View โ†’
region:us

๐Ÿ’ก Relevant because it matches: sales, forecasting.

chainyo/rvl-cdip-invoice

Use case: Document AI

Source: Hugging Face | Type: Text | Updated: 2022-04-06

View โ†’
license:other size_categories:10K<n<100K format:parquet modality:image library:datasets library:dask library:mlcroissant library:polars region:us

๐Ÿ’ก Relevant because it matches: invoice.

cheongmyeong17/contract

Use case: Document AI

Source: Hugging Face | Type: Text | Updated: 2023-02-13

View โ†’
size_categories:1K<n<10K format:parquet modality:text library:datasets library:pandas library:polars library:mlcroissant region:us

๐Ÿ’ก Relevant because it matches: contract.

honlzl/invoice

Use case: Document AI

Source: Hugging Face | Type: Text | Updated: 2023-04-04

View โ†’
region:us

๐Ÿ’ก Relevant because it matches: invoice.

kiddothe2b/contract-nli

Use case: Document AI

Source: Hugging Face | Type: Text | Updated: 2022-07-27

View โ†’
license:cc-by-nc-sa-4.0 size_categories:10K<n<100K modality:text library:datasets library:mlcroissant region:us

๐Ÿ’ก Relevant because it matches: contract.

NitishKarra/invoice-bills

Use case: Document AI

Source: Hugging Face | Type: Text | Updated: 2022-08-02

View โ†’
region:us

๐Ÿ’ก Relevant because it matches: invoice.

ProjectsbyGaurav/Donut-Receipt-training

Use case: Document AI

Source: Hugging Face | Type: Text | Updated: 2024-01-04

View โ†’
region:us

๐Ÿ’ก Relevant because it matches: receipt.

skang187/contract

Use case: Document AI

Source: Hugging Face | Type: Text | Updated: 2023-02-13

View โ†’
size_categories:1K<n<10K format:parquet modality:text library:datasets library:pandas library:mlcroissant library:polars region:us

๐Ÿ’ก Relevant because it matches: contract.

ClarusC64/ai-procurement-carbon-clauses-delivery-coherence-risk-v0.1

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2026-02-17

View โ†’
task_categories:text-classification language:en license:mit size_categories:n<1K format:csv modality:text library:datasets library:pandas library:polars library:mlcroissant ...truncated...

๐Ÿ’ก Relevant because it matches: procurement.

akshatshah1103/retail-faq

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2023-10-01

View โ†’
license:apache-2.0 size_categories:n<1K format:json modality:text library:datasets library:pandas library:mlcroissant library:polars region:us

๐Ÿ’ก Relevant because it matches: retail.

bitext/Bitext-retail-banking-llm-chatbot-training-dataset

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2024-07-15

View โ†’
task_categories:question-answering task_categories:table-question-answering language:en license:cdla-sharing-1.0 size_categories:10K<n<100K format:parquet modality:text library:datasets library:pandas library:mlcroissant ...truncated...

๐Ÿ’ก Relevant because it matches: retail.

David-Egea/Creditcard-fraud-detection

Use case: Business Analytics

Source: Hugging Face | Type: Text | Updated: 2024-02-12

View โ†’
license:mit size_categories:100K<n<1M format:csv modality:tabular library:datasets library:pandas library:mlcroissant library:polars region:us

๐Ÿ’ก Relevant because it matches: fraud.

dineshjulakanti/retail

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2024-07-09

View โ†’
license:apache-2.0 size_categories:n<1K format:csv modality:tabular modality:text library:datasets library:pandas library:mlcroissant library:polars region:us

๐Ÿ’ก Relevant because it matches: retail.

Falah/ads-retail

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2023-09-16

View โ†’
size_categories:10K<n<100K format:parquet modality:text library:datasets library:pandas library:mlcroissant library:polars region:us

๐Ÿ’ก Relevant because it matches: retail.

heqi511/fraud

Use case: Business Analytics

Source: Hugging Face | Type: Text | Updated: 2024-04-15

View โ†’
size_categories:100K<n<1M format:csv modality:tabular modality:text library:datasets library:pandas library:mlcroissant library:polars region:us

๐Ÿ’ก Relevant because it matches: fraud.

ikuldeep1/vehicle-damage-fraud-image

Use case: Business Analytics

Source: Hugging Face | Type: Text | Updated: 2024-02-10

View โ†’
size_categories:1K<n<10K format:parquet modality:image library:datasets library:dask library:mlcroissant library:polars region:us

๐Ÿ’ก Relevant because it matches: fraud.

ikuldeep1/vehicle-damage-fraud-image-balanced

Use case: Business Analytics

Source: Hugging Face | Type: Text | Updated: 2024-02-11

View โ†’
size_categories:10K<n<100K format:parquet modality:image library:datasets library:dask library:mlcroissant library:polars region:us

๐Ÿ’ก Relevant because it matches: fraud.

Ingrid0693/data-crm-guanaco

Use case: Workflow Automation

Source: Hugging Face | Type: Text | Updated: 2024-02-02

View โ†’
license:mit size_categories:n<1K format:csv modality:text library:datasets library:pandas library:mlcroissant library:polars region:us

๐Ÿ’ก Relevant because it matches: crm.

Ingrid0693/guanaco-llama2-CRM

Use case: Workflow Automation

Source: Hugging Face | Type: Text | Updated: 2024-02-02

View โ†’
license:mit size_categories:n<1K format:parquet modality:text library:datasets library:pandas library:mlcroissant library:polars region:us

๐Ÿ’ก Relevant because it matches: crm.

Kshitijbhatt1998/ieee-fraud-detection-pipeline-features

Use case: Business Analytics

Source: Hugging Face | Type: Text | Updated: 2026-03-30

View โ†’
task_categories:tabular-classification task_ids:tabular-multi-class-classification language:en license:apache-2.0 size_categories:100K<n<1M modality:tabular modality:text region:us fraud-detection fintech ...truncated...

๐Ÿ’ก Relevant because it matches: fraud.

letsrecap/Crm

Use case: Workflow Automation

Source: Hugging Face | Type: Text | Updated: 2025-02-13

View โ†’
size_categories:10K<n<100K format:json modality:text library:datasets library:pandas library:mlcroissant library:polars region:us

๐Ÿ’ก Relevant because it matches: crm.

liberatoratif/Credit-card-fraud-detection

Use case: Business Analytics

Source: Hugging Face | Type: Text | Updated: 2023-10-13

View โ†’
region:us

๐Ÿ’ก Relevant because it matches: fraud.

LightningRodLabs/supply-chain-predictions

Use case: Business Analytics

Source: Hugging Face | Type: Text | Updated: 2026-04-03

View โ†’
task_categories:time-series-forecasting size_categories:n<1K format:parquet format:optimized-parquet modality:text library:datasets library:pandas library:polars library:mlcroissant arxiv:2604.01298 ...truncated...

๐Ÿ’ก Relevant because it matches: forecasting.

m-a-p/COIG-P-CRM

Use case: Workflow Automation

Source: Hugging Face | Type: Text | Updated: 2025-04-09

View โ†’
size_categories:100K<n<1M format:parquet modality:text library:datasets library:dask library:mlcroissant library:polars arxiv:2504.05535 region:us

๐Ÿ’ก Relevant because it matches: crm.

maryampirjamaat/crm

Use case: Workflow Automation

Source: Hugging Face | Type: Text | Updated: 2025-04-13

View โ†’
region:us

๐Ÿ’ก Relevant because it matches: crm.

michaelmallari/online-retail

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2024-04-18

View โ†’
license:cc-by-4.0 region:us

๐Ÿ’ก Relevant because it matches: retail.

michaelmallari/online-retail-ii

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2024-04-18

View โ†’
license:cc-by-4.0 region:us

๐Ÿ’ก Relevant because it matches: retail.

minhtuyenvp02/crm

Use case: Workflow Automation

Source: Hugging Face | Type: Text | Updated: 2025-02-18

View โ†’
region:us

๐Ÿ’ก Relevant because it matches: crm.

nikrecap/Crm

Use case: Workflow Automation

Source: Hugging Face | Type: Text | Updated: 2025-02-13

View โ†’
size_categories:10K<n<100K format:json modality:text library:datasets library:pandas library:mlcroissant library:polars region:us

๐Ÿ’ก Relevant because it matches: crm.

pytorch-lifestream/retailhero-uplift

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2024-02-19

View โ†’
task_categories:tabular-classification size_categories:10M<n<100M format:csv modality:tabular modality:text library:datasets library:pandas library:mlcroissant library:polars region:us ...truncated...

๐Ÿ’ก Relevant because it matches: finance.

Qdrant/hm_ecommerce_products

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2025-12-16

View โ†’
task_categories:image-classification task_categories:text-classification task_categories:image-to-text task_categories:image-feature-extraction license:cc-by-4.0 size_categories:100K<n<1M format:parquet modality:tabular modality:text library:datasets ...truncated...

๐Ÿ’ก Relevant because it matches: retail.

Salesforce/UniDoc-Bench

Use case: Document AI

Source: Hugging Face | Type: Text | Updated: 2025-12-03

View โ†’
task_categories:question-answering task_categories:text-retrieval task_categories:visual-question-answering task_categories:document-question-answering task_categories:image-text-to-text language:en license:cc-by-nc-4.0 size_categories:1K<n<10K format:parquet modality:image ...truncated...

๐Ÿ’ก Relevant because it matches: document, pdf. Watch-outs: benchmark, vision-language.

SathwikBalu/retail-chatfaq

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2024-07-23

View โ†’
size_categories:n<1K format:parquet modality:text library:datasets library:pandas library:mlcroissant library:polars region:us

๐Ÿ’ก Relevant because it matches: retail.

TheFinAI/en-forecasting-bigdata

Use case: Business Analytics

Source: Hugging Face | Type: Text | Updated: 2023-06-25

View โ†’
size_categories:1K<n<10K format:parquet modality:text library:datasets library:pandas library:mlcroissant library:polars region:us

๐Ÿ’ก Relevant because it matches: forecasting.

TheFinAI/en-forecasting-portoseguro

Use case: Business Analytics

Source: Hugging Face | Type: Text | Updated: 2024-02-11

View โ†’
size_categories:10K<n<100K format:parquet modality:tabular modality:text library:datasets library:pandas library:mlcroissant library:polars region:us

๐Ÿ’ก Relevant because it matches: forecasting.

TheFinAI/en-forecasting-taiwan

Use case: Business Analytics

Source: Hugging Face | Type: Text | Updated: 2024-02-11

View โ†’
size_categories:1K<n<10K format:parquet modality:tabular modality:text library:datasets library:pandas library:mlcroissant library:polars region:us

๐Ÿ’ก Relevant because it matches: forecasting.

artefactory/Argimi-Ardian-Finance-10k-text

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2026-02-23

View โ†’
task_categories:text-retrieval task_categories:text-generation language:en license:cc-by-4.0 size_categories:1M<n<10M format:webdataset modality:text library:datasets library:webdataset library:mlcroissant ...truncated...

๐Ÿ’ก Relevant because it matches: finance.

beanjar/sp500-business-description-sentence-bert-embeddings

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2023-02-05

View โ†’
license:mit size_categories:n<1K format:csv modality:tabular modality:text library:datasets library:pandas library:mlcroissant library:polars region:us

๐Ÿ’ก Relevant because it matches: business.

CFPB/consumer-finance-complaints

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2024-07-16

View โ†’
task_categories:text-classification task_ids:topic-classification annotations_creators:crowdsourced language_creators:crowdsourced multilinguality:monolingual source_datasets:original language:en license:cc0-1.0 size_categories:1M<n<10M region:us

๐Ÿ’ก Relevant because it matches: finance.

DavidLazer/business

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2023-08-02

View โ†’
license:cc size_categories:n<1K format:csv modality:text library:datasets library:pandas library:mlcroissant library:polars region:us

๐Ÿ’ก Relevant because it matches: business.

dianalogan/Marketing-Budget-and-Actual-Sales-Dataset

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2022-10-21

View โ†’
task_ids:intent-classification task_ids:multi-class-classification task_ids:sentiment-classification annotations_creators:diana_logan multilinguality:monolingual source_datasets:other-generated-datasets language:en license:apache-2.0 arxiv:2010.12421 region:us

๐Ÿ’ก Relevant because it matches: sales.

gbharti/finance-alpaca

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2025-11-24

View โ†’
task_categories:text-generation language:en license:mit size_categories:10K<n<100K format:json modality:text library:datasets library:pandas library:mlcroissant library:polars ...truncated...

๐Ÿ’ก Relevant because it matches: finance.

gbharti/finance-alpaca-csv

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2023-03-29

View โ†’
region:us

๐Ÿ’ก Relevant because it matches: finance.

Josephgflowers/Finance-Instruct-500k

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2026-02-24

View โ†’
license:apache-2.0 size_categories:100K<n<1M format:json modality:text library:datasets library:pandas library:mlcroissant library:polars region:us finance ...truncated...

๐Ÿ’ก Relevant because it matches: finance.

nickmuchi/trade-the-event-finance

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2022-02-04

View โ†’
size_categories:100K<n<1M format:parquet modality:text library:datasets library:dask library:mlcroissant library:polars region:us

๐Ÿ’ก Relevant because it matches: finance.

readerbench/ro-business-emails

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2023-05-18

View โ†’
license:apache-2.0 size_categories:1K<n<10K format:parquet modality:text library:datasets library:pandas library:mlcroissant library:polars region:us

๐Ÿ’ก Relevant because it matches: business.

rucyang/sales

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2022-01-20

View โ†’
region:us

๐Ÿ’ก Relevant because it matches: sales.

scholarly360/salestech_sales_qualification_framework_bant

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2023-06-11

View โ†’
size_categories:n<1K format:parquet modality:text library:datasets library:pandas library:mlcroissant library:polars region:us salestech sales

๐Ÿ’ก Relevant because it matches: sales.

stampylongmoue/business

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2023-08-19

View โ†’
license:cc-by-nc-nd-4.0 region:us

๐Ÿ’ก Relevant because it matches: business.

Thaweewat/alpaca-finance-43k-th

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2023-05-09

View โ†’
task_categories:question-answering task_categories:summarization language:th license:cc-by-sa-3.0 size_categories:10K<n<100K format:parquet modality:text library:datasets library:pandas library:mlcroissant ...truncated...

๐Ÿ’ก Relevant because it matches: finance.

winddude/reddit_finance_43_250k

Use case: Business Operations

Source: Hugging Face | Type: Text | Updated: 2023-05-25

View โ†’
language:en license:gpl-3.0 size_categories:100K<n<1M format:json modality:tabular modality:text library:datasets library:pandas library:mlcroissant library:polars ...truncated...

๐Ÿ’ก Relevant because it matches: finance.