🛡️ No DPA Required
DataUnchain is 100% self-hosted. No
data is transmitted to the Author or any third party. There is no data processing relationship — you are
both the data controller and processor. A Data Processing Agreement is therefore not applicable.
How is this different from cloud services?
When you use cloud-based document processing (AWS Textract, Google Doc AI, etc.), your data is sent to their
servers. Under GDPR, this creates a data processing relationship requiring a DPA.
With DataUnchain:
- ✅ All AI inference runs on your device
- ✅ Your documents stay on your device
- ✅ Extracted data is stored on your device
- ✅ No network connection is required
- ✅ No sub-processors involved
GDPR Data Flow
📄 Your Documents
↓ (local filesystem)
🐳 DataUnchain (Docker on your machine)
↓ (internal Docker network)
💾 PostgreSQL + Excel (local volumes)
⛔ Nothing leaves your infrastructure
For regulated industries
DataUnchain is particularly suited for sectors where data cannot leave the premises:
- Healthcare: Patient records, lab results — air-gapped deployment
- Legal: Client-attorney privilege preserved — zero cloud exposure
- Finance: Trade secrets, compliance documents — full data sovereignty
- Government: Classified or sensitive documents — deploy on-premise
For questions, contact the project maintainer via the GitHub issue tracker.