Supported File Types
| Category | Formats | Notes |
|---|---|---|
.pdf | Standard and scanned — OCR fallback activates automatically for scanned files | |
| Images | .jpg, .jpeg, .png | Vision AI reads directly from the image |
| Word documents | .docx | Full text extraction |
| Spreadsheets | .xlsx, .csv | Tabular data extraction |
| Text files | .txt, .html | Full text extraction |
| Audio | .mp3, .wav, .m4a | Transcribed locally, then extracted |
For documents with Chinese characters, stamps, or seals — including Fapiao and Business Licenses — RecordEngine uses vision-first extraction rather than text OCR, which handles stamps and handwriting that OCR-only tools miss.
Method 1 — Web Upload (UI)
The most common upload method for individual documents or small batches.Select your files
Drag and drop files onto the upload zone, or click Browse to select from your file system. You can select multiple files at once — each becomes a separate document.
Assign contact and folder
Set:
- Contact — the client or vendor this document belongs to
- Folder — the folder within that contact (e.g. Q4 Invoices, Documents, Review)
Select an extraction profile
Choose the profile that matches your document type. If the contact has a default profile set, it’s pre-selected automatically.
Method 2 — Email Intake
Send any file as an email attachment to your RecordEngine intake address. No login required — documents land directly in the System → Inbox folder and process automatically. Best for:- Clients or vendors submitting documents without access to RecordEngine
- Forwarding attachments from other email threads
- Mobile capture (photograph a receipt, email it)
Method 3 — Hot Folder (Watch Folder)
RecordEngine monitors a designated folder on the server’s file system. Any file copied into that folder is automatically ingested — no UI interaction needed. Best for:- Bulk imports from a scanner or MFP (multi-function printer)
- Automated file drops from other software
- Legacy systems that export files to a directory
docker-compose.yml and is active whenever the xr-watcher container is running. Contact your administrator for the hot folder path on your server.
Method 4 — REST API
Upload programmatically from any external system using the API. Ideal for automated pipelines where documents are generated or received by software — for example, uploading attachments received in a CRM, or exporting reports from an ERP.Duplicate Detection
RecordEngine checks for duplicate filenames within the same folder on upload. If a file with the same name already exists in the target folder, you’re prompted to either:- Upload as a new version — links the new file to the existing document as version 2
- Upload as a separate document — creates an independent new document
Processing Time
Once uploaded, documents transition through statuses automatically:| Status | Typical duration |
|---|---|
| New | A few seconds — queued waiting for AI worker |
| Processing | 10–45 seconds for clean PDFs and images |
| Processing | 1–3 minutes for audio files (transcription first) |
| Processing | 2–5 minutes for large multi-page PDFs |
| Needs Review | Ready for human review |
The very first document after a server restart takes longer — typically 2–5 minutes — because the AI model loads into GPU memory. Every subsequent document processes much faster. Run a warmup document after any server restart to pre-load the model.
Uploading Multiple Files
You can upload multiple files in a single web upload operation:- On the Upload page, select multiple files at once using Ctrl+Click or Shift+Click in the file picker
- All selected files share the same contact, folder, and profile assignment
- Each file becomes an independent document and processes in parallel
What Happens After Upload
Once a file is uploaded, RecordEngine automatically:- Stores the original file in the document archive
- Detects the file type and chooses the appropriate extraction method (vision, text, audio transcription)
- Runs the AI extraction against the selected profile
- Generates an AI summary
- Calculates a confidence score
- Evaluates any matching Rules Engine rules
- Sets status to Needs Review