How to Extract Data from Spanish Delivery Notes (Albaranes)
Into Excel
Most document extraction tools treat every delivery note the same. The Spanish albarán (delivery note) breaks that assumption in three ways that matter when you run a receiving dock in Barcelona, Valencia, or Zaragoza: it carries fields no generic delivery note template recognizes — like a tax ID in NIF format with a check letter that follows an 8-digit-plus-1-letter pattern — it comes back from the receiver covered in handwritten annotations, and under Spanish commercial law it serves as legal proof of delivery without being a tax document. This article walks through every field on a Spanish albarán, explains why format fragmentation hits Spanish warehouses harder than most, and shows how to get clean, reconciled delivery data into a spreadsheet without manual keystrokes.
Key Takeaways
- Spanish albaranes look like delivery notes but carry a unique legal weight — they are the only supply chain document that leaves the warehouse printed and returns with handwritten annotations that serve as proof of delivery under the 141-year-old Código de Comercio.
- Template-based extraction tools fail at the albarán's two-layer structure — they either discard the handwritten corrections as noise or blend them into the printed data stream, producing quantities that break the three-way reconciliation between purchase order, delivery note, and invoice.
- Column-name extraction reads both layers by meaning — define "Qty Shipped" and "Qty Received" once, and ImageToTable.ai finds each value across every supplier layout without requiring a separate template per format.
A Spanish delivery note carries a legal weight no generic delivery note has — and fields no generic template recognizes.
In most of Europe, a delivery note is a shipping document. It lists what's in the box, the driver hands it over, and the receiving clerk checks it against the purchase order. Nobody’s lawyer ever looks at it. In Spain, the albarán (delivery note) occupies a different category. Under the Código de Comercio (Spanish Commercial Code, Royal Decree of 22 August 1885), the albarán serves as documentary proof of delivery — traditio — in a compraventa (sale of goods) contract. It is not a tax instrument. It does not carry IVA (VAT) amounts. But it is the document that legally confirms goods changed hands.
The albarán is the only shipping document in the European supply chain that is simultaneously a legal delivery receipt and a turnaround document — the supplier prints it, the goods ride with it, the receiver writes on it, and the signed copy returns to the supplier as proof. The Wikipedia entry on albarán defines it clearly: "un documento mercantil que acredita el traslado y la entrega de un pedido" — a commercial document that certifies the transport and delivery of an order. The receiving party must sign it to confirm correct receipt. That signature transforms the document from a simple packing list into a legal record.
This legal role creates structural differences between a Spanish albarán and a generic delivery note. An albarán from a Spanish supplier — whether it’s Mercadona’s logistics center near Valencia or an industrial distributor like RS Components España — typically carries fields a British despatch note or a German Lieferschein does not. The tax ID field appears in NIF format (Número de Identificación Fiscal): 8 digits plus one check letter for individuals, one letter plus 7 digits plus one check letter for companies. Both the supplier’s NIF and the buyer’s NIF appear on the document. The delivery note number follows supplier-specific internal numbering that has no cross-supplier standard. And critically — because the albarán predates the invoice in the commercial workflow — the delivery note number must later be matched against the factura (invoice) that references it, enabling three-way reconciliation with the purchase order.
The Spanish albarán is not a "delivery note plus some extra fields." It is a document category shaped by a 140-year-old commercial code, a multi-step reconciliation workflow, and a physical journey that makes it the only supply chain document that leaves the warehouse printed and returns with handwritten data. Extraction tools trained on generic delivery note layouts will miss the NIF check letter, misplace the delivery note number among the PO reference and carrier tracking number, and ignore the receiver’s handwritten annotations entirely.
Every albarán carries 10–14 fields that matter for warehouse receiving — and their exact field labels vary by supplier.
When a Spanish warehouse receives a shipment, the receiving clerk needs to extract data from the albarán and enter it into either the WMS (Warehouse Management System) for inventory posting or the ERP for purchase order reconciliation. In Spain, the software doing the receiving is likely SAP Business One, Sage 200 (or Sage Murano, widely adopted in Spanish SMEs), Microsoft Dynamics NAV / Business Central, or the Spanish-built Holded SaaS ERP. For warehouse-specific operations, Mecalux Easy WMS — a Spanish WMS from the Barcelona-based intralogistics group Mecalux — is common in mid-to-large warehouses. The WMS expects structured data. The albarán is anything but.
The fields a Spanish warehouse typically extracts from each albarán, and where each one feeds downstream:
| Albarán Field | Typical Supplier Label | Feeds Into |
|---|---|---|
| Delivery Note Number | "Albarán N.º", "Nº Albarán", "Delivery Note Ref" | Goods receipt record; three-way match key |
| Purchase Order Reference | "Su Pedido", "Pedido N.º", "PO Ref." | PO reconciliation in ERP |
| Supplier Name & NIF | "Proveedor", "Razón Social", "NIF/CIF" | Supplier master data verification |
| Buyer NIF | "Cliente", "Destinatario", "NIF" | Internal cost center routing |
| Delivery Date | "Fecha", "Fecha de Entrega" | Receiving log; carrier SLA tracking |
| Product Code / SKU | "Código", "Referencia", "Artículo" | Inventory posting (PGC Grupo 3 Existencias) |
| Item Description | "Descripción", "Concepto" | Receiving verification against PO line items |
| Quantity Shipped | "Cantidad", "Uds.", "Unidades" | Debe (3xx) Existencias entry in Spanish accounting |
| Quantity Received | Handwritten annotation | Discrepancy flag; accounts payable hold |
| Receiver Signature | "Recibí conforme", "Firma" | Legal proof of delivery; supplier confirmation |
| Exception / Damage Notes | Handwritten margin notes | Non-conformance report; supplier claim |
The Spanish Plan General de Contabilidad (PGC) determines where this data lands. When goods are received against an albarán, the accounting entry debits Grupo 3 (Existencias — Inventory, typically cuenta 600 for merchandise purchases or cuenta 601 for raw materials) and credits cuenta 400 (Proveedores). The albarán number is the reference that links the physical receipt to the accounting entry — before the factura arrives. If the albarán number is entered incorrectly, the three-way match between PO, delivery note, and invoice breaks at the accounting level.
Spanish warehouses face the same format fragmentation as any receiving dock — but the Spanish regulatory vacuum makes it permanent.
There is no Spanish law that mandates a standard albarán format. The Código de Comercio describes the document's function but does not prescribe its layout. AECOC (Asociación de Fabricantes y Distribuidores), Spain's GS1 standards body with over 35,000 member companies, has developed an electronic delivery note standard through its AECOC EDI and AECOC TRANSP platforms. But EDI adoption in Spanish logistics follows the same pattern as elsewhere: large retailers like Mercadona, Carrefour Spain, and El Corte Inglés mandate it for their Tier 1 suppliers, while the mid-market — regional distributors, industrial suppliers, local manufacturers — continues to ship with paper albaranes printed from whatever ERP the supplier happens to use.
The result is the same visual chaos a receiving clerk at a German or French warehouse faces, with one additional complication: Spanish suppliers frequently use carbon-copy albarán books. The top copy stays with the buyer, the second goes to the driver, and the third remains in the supplier's book. By the time the buyer's copy reaches the data entry desk, it may be a second or third generation carbon sheet with progressively faded text. Combine that with the handwritten annotations the receiver added at the dock — quantity corrections, damage notes, a signature — and you have a single page carrying three layers of information at different legibility levels.
UNO Logística, Spain's main logistics trade association representing over 350 logistics operators and transport companies, identifies digitalization of document workflows as a core priority in its sector agenda. But the association also notes that the fragmentation of document formats across the supply chain — particularly between large retailers using EDI and mid-market suppliers using paper — remains a structural barrier. The Centro Español de Logística (CEL), Spain's largest logistics professional community since 1978, has published on the document standardization challenge in its supply chain digitalization working groups. The consensus: standardization will come to high-volume corridors first, but the long tail of mid-market suppliers will continue producing heterogeneous paper and PDF albaranes for years.
Format fragmentation in Spain is reinforced by an incentive architecture similar to standard packing slips: the supplier bears the cost of changing their albarán format but captures none of the efficiency gain — that gain goes entirely to the receiving warehouse. A small Spanish manufacturer printing albaranes from Sage Murano has zero motivation to redesign its output template so a customer's WMS can read it faster. The format stays fragmented because the economics of standardization are permanently misaligned.
You don't need a template for every supplier. You define your columns once and let the AI find each field by what it means.
The standard approach to extracting Spanish albarán data — typing every field from every supplier's unique layout into an Excel spreadsheet — takes about 3 minutes per single-page albarán according to industry benchmarks. For a warehouse receiving 30 shipments a day, that's 90 minutes of pure data entry. The alternative that tools like template-based OCR offer is worse in the Spanish context because it requires building a separate extraction template for every supplier layout — and Spanish suppliers, unlike German or French ones where format standardization is more advanced, show extreme variability. An albarán from a Catalan industrial supplier and one from an Andalusian food distributor share almost no visual similarity, even though they carry the same legal function.
What works instead is Custom Column Extraction: a method where you type the field names you want — "Delivery Note Number," "PO Reference," "Supplier NIF," "Product Code," "Quantity Shipped," "Receiver Signature" — and the AI reads each document to locate those values by understanding what each field name means semantically, not by matching a fixed pixel position. The column names you enter become the headers of your output spreadsheet. One column definition works across every supplier because the AI reads the document like a person would — looking for the meaning of each field rather than its coordinates on the page.
Here is the workflow end to end, using a batch of albaranes from multiple Spanish suppliers:
Upload your albaranes — PDFs, scans, or photos
Drop in a batch of albarán PDFs from the supplier portal, scanned carbon copies from the receiving dock, or photos taken at delivery. Digital PDFs and scanned paper can be mixed in the same upload. If your suppliers or drivers send albaranes directly, the Collection Link feature generates a shareable upload page — no login required for the sender — so albaranes land in your processing queue automatically without email attachments or WhatsApp photos.
Define the columns you need across every albarán
Enter the field names your receiving workflow and ERP require: Albarán Number | PO Reference | Supplier Name | Supplier NIF | Delivery Date | Product Code | Item Description | Qty Shipped | Qty Received | Exception Notes | Receiver Signature. For reconciliation automation, add a computed column like Qty Discrepancy (Qty Shipped - Qty Received) and the AI calculates the difference during extraction — flagging mismatches before the data reaches your WMS or AP system. You can also define inferred columns like Delivery Condition (options: Complete/Partial/Damaged) and the AI evaluates the albarán content to assign the correct status.
Download the structured spreadsheet
Export to XLSX, CSV, or JSON. Each albarán becomes one row in the output table. The delivery note number, supplier NIF, product codes, shipped quantities, received quantities, and receiver signature status appear in adjacent columns — ready for WMS goods receipt posting, PO reconciliation in Sage or SAP Business One, or three-way matching against supplier facturas. Processing runs at 5–10 seconds per page. For Google Sheets users, the sidebar add-on extracts results directly into the active sheet without leaving the spreadsheet.
Try it below with a sample albarán. The demo uses a preset — a pre-configured set of column names matching the standard delivery note / packing slip structure — so you can see extraction results immediately without typing column names. Click "Process" and the AI reads both the printed shipment data and any handwritten receiving annotations on the page.
Files are processed securely and not stored.
An albarán is the only logistics document that leaves the warehouse printed and returns with handwritten data — your extraction has to read both.
No other document in the shipping envelope makes a round trip. The purchase order stays with the buyer. The invoice is generated after delivery. The bill of lading moves with the carrier but does not return to the supplier covered in annotations. The albarán is unique: the supplier prints it, the goods travel with it, the receiver writes on it — circling a damaged box, writing "solo 8 uds recibidas" (only 8 units received) next to a line that says 10, signing at the bottom — and the annotated copy goes back to the supplier as proof of delivery condition.
This round-trip nature creates a two-layer document problem that most extraction tools handle poorly. Template-based OCR tools read the printed layer: the supplier's fields, the quantities as-shipped, the reference numbers. The handwritten layer — corrections, condition notes, a signature — is either ignored as noise or mixed into the printed text stream, producing output where "10 unidades" and "8 uds" land in the same cell.
A semantic reader that understands field meaning, rather than matching pixel blocks, can separate the two layers into distinct columns. Define Qty Shipped and the AI reads the printed quantity from the supplier's line-item table. Define Qty Received in the same column setup and the AI reads the handwritten correction the receiver wrote next to it. Define Exception Notes and handwritten margin comments — "caja dañada" (damaged box), "embalaje roto" (broken packaging), "falta 2" (2 missing) — are extracted as structured text rather than ignored. Define Receiver Signature with a format hint like "Y/N" and the AI detects whether the document contains a signature, regardless of where on the page it appears.
The albarán's handwritten layer is not noise — it is the only field-level record of what actually arrived versus what was supposed to. Ignoring it means your WMS receives incorrect quantities and your three-way match fails at the factura reconciliation stage, where the invoice quantity (based on what was shipped) must be reconciled against what was actually received.
Extracted albarán data feeds directly into Spanish accounting — every field maps to a PGC account code or a WMS transaction.
The output spreadsheet from the extraction is not the destination. It is an intermediate format that bridges the albarán and the systems that need its data. In a Spanish warehouse running Sage 200 or SAP Business One, the extracted fields follow a specific downstream path:
| Extracted Field | Destination System | PGC / Transaction |
|---|---|---|
| Product Code, Qty Received | WMS Goods Receipt | Debe (3xx) Existencias / Haber (400) Proveedores |
| Delivery Note Number, PO Reference | ERP Purchase Order Module | PO status: Partially Received / Fully Received |
| Supplier NIF, Delivery Date | ERP Supplier Ledger | Cuenta 400 (Proveedores) subledger posting |
| Qty Discrepancy, Exception Notes | AP Hold / Non-Conformance | Cuenta 606 (Descuentos sobre compras por pronto pago) or supplier debit note |
| Receiver Signature | Document Archive / Legal | Proof of delivery for supplier disputes |
The critical link is the albarán number. Under Spanish commercial practice, the supplier later issues a factura that references the same albarán number. The three-way match — purchase order, albarán (goods received), factura (invoice) — depends on the albarán number and the delivered quantities being correct in the ERP. If manual entry introduces a typo in the albarán number (transposed digits, missing a character), the subsequent factura reconciliation fails and someone has to trace back through paper records to find the correct number. Extraction that pulls the albarán number directly from the document eliminates the most common reconciliation failure point.
For teams that receive albaranes from field staff or drivers rather than at a fixed receiving dock, the Collection Link feature closes the document intake gap. Generate a shareable link, send it to drivers or remote warehouse staff, and albaranes arrive directly in your processing queue — no email attachments, no WhatsApp photos, no registration required for the sender. This is particularly useful for Spanish logistics operations where deliveries happen at multiple sites or where third-party transport companies handle the physical delivery but the receiving data needs to reach a central AP or inventory team.
Frequently Asked Questions
Does this work with albaranes that use carbon-copy paper, not digital PDFs?
Yes, with one caveat. First and second generation carbon copies at standard scan quality (300 dpi or higher) extract reliably for printed fields and legible handwriting. Third and fourth generation carbon copies — where ink pressure has faded significantly — produce lower accuracy on fine-print fields like reference numbers. For best results, scan the top or second copy. The AI still attempts extraction on faded text but may flag low-confidence values. Thermal paper albaranes (used by some Spanish courier services) work well when fresh; thermal prints older than 6–12 months can darken unevenly and benefit from a spot-check of extracted values.
Can the tool distinguish the NIF from the albarán number when both appear in the same header block?
Yes. The AI reads field labels and understands their semantic context rather than matching text by position. When you define a column named Supplier NIF, it searches for the tax identification number specifically — recognizing the 8-digit-plus-check-letter format that distinguishes an NIF from a delivery note number. It also distinguishes the albarán number from the PO reference and from any carrier tracking or consignment number that may appear nearby. Each identifier lands in the correct column regardless of layout, which is essential for the three-way match — a mismatched NIF in the supplier ledger produces a reconciliation error that can take hours to trace.
Can I process albaranes from 20 different Spanish suppliers in one batch without creating 20 templates?
Yes. That is the core difference between column-name extraction and template-based OCR. You define your columns once — Albarán Number | PO Reference | Supplier NIF | Product Code | Qty Shipped | Qty Received | Exception Notes | Receiver Signature — and the AI finds each value across every supplier's albarán by understanding what the column name means. A Mercadona logistics center albarán, an RS Components España delivery note, and a local Andalusian manufacturer's hand-filled albarán book page all produce the same structured output from the same column definitions. Upload them in a single batch, one unified Excel output, one row per document.
Does the extracted data handle the multi-rate IVA breakdown that Spanish documents require?
The albarán itself does not carry IVA amounts — that is the factura's role. Spanish albaranes are commercial documents, not tax instruments. The data you extract from an albarán is non-financial: reference numbers, product codes, quantities, and condition information. The IVA breakdown (21%, 10%, or 4% rates, each with its own base imponible and cuota) appears on the factura, not the albarán. If you need to extract IVA data from Spanish facturas, see the Spanish invoice extraction guide. The albarán extraction covered here feeds the goods-receipt side of the three-way match; the factura extraction feeds the invoice side.
What happens if the receiver wrote on the albarán in Spanish — can the AI read handwritten Spanish notes?
Yes. The AI reads handwritten text regardless of language. Spanish annotations like "recibido conforme" (received in order), "falta 1 caja" (1 box missing), or "producto dañado" (damaged product) are extracted as structured text in the language they were written. The column name you define (e.g., Exception Notes) determines what the AI looks for; the language of the handwritten content does not affect the extraction logic. Standard block handwriting extracts reliably. Heavily rushed cursive — common in driver notes scribbled at the dock — may require manual verification for full-text transcription, though structured detection like "signature present: yes/no" remains reliable even on rushed handwriting.
Spanish albaranes carry the weight of a 140-year-old commercial code, the variety of every supplier's ERP output, and the physical evidence of what actually arrived on the dock. The data extraction problem is not speed — it is that the receiving clerk never builds muscle memory, because every albarán layout is a first-time read. Column-name extraction changes the equation: you define what you need once, and the format war becomes irrelevant.
Related: Batch extract packing slips and delivery notes to Excel · Manual vs AI packing slip extraction in warehouse receiving · Extract packing slip fields from any format into Excel · Delivery note to Excel converter