12 Months of Pay Slips,
One Payroll Spreadsheet
The Japanese payslip encodes more structural detail per page than most payroll documents in the world — and that's exactly what makes batch processing it so punishingly manual.
Key Takeaways
- 50 employees × 12 months × 10+ fields per payslip equals 6,000+ independent data points — and payroll software generates every one correctly but never consolidates them across time.
- The ¥40,000–200,000/month you pay for cloud payroll covers the correctness of each payslip as it leaves the system — it covers zero seconds of the manual re-entry that happens next.
- ImageToTable.ai populates a single spreadsheet from 12 months of payslips in one batch — each row a month, each column a payroll field — without a single keystroke of transcription.
The Hidden Complexity of the Japanese Payslip
In most markets, a pay slip answers one question: how much landed in the bank account. A Japanese kyuyo meisai (給与明細) answers closer to ten. Base salary (kihon kyuyo, 基本給). Commuting allowance (tsukin teate, 通勤手当). Housing allowance (jutaku teate, 住宅手当). Overtime pay (zangyodai, 残業代). Health insurance (kenko hoken, 健康保険). Welfare pension (kosei nenkin, 厚生年金). Income tax (shotokuzei, 所得税). Net pay (sashihiki shikyugaku, 差引支給額). Each is a separate line item, and each changes independently from month to month.
Japan's itemised salary structure is not a formatting choice — it reflects a legal framework where allowances carry distinct tax and social insurance treatment under the Labour Standards Act (労働基準法, Articles 32 through 89). A commuting allowance is tax-exempt up to ¥150,000 per month. A housing allowance is fully taxable and included in social insurance remuneration. Overtime carries statutory premium rates starting at 25% and rising to 60% for late-night holiday work. Misclassifying one allowance cascades into incorrect withholding across health insurance, welfare pension, and income tax.
Now multiply that by 12 months and 50 employees. The result is not a single spreadsheet — it's a data reconciliation exercise where 6,000+ independent data points each carry compliance weight. Most HR teams in Japan know this pain intimately. What fewer realise is that the bottleneck isn't the payslip format itself — it's the assumption that multi-month consolidation has to happen one cell at a time.
What Japan's Payroll Software Can — and Can't — Do for Multi-Month Data
Japan's domestic payroll platforms have solved the compliance side of payslip generation impressively well. SmartHR, with approximately 60,000 companies on its platform, automates social insurance calculation and nenmatsu chosei (年末調整, year-end tax adjustment) workflows. freee HR handles income tax withholding at progressive rates and My Number (マイナンバー) management with a partial English interface that foreign subsidiaries find accessible. MoneyForward Cloud Payroll and Yayoi Kyuyo (弥生給与) each serve distinct segments of the market — the former integrated with accounting, the latter dominant in SMEs.
But these systems were designed to produce individual pay statements, not to consolidate them across time. Export a year of payslip data from any of these platforms and the output is a set of month-by-month files — each with its own column structure, each missing the context of adjacent months. A bonus month (賞与, shoyo) introduces bonus-specific social insurance contribution rates that don't appear in regular monthly exports. An employee whose commuting route changed in April will have a different allowance figure that breaks any copy-paste pattern from previous months. The system has done its job: the payslip is correct. What it hasn't done is give you the consolidated view HR actually needs.
This gap is where the real manual work lives — and it's larger than most teams admit. A USD 2.16 billion Japan HR tech market growing at 6.87% CAGR toward USD 3.93 billion by 2034 has poured immense investment into compliance automation. The data consolidation layer between what payroll software outputs and what HR analysis requires remains stubbornly manual.
From a Stack of Kyuyo Meisai to One Consolidated Spreadsheet
The shift from manual transcription to batch extraction is conceptually simple: instead of typing 10+ fields per payslip, you tell the AI what columns you want once, then upload all 12 months of payslips at the same time. The AI reads each document, locates the corresponding values, and populates one unified table — one row per month, one column per payroll field.
The column definitions are where the Japan-specific knowledge lives. A well-designed extraction template for Japanese payslips includes:
- Employee Name — to distinguish records across a multi-employee batch
- Year-Month — the payslip period, critical for year-end reconciliation
- Basic Salary (基本給) — the fixed monthly base that drives standard monthly remuneration calculations
- Commuting Allowance (通勤手当) — tax-exempt up to ¥150,000/month, tracked separately for audit
- Overtime Pay (残業代) — subject to statutory premium rates under Article 37 of the Labour Standards Act
- Health Insurance (健康保険) — region-dependent rate, 4.72%–5.39% of standard monthly remuneration
- Welfare Pension (厚生年金) — 9.15% of standard monthly remuneration, split between employer and employee
- Income Tax (所得税) — withheld at progressive rates (5%–45%)
- Net Pay (差引支給額) — the final take-home amount after all deductions
This is not template-based extraction where you draw boxes around each field. The AI locates each value based on what it means — "overtime pay" is the line labelled 残業代 regardless of where it appears on the page — so it works across payslips from different payroll software, different months, or even scanned paper originals with slight layout drift.
The batch workflow has three distinct advantages that matter specifically in a payroll context: naming consistency (the file name of each payslip determines the row label, no manual renaming needed), structural tolerance (a bonus month payslip with extra deduction lines does not break extraction of the 9 standard fields), and zero-copy output (the consolidated table exports directly to Excel, not through a copy-paste step that introduces its own error risk).
Files are processed securely and not stored.
A detail that trips up first-time batch users: file names matter. If you upload 12 payslips as 2026-01.pdf, 2026-02.pdf, and so on, those names become row identifiers in the output. But if your payroll software exports them as kyuyo_meisai_0001.pdf, kyuyo_meisai_0002.pdf, you lose the month traceability. Rename the files first — it takes 30 seconds and saves 30 minutes of row matching later.
The Year-End Adjustment Use Case: When Multi-Month Accuracy Becomes a Compliance Obligation
Nenmatsu chosei (年末調整) — Japan's employer-run year-end tax adjustment — is where batch consolidation shifts from a productivity tool to a compliance instrument. Every December, employers must reconcile the full year's income tax withholding against the employee's actual annual tax liability, accounting for dependents, insurance premiums, mortgage deductions, and other adjustments. Getting it wrong means either the employee is over-withheld (and waits months for a refund) or under-withheld (and the employer bears the shortfall).
Reconciling 12 months of withholding data for a single employee means pulling 12 separate payslips, locating the income tax line item on each, and entering it into a working spreadsheet. For 50 employees, that's 600 manual field extractions, each with compliance weight. One misread digit — a 3 typed as an 8 in August's withholding — propagates into the year-end total and creates an error that may not surface until the nenmatsu chosei calculation returns an unexpected result in January.
Batch processing eliminates per-field manual entry. Upload all 12 months for all employees in one batch, define your columns once — Employee Name, Year-Month, Income Tax (所得税), and any other fields you need for your reconciliation sheet — and the output is a clean table where the income tax row for August 2026 sits directly below the row for July 2026, ready for summation or cross-checking. The data goes from payslip to spreadsheet without touching a human keyboard, and that single removal of the transcription step eliminates the most common class of payroll reconciliation errors.
For companies that also need to review social insurance contributions — particularly during the April–June standard monthly remuneration (標準報酬月額, hyojun hoshu gaku) review window, when the MHLW determines the next year's insurance bracket — having three consecutive months of payslip data consolidated in one view makes the bracket verification a five-minute scan instead of a three-spreadsheet cross-reference.
Bonus Months and Structural Shifts: Handling Shoyo in a Batch
Twice a year — typically June or July and December — the payslip structure shifts. Shoyo (賞与, bonus) appears as an additional line item that regular monthly payslips don't contain. Social insurance contributions on bonuses use a special contribution rate distinct from regular monthly rates, and income tax on bonus amounts is withheld at a separate bonus withholding rate calculated by reference to the previous month's regular salary. The payslip for a bonus month is effectively a different document than the one for a non-bonus month, and any template-based extraction approach that assumes structural consistency will fail on bonus months.
This is where extraction based on semantic understanding — reading the field label rather than its position — makes the practical difference. When the AI looks for "Gross Salary (支給額合計)" on a regular payslip and finds it in column A, but on a bonus payslip the same label appears in column B because a new "Bonus Amount (賞与額)" column has been inserted to the left, position-based extraction breaks. Semantic extraction doesn't — it follows the label, not the coordinates.
The batch processing approach handles bonus months naturally: include the bonus-specific fields (賞与額, bonus social insurance, bonus income tax) in your column definitions, and the AI will populate those columns only for the months where those fields exist. Regular months will simply show blank cells in bonus-specific columns — a result that's immediately visible in the consolidated spreadsheet and far cleaner than separate handling of bonus and non-bonus periods.
For a deeper walkthrough of extracting individual Japanese payslip fields — including detailed column setup and format handling — see our guide on extracting Japanese kyuyo meisai data into Excel.
Frequently Asked Questions
Can AI batch processing handle Japanese payslips from different payroll software — SmartHR, freee, MoneyForward, Yayoi — in the same batch?
Yes. Because the extraction works on the field label (text on the page) rather than the layout or software origin, payslips from different systems can be processed in the same batch. A Yayoi payslip and a SmartHR payslip for the same employee in different months will both return values for "Basic Salary" and "Health Insurance" regardless of layout differences. The only requirement is that the field labels are recognisable — and Japanese payroll terminology is standardised enough across systems that cross-platform extraction is reliable in practice.
Does batch extraction handle scanned paper payslips, or only digital PDFs?
It handles both. The AI processes the visual content of the image — whether from a native PDF, a screenshot, or a photo of a printed payslip — and extracts based on what it reads on the page. Scanned paper payslips with slight skew or lighting variation work as long as the text is legible. Handwritten annotation on payslips is also recognised, which matters for companies where payroll corrections are marked by hand on printed copies.
What happens if a payslip uses non-standard terminology for payroll fields?
Most Japanese employers follow a consistent terminology set — 基本給 for base salary, 健康保険 for health insurance, 所得税 for income tax — because these terms align with statutory reporting categories. For uncommon variants (e.g., 給与 instead of 支給額), the AI's contextual understanding typically maps them correctly to the intended column. If you encounter persistent mismatches, rephrasing the column name to include both variants — like "Gross Pay (支給額/給与)" — resolves the ambiguity.
How many payslips can I process in one batch?
Batch limits depend on your subscription plan. The free tier supports a limited number of pages per batch; paid plans scale to higher volumes. For a typical HR team processing monthly payslips for 20–200 employees, a single batch covering 12 months is within standard plan limits. File size matters more than file count — high-resolution scanned PDFs consume more processing resources than digitally generated payslip PDFs from payroll software.
Can I add computed columns — like "Employee Social Insurance Total" summing health insurance + pension — in the same batch extraction?
Yes. Computed columns let you define calculations that execute during extraction. For payroll consolidation, useful computed columns include: Total Social Insurance Deduction (健康保険 + 厚生年金 + 雇用保険), Employer Social Insurance Cost (health insurance + pension + child allowance + workers' accident insurance, using fixed rates for the employer portion), and Taxable Income (支給額合計 − 通勤手当, since commuting allowance is tax-exempt up to the cap). These calculations run per-row and populate the output spreadsheet without post-processing in Excel.
Does the AI understand the difference between employer and employee social insurance contributions on a Japanese payslip?
The AI extracts what is printed on the payslip. Japanese payslips issued to employees show only the employee-side deductions — the employer's matching social insurance contributions do not appear on an employee-facing kyuyo meisai. If you need to calculate the employer's total payroll cost, you can either set up a computed column with the employer contribution rates (health insurance employer share: 4.72%–5.39%, welfare pension: 9.15%, child allowance: 0.36%, workers' accident: 0.25%–8.8% by industry) applied to the gross salary extracted from each payslip, or process the employer-side payroll reports separately.
A year's worth of payroll data carries enough compliance weight without adding manual transcription risk on top. The difference between typing 1,200 field entries into a spreadsheet and uploading 12 files into one batch is not just speed — it's the difference between reconciling errors and never creating them in the first place. Test it on your own payslips — upload a few months and see whether "three minutes per payslip" becomes "one batch per year."