← Back to AI Insights
Gemini Executive Synthesis

DeepTable – an API that converts messy Excel files (with merged cells, multi-level headers, multiple tables, totals mixed with data) into SQL-ready relational tables with cell-level provenance.

Technical Positioning
Solves the 'harder, more general problem' of understanding the semantic structure of real-world spreadsheets, where LLMs fail on complex workbooks at scale.
SaaS Insight & Market Implications
DeepTable addresses a pervasive and costly data ingestion problem for enterprises: transforming complex, unstructured Excel data into usable, structured formats. The explicit mention of LLMs failing on 'complex workbooks at scale' highlights a significant gap this solution aims to fill. The ability to convert messy spreadsheets into 'SQL-ready relational tables with full cell-level provenance' is a critical value proposition for data engineering, analytics, and compliance. This directly impacts data quality, automation, and operational efficiency, reducing manual data cleaning efforts. Market implications are substantial for any organization relying on Excel for data exchange or storage, particularly in finance, supply chain, or operations. This product targets a fundamental data integration challenge, offering a robust, scalable solution where existing methods fall short. The API model facilitates seamless integration into existing data pipelines.
Proprietary Technical Taxonomy
API semantic structure relational tables merged cells multi-level headers LLMs agent-guided compilation pipeline SQL-ready

Raw Developer Origin & Technical Request

Source Icon Hacker News Apr 1, 2026
Show HN: DeepTable – an API that converts messy Excel files into structured data

We tried to build an Excel error checker. To achieve that, we needed to actually understand the semantic structure of a spreadsheet first. So we built that, and it turned out to be the harder, more general problem.The core issue: most real-world spreadsheets aren't relational tables. Merged cells, multi-level headers, multiple tables per sheet, totals mixed in with data. You can't just dump them to CSV and call it done. LLMs handle the easy cases but fall apart on complex workbooks at scale.Our approach uses an agent-guided compilation pipeline that produces SQL-ready relational tables with full cell-level provenance. This demo visualizes what we do: storage.googleapis.com/deeptable-public/... have a handful of early customers but honestly don't know yet whether this is a real market or a niche problem. We're posting this to hear from people who've dealt with arbitrary spreadsheet ingestion. Whether you solved it, gave up, or are still living with the pain.If you want to try it on your own files, email me (see my profile for my email) and I'll give you API access.

Developer Debate & Comments

No active discussions extracted for this entry yet.

Engagement Signals

7
Upvotes
0
Comments

Cross-Market Term Frequency

Quantifies the cross-market adoption of foundational terms like API and LLMs by tracking occurrence frequency across active SaaS architectures and enterprise developer debates.

Macro Market Trends

Correlated public search velocity for adjacent technologies.

Api Api Security Api Vulnerability Scanning