← Back to Product Feed

Hacker News Show HN: DeepTable – an API that converts messy Excel files into structured data

Solves the 'harder, more general problem' of understanding the semantic structure of real-world spreadsheets, where LLMs fail on complex workbooks at scale.

7
Traction Score
0
Discussions
Apr 1, 2026
Launch Date
View Origin Link

Product Positioning & Context

AI Executive Synthesis
Solves the 'harder, more general problem' of understanding the semantic structure of real-world spreadsheets, where LLMs fail on complex workbooks at scale.
DeepTable addresses a pervasive and costly data ingestion problem for enterprises: transforming complex, unstructured Excel data into usable, structured formats. The explicit mention of LLMs failing on 'complex workbooks at scale' highlights a significant gap this solution aims to fill. The ability to convert messy spreadsheets into 'SQL-ready relational tables with full cell-level provenance' is a critical value proposition for data engineering, analytics, and compliance. This directly impacts data quality, automation, and operational efficiency, reducing manual data cleaning efforts. Market implications are substantial for any organization relying on Excel for data exchange or storage, particularly in finance, supply chain, or operations. This product targets a fundamental data integration challenge, offering a robust, scalable solution where existing methods fall short. The API model facilitates seamless integration into existing data pipelines.
We tried to build an Excel error checker. To achieve that, we needed to actually understand the semantic structure of a spreadsheet first. So we built that, and it turned out to be the harder, more general problem.The core issue: most real-world spreadsheets aren't relational tables. Merged cells, multi-level headers, multiple tables per sheet, totals mixed in with data. You can't just dump them to CSV and call it done. LLMs handle the easy cases but fall apart on complex workbooks at scale.Our approach uses an agent-guided compilation pipeline that produces SQL-ready relational tables with full cell-level provenance. This demo visualizes what we do: https://storage.googleapis.com/deeptable-public/deeptable_an...We have a handful of early customers but honestly don't know yet whether this is a real market or a niche problem. We're posting this to hear from people who've dealt with arbitrary spreadsheet ingestion. Whether you solved it, gave up, or are still living with the pain.If you want to try it on your own files, email me (see my profile for my email) and I'll give you API access.
API semantic structure relational tables merged cells multi-level headers LLMs agent-guided compilation pipeline SQL-ready

Community Voice & Feedback

No active discussions extracted yet.

Related Early-Stage Discoveries

Discovery Source

Hacker News Hacker News

Aggregated via automated community intelligence tracking.

Tech Stack Dependencies

No direct open-source NPM package mentions detected in the product documentation.

Media Tractions & Mentions

No mainstream media stories specifically mentioning this product name have been intercepted yet.

Deep Research & Science

No direct peer-reviewed scientific literature matched with this product's architecture.